Skip to main content

Showing 1–3 of 3 results for author: Knoertzer, M

.
  1. arXiv:2009.12922  [pdf, other

    cs.DC cs.DB cs.LG cs.PF

    Seagull: An Infrastructure for Load Prediction and Optimized Resource Allocation

    Authors: Olga Poppe, Tayo Amuneke, Dalitso Banda, Aritra De, Ari Green, Manon Knoertzer, Ehi Nosakhare, Karthik Rajendran, Deepak Shankargouda, Meina Wang, Alan Au, Carlo Curino, Qun Guo, Alekh Jindal, Ajay Kalhan, Morgan Oslake, Sonia Parchani, Vijay Ramani, Raj Sellappan, Saikat Sen, Sheetal Shrotri, Soundararajan Srinivasan, Ping Xia, Shize Xu, Alicia Yang , et al. (1 additional authors not shown)

    Abstract: Microsoft Azure is dedicated to guarantee high quality of service to its customers, in particular, during periods of high customer activity, while controlling cost. We employ a Data Science (DS) driven solution to predict user load and leverage these predictions to optimize resource allocation. To this end, we built the Seagull infrastructure that processes per-server telemetry, validates the data… ▽ More

    Submitted 16 October, 2020; v1 submitted 27 September, 2020; originally announced September 2020.

    Comments: Technical report for the paper in VLDB 2021

  2. arXiv:2006.05469  [pdf, other

    cs.CL cs.LG

    Examination and Extension of Strategies for Improving Personalized Language Modeling via Interpolation

    Authors: Liqun Shao, Sahitya Mantravadi, Tom Manzini, Alejandro Buendia, Manon Knoertzer, Soundar Srinivasan, Chris Quirk

    Abstract: In this paper, we detail novel strategies for interpolating personalized language models and methods to handle out-of-vocabulary (OOV) tokens to improve personalized language models. Using publicly available data from Reddit, we demonstrate improvements in offline metrics at the user level by interpolating a global LSTM-based authoring model with a user-personalized n-gram model. By optimizing thi… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

    Comments: ACL Natural Language Interface Workshop 2020, short paper

  3. arXiv:1810.08744  [pdf, other

    cs.LG cs.AI cs.DC stat.ML

    MMLSpark: Unifying Machine Learning Ecosystems at Massive Scales

    Authors: Mark Hamilton, Sudarshan Raghunathan, Ilya Matiach, Andrew Schonhoffer, Anand Raman, Eli Barzilay, Karthik Rajendran, Dalitso Banda, Casey Jisoo Hong, Manon Knoertzer, Ben Brodsky, Minsoo Thigpen, Janhavi Suresh Mahajan, Courtney Cochrane, Abhiram Eswaran, Ari Green

    Abstract: We introduce Microsoft Machine Learning for Apache Spark (MMLSpark), an ecosystem of enhancements that expand the Apache Spark distributed computing library to tackle problems in Deep Learning, Micro-Service Orchestration, Gradient Boosting, Model Interpretability, and other areas of modern computation. Furthermore, we present a novel system called Spark Serving that allows users to run any Apache… ▽ More

    Submitted 21 June, 2019; v1 submitted 19 October, 2018; originally announced October 2018.