-
Seagull: An Infrastructure for Load Prediction and Optimized Resource Allocation
Authors:
Olga Poppe,
Tayo Amuneke,
Dalitso Banda,
Aritra De,
Ari Green,
Manon Knoertzer,
Ehi Nosakhare,
Karthik Rajendran,
Deepak Shankargouda,
Meina Wang,
Alan Au,
Carlo Curino,
Qun Guo,
Alekh Jindal,
Ajay Kalhan,
Morgan Oslake,
Sonia Parchani,
Vijay Ramani,
Raj Sellappan,
Saikat Sen,
Sheetal Shrotri,
Soundararajan Srinivasan,
Ping Xia,
Shize Xu,
Alicia Yang
, et al. (1 additional authors not shown)
Abstract:
Microsoft Azure is dedicated to guarantee high quality of service to its customers, in particular, during periods of high customer activity, while controlling cost. We employ a Data Science (DS) driven solution to predict user load and leverage these predictions to optimize resource allocation. To this end, we built the Seagull infrastructure that processes per-server telemetry, validates the data…
▽ More
Microsoft Azure is dedicated to guarantee high quality of service to its customers, in particular, during periods of high customer activity, while controlling cost. We employ a Data Science (DS) driven solution to predict user load and leverage these predictions to optimize resource allocation. To this end, we built the Seagull infrastructure that processes per-server telemetry, validates the data, trains and deploys ML models. The models are used to predict customer load per server (24h into the future), and optimize service operations. Seagull continually re-evaluates accuracy of predictions, fallback to previously known good models and triggers alerts as appropriate. We deployed this infrastructure in production for PostgreSQL and MySQL servers across all Azure regions, and applied it to the problem of scheduling server backups during low-load time. This minimizes interference with user-induced load and improves customer experience.
△ Less
Submitted 16 October, 2020; v1 submitted 27 September, 2020;
originally announced September 2020.
-
Examination and Extension of Strategies for Improving Personalized Language Modeling via Interpolation
Authors:
Liqun Shao,
Sahitya Mantravadi,
Tom Manzini,
Alejandro Buendia,
Manon Knoertzer,
Soundar Srinivasan,
Chris Quirk
Abstract:
In this paper, we detail novel strategies for interpolating personalized language models and methods to handle out-of-vocabulary (OOV) tokens to improve personalized language models. Using publicly available data from Reddit, we demonstrate improvements in offline metrics at the user level by interpolating a global LSTM-based authoring model with a user-personalized n-gram model. By optimizing thi…
▽ More
In this paper, we detail novel strategies for interpolating personalized language models and methods to handle out-of-vocabulary (OOV) tokens to improve personalized language models. Using publicly available data from Reddit, we demonstrate improvements in offline metrics at the user level by interpolating a global LSTM-based authoring model with a user-personalized n-gram model. By optimizing this approach with a back-off to uniform OOV penalty and the interpolation coefficient, we observe that over 80% of users receive a lift in perplexity, with an average of 5.2% in perplexity lift per user. In doing this research we extend previous work in building NLIs and improve the robustness of metrics for downstream tasks.
△ Less
Submitted 9 June, 2020;
originally announced June 2020.
-
MMLSpark: Unifying Machine Learning Ecosystems at Massive Scales
Authors:
Mark Hamilton,
Sudarshan Raghunathan,
Ilya Matiach,
Andrew Schonhoffer,
Anand Raman,
Eli Barzilay,
Karthik Rajendran,
Dalitso Banda,
Casey Jisoo Hong,
Manon Knoertzer,
Ben Brodsky,
Minsoo Thigpen,
Janhavi Suresh Mahajan,
Courtney Cochrane,
Abhiram Eswaran,
Ari Green
Abstract:
We introduce Microsoft Machine Learning for Apache Spark (MMLSpark), an ecosystem of enhancements that expand the Apache Spark distributed computing library to tackle problems in Deep Learning, Micro-Service Orchestration, Gradient Boosting, Model Interpretability, and other areas of modern computation. Furthermore, we present a novel system called Spark Serving that allows users to run any Apache…
▽ More
We introduce Microsoft Machine Learning for Apache Spark (MMLSpark), an ecosystem of enhancements that expand the Apache Spark distributed computing library to tackle problems in Deep Learning, Micro-Service Orchestration, Gradient Boosting, Model Interpretability, and other areas of modern computation. Furthermore, we present a novel system called Spark Serving that allows users to run any Apache Spark program as a distributed, sub-millisecond latency web service backed by their existing Spark Cluster. All MMLSpark contributions have the same API to enable simple composition across frameworks and usage across batch, streaming, and RESTful web serving scenarios on static, elastic, or serverless clusters. We showcase MMLSpark by creating a method for deep object detection capable of learning without human labeled data and demonstrate its effectiveness for Snow Leopard conservation.
△ Less
Submitted 21 June, 2019; v1 submitted 19 October, 2018;
originally announced October 2018.