-
Experimental Performance Evaluation of Cloud-Based Analytics-as-a-Service
Authors:
Francesco Pace,
Marco Milanesio,
Daniele Venzano,
Damiano Carra,
Pietro Michiardi
Abstract:
An increasing number of Analytics-as-a-Service solutions has recently seen the light, in the landscape of cloud-based services. These services allow flexible composition of compute and storage components, that create powerful data ingestion and processing pipelines. This work is a first attempt at an experimental evaluation of analytic application performance executed using a wide range of storage…
▽ More
An increasing number of Analytics-as-a-Service solutions has recently seen the light, in the landscape of cloud-based services. These services allow flexible composition of compute and storage components, that create powerful data ingestion and processing pipelines. This work is a first attempt at an experimental evaluation of analytic application performance executed using a wide range of storage service configurations. We present an intuitive notion of data locality, that we use as a proxy to rank different service compositions in terms of expected performance. Through an empirical analysis, we dissect the performance achieved by analytic workloads and unveil problems due to the impedance mismatch that arise in some configurations. Our work paves the way to a better understanding of modern cloud-based analytic services and their performance, both for its end-users and their providers.
△ Less
Submitted 15 March, 2017; v1 submitted 25 February, 2016;
originally announced February 2016.
-
Tagging with DHARMA, a DHT-based Approach for Resource Mapping through Approximation
Authors:
Luca Maria Aiello,
Marco Milanesio,
Giancarlo Ruffo,
Rossano Schifanella
Abstract:
We introduce collaborative tagging and faceted search on structured P2P systems. Since a trivial and brute force mapping of an entire folksonomy over a DHT-based system may reduce scalability, we propose an approximated graph maintenance approach. Evaluations on real data coming from Last.fm prove that such strategies reduce vocabulary noise (i.e., representation's overfitting phenomena) and hotsp…
▽ More
We introduce collaborative tagging and faceted search on structured P2P systems. Since a trivial and brute force mapping of an entire folksonomy over a DHT-based system may reduce scalability, we propose an approximated graph maintenance approach. Evaluations on real data coming from Last.fm prove that such strategies reduce vocabulary noise (i.e., representation's overfitting phenomena) and hotspots issues.
△ Less
Submitted 19 January, 2011;
originally announced January 2011.
-
Collaborative Filtering without Explicit Feedbacks for Digital Recorders
Authors:
Alessandro Basso,
Marco Milanesio,
André Panisson,
Giancarlo Ruffo
Abstract:
Recommendation is usually reduced to a prediction problem over the function $r(u_a, e_i)$ that returns the expected rating of element $e_i$ for user $u_a$. In the IPTV domain, we deal with an environment where the definitions of all the parameters involved in this function (i.e., user profiles, feedback ratings and elements) are controversial. To our knowledge, this paper represents the first atte…
▽ More
Recommendation is usually reduced to a prediction problem over the function $r(u_a, e_i)$ that returns the expected rating of element $e_i$ for user $u_a$. In the IPTV domain, we deal with an environment where the definitions of all the parameters involved in this function (i.e., user profiles, feedback ratings and elements) are controversial. To our knowledge, this paper represents the first attempt to run collaborative filtering algorithms without inner assumptions: we start our analysis from an unstructured set of recordings, before performing a data pre-processing phase in order to extract useful information. Hence, we experiment with a real Digital Video Recorder system where EPG have not been provided to the user for selecting event timings and where explicit feedbacks were not collected.
△ Less
Submitted 17 January, 2011;
originally announced January 2011.