-
Scheduling Data Intensive Workloads through Virtualization on MapReduce based Clouds
Abstract: MapReduce has become a popular programming model for running data intensive applications on the cloud. Completion time goals or deadlines of MapReduce jobs set by users are becoming crucial in existing cloud-based data processing environments like Hadoop. There is a conflict between the scheduling MR jobs to meet deadlines and "data locality" (assigning tasks to nodes that contain their input data… ▽ More
Submitted 9 August, 2012; originally announced August 2012.
Journal ref: International Journal of Distributed and Parallel Systems (IJDPS)Vol.3, No.4, Pages 99-110, July 2012
-
Performance Issues of Heterogeneous Hadoop Clusters in Cloud Computing
Abstract: Nowadays most of the cloud applications process large amount of data to provide the desired results. Data volumes to be processed by cloud applications are growing much faster than computing power. This growth demands new strategies for processing and analyzing information. Dealing with large data volumes requires two things: 1) Inexpensive, reliable storage 2) New tools for analyzing unstructured… ▽ More
Submitted 4 July, 2012; originally announced July 2012.
Comments: 6 Pages
Journal ref: Global Journal of Computer Science and Technology, Volume XI Issue VIII May 2011
-
Survey on Improved Scheduling in Hadoop MapReduce in Cloud Environments
Abstract: Cloud Computing is emerging as a new computational paradigm shift. Hadoop-MapReduce has become a powerful Computation Model for processing large data on distributed commodity hardware clusters such as Clouds. In all Hadoop implementations, the default FIFO scheduler is available where jobs are scheduled in FIFO order with support for other priority based schedulers also. In this paper we study var… ▽ More
Submitted 3 July, 2012; originally announced July 2012.
Comments: 5 Pages, 2 figures; International Journal of Computer Applications, November 2011
-
Dimensionality Reduction: An Empirical Study on the Usability of IFE-CF (Independent Feature Elimination- by C-Correlation and F-Correlation) Measures
Abstract: The recent increase in dimensionality of data has thrown a great challenge to the existing dimensionality reduction methods in terms of their effectiveness. Dimensionality reduction has emerged as one of the significant preprocessing steps in machine learning applications and has been effective in removing inappropriate data, increasing learning accuracy, and improving comprehensibility. Feature… ▽ More
Submitted 5 February, 2010; originally announced February 2010.
Comments: International Journal of Computer Science Issues, IJCSI, Vol. 7, Issue 1, No. 1, January 2010, http://ijcsi.org
Journal ref: International Journal of Computer Science Issues, IJCSI, Vol. 7, Issue 1, No. 1, January 2010, http://ijcsi.org/articles/Dimensionality-Reduction-An-Empirical-Study-on-the-Usability-of-IFE-CF-(Independent-Feature-Elimination-by-C-Correlation-and-F-Correlation)-Measures.php