Showing 1–2 of 2 results for author: Mokhtar, H M O
-
Ontology Based Document Clustering Using MapReduce
Authors:
Abdelrahman Elsayed,
Hoda M. O. Mokhtar,
Osama Ismail
Abstract:
Nowadays, document clustering is considered as a data intensive task due to the dramatic, fast increase in the number of available documents. Nevertheless, the features that represent those documents are also too large. The most common method for representing documents is the vector space model, which represents document features as a bag of words and does not represent semantic relations between…
▽ More
Nowadays, document clustering is considered as a data intensive task due to the dramatic, fast increase in the number of available documents. Nevertheless, the features that represent those documents are also too large. The most common method for representing documents is the vector space model, which represents document features as a bag of words and does not represent semantic relations between words. In this paper we introduce a distributed implementation for the bisecting k-means using MapReduce programming model. The aim behind our proposed implementation is to solve the problem of clustering intensive data documents. In addition, we propose integrating the WordNet ontology with bisecting k-means in order to utilize the semantic relations between words to enhance document clustering results. Our presented experimental results show that using lexical categories for nouns only enhances internal evaluation measures of document clustering; and decreases the documents features from thousands to tens features. Our experiments were conducted using Amazon Elastic MapReduce to deploy the Bisecting k-means algorithm.
△ Less
Submitted 12 May, 2015;
originally announced May 2015.
-
Spatio-Temporal Queries for moving objects Data warehousing
Authors:
Leila Esheiba,
Hoda M. O. Mokhtar,
Mohamed El-Sharkawi
Abstract:
In the last decade, Moving Object Databases (MODs) have attracted a lot of attention from researchers. Several research works were conducted to extend traditional database techniques to accommodate the new requirements imposed by the continuous change in location information of moving objects. Managing, querying, storing, and mining moving objects were the key research directions. This extensive i…
▽ More
In the last decade, Moving Object Databases (MODs) have attracted a lot of attention from researchers. Several research works were conducted to extend traditional database techniques to accommodate the new requirements imposed by the continuous change in location information of moving objects. Managing, querying, storing, and mining moving objects were the key research directions. This extensive interest in moving objects is a natural consequence of the recent ubiquitous location-aware devices, such as PDAs, mobile phones, etc., as well as the variety of information that can be extracted from such new databases. In this paper we propose a Spatio-Temporal data warehousing (STDW) for efficiently querying location information of moving objects. The proposed schema introduces new measures like direction majority and other direction-based measures that enhance the decision making based on location information.
△ Less
Submitted 11 July, 2013;
originally announced July 2013.