-
Open Research Issues and Tools for Visualization and Big Data Analytics
Abstract: The new age of digital growth has marked all fields. This technological evolution has impacted data flows which have witnessed a rapid expansion over the last decade that makes the data traditional processing unable to catch up with the rapid flow of massive data. In this context, the implementation of a big data analytics system becomes crucial to make big data more relevant and valuable. Therefo… ▽ More
Submitted 18 April, 2024; originally announced April 2024.
Comments: 28 pages, 4 figures
Journal ref: International Journal of Computing and Digital Systems, 15, 1103-1117, 2024
-
Towards Big Data Modeling and Management Systems: From DBMS to BDMS
Abstract: To succeed in a Big Data strategy, you have to arm yourself with a wide range of data skills and best practices. This strategy can result in an impressive asset that can streamline operational costs, reduce time to market, and enable the creation of new products. However, several Big Data challenges may take place in enterprises when it comes to moving initiatives of boardroom discussions to effec… ▽ More
Submitted 15 September, 2023; originally announced September 2023.
Comments: 6 pages, 9 Figures
Journal ref: 2023 IEEE International Conference on Advanced Systems and Emergent Technologies (IC_ASET)
-
Dimensionality reduction with missing values imputation
Abstract: In this study, we propose a new statical approach for high-dimensionality reduction of heterogenous data that limits the curse of dimensionality and deals with missing values. To handle these latter, we propose to use the Random Forest imputation's method. The main purpose here is to extract useful information and so reducing the search space to facilitate the data exploration process. Several ill… ▽ More
Submitted 2 July, 2017; originally announced July 2017.
Comments: 6 pages, 2 figures, The first Computer science University of Tunis El Manar, PhD Symposium (CUPS'17), Tunisia, May 22-25, 2017
-
Classification non supervisée des données hétérogènes à large échelle
Abstract: When it comes to cluster massive data, response time, disk access and quality of formed classes becoming major issues for companies. It is in this context that we have come to define a clustering framework for large scale heterogeneous data that contributes to the resolution of these issues. The proposed framework is based on, firstly, the descriptive analysis based on MCA, and secondly, the MapRe… ▽ More
Submitted 2 July, 2017; originally announced July 2017.
Comments: 6 pages, in French, 8 figures
Journal ref: Conférence Internationale Francophone sur la Science de Données - Les 23èmes Rencontres annuelles de la Société Francophone de Classification (AAFD & SFC), Marrakech, Maroc, pp. 37-42, 2016
-
Mining Semi-structured Data
Abstract: The need for discovering knowledge from XML documents according to both structure and content features has become challenging, due to the increase in application contexts for which handling both structure and content information in XML data is essential. So, the challenge is to find an hierarchical structure which ensure a combination of data levels and their representative structures. In this wor… ▽ More
Submitted 15 April, 2015; originally announced April 2015.
Journal ref: The 5th International Conference on Web and Information Technologies (ICWIT), pp. 51-60, 2013
-
arXiv:1312.1860 [pdf, ps, other]
Flexible queries in XML native databases
Abstract: To date, most of the XML native databases (DB) flexible querying systems are based on exploiting the tree structure of their semi structured data (SSD). However, it becomes important to test the efficiency of Formal Concept Analysis (FCA) formalism for this type of data since it has been proved a great performance in the field of information retrieval (IR). So, the IR in XML databases based on FCA… ▽ More
Submitted 6 December, 2013; originally announced December 2013.
Comments: 5 Pages, 1 Figure
Journal ref: International Conference on Control, Engineering & Information Technology (CEIT), Proceedings Engineering & Technology, Vol. 4, pp. 100-104, 2013