Skip to main content

Showing 1–20 of 20 results for author: Aouiche, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:1701.08029  [pdf

    cs.DB

    Index and Materialized View Selection in Data Warehouses

    Authors: Kamel Aouiche, Jérôme Darmont

    Abstract: The aim of this article is to present an overview of the major families of state-of-the-art index and materialized view selection methods, and to discuss the issues and future trends in data warehouse performance optimization. We particularly focus on data mining-based heuristics we developed to reduce the selection problem complexity and target the most pertinent candidate indexes and materialize… ▽ More

    Submitted 27 January, 2017; originally announced January 2017.

    Journal ref: Handbook of Research on Innovations in Database Technologies and Applications, II, pp.693-700, 2009

  2. Web 2.0 OLAP: From Data Cubes to Tag Clouds

    Authors: Kamel Aouiche, Daniel Lemire, Robert Godin

    Abstract: Increasingly, business projects are ephemeral. New Business Intelligence tools must support ad-lib data sources and quick perusal. Meanwhile, tag clouds are a popular community-driven visualization technique. Hence, we investigate tag-cloud views with support for OLAP operations such as roll-ups, slices, dices, clustering, and drill-downs. As a case study, we implemented an application where users… ▽ More

    Submitted 15 March, 2016; v1 submitted 16 May, 2009; originally announced May 2009.

    Comments: Software at https://github.com/lemire/OLAPTagCloud. arXiv admin note: substantial text overlap with arXiv:0710.2156

    Journal ref: Lecture Notes in Business Information Processing Vol. 18, pages 51-64, 2009

  3. Sorting improves word-aligned bitmap indexes

    Authors: Daniel Lemire, Owen Kaser, Kamel Aouiche

    Abstract: Bitmap indexes must be compressed to reduce input/output costs and minimize CPU usage. To accelerate logical operations (AND, OR, XOR) over bitmaps, we use techniques based on run-length encoding (RLE), such as Word-Aligned Hybrid (WAH) compression. These techniques are sensitive to the order of the rows: a simple lexicographical sort can divide the index size by 9 and make indexes several times f… ▽ More

    Submitted 29 July, 2016; v1 submitted 23 January, 2009; originally announced January 2009.

    Journal ref: Data & Knowledge Engineering, Volume 69, Issue 1, 2010, Pages 3-28

  4. arXiv:0809.2687  [pdf, ps, other

    cs.DB

    Frequent itemsets mining for database auto-administration

    Authors: Kamel Aouiche, Jérôme Darmont, Le Gruenwald

    Abstract: With the wide development of databases in general and data warehouses in particular, it is important to reduce the tasks that a database administrator must perform manually. The aim of auto-administrative systems is to administrate and adapt themselves automatically without loss (or even with a gain) in performance. The idea of using data mining techniques to extract useful knowledge for adminis… ▽ More

    Submitted 16 September, 2008; originally announced September 2008.

    Comments: in 7th International Database Engineering and Application Symposium (IDEAS 03), Hong-Kong : Chine (2003)

  5. arXiv:0809.1981  [pdf

    cs.DB

    A Join Index for XML Data Warehouses

    Authors: Hadj Mahboubi, Kamel Aouiche, Jérôme Darmont

    Abstract: XML data warehouses form an interesting basis for decision-support applications that exploit complex data. However, native-XML database management systems (DBMSs) currently bear limited performances and it is necessary to research for ways to optimize them. In this paper, we propose a new join index that is specifically adapted to the multidimensional architecture of XML warehouses. It eliminate… ▽ More

    Submitted 11 September, 2008; originally announced September 2008.

    Comments: 2008 International Conference on Information Resources Management (Conf-IRM 08), Niagra Falls : Canada (2008)

  6. arXiv:0809.1965  [pdf

    cs.DB

    Dynamic index selection in data warehouses

    Authors: Stéphane Azefack, Kamel Aouiche, Jérôme Darmont

    Abstract: Analytical queries defined on data warehouses are complex and use several join operations that are very costly, especially when run on very large data volumes. To improve response times, data warehouse administrators casually use indexing techniques. This task is nevertheless complex and fastidious. In this paper, we present an automatic, dynamic index selection method for data warehouses that i… ▽ More

    Submitted 11 September, 2008; originally announced September 2008.

    Journal ref: 4th International Conference on Innovations in Information Technology (Innovations 07), Dubai : Émirats arabes unis (2006)

  7. arXiv:0809.1963  [pdf

    cs.DB

    Materialized View Selection by Query Clustering in XML Data Warehouses

    Authors: Hadj Mahboubi, Kamel Aouiche, Jérôme Darmont

    Abstract: XML data warehouses form an interesting basis for decision-support applications that exploit complex data. However, native XML database management systems currently bear limited performances and it is necessary to design strategies to optimize them. In this paper, we propose an automatic strategy for the selection of XML materialized views that exploits a data mining technique, more precisely th… ▽ More

    Submitted 11 September, 2008; originally announced September 2008.

    Journal ref: 4th International Multiconference on Computer Science and Information Technology (CSIT 06), Amman : Jordanie (2006)

  8. arXiv:0808.2083  [pdf, other

    cs.DB

    Histogram-Aware Sorting for Enhanced Word-Aligned Compression in Bitmap Indexes

    Authors: Owen Kaser, Daniel Lemire, Kamel Aouiche

    Abstract: Bitmap indexes must be compressed to reduce input/output costs and minimize CPU usage. To accelerate logical operations (AND, OR, XOR) over bitmaps, we use techniques based on run-length encoding (RLE), such as Word-Aligned Hybrid (WAH) compression. These techniques are sensitive to the order of the rows: a simple lexicographical sort can divide the index size by 9 and make indexes several times… ▽ More

    Submitted 19 January, 2009; v1 submitted 14 August, 2008; originally announced August 2008.

    Comments: To appear in proceedings of DOLAP 2008

    ACM Class: H.3.2; E.1

  9. arXiv:0805.3339  [pdf, other

    cs.DB

    Tri de la table de faits et compression des index bitmaps avec alignement sur les mots

    Authors: Kamel Aouiche, Daniel Lemire, Owen Kaser

    Abstract: Bitmap indexes are frequently used to index multidimensional data. They rely mostly on sequential input/output. Bitmaps can be compressed to reduce input/output costs and minimize CPU usage. The most efficient compression techniques are based on run-length encoding (RLE), such as Word-Aligned Hybrid (WAH) compression. This type of compression accelerates logical operations (AND, OR) over the bit… ▽ More

    Submitted 14 August, 2008; v1 submitted 21 May, 2008; originally announced May 2008.

    Comments: to appear at BDA'08

  10. arXiv:0710.2156  [pdf, other

    cs.DB

    Collaborative OLAP with Tag Clouds: Web 2.0 OLAP Formalism and Experimental Evaluation

    Authors: Kamel Aouiche, Daniel Lemire, Robert Godin

    Abstract: Increasingly, business projects are ephemeral. New Business Intelligence tools must support ad-lib data sources and quick perusal. Meanwhile, tag clouds are a popular community-driven visualization technique. Hence, we investigate tag-cloud views with support for OLAP operations such as roll-ups, slices, dices, clustering, and drill-downs. As a case study, we implemented an application where users… ▽ More

    Submitted 15 March, 2016; v1 submitted 11 October, 2007; originally announced October 2007.

    Comments: Software at https://github.com/lemire/OLAPTagCloud

  11. arXiv:0707.1548  [pdf, other

    cs.DB

    Data Mining-based Materialized View and Index Selection in Data Warehouses

    Authors: Kamel Aouiche, Jérôme Darmont

    Abstract: Materialized views and indexes are physical structures for accelerating data access that are casually used in data warehouses. However, these data structures generate some maintenance overhead. They also share the same storage space. Most existing studies about materialized view and index selection consider these structures separately. In this paper, we adopt the opposite stance and couple mater… ▽ More

    Submitted 10 July, 2007; originally announced July 2007.

  12. arXiv:0707.1534  [pdf, other

    cs.DB

    An Architecture Framework for Complex Data Warehouses

    Authors: Jérôme Darmont, Omar Boussaid, Jean-Christian Ralaivao, Kamel Aouiche

    Abstract: Nowadays, many decision support applications need to exploit data that are not only numerical or symbolic, but also multimedia, multistructure, multisource, multimodal, and/or multiversion. We term such data complex data. Managing and analyzing complex data involves a lot of different issues regarding their structure, storage and processing, and metadata are a key element in all these processes.… ▽ More

    Submitted 10 July, 2007; originally announced July 2007.

  13. arXiv:0707.1306  [pdf, other

    cs.DB

    Sélection simultanée d'index et de vues matérialisées

    Authors: Nora Maiz, Kamel Aouiche, Jérôme Darmont

    Abstract: Indices and materialized views are physical structures that accelerate data access in data warehouses. However, these data structures generate some maintenance overhead. They also share the same storage space. The existing studies about index and materialized view selection consider these structures separately. In this paper, we adopt the opposite stance and couple index and materialized view se… ▽ More

    Submitted 9 July, 2007; originally announced July 2007.

  14. arXiv:0707.1304  [pdf, other

    cs.DB

    Un index de jointure pour les entrepôts de données XML

    Authors: Hadj Mahboubi, Kamel Aouiche, Jérôme Darmont

    Abstract: XML data warehouses form an interesting basis for decision-support applications that exploit heterogeneous data from multiple sources. However, XML-native database systems currently bear limited performances and it is necessary to research ways to optimize them. In this paper, we propose a new index that is specifically adapted to the multidimensional architecture of XML warehouses and eliminate… ▽ More

    Submitted 9 July, 2007; originally announced July 2007.

  15. arXiv:0707.1288  [pdf, other

    cs.DB

    Espaces de représentation multidimensionnels dédiés à la visualisation

    Authors: Riadh Ben Messaoud, Kamel Aouiche, Cécile Favre

    Abstract: In decision-support systems, the visual component is important for On Line Analysis Processing (OLAP). In this paper, we propose a new approach that faces the visualization problem due to data sparsity. We use the results of a Multiple Correspondence Analysis (MCA) to reduce the negative effect of sparsity by organizing differently data cube cells. Our approach does not reduce sparsity, however… ▽ More

    Submitted 9 July, 2007; originally announced July 2007.

  16. arXiv:0704.3520  [pdf, ps, other

    cs.DB

    Vers l'auto-administration des entrepôts de données

    Authors: Kamel Aouiche, Jérôme Darmont

    Abstract: With the wide development of databases in general and data warehouses in particular, it is important to reduce the tasks that a database administrator must perform manually. The idea of using data mining techniques to extract useful knowledge for administration from the data themselves has existed for some years. However, little research has been achieved. The aim of this study is to search for… ▽ More

    Submitted 26 April, 2007; originally announced April 2007.

    Comments: Version courte de 4 pages

    Journal ref: XXXVèmes Journées de Statistique, Session spéciale Entreposage et Fouille de Données, Lyon (02/06/2003) 105-108

  17. arXiv:cs/0703114  [pdf, ps, other

    cs.DB

    Clustering-Based Materialized View Selection in Data Warehouses

    Authors: Kamel Aouiche, Pierre-Emmanuel Jouve, Jerome Darmont

    Abstract: Materialized view selection is a non-trivial task. Hence, its complexity must be reduced. A judicious choice of views must be cost-driven and influenced by the workload experienced by the system. In this paper, we propose a framework for materialized view selection that exploits a data mining technique (clustering), in order to determine clusters of similar queries. We also propose a view mergin… ▽ More

    Submitted 23 March, 2007; originally announced March 2007.

  18. arXiv:cs/0703113  [pdf, ps, other

    cs.DB

    Automatic Selection of Bitmap Join Indexes in Data Warehouses

    Authors: Kamel Aouiche, Jerome Darmont, Omar Boussaid, Fadila Bentayeb

    Abstract: The queries defined on data warehouses are complex and use several join operations that induce an expensive computational cost. This cost becomes even more prohibitive when queries access very large volumes of data. To improve response time, data warehouse administrators generally use indexing techniques such as star join indexes or bitmap join indexes. This task is nevertheless complex and fast… ▽ More

    Submitted 23 March, 2007; originally announced March 2007.

  19. arXiv:cs/0703058  [pdf, other

    cs.DB cs.PF

    A Comparison of Five Probabilistic View-Size Estimation Techniques in OLAP

    Authors: Kamel Aouiche, Daniel Lemire

    Abstract: A data warehouse cannot materialize all possible views, hence we must estimate quickly, accurately, and reliably the size of views to determine the best candidates for materialization. Many available techniques for view-size estimation make particular statistical assumptions and their error can be large. Comparatively, unassuming probabilistic techniques are slower, but they estimate accurately… ▽ More

    Submitted 8 December, 2008; v1 submitted 13 March, 2007; originally announced March 2007.

  20. arXiv:cs/0703056  [pdf, other

    cs.DB cs.PF

    Unasssuming View-Size Estimation Techniques in OLAP

    Authors: Kamel Aouiche, Daniel Lemire

    Abstract: Even if storage was infinite, a data warehouse could not materialize all possible views due to the running time and update requirements. Therefore, it is necessary to estimate quickly, accurately, and reliably the size of views. Many available techniques make particular statistical assumptions and their error can be quite large. Unassuming techniques exist, but typically assume we have independe… ▽ More

    Submitted 8 December, 2008; v1 submitted 12 March, 2007; originally announced March 2007.

    Comments: Published in ICEIS 2007