Skip to main content

Showing 1–7 of 7 results for author: Nadif, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2412.14867  [pdf, other

    cs.CL

    Graph-Convolutional Networks: Named Entity Recognition and Large Language Model Embedding in Document Clustering

    Authors: Imed Keraghel, Mohamed Nadif

    Abstract: Recent advances in machine learning, particularly Large Language Models (LLMs) such as BERT and GPT, provide rich contextual embeddings that improve text representation. However, current document clustering approaches often ignore the deeper relationships between named entities (NEs) and the potential of LLM embeddings. This paper proposes a novel approach that integrates Named Entity Recognition… ▽ More

    Submitted 19 December, 2024; originally announced December 2024.

    Comments: 11 pages, 4 figures

  2. arXiv:2402.12890  [pdf, other

    cs.CL cs.LG

    More Discriminative Sentence Embeddings via Semantic Graph Smoothing

    Authors: Chakib Fettal, Lazhar Labiod, Mohamed Nadif

    Abstract: This paper explores an empirical approach to learn more discriminantive sentence representations in an unsupervised fashion. Leveraging semantic graph smoothing, we enhance sentence embeddings obtained from pretrained models to improve results for the text clustering and classification tasks. Our method, validated on eight benchmarks, demonstrates consistent improvements, showcasing the potential… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: Accepted in EACL 2024

  3. arXiv:2402.04794  [pdf, other

    cs.LG

    Scalable Multi-view Clustering via Explicit Kernel Features Maps

    Authors: Chakib Fettal, Lazhar Labiod, Mohamed Nadif

    Abstract: A growing awareness of multi-view learning as an important component in data science and machine learning is a consequence of the increasing prevalence of multiple views in real-world applications, especially in the context of networks. In this paper we introduce a new scalability framework for multi-view subspace clustering. An efficient optimization strategy is proposed, leveraging kernel featur… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  4. arXiv:2402.04732  [pdf, other

    cs.LG

    Graph Cuts with Arbitrary Size Constraints Through Optimal Transport

    Authors: Chakib Fettal, Lazhar Labiod, Mohamed Nadif

    Abstract: A common way of partitioning graphs is through minimum cuts. One drawback of classical minimum cut methods is that they tend to produce small groups, which is why more balanced variants such as normalized and ratio cuts have seen more success. However, we believe that with these variants, the balance constraints can be too restrictive for some applications like for clustering of imbalanced dataset… ▽ More

    Submitted 4 October, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: Published in Transactions on Machine Learning Research

  5. arXiv:2401.10825  [pdf, other

    cs.CL cs.LG

    Recent Advances in Named Entity Recognition: A Comprehensive Survey and Comparative Study

    Authors: Imed Keraghel, Stanislas Morbieu, Mohamed Nadif

    Abstract: Named Entity Recognition seeks to extract substrings within a text that name real-world objects and to determine their type (for example, whether they refer to persons or organizations). In this survey, we first present an overview of recent popular approaches, including advancements in Transformer-based methods and Large Language Models (LLMs) that have not had much coverage in other surveys. In… ▽ More

    Submitted 20 December, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: 42 pages

    MSC Class: 68T50; 68Q32

  6. arXiv:1901.02291  [pdf, other

    cs.LG stat.ML

    Spectral Clustering via Ensemble Deep Autoencoder Learning (SC-EDAE)

    Authors: Severine Affeldt, Lazhar Labiod, Mohamed Nadif

    Abstract: Recently, a number of works have studied clustering strategies that combine classical clustering algorithms and deep learning methods. These approaches follow either a sequential way, where a deep representation is learned using a deep autoencoder before obtaining clusters with k-means, or a simultaneous way, where deep representation and clusters are learned jointly by optimizing a single objecti… ▽ More

    Submitted 12 June, 2019; v1 submitted 8 January, 2019; originally announced January 2019.

    Comments: Revised manuscript

  7. arXiv:1305.6451  [pdf, other

    cs.SI physics.soc-ph

    Data Leak Aware Crowdsourcing in Social Network

    Authors: Iheb Ben Amor, Athman Bougetteya, Mourad Ouziri, Salima Benbernou, Mohamed Nadif

    Abstract: Harnessing human computation for solving complex problems call spawns the issue of finding the unknown competitive group of solvers. In this paper, we propose an approach called Friendlysourcing to build up teams from social network answering a business call, all the while avoiding partial solution disclosure to competitive groups. The contributions of this paper include (i) a clustering based app… ▽ More

    Submitted 28 May, 2013; originally announced May 2013.

    Journal ref: Springer 7652 (2012) pp 226-236