Skip to main content

Showing 1–11 of 11 results for author: Xypolopoulos, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.19494  [pdf, ps, other

    cs.CL cs.LG

    Graph Linearization Methods for Reasoning on Graphs with Large Language Models

    Authors: Christos Xypolopoulos, Guokan Shang, Xiao Fei, Giannis Nikolentzos, Hadi Abdine, Iakovos Evdaimon, Michail Chatzianastasis, Giorgos Stamou, Michalis Vazirgiannis

    Abstract: Large language models have evolved to process multiple modalities beyond text, such as images and audio, which motivates us to explore how to effectively leverage them for graph reasoning tasks. The key question, therefore, is how to transform graphs into linear sequences of tokens, a process we term "graph linearization", so that LLMs can handle graphs naturally. We consider that graphs should be… ▽ More

    Submitted 25 June, 2025; v1 submitted 25 October, 2024; originally announced October 2024.

  2. arXiv:2410.13517  [pdf, other

    cs.CL cs.AI

    Bias in the Mirror: Are LLMs opinions robust to their own adversarial attacks ?

    Authors: Virgile Rennard, Christos Xypolopoulos, Michalis Vazirgiannis

    Abstract: Large language models (LLMs) inherit biases from their training data and alignment processes, influencing their responses in subtle ways. While many studies have examined these biases, little work has explored their robustness during interactions. In this paper, we introduce a novel approach where two instances of an LLM engage in self-debate, arguing opposing viewpoints to persuade a neutral vers… ▽ More

    Submitted 5 November, 2024; v1 submitted 17 October, 2024; originally announced October 2024.

  3. arXiv:2403.01535  [pdf, other

    cs.LG cs.SI

    Neural Graph Generator: Feature-Conditioned Graph Generation using Latent Diffusion Models

    Authors: Iakovos Evdaimon, Giannis Nikolentzos, Christos Xypolopoulos, Ahmed Kammoun, Michail Chatzianastasis, Hadi Abdine, Michalis Vazirgiannis

    Abstract: Graph generation has emerged as a crucial task in machine learning, with significant challenges in generating graphs that accurately reflect specific properties. Existing methods often fall short in efficiently addressing this need as they struggle with the high-dimensional complexity and varied nature of graph properties. In this paper, we introduce the Neural Graph Generator (NGG), a novel appro… ▽ More

    Submitted 18 September, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

  4. arXiv:2304.00869  [pdf, other

    cs.CL

    GreekBART: The First Pretrained Greek Sequence-to-Sequence Model

    Authors: Iakovos Evdaimon, Hadi Abdine, Christos Xypolopoulos, Stamatis Outsios, Michalis Vazirgiannis, Giorgos Stamou

    Abstract: The era of transfer learning has revolutionized the fields of Computer Vision and Natural Language Processing, bringing powerful pretrained models with exceptional performance across a variety of tasks. Specifically, Natural Language Processing tasks have been dominated by transformer-based language models. In Natural Language Inference and Natural Language Generation tasks, the BERT model and its… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  5. arXiv:2112.00566  [pdf, ps, other

    cs.CL

    NLP Research and Resources at DaSciM, Ecole Polytechnique

    Authors: Hadi Abdine, Yanzhu Guo, Moussa Kamal Eddine, Giannis Nikolentzos, Stamatis Outsios, Guokan Shang, Christos Xypolopoulos, Michalis Vazirgiannis

    Abstract: DaSciM (Data Science and Mining) part of LIX at Ecole Polytechnique, established in 2013 and since then producing research results in the area of large scale data analysis via methods of machine and deep learning. The group has been specifically active in the area of NLP and text mining with interesting results at methodological and resources level. Here follow our different contributions of inter… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

  6. arXiv:2109.10234  [pdf, other

    cs.CL

    BERTweetFR : Domain Adaptation of Pre-Trained Language Models for French Tweets

    Authors: Yanzhu Guo, Virgile Rennard, Christos Xypolopoulos, Michalis Vazirgiannis

    Abstract: We introduce BERTweetFR, the first large-scale pre-trained language model for French tweets. Our model is initialized using the general-domain French language model CamemBERT which follows the base architecture of RoBERTa. Experiments show that BERTweetFR outperforms all previous general-domain French language models on two downstream Twitter NLP tasks of offensiveness identification and named ent… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: Accepted at the Seventh Workshop on Noisy User-generated Text (W-NUT 2021)

  7. arXiv:2105.01990  [pdf, other

    cs.CL

    Evaluation Of Word Embeddings From Large-Scale French Web Content

    Authors: Hadi Abdine, Christos Xypolopoulos, Moussa Kamal Eddine, Michalis Vazirgiannis

    Abstract: Distributed word representations are popularly used in many tasks in natural language processing. Adding that pretrained word vectors on huge text corpus achieved high performance in many different NLP tasks. This paper introduces multiple high-quality word vectors for the French language where two of them are trained on massive crawled French data during this study and the others are trained on a… ▽ More

    Submitted 10 March, 2022; v1 submitted 5 May, 2021; originally announced May 2021.

  8. arXiv:2102.07836  [pdf, other

    cs.CL

    How COVID-19 Is Changing Our Language : Detecting Semantic Shift in Twitter Word Embeddings

    Authors: Yanzhu Guo, Christos Xypolopoulos, Michalis Vazirgiannis

    Abstract: Words are malleable objects, influenced by events that are reflected in written texts. Situated in the global outbreak of COVID-19, our research aims at detecting semantic shifts in social media language triggered by the health crisis. With COVID-19 related big data extracted from Twitter, we train separate word embedding models for different time periods after the outbreak. We employ an alignment… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.

  9. arXiv:2006.06251  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Performance in the Courtroom: Automated Processing and Visualization of Appeal Court Decisions in France

    Authors: Paul Boniol, George Panagopoulos, Christos Xypolopoulos, Rajaa El Hamdani, David Restrepo Amariles, Michalis Vazirgiannis

    Abstract: Artificial Intelligence techniques are already popular and important in the legal domain. We extract legal indicators from judicial judgment to decrease the asymmetry of information of the legal system and the access-to-justice gap. We use NLP methods to extract interesting entities/data from judgments to construct networks of lawyers and judgments. We propose metrics to rank lawyers based on thei… ▽ More

    Submitted 9 July, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

  10. Unsupervised Word Polysemy Quantification with Multiresolution Grids of Contextual Embeddings

    Authors: Christos Xypolopoulos, Antoine J. -P. Tixier, Michalis Vazirgiannis

    Abstract: The number of senses of a given word, or polysemy, is a very subjective notion, which varies widely across annotators and resources. We propose a novel method to estimate polysemy, based on simple geometry in the contextual embedding space. Our approach is fully unsupervised and purely data-driven. We show through rigorous experiments that our rankings are well correlated (with strong statistical… ▽ More

    Submitted 12 February, 2021; v1 submitted 23 March, 2020; originally announced March 2020.

    Comments: Equal contribution by Christos Xypolopoulos and Antoine J.-P. Tixier

  11. arXiv:1810.06694  [pdf, ps, other

    cs.CL

    Word Embeddings from Large-Scale Greek Web Content

    Authors: Stamatis Outsios, Konstantinos Skianis, Polykarpos Meladianos, Christos Xypolopoulos, Michalis Vazirgiannis

    Abstract: Word embeddings are undoubtedly very useful components in many NLP tasks. In this paper, we present word embeddings and other linguistic resources trained on the largest to date digital Greek language corpus. We also present a live web tool for testing the Greek word embeddings, by offering "analogy", "similarity score" and "most similar words" functions. Through our explorer, one could interact w… ▽ More

    Submitted 26 October, 2018; v1 submitted 8 October, 2018; originally announced October 2018.