Skip to main content

Showing 1–14 of 14 results for author: Lampos, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.15370  [pdf, ps, other

    cs.SI

    Prediction of Reposting on X

    Authors: Ziming Xu, Shi Zhou, Vasileios Lampos, Ingemar J. Cox

    Abstract: There have been considerable efforts to predict a user's reposting behaviour on X (formerly Twitter) using machine learning models. The problem is previously cast as a supervised classification task, where Tweets are randomly assigned to a test or training set. The random assignment helps to ensure that the test and training sets are drawn from the same distribution. In practice, we would like to… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  2. arXiv:2505.15312  [pdf, ps, other

    cs.LG

    Sonnet: Spectral Operator Neural Network for Multivariable Time Series Forecasting

    Authors: Yuxuan Shu, Vasileios Lampos

    Abstract: Multivariable time series forecasting methods can integrate information from exogenous variables, leading to significant prediction accuracy gains. Transformer architecture has been widely applied in various time series forecasting models due to its ability to capture long-range sequential dependencies. However, a naïve application of transformers often struggles to effectively model complex relat… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: The code is available at https://github.com/ClaudiaShu/Sonnet

  3. arXiv:2502.15654  [pdf, other

    cs.CL cs.LG

    Machine-generated text detection prevents language model collapse

    Authors: George Drayson, Emine Yilmaz, Vasileios Lampos

    Abstract: As Large Language Models (LLMs) become increasingly prevalent, their generated outputs are proliferating across the web, risking a future where machine-generated content dilutes human-authored text. Since online data is the primary resource for LLM pre-training, subsequent models could be trained on an unknown portion of synthetic samples. This will lead to model collapse, a degenerative process w… ▽ More

    Submitted 19 May, 2025; v1 submitted 21 February, 2025; originally announced February 2025.

  4. arXiv:2406.07438  [pdf, other

    cs.LG

    DeformTime: Capturing Variable Dependencies with Deformable Attention for Time Series Forecasting

    Authors: Yuxuan Shu, Vasileios Lampos

    Abstract: In multivariable time series (MTS) forecasting, existing state-of-the-art deep learning approaches tend to focus on autoregressive formulations and often overlook the potential of using exogenous variables in enhancing the prediction of the target endogenous variable. To address this limitation, we present DeformTime, a neural network architecture that attempts to capture correlated temporal patte… ▽ More

    Submitted 1 April, 2025; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: Published in Transactions on Machine Learning Research (04/2025). The code is available at https://github.com/ClaudiaShu/DeformTime

  5. arXiv:2401.02594  [pdf, other

    cs.CL

    Unsupervised hard Negative Augmentation for contrastive learning

    Authors: Yuxuan Shu, Vasileios Lampos

    Abstract: We present Unsupervised hard Negative Augmentation (UNA), a method that generates synthetic negative instances based on the term frequency-inverse document frequency (TF-IDF) retrieval model. UNA uses TF-IDF scores to ascertain the perceived importance of terms in a sentence and then produces negative samples by replacing terms with respect to that. Our experiments demonstrate that models trained… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: The code and pre-trained models are available at https://github.com/ClaudiaShu/UNA

  6. arXiv:2212.09306  [pdf, other

    cs.CL

    E-NER -- An Annotated Named Entity Recognition Corpus of Legal Text

    Authors: Ting Wai Terence Au, Ingemar J. Cox, Vasileios Lampos

    Abstract: Identifying named entities such as a person, location or organization, in documents can highlight key information to readers. Training Named Entity Recognition (NER) models requires an annotated data set, which can be a time-consuming labour-intensive task. Nevertheless, there are publicly available NER data sets for general English. Recently there has been interest in developing NER for legal tex… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

    Comments: 5 pages, 3 figures, submitted to NLLP workshop in EMNLP 2022

  7. arXiv:2105.12433  [pdf, other

    cs.LG

    Estimating the Uncertainty of Neural Network Forecasts for Influenza Prevalence Using Web Search Activity

    Authors: Michael Morris, Peter Hayes, Ingemar J. Cox, Vasileios Lampos

    Abstract: Influenza is an infectious disease with the potential to become a pandemic, and hence, forecasting its prevalence is an important undertaking for planning an effective response. Research has found that web search activity can be used to improve influenza models. Neural networks (NN) can provide state-of-the-art forecasting accuracy but do not commonly incorporate uncertainty in their estimates, so… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

  8. arXiv:2007.11821  [pdf, other

    cs.IR cs.CY

    Providing early indication of regional anomalies in COVID19 case counts in England using search engine queries

    Authors: Elad Yom-Tov, Vasileios Lampos, Ingemar J. Cox, Michael Edelstein

    Abstract: COVID19 was first reported in England at the end of January 2020, and by mid-June over 150,000 cases were reported. We assume that, similarly to influenza-like illnesses, people who suffer from COVID19 may query for their symptoms prior to accessing the medical system (or in lieu of it). Therefore, we analyzed searches to Bing from users in England, identifying cases where unexpected rises in rele… ▽ More

    Submitted 23 July, 2020; originally announced July 2020.

  9. Tracking COVID-19 using online search

    Authors: Vasileios Lampos, Maimuna S. Majumder, Elad Yom-Tov, Michael Edelstein, Simon Moura, Yohhei Hamada, Molebogeng X. Rangaka, Rachel A. McKendry, Ingemar J. Cox

    Abstract: Previous research has demonstrated that various properties of infectious diseases can be inferred from online search behaviour. In this work we use time series of online search query frequencies to gain insights about the prevalence of COVID-19 in multiple countries. We first develop unsupervised modelling techniques based on associated symptom categories identified by the United Kingdom's Nationa… ▽ More

    Submitted 10 February, 2021; v1 submitted 18 March, 2020; originally announced March 2020.

    Comments: Published in Nature Digital Medicine. Please note that the published version differs from this preprint

    Journal ref: Nature Digital Medicine 4, 17 (2021)

  10. arXiv:1712.08076  [pdf, ps, other

    cs.CY

    Assessing public health interventions using Web content

    Authors: Vasileios Lampos

    Abstract: Public health interventions are a fundamental tool for mitigating the spread of an infectious disease. However, it is not always possible to obtain a conclusive estimate for the impact of an intervention, especially in situations where the effects are fragmented in population parts that are under-represented within traditional public health surveillance schemes. To this end, online user activity c… ▽ More

    Submitted 21 December, 2017; originally announced December 2017.

  11. arXiv:1612.03494  [pdf, other

    cs.AI cs.CL cs.SI

    Flu Detector: Estimating influenza-like illness rates from online user-generated content

    Authors: Vasileios Lampos

    Abstract: We provide a brief technical description of an online platform for disease monitoring, titled as the Flu Detector (fludetector.cs.ucl.ac.uk). Flu Detector, in its current version (v.0.5), uses either Twitter or Google search data in conjunction with statistical Natural Language Processing models to estimate the rate of influenza-like illness in the population of England. Its back-end is a live ser… ▽ More

    Submitted 11 December, 2016; originally announced December 2016.

  12. arXiv:1304.5507  [pdf, other

    cs.SI physics.soc-ph

    Analysing Mood Patterns in the United Kingdom through Twitter Content

    Authors: Vasileios Lampos, Thomas Lansdall-Welfare, Ricardo Araya, Nello Cristianini

    Abstract: Social Media offer a vast amount of geo-located and time-stamped textual content directly generated by people. This information can be analysed to obtain insights about the general state of a large population of users and to address scientific questions from a diversity of disciplines. In this work, we estimate temporal patterns of mood variation through the use of emotionally loaded words contain… ▽ More

    Submitted 19 April, 2013; originally announced April 2013.

  13. arXiv:1208.2873  [pdf, other

    cs.LG cs.CL cs.IR cs.SI stat.AP stat.ML

    Detecting Events and Patterns in Large-Scale User Generated Textual Streams with Statistical Learning Methods

    Authors: Vasileios Lampos

    Abstract: A vast amount of textual web streams is influenced by events or phenomena emerging in the real world. The social web forms an excellent modern paradigm, where unstructured user generated content is published on a regular basis and in most occasions is freely distributed. The present Ph.D. Thesis deals with the problem of inferring information - or patterns in general - about events emerging in rea… ▽ More

    Submitted 13 August, 2012; originally announced August 2012.

    Comments: PhD thesis, 238 pages, 9 chapters, 2 appendices, 58 figures, 49 tables

  14. arXiv:1204.0423  [pdf, other

    cs.SI physics.soc-ph

    On voting intentions inference from Twitter content: a case study on UK 2010 General Election

    Authors: Vasileios Lampos

    Abstract: This is a report, where preliminary work regarding the topic of voting intention inference from Social Media - such as Twitter - is presented. Our case study is the UK 2010 General Election and we are focusing on predicting the percentages of voting intention polls (conducted by YouGov) for the three major political parties - Conservatives, Labours and Liberal Democrats - during a 5-month period b… ▽ More

    Submitted 21 May, 2012; v1 submitted 2 April, 2012; originally announced April 2012.

    Comments: 11 pages, 2 figures, 7 tables

    ACM Class: J.4; I.7.0; I.2.6