Skip to main content

Showing 1–16 of 16 results for author: Cox, I J

.
  1. arXiv:2505.15370  [pdf, ps, other

    cs.SI

    Prediction of Reposting on X

    Authors: Ziming Xu, Shi Zhou, Vasileios Lampos, Ingemar J. Cox

    Abstract: There have been considerable efforts to predict a user's reposting behaviour on X (formerly Twitter) using machine learning models. The problem is previously cast as a supervised classification task, where Tweets are randomly assigned to a test or training set. The random assignment helps to ensure that the test and training sets are drawn from the same distribution. In practice, we would like to… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  2. arXiv:2401.15061  [pdf, other

    cs.NE cs.ET physics.optics

    Digital-analog hybrid matrix multiplication processor for optical neural networks

    Authors: Xiansong Meng, Deming Kong, Kwangwoong Kim, Qiuchi Li, Po Dong, Ingemar J. Cox, Christina Lioma, Hao Hu

    Abstract: The computational demands of modern AI have spurred interest in optical neural networks (ONNs) which offer the potential benefits of increased speed and lower power consumption. However, current ONNs face various challenges,most significantly a limited calculation precision (typically around 4 bits) and the requirement for high-resolution signal format converters (digital-to-analogue conversions (… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  3. arXiv:2212.09306  [pdf, other

    cs.CL

    E-NER -- An Annotated Named Entity Recognition Corpus of Legal Text

    Authors: Ting Wai Terence Au, Ingemar J. Cox, Vasileios Lampos

    Abstract: Identifying named entities such as a person, location or organization, in documents can highlight key information to readers. Training Named Entity Recognition (NER) models requires an annotated data set, which can be a time-consuming labour-intensive task. Nevertheless, there are publicly available NER data sets for general English. Recently there has been interest in developing NER for legal tex… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

    Comments: 5 pages, 3 figures, submitted to NLLP workshop in EMNLP 2022

  4. arXiv:2105.12433  [pdf, other

    cs.LG

    Estimating the Uncertainty of Neural Network Forecasts for Influenza Prevalence Using Web Search Activity

    Authors: Michael Morris, Peter Hayes, Ingemar J. Cox, Vasileios Lampos

    Abstract: Influenza is an infectious disease with the potential to become a pandemic, and hence, forecasting its prevalence is an important undertaking for planning an effective response. Research has found that web search activity can be used to improve influenza models. Neural networks (NN) can provide state-of-the-art forecasting accuracy but do not commonly incorporate uncertainty in their estimates, so… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

  5. arXiv:2007.11821  [pdf, other

    cs.IR cs.CY

    Providing early indication of regional anomalies in COVID19 case counts in England using search engine queries

    Authors: Elad Yom-Tov, Vasileios Lampos, Ingemar J. Cox, Michael Edelstein

    Abstract: COVID19 was first reported in England at the end of January 2020, and by mid-June over 150,000 cases were reported. We assume that, similarly to influenza-like illnesses, people who suffer from COVID19 may query for their symptoms prior to accessing the medical system (or in lieu of it). Therefore, we analyzed searches to Bing from users in England, identifying cases where unexpected rises in rele… ▽ More

    Submitted 23 July, 2020; originally announced July 2020.

  6. arXiv:2007.02603  [pdf

    cs.CY

    Go local: The key to controlling the COVID-19 pandemic in the post lockdown era

    Authors: Isabel Bennett, Jobie Budd, Erin M. Manning, Ed Manley, Mengdie Zhuang, Ingemar J. Cox, Michael Short, Anne M. Johnson, Deenan Pillay, Rachel A. McKendry

    Abstract: The UK government announced its first wave of lockdown easing on 10 May 2020, two months after the non-pharmaceutical measures to reduce the spread of COVID-19 were first introduced on 23 March 2020. Analysis of reported case rate data from Public Health England and aggregated and anonymised crowd level mobility data shows variability across local authorities in the UK. A locality-based approach t… ▽ More

    Submitted 6 July, 2020; originally announced July 2020.

    Comments: 6 pages, 3 figures

  7. Tracking COVID-19 using online search

    Authors: Vasileios Lampos, Maimuna S. Majumder, Elad Yom-Tov, Michael Edelstein, Simon Moura, Yohhei Hamada, Molebogeng X. Rangaka, Rachel A. McKendry, Ingemar J. Cox

    Abstract: Previous research has demonstrated that various properties of infectious diseases can be inferred from online search behaviour. In this work we use time series of online search query frequencies to gain insights about the prevalence of COVID-19 in multiple countries. We first develop unsupervised modelling techniques based on associated symptom categories identified by the United Kingdom's Nationa… ▽ More

    Submitted 10 February, 2021; v1 submitted 18 March, 2020; originally announced March 2020.

    Comments: Published in Nature Digital Medicine. Please note that the published version differs from this preprint

    Journal ref: Nature Digital Medicine 4, 17 (2021)

  8. arXiv:1802.06833  [pdf, other

    cs.IR

    Seasonal Web Search Query Selection for Influenza-Like Illness (ILI) Estimation

    Authors: Niels Dalum Hansen, Kåre Mølbak, Ingemar J. Cox, Christina Lioma

    Abstract: Influenza-like illness (ILI) estimation from web search data is an important web analytics task. The basic idea is to use the frequencies of queries in web search logs that are correlated with past ILI activity as features when estimating current ILI activity. It has been noted that since influenza is seasonal, this approach can lead to spurious correlations with features/queries that also exhibit… ▽ More

    Submitted 19 February, 2018; originally announced February 2018.

  9. arXiv:1702.07326  [pdf, other

    cs.IR q-bio.QM stat.AP

    Time-Series Adaptive Estimation of Vaccination Uptake Using Web Search Queries

    Authors: Niels Dalum Hansen, Kåre Mølbak, Ingemar J. Cox, Christina Lioma

    Abstract: Estimating vaccination uptake is an integral part of ensuring public health. It was recently shown that vaccination uptake can be estimated automatically from web data, instead of slowly collected clinical records or population surveys. All prior work in this area assumes that features of vaccination uptake collected from the web are temporally regular. We present the first ever method to remove t… ▽ More

    Submitted 23 February, 2017; originally announced February 2017.

  10. arXiv:1608.06253  [pdf, other

    cs.IR cs.LG stat.ML

    Multi-Dueling Bandits and Their Application to Online Ranker Evaluation

    Authors: Brian Brost, Yevgeny Seldin, Ingemar J. Cox, Christina Lioma

    Abstract: New ranking algorithms are continually being developed and refined, necessitating the development of efficient methods for evaluating these rankers. Online ranker evaluation focuses on the challenge of efficiently determining, from implicit user feedback, which ranker out of a finite set of rankers is the best. Online ranker evaluation can be modeled by dueling ban- dits, a mathematical model for… ▽ More

    Submitted 22 August, 2016; originally announced August 2016.

  11. arXiv:1608.00788  [pdf, other

    cs.IR

    An Improved Multileaving Algorithm for Online Ranker Evaluation

    Authors: Brian Brost, Ingemar J. Cox, Yevgeny Seldin, Christina Lioma

    Abstract: Online ranker evaluation is a key challenge in information retrieval. An important task in the online evaluation of rankers is using implicit user feedback for inferring preferences between rankers. Interleaving methods have been found to be efficient and sensitive, i.e. they can quickly detect even small differences in quality. It has recently been shown that multileaving methods exhibit similar… ▽ More

    Submitted 2 August, 2016; originally announced August 2016.

  12. arXiv:1409.7291  [pdf, other

    physics.soc-ph cs.AI cs.SI q-bio.PE

    Optimizing Hybrid Spreading in Metapopulations

    Authors: Changwang Zhang, Shi Zhou, Joel C. Miller, Ingemar J. Cox, Benjamin M. Chain

    Abstract: Epidemic spreading phenomena are ubiquitous in nature and society. Examples include the spreading of diseases, information, and computer viruses. Epidemics can spread by local spreading, where infected nodes can only infect a limited set of direct target nodes and global spreading, where an infected node can infect every other node. In reality, many epidemics spread using a hybrid mixture of both… ▽ More

    Submitted 31 March, 2015; v1 submitted 25 September, 2014; originally announced September 2014.

    Journal ref: Scientific Reports. 2015 Apr 29;5:9924

  13. arXiv:1307.4980  [pdf, other

    cs.GT cs.IR

    Multi-keyword multi-click advertisement option contracts for sponsored search

    Authors: Bowei Chen, Jun Wang, Ingemar J. Cox, Mohan S. Kankanhalli

    Abstract: In sponsored search, advertisement (abbreviated ad) slots are usually sold by a search engine to an advertiser through an auction mechanism in which advertisers bid on keywords. In theory, auction mechanisms have many desirable economic properties. However, keyword auctions have a number of limitations including: the uncertainty in payment prices for advertisers; the volatility in the search engin… ▽ More

    Submitted 9 December, 2015; v1 submitted 18 July, 2013; originally announced July 2013.

    Comments: Chen, Bowei and Wang, Jun and Cox, Ingemar J. and Kankanhalli, Mohan S. (2015) Multi-keyword multi-click advertisement option contracts for sponsored search. ACM Transactions on Intelligent Systems and Technology, 7 (1). pp. 1-29. ISSN: 2157-6904

  14. FindZebra: A search engine for rare diseases

    Authors: Radu Dragusin, Paula Petcu, Christina Lioma, Birger Larsen, Henrik L. Jørgensen, Ingemar J. Cox, Lars Kai Hansen, Peter Ingwersen, Ole Winther

    Abstract: Background: The web has become a primary information resource about illnesses and treatments for both medical and non-medical users. Standard web search is by far the most common interface for such information. It is therefore of interest to find out how well web search engines work for diagnostic queries and what factors contribute to successes and failures. Among diseases, rare (or orphan) disea… ▽ More

    Submitted 13 March, 2013; originally announced March 2013.

    Journal ref: International Journal of Medical Informatics, Available online 23 February 2013, ISSN 1386-5056

  15. arXiv:0903.0687  [pdf, ps, other

    physics.soc-ph physics.data-an

    Second-Order Assortative Mixing in Social Networks

    Authors: Shi Zhou, Ingemar J. Cox, Lars K. Hansen

    Abstract: In a social network, the number of links of a node, or node degree, is often assumed as a proxy for the node's importance or prominence within the network. It is known that social networks exhibit the (first-order) assortative mixing, i.e. if two nodes are connected, they tend to have similar node degrees, suggesting that people tend to mix with those of comparable prominence. In this paper, we re… ▽ More

    Submitted 23 October, 2017; v1 submitted 3 March, 2009; originally announced March 2009.

    Comments: Cite as: Zhou S., Cox I.J., Hansen L.K. (2017) Second-Order Assortative Mixing in Social Networks. In: Goncalves B., Menezes R., Sinatra R., Zlatic V. (eds) Complex Networks VIII. CompleNet 2017. Springer Proceedings in Complexity. Springer, Cham. https://doi.org/10.1007/978-3-319-54241-6_1

  16. arXiv:cs/0703043  [pdf, other

    cs.DL

    A Comparison of On-Line Computer Science Citation Databases

    Authors: Vaclav Petricek, Ingemar J. Cox, Hui Han, Isaac G. Councill, C. Lee Giles

    Abstract: This paper examines the difference and similarities between the two on-line computer science citation databases DBLP and CiteSeer. The database entries in DBLP are inserted manually while the CiteSeer entries are obtained autonomously via a crawl of the Web and automatic processing of user submissions. CiteSeer's autonomous citation database can be considered a form of self-selected on-line surv… ▽ More

    Submitted 9 March, 2007; originally announced March 2007.

    Comments: ECDL 2005

    ACM Class: H.3.7