Skip to main content

Showing 1–13 of 13 results for author: Zappella, G

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.21424  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Cost-Effective Hallucination Detection for LLMs

    Authors: Simon Valentin, Jinmiao Fu, Gianluca Detommaso, Shaoyuan Xu, Giovanni Zappella, Bryan Wang

    Abstract: Large language models (LLMs) can be prone to hallucinations - generating unreliable outputs that are unfaithful to their inputs, external facts or internally inconsistent. In this work, we address several challenges for post-hoc hallucination detection in production settings. Our pipeline for hallucination detection entails: first, producing a confidence score representing the likelihood that a ge… ▽ More

    Submitted 9 August, 2024; v1 submitted 31 July, 2024; originally announced July 2024.

    Comments: Accepted to GenAI Evaluation Workshop at KDD 2024

  2. arXiv:2207.06940  [pdf, other

    cs.LG stat.ML

    PASHA: Efficient HPO and NAS with Progressive Resource Allocation

    Authors: Ondrej Bohdal, Lukas Balles, Martin Wistuba, Beyza Ermis, Cédric Archambeau, Giovanni Zappella

    Abstract: Hyperparameter optimization (HPO) and neural architecture search (NAS) are methods of choice to obtain the best-in-class machine learning models, but in practice they can be costly to run. When models are trained on large datasets, tuning them with HPO or NAS rapidly becomes prohibitively expensive for practitioners, even when efficient multi-fidelity methods are employed. We propose an approach t… ▽ More

    Submitted 8 March, 2023; v1 submitted 14 July, 2022; originally announced July 2022.

    Comments: Accepted at ICLR 2023

  3. arXiv:2203.04640  [pdf, other

    cs.CL cs.AI stat.ML

    Memory Efficient Continual Learning with Transformers

    Authors: Beyza Ermis, Giovanni Zappella, Martin Wistuba, Aditya Rawal, Cedric Archambeau

    Abstract: In many real-world scenarios, data to train machine learning models becomes available over time. Unfortunately, these models struggle to continually learn new concepts without forgetting what has been learnt in the past. This phenomenon is known as catastrophic forgetting and it is difficult to prevent due to practical constraints. For instance, the amount of data that can be stored or the computa… ▽ More

    Submitted 13 January, 2023; v1 submitted 9 March, 2022; originally announced March 2022.

    Comments: This paper was published at NeurIPS 2022

  4. arXiv:2004.13576  [pdf, other

    stat.ML cs.LG

    A Linear Bandit for Seasonal Environments

    Authors: Giuseppe Di Benedetto, Vito Bellini, Giovanni Zappella

    Abstract: Contextual bandit algorithms are extremely popular and widely used in recommendation systems to provide online personalised recommendations. A recurrent assumption is the stationarity of the reward function, which is rather unrealistic in most of the real-world applications. In the music recommendation scenario for instance, people's music taste can abruptly change during certain events, such as H… ▽ More

    Submitted 28 April, 2020; originally announced April 2020.

  5. arXiv:2004.13106  [pdf, other

    cs.LG stat.ML

    Learning to Rank in the Position Based Model with Bandit Feedback

    Authors: Beyza Ermis, Patrick Ernst, Yannik Stein, Giovanni Zappella

    Abstract: Personalization is a crucial aspect of many online experiences. In particular, content ranking is often a key component in delivering sophisticated personalization results. Commonly, supervised learning-to-rank methods are applied, which suffer from bias introduced during data collection by production systems in charge of producing the ranking. To compensate for this problem, we leverage contextua… ▽ More

    Submitted 27 April, 2020; originally announced April 2020.

  6. arXiv:1807.02089  [pdf, other

    stat.ML cs.LG

    Linear Bandits with Stochastic Delayed Feedback

    Authors: Claire Vernade, Alexandra Carpentier, Tor Lattimore, Giovanni Zappella, Beyza Ermis, Michael Brueckner

    Abstract: Stochastic linear bandits are a natural and well-studied model for structured exploration/exploitation problems and are widely used in applications such as online marketing and recommendation. One of the main challenges faced by practitioners hoping to apply existing algorithms is that usually the feedback is randomly delayed and delays are only partially observable. For example, while a purchase… ▽ More

    Submitted 2 March, 2020; v1 submitted 5 July, 2018; originally announced July 2018.

  7. arXiv:1608.03544  [pdf, other

    cs.LG cs.AI cs.IR stat.ML

    On Context-Dependent Clustering of Bandits

    Authors: Claudio Gentile, Shuai Li, Purushottam Kar, Alexandros Karatzoglou, Evans Etrue, Giovanni Zappella

    Abstract: We investigate a novel cluster-of-bandit algorithm CAB for collaborative recommendation tasks that implements the underlying feedback sharing mechanism by estimating the neighborhood of users in a context-dependent manner. CAB makes sharp departures from the state of the art by incorporating collaborative effects into inference as well as learning processes in a manner that seamlessly interleaving… ▽ More

    Submitted 27 February, 2017; v1 submitted 6 August, 2016; originally announced August 2016.

  8. arXiv:1401.8257  [pdf, other

    cs.LG stat.ML

    Online Clustering of Bandits

    Authors: Claudio Gentile, Shuai Li, Giovanni Zappella

    Abstract: We introduce a novel algorithmic approach to content recommendation based on adaptive clustering of exploration-exploitation ("bandit") strategies. We provide a sharp regret analysis of this algorithm in a standard stochastic noise setting, demonstrate its scalability properties, and prove its effectiveness on a number of artificial and real-world datasets. Our experiments show a significant incre… ▽ More

    Submitted 6 June, 2014; v1 submitted 31 January, 2014; originally announced January 2014.

    Comments: In E. Xing and T. Jebara (Eds.), Proceedings of 31st International Conference on Machine Learning, Journal of Machine Learning Research Workshop and Conference Proceedings, Vol.32 (JMLR W&CP-32), Beijing, China, Jun. 21-26, 2014 (ICML 2014), Submitted by Shuai Li (https://sites.google.com/site/shuailidotsli)

  9. arXiv:1306.0811  [pdf, other

    cs.LG cs.SI stat.ML

    A Gang of Bandits

    Authors: Nicolò Cesa-Bianchi, Claudio Gentile, Giovanni Zappella

    Abstract: Multi-armed bandit problems are receiving a great deal of attention because they adequately formalize the exploration-exploitation trade-offs arising in several industrially relevant applications, such as online advertisement and, more generally, recommendation systems. In many cases, however, these applications have a strong social component, whose integration in the bandit algorithm could lead t… ▽ More

    Submitted 4 November, 2013; v1 submitted 4 June, 2013; originally announced June 2013.

    Comments: NIPS 2013

  10. arXiv:1301.5112  [pdf, ps, other

    cs.LG stat.ML

    Active Learning on Trees and Graphs

    Authors: Nicolo Cesa-Bianchi, Claudio Gentile, Fabio Vitale, Giovanni Zappella

    Abstract: We investigate the problem of active learning on a given tree whose nodes are assigned binary labels in an adversarial way. Inspired by recent results by Guillory and Bilmes, we characterize (up to constant factors) the optimal placement of queries so to minimize the mistakes made on the non-queried nodes. Our query selection algorithm is extremely efficient, and the optimal number of mistakes on… ▽ More

    Submitted 22 January, 2013; originally announced January 2013.

  11. arXiv:1301.4769  [pdf, other

    cs.LG cs.DS stat.ML

    A Correlation Clustering Approach to Link Classification in Signed Networks -- Full Version --

    Authors: Nicolo Cesa-Bianchi, Claudio Gentile, Fabio Vitale, Giovanni Zappella

    Abstract: Motivated by social balance theory, we develop a theory of link classification in signed networks using the correlation clustering index as measure of label regularity. We derive learning bounds in terms of correlation clustering within three fundamental transductive learning settings: online, batch and active. Our main algorithmic contribution is in the active setting, where we introduce a new fa… ▽ More

    Submitted 28 February, 2013; v1 submitted 21 January, 2013; originally announced January 2013.

  12. arXiv:1301.4767  [pdf, other

    cs.LG cs.SI stat.ML

    A Linear Time Active Learning Algorithm for Link Classification -- Full Version --

    Authors: Nicolo Cesa-Bianchi, Claudio Gentile, Fabio Vitale, Giovanni Zappella

    Abstract: We present very efficient active learning algorithms for link classification in signed networks. Our algorithms are motivated by a stochastic model in which edge labels are obtained through perturbations of a initial sign assignment consistent with a two-clustering of the nodes. We provide a theoretical analysis within this model, showing that we can achieve an optimal (to whithin a constant facto… ▽ More

    Submitted 28 February, 2013; v1 submitted 21 January, 2013; originally announced January 2013.

  13. arXiv:1212.5637  [pdf, other

    cs.LG stat.ML

    Random Spanning Trees and the Prediction of Weighted Graphs

    Authors: Nicolo' Cesa-Bianchi, Claudio Gentile, Fabio Vitale, Giovanni Zappella

    Abstract: We investigate the problem of sequentially predicting the binary labels on the nodes of an arbitrary weighted graph. We show that, under a suitable parametrization of the problem, the optimal number of prediction mistakes can be characterized (up to logarithmic factors) by the cutsize of a random spanning tree of the graph. The cutsize is induced by the unknown adversarial labeling of the graph no… ▽ More

    Submitted 21 December, 2012; originally announced December 2012.

    Comments: Appeared in ICML 2010