Skip to main content

Showing 1–14 of 14 results for author: Kilcher, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2304.07327  [pdf, other

    cs.CL cs.AI

    OpenAssistant Conversations -- Democratizing Large Language Model Alignment

    Authors: Andreas Köpf, Yannic Kilcher, Dimitri von Rütte, Sotiris Anagnostidis, Zhi-Rui Tam, Keith Stevens, Abdullah Barhoum, Nguyen Minh Duc, Oliver Stanley, Richárd Nagyfi, Shahul ES, Sameer Suri, David Glushkov, Arnav Dantuluri, Andrew Maguire, Christoph Schuhmann, Huu Nguyen, Alexander Mattick

    Abstract: Aligning large language models (LLMs) with human preferences has proven to drastically improve usability and has driven rapid adoption as demonstrated by ChatGPT. Alignment techniques such as supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF) greatly reduce the required skill and domain knowledge to effectively harness the capabilities of LLMs, increasing their acce… ▽ More

    Submitted 31 October, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

    Comments: Published in NeurIPS 2023 Datasets and Benchmarks

    Report number: V-02 ACM Class: I.2

  2. arXiv:2201.10936  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    FIGARO: Generating Symbolic Music with Fine-Grained Artistic Control

    Authors: Dimitri von Rütte, Luca Biggio, Yannic Kilcher, Thomas Hofmann

    Abstract: Generating music with deep neural networks has been an area of active research in recent years. While the quality of generated samples has been steadily increasing, most methods are only able to exert minimal control over the generated sequence, if any. We propose the self-supervised description-to-sequence task, which allows for fine-grained controllable generation on a global level. We do so by… ▽ More

    Submitted 22 February, 2024; v1 submitted 26 January, 2022; originally announced January 2022.

    Comments: Published in ICLR 2023

  3. arXiv:2109.00527  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Boosting Search Engines with Interactive Agents

    Authors: Leonard Adolphs, Benjamin Boerschinger, Christian Buck, Michelle Chen Huebscher, Massimiliano Ciaramita, Lasse Espeholt, Thomas Hofmann, Yannic Kilcher, Sascha Rothe, Pier Giuseppe Sessa, Lierni Sestorain Saralegui

    Abstract: This paper presents first successful steps in designing search agents that learn meta-strategies for iterative query refinement in information-seeking tasks. Our approach uses machine reading to guide the selection of refinement terms from aggregated search results. Agents are then empowered with simple but effective search operators to exert fine-grained and transparent control over queries and s… ▽ More

    Submitted 7 June, 2022; v1 submitted 1 September, 2021; originally announced September 2021.

    Comments: Published in Transactions on Machine Learning Research (06/2022)

  4. arXiv:2103.12685  [pdf, other

    cs.LG cs.AI

    Generative Minimization Networks: Training GANs Without Competition

    Authors: Paulina Grnarova, Yannic Kilcher, Kfir Y. Levy, Aurelien Lucchi, Thomas Hofmann

    Abstract: Many applications in machine learning can be framed as minimization problems and solved efficiently using gradient-based techniques. However, recent applications of generative models, particularly GANs, have triggered interest in solving min-max games for which standard optimization techniques are often not suitable. Among known problems experienced by practitioners is the lack of convergence guar… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

  5. arXiv:2102.03313  [pdf, other

    cs.LG

    Rethinking Neural Networks With Benford's Law

    Authors: Surya Kant Sahu, Abhinav Java, Arshad Shaikh, Yannic Kilcher

    Abstract: Benford's Law (BL) or the Significant Digit Law defines the probability distribution of the first digit of numerical values in a data sample. This Law is observed in many naturally occurring datasets. It can be seen as a measure of naturalness of a given distribution and finds its application in areas like anomaly and fraud detection. In this work, we address the following question: Is the distrib… ▽ More

    Submitted 22 October, 2021; v1 submitted 5 February, 2021; originally announced February 2021.

    Comments: Short version accepted to NeurIPS 2021 ML4PS Workshop

  6. arXiv:1911.04156  [pdf, other

    cs.CL cs.AI

    Meta Answering for Machine Reading

    Authors: Benjamin Borschinger, Jordan Boyd-Graber, Christian Buck, Jannis Bulian, Massimiliano Ciaramita, Michelle Chen Huebscher, Wojciech Gajewski, Yannic Kilcher, Rodrigo Nogueira, Lierni Sestorain Saralegu

    Abstract: We investigate a framework for machine reading, inspired by real world information-seeking problems, where a meta question answering system interacts with a black box environment. The environment encapsulates a competitive machine reader based on BERT, providing candidate answers to questions, and possibly some context. To validate the realism of our formulation, we ask humans to play the role of… ▽ More

    Submitted 30 April, 2020; v1 submitted 11 November, 2019; originally announced November 2019.

  7. arXiv:1906.01527  [pdf, other

    cs.LG stat.ML

    Adversarial Training is a Form of Data-dependent Operator Norm Regularization

    Authors: Kevin Roth, Yannic Kilcher, Thomas Hofmann

    Abstract: We establish a theoretical link between adversarial training and operator norm regularization for deep neural networks. Specifically, we prove that $\ell_p$-norm constrained projected gradient ascent based adversarial training with an $\ell_q$-norm loss on the logits of clean and perturbed inputs is equivalent to data-dependent (p, q) operator norm regularization. This fundamental connection confi… ▽ More

    Submitted 23 October, 2020; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: NeurIPS2020

  8. arXiv:1902.04818  [pdf, other

    cs.LG stat.ML

    The Odds are Odd: A Statistical Test for Detecting Adversarial Examples

    Authors: Kevin Roth, Yannic Kilcher, Thomas Hofmann

    Abstract: We investigate conditions under which test statistics exist that can reliably detect examples, which have been adversarially manipulated in a white-box attack. These statistics can be easily computed and calibrated by randomly corrupting inputs. They exploit certain anomalies that adversarial attacks introduce, in particular if they follow the paradigm of choosing perturbations optimally under p-n… ▽ More

    Submitted 9 May, 2019; v1 submitted 13 February, 2019; originally announced February 2019.

  9. arXiv:1711.05475  [pdf, other

    cs.LG cs.CR

    The best defense is a good offense: Countering black box attacks by predicting slightly wrong labels

    Authors: Yannic Kilcher, Thomas Hofmann

    Abstract: Black-Box attacks on machine learning models occur when an attacker, despite having no access to the inner workings of a model, can successfully craft an attack by means of model theft. The attacker will train an own substitute model that mimics the model to be attacked. The substitute can then be used to design attacks against the original model, for example by means of adversarial samples. We pu… ▽ More

    Submitted 15 November, 2017; originally announced November 2017.

  10. arXiv:1710.11386  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Parametrizing filters of a CNN with a GAN

    Authors: Yannic Kilcher, Gary Becigneul, Thomas Hofmann

    Abstract: It is commonly agreed that the use of relevant invariances as a good statistical bias is important in machine-learning. However, most approaches that explicitly incorporate invariances into a model architecture only make use of very simple transformations, such as translations and rotations. Hence, there is a need for methods to model and extract richer transformations that capture much higher-lev… ▽ More

    Submitted 31 October, 2017; originally announced October 2017.

  11. arXiv:1710.11383  [pdf, other

    cs.LG stat.ML

    Flexible Prior Distributions for Deep Generative Models

    Authors: Yannic Kilcher, Aurelien Lucchi, Thomas Hofmann

    Abstract: We consider the problem of training generative models with deep neural networks as generators, i.e. to map latent codes to data points. Whereas the dominant paradigm combines simple priors over codes with complex deterministic models, we argue that it might be advantageous to use more flexible code distributions. We demonstrate how these distributions can be induced directly from the data. The ben… ▽ More

    Submitted 7 January, 2018; v1 submitted 31 October, 2017; originally announced October 2017.

    Comments: arXiv admin note: text overlap with arXiv:1707.09241

  12. arXiv:1710.11381  [pdf, other

    cs.LG stat.ML

    Semantic Interpolation in Implicit Models

    Authors: Yannic Kilcher, Aurelien Lucchi, Thomas Hofmann

    Abstract: In implicit models, one often interpolates between sampled points in latent space. As we show in this paper, care needs to be taken to match-up the distributional assumptions on code vectors with the geometry of the interpolating paths. Otherwise, typical assumptions about the quality and semantics of in-between points may not be justified. Based on our analysis we propose to modify the prior code… ▽ More

    Submitted 2 February, 2018; v1 submitted 31 October, 2017; originally announced October 2017.

  13. arXiv:1707.09241  [pdf, other

    stat.ML cs.LG

    Generator Reversal

    Authors: Yannic Kilcher, Aurélien Lucchi, Thomas Hofmann

    Abstract: We consider the problem of training generative models with deep neural networks as generators, i.e. to map latent codes to data points. Whereas the dominant paradigm combines simple priors over codes with complex deterministic models, we propose instead to use more flexible code distributions. These distributions are estimated non-parametrically by reversing the generator map during training. The… ▽ More

    Submitted 28 July, 2017; originally announced July 2017.

  14. arXiv:1611.06652  [pdf, other

    stat.ML cs.LG

    Scalable Adaptive Stochastic Optimization Using Random Projections

    Authors: Gabriel Krummenacher, Brian McWilliams, Yannic Kilcher, Joachim M. Buhmann, Nicolai Meinshausen

    Abstract: Adaptive stochastic gradient methods such as AdaGrad have gained popularity in particular for training deep neural networks. The most commonly used and studied variant maintains a diagonal matrix approximation to second order information by accumulating past gradients which are used to tune the step size adaptively. In certain situations the full-matrix variant of AdaGrad is expected to attain bet… ▽ More

    Submitted 21 November, 2016; originally announced November 2016.

    Comments: To appear in Advances in Neural Information Processing Systems 29 (NIPS 2016)