Skip to main content

Showing 1–7 of 7 results for author: Vouitsis, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.10060  [pdf, other

    cs.LG cs.AI stat.ML

    Textual Bayes: Quantifying Uncertainty in LLM-Based Systems

    Authors: Brendan Leigh Ross, Noël Vouitsis, Atiyeh Ashari Ghomi, Rasa Hosseinzadeh, Ji Xin, Zhaoyan Liu, Yi Sui, Shiyi Hou, Kin Kwan Leung, Gabriel Loaiza-Ganem, Jesse C. Cresswell

    Abstract: Although large language models (LLMs) are becoming increasingly capable of solving challenging real-world tasks, accurately quantifying their uncertainty remains a critical open problem, which limits their applicability in high-stakes domains. This challenge is further compounded by the closed-source, black-box nature of many state-of-the-art LLMs. Moreover, LLM-based systems can be highly sensiti… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

  2. arXiv:2411.08954  [pdf, other

    cs.LG cs.AI

    Inconsistencies In Consistency Models: Better ODE Solving Does Not Imply Better Samples

    Authors: Noël Vouitsis, Rasa Hosseinzadeh, Brendan Leigh Ross, Valentin Villecroze, Satya Krishna Gorti, Jesse C. Cresswell, Gabriel Loaiza-Ganem

    Abstract: Although diffusion models can generate remarkably high-quality samples, they are intrinsically bottlenecked by their expensive iterative sampling procedure. Consistency models (CMs) have recently emerged as a promising diffusion model distillation method, reducing the cost of sampling by generating high-fidelity samples in just a few iterations. Consistency model distillation aims to solve the pro… ▽ More

    Submitted 15 November, 2024; v1 submitted 13 November, 2024; originally announced November 2024.

    Comments: NeurIPS 2024 ATTRIB Workshop

  3. arXiv:2410.12916  [pdf, other

    cs.CL

    MSc-SQL: Multi-Sample Critiquing Small Language Models For Text-To-SQL Translation

    Authors: Satya Krishna Gorti, Ilan Gofman, Zhaoyan Liu, Jiapeng Wu, Noël Vouitsis, Guangwei Yu, Jesse C. Cresswell, Rasa Hosseinzadeh

    Abstract: Text-to-SQL generation enables non-experts to interact with databases via natural language. Recent advances rely on large closed-source models like GPT-4 that present challenges in accessibility, privacy, and latency. To address these issues, we focus on developing small, efficient, and open-source text-to-SQL models. We demonstrate the benefits of sampling multiple candidate SQL generations and p… ▽ More

    Submitted 16 February, 2025; v1 submitted 16 October, 2024; originally announced October 2024.

    Comments: Published at NAACL 2025

  4. arXiv:2401.13744  [pdf, other

    cs.LG cs.HC stat.ML

    Conformal Prediction Sets Improve Human Decision Making

    Authors: Jesse C. Cresswell, Yi Sui, Bhargava Kumar, Noël Vouitsis

    Abstract: In response to everyday queries, humans explicitly signal uncertainty and offer alternative answers when they are unsure. Machine learning models that output calibrated prediction sets through conformal prediction mimic this human behaviour; larger sets signal greater uncertainty while providing alternatives. In this work, we study the usefulness of conformal prediction sets as an aid for human de… ▽ More

    Submitted 9 June, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: Published at ICML 2024. Code available at https://github.com/layer6ai-labs/hitl-conformal-prediction

  5. arXiv:2312.10144  [pdf, other

    cs.LG cs.AI cs.CV

    Data-Efficient Multimodal Fusion on a Single GPU

    Authors: Noël Vouitsis, Zhaoyan Liu, Satya Krishna Gorti, Valentin Villecroze, Jesse C. Cresswell, Guangwei Yu, Gabriel Loaiza-Ganem, Maksims Volkovs

    Abstract: The goal of multimodal alignment is to learn a single latent space that is shared between multimodal inputs. The most powerful models in this space have been trained using massive datasets of paired inputs and large-scale computational resources, making them prohibitively expensive to train in many practical scenarios. We surmise that existing unimodal encoders pre-trained on large amounts of unim… ▽ More

    Submitted 10 April, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: CVPR 2024 (Highlight)

  6. arXiv:2304.13742  [pdf, other

    cs.LG cs.AI stat.ML

    TR0N: Translator Networks for 0-Shot Plug-and-Play Conditional Generation

    Authors: Zhaoyan Liu, Noel Vouitsis, Satya Krishna Gorti, Jimmy Ba, Gabriel Loaiza-Ganem

    Abstract: We propose TR0N, a highly general framework to turn pre-trained unconditional generative models, such as GANs and VAEs, into conditional models. The conditioning can be highly arbitrary, and requires only a pre-trained auxiliary model. For example, we show how to turn unconditional models into class-conditional ones with the help of a classifier, and also into text-to-image models by leveraging CL… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: Accepted at ICML 2023

  7. arXiv:2203.15086  [pdf, other

    cs.CV

    X-Pool: Cross-Modal Language-Video Attention for Text-Video Retrieval

    Authors: Satya Krishna Gorti, Noel Vouitsis, Junwei Ma, Keyvan Golestan, Maksims Volkovs, Animesh Garg, Guangwei Yu

    Abstract: In text-video retrieval, the objective is to learn a cross-modal similarity function between a text and a video that ranks relevant text-video pairs higher than irrelevant pairs. However, videos inherently express a much wider gamut of information than texts. Instead, texts often capture sub-regions of entire videos and are most semantically similar to certain frames within videos. Therefore, for… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: CVPR 2022