Skip to main content

Showing 1–11 of 11 results for author: Patel, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.02144  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Towards Interpretable Soft Prompts

    Authors: Oam Patel, Jason Wang, Nikhil Shivakumar Nayak, Suraj Srinivas, Himabindu Lakkaraju

    Abstract: Soft prompts have been popularized as a cheap and easy way to improve task-specific LLM performance beyond few-shot prompts. Despite their origin as an automated prompting method, however, soft prompts and other trainable prompts remain a black-box method with no immediately interpretable connections to prompting. We create a novel theoretical framework for evaluating the interpretability of train… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

    Comments: 9 pages, 8 figures

    MSC Class: 68T50 ACM Class: I.2.0; G.3

  2. arXiv:2503.18182  [pdf, other

    cs.CL

    Exploring Topic Trends in COVID-19 Research Literature using Non-Negative Matrix Factorization

    Authors: Divya Patel, Vansh Parikh, Om Patel, Agam Shah, Bhaskar Chaudhury

    Abstract: In this work, we apply topic modeling using Non-Negative Matrix Factorization (NMF) on the COVID-19 Open Research Dataset (CORD-19) to uncover the underlying thematic structure and its evolution within the extensive body of COVID-19 research literature. NMF factorizes the document-term matrix into two non-negative matrices, effectively representing the topics and their distribution across the docu… ▽ More

    Submitted 23 March, 2025; originally announced March 2025.

  3. arXiv:2501.14249  [pdf, other

    cs.LG cs.AI cs.CL

    Humanity's Last Exam

    Authors: Long Phan, Alice Gatti, Ziwen Han, Nathaniel Li, Josephina Hu, Hugh Zhang, Chen Bo Calvin Zhang, Mohamed Shaaban, John Ling, Sean Shi, Michael Choi, Anish Agrawal, Arnav Chopra, Adam Khoja, Ryan Kim, Richard Ren, Jason Hausenloy, Oliver Zhang, Mantas Mazeika, Dmitry Dodonov, Tung Nguyen, Jaeho Lee, Daron Anderson, Mikhail Doroshenko, Alun Cennyth Stokes , et al. (1084 additional authors not shown)

    Abstract: Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are not keeping pace in difficulty: LLMs now achieve over 90\% accuracy on popular benchmarks like MMLU, limiting informed measurement of state-of-the-art LLM capabilities. In response, we introduce Humanity's Last Exam (HLE), a multi-modal benchmark at the frontier of… ▽ More

    Submitted 19 April, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

    Comments: 29 pages, 6 figures

  4. arXiv:2501.03277  [pdf, other

    cs.CL

    HonkaiChat: Companions from Anime that feel alive!

    Authors: Yueze Liu, Yichi Zhang, Shaan Om Patel, Zhaoyang Zhu, Shilong Guo

    Abstract: Modern conversational agents, including anime-themed chatbots, are frequently reactive and personality-driven but fail to capture the dynamic nature of human interactions. We propose an event-driven dialogue framework to address these limitations by embedding dynamic events in conversation prompts and fine-tuning models on character-specific data. Evaluations on GPT-4 and comparisons with industry… ▽ More

    Submitted 5 January, 2025; originally announced January 2025.

    Comments: 5 pages, 4 figures. This is a preprint. Not yet submitted to a journal or conference. More iterated versions to be updated

  5. arXiv:2407.03518  [pdf

    cs.CL cs.AI

    Improving LLM Abilities in Idiomatic Translation

    Authors: Sundesh Donthi, Maximilian Spencer, Om Patel, Joon Doh, Eid Rodan, Kevin Zhu, Sean O'Brien

    Abstract: For large language models (LLMs) like NLLB and GPT, translating idioms remains a challenge. Our goal is to enhance translation fidelity by improving LLM processing of idiomatic language while preserving the original linguistic style. This has a significant social impact, as it preserves cultural nuances and ensures translated texts retain their intent and emotional resonance, fostering better cros… ▽ More

    Submitted 22 January, 2025; v1 submitted 3 July, 2024; originally announced July 2024.

    Comments: Preprint for LoResLM Workshop at COLING 2025

  6. arXiv:2406.07882  [pdf, other

    cs.CL cs.AI cs.HC

    Designing a Dashboard for Transparency and Control of Conversational AI

    Authors: Yida Chen, Aoyu Wu, Trevor DePodesta, Catherine Yeh, Kenneth Li, Nicholas Castillo Marin, Oam Patel, Jan Riecke, Shivam Raval, Olivia Seow, Martin Wattenberg, Fernanda Viégas

    Abstract: Conversational LLMs function as black box systems, leaving users guessing about why they see the output they do. This lack of transparency is potentially problematic, especially given concerns around bias and truthfulness. To address this issue, we present an end-to-end prototype-connecting interpretability techniques with user experience design-that seeks to make chatbots more transparent. We beg… ▽ More

    Submitted 14 October, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: Project page: https://bit.ly/talktuner-project-page, 38 pages, 23 figures

  7. arXiv:2403.05030  [pdf, other

    cs.CR cs.AI cs.LG

    Defending Against Unforeseen Failure Modes with Latent Adversarial Training

    Authors: Stephen Casper, Lennart Schulze, Oam Patel, Dylan Hadfield-Menell

    Abstract: Despite extensive diagnostics and debugging by developers, AI systems sometimes exhibit harmful unintended behaviors. Finding and fixing these is challenging because the attack surface is so large -- it is not tractable to exhaustively search for inputs that may elicit harmful behaviors. Red-teaming and adversarial training (AT) are commonly used to improve robustness, however, they empirically st… ▽ More

    Submitted 21 August, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  8. arXiv:2403.03218  [pdf, other

    cs.LG cs.AI cs.CL cs.CY

    The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning

    Authors: Nathaniel Li, Alexander Pan, Anjali Gopal, Summer Yue, Daniel Berrios, Alice Gatti, Justin D. Li, Ann-Kathrin Dombrowski, Shashwat Goel, Long Phan, Gabriel Mukobi, Nathan Helm-Burger, Rassin Lababidi, Lennart Justen, Andrew B. Liu, Michael Chen, Isabelle Barrass, Oliver Zhang, Xiaoyuan Zhu, Rishub Tamirisa, Bhrugu Bharathi, Adam Khoja, Zhenqi Zhao, Ariel Herbert-Voss, Cort B. Breuer , et al. (32 additional authors not shown)

    Abstract: The White House Executive Order on Artificial Intelligence highlights the risks of large language models (LLMs) empowering malicious actors in developing biological, cyber, and chemical weapons. To measure these risks of malicious use, government institutions and major AI labs are developing evaluations for hazardous capabilities in LLMs. However, current evaluations are private, preventing furthe… ▽ More

    Submitted 15 May, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: See the project page at https://wmdp.ai

  9. arXiv:2306.03341  [pdf, other

    cs.LG cs.AI cs.CL

    Inference-Time Intervention: Eliciting Truthful Answers from a Language Model

    Authors: Kenneth Li, Oam Patel, Fernanda Viégas, Hanspeter Pfister, Martin Wattenberg

    Abstract: We introduce Inference-Time Intervention (ITI), a technique designed to enhance the "truthfulness" of large language models (LLMs). ITI operates by shifting model activations during inference, following a set of directions across a limited number of attention heads. This intervention significantly improves the performance of LLaMA models on the TruthfulQA benchmark. On an instruction-finetuned LLa… ▽ More

    Submitted 26 June, 2024; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 spotlight; code: https://github.com/likenneth/honest_llama

  10. arXiv:2204.11835  [pdf, other

    q-bio.QM cs.AI cs.LG

    A Novel Scalable Apache Spark Based Feature Extraction Approaches for Huge Protein Sequence and their Clustering Performance Analysis

    Authors: Preeti Jha, Aruna Tiwari, Neha Bharill, Milind Ratnaparkhe, Om Prakash Patel, Nilagiri Harshith, Mukkamalla Mounika, Neha Nagendra

    Abstract: Genome sequencing projects are rapidly increasing the number of high-dimensional protein sequence datasets. Clustering a high-dimensional protein sequence dataset using traditional machine learning approaches poses many challenges. Many different feature extraction methods exist and are widely used. However, extracting features from millions of protein sequences becomes impractical because they ar… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

  11. A Comparative Study of Histogram Equalization Based Image Enhancement Techniques for Brightness Preservation and Contrast Enhancement

    Authors: Omprakash Patel, Yogendra P. S. Maravi, Sanjeev Sharma

    Abstract: Histogram Equalization is a contrast enhancement technique in the image processing which uses the histogram of image. However histogram equalization is not the best method for contrast enhancement because the mean brightness of the output image is significantly different from the input image. There are several extensions of histogram equalization has been proposed to overcome the brightness preser… ▽ More

    Submitted 16 November, 2013; originally announced November 2013.

    Comments: 15 pages, 5 figures, 4 tables, Signal & Image Processing : An International Journal (SIPIJ)

    Journal ref: Signal & Image Processing : An International Journal (SIPIJ) Vol.4, No.5, October 2013