Skip to main content

Showing 1–12 of 12 results for author: Hinck, M

.
  1. arXiv:2505.15970  [pdf, ps, other

    cs.CV cs.LG

    Analyzing Hierarchical Structure in Vision Models with Sparse Autoencoders

    Authors: Matthew Lyle Olson, Musashi Hinck, Neale Ratzlaff, Changbai Li, Phillip Howard, Vasudev Lal, Shao-Yen Tseng

    Abstract: The ImageNet hierarchy provides a structured taxonomy of object categories, offering a valuable lens through which to analyze the representations learned by deep vision models. In this work, we conduct a comprehensive analysis of how vision models encode the ImageNet hierarchy, leveraging Sparse Autoencoders (SAEs) to probe their internal representations. SAEs have been widely used as an explanati… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: (Oral) CVPR 2025 Workshop on Mechanistic Interpretability for Vision. Authors 1 and 2 contributed equally

  2. arXiv:2502.10928  [pdf, other

    cs.LG cs.AI cs.CL

    Probing Semantic Routing in Large Mixture-of-Expert Models

    Authors: Matthew Lyle Olson, Neale Ratzlaff, Musashi Hinck, Man Luo, Sungduk Yu, Chendi Xue, Vasudev Lal

    Abstract: In the past year, large (>100B parameter) mixture-of-expert (MoE) models have become increasingly common in the open domain. While their advantages are often framed in terms of efficiency, prior work has also explored functional differentiation through routing behavior. We investigate whether expert routing in large MoE models is influenced by the semantics of the inputs. To test this, we design t… ▽ More

    Submitted 21 May, 2025; v1 submitted 15 February, 2025; originally announced February 2025.

    Comments: 16 pages, 5 figures, 5 tables

  3. arXiv:2502.08395  [pdf, other

    cs.CL

    IssueBench: Millions of Realistic Prompts for Measuring Issue Bias in LLM Writing Assistance

    Authors: Paul Röttger, Musashi Hinck, Valentin Hofmann, Kobi Hackenburg, Valentina Pyatkin, Faeze Brahman, Dirk Hovy

    Abstract: Large language models (LLMs) are helping millions of users write texts about diverse issues, and in doing so expose users to different ideas and perspectives. This creates concerns about issue bias, where an LLM tends to present just one perspective on a given issue, which in turn may influence how users think about this issue. So far, it has not been possible to measure which issue biases LLMs ac… ▽ More

    Submitted 12 February, 2025; originally announced February 2025.

    Comments: under review

  4. arXiv:2412.06060  [pdf, other

    cs.CL cs.AI

    Steering Large Language Models to Evaluate and Amplify Creativity

    Authors: Matthew Lyle Olson, Neale Ratzlaff, Musashi Hinck, Shao-yen Tseng, Vasudev Lal

    Abstract: Although capable of generating creative text, Large Language Models (LLMs) are poor judges of what constitutes "creativity". In this work, we show that we can leverage this knowledge of how to write creatively in order to better judge what is creative. We take a mechanistic approach that extracts differences in the internal states of an LLM when prompted to respond "boringly" or "creatively" to pr… ▽ More

    Submitted 8 December, 2024; originally announced December 2024.

    Comments: (Spotlight) NeurIPS 2024 Workshop on Creativity & Generative AI. Authors 1 and 2 contributed equally

  5. arXiv:2411.12590  [pdf, other

    cs.CV cs.LG

    Debias your Large Multi-Modal Model at Test-Time via Non-Contrastive Visual Attribute Steering

    Authors: Neale Ratzlaff, Matthew Lyle Olson, Musashi Hinck, Estelle Aflalo, Shao-Yen Tseng, Vasudev Lal, Phillip Howard

    Abstract: Large Multi-Modal Models (LMMs) have demonstrated impressive capabilities as general-purpose chatbots able to engage in conversations about visual inputs. However, their responses are influenced by societal biases present in their training datasets, leading to undesirable differences in how the model responds when presented with images depicting people of different demographics. In this work, we p… ▽ More

    Submitted 13 March, 2025; v1 submitted 15 November, 2024; originally announced November 2024.

    Comments: 10 pages, 6 Figures, 8 Tables. arXiv admin note: text overlap with arXiv:2410.13976

  6. arXiv:2410.13976  [pdf, other

    cs.CV cs.CL cs.LG

    Debiasing Large Vision-Language Models by Ablating Protected Attribute Representations

    Authors: Neale Ratzlaff, Matthew Lyle Olson, Musashi Hinck, Shao-Yen Tseng, Vasudev Lal, Phillip Howard

    Abstract: Large Vision Language Models (LVLMs) such as LLaVA have demonstrated impressive capabilities as general-purpose chatbots that can engage in conversations about a provided input image. However, their responses are influenced by societal biases present in their training datasets, leading to undesirable differences in how the model responds when presented with images depicting people of different dem… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: NeurIPS workshop on SafeGenAI, 10 pages, 2 figures

  7. AutoPersuade: A Framework for Evaluating and Explaining Persuasive Arguments

    Authors: Till Raphael Saenger, Musashi Hinck, Justin Grimmer, Brandon M. Stewart

    Abstract: We introduce AutoPersuade, a three-part framework for constructing persuasive messages. First, we curate a large dataset of arguments with human evaluations. Next, we develop a novel topic model to identify argument features that influence persuasiveness. Finally, we use this model to predict the effectiveness of new arguments and assess the causal impact of different components to provide explana… ▽ More

    Submitted 11 March, 2025; v1 submitted 11 October, 2024; originally announced October 2024.

    Comments: Published in Proceedings of EMNLP 2024. The official version is available in the ACL Anthology at https://aclanthology.org/2024.emnlp-main.913/

  8. arXiv:2408.15993  [pdf, other

    cs.CV cs.LG physics.ao-ph

    ClimDetect: A Benchmark Dataset for Climate Change Detection and Attribution

    Authors: Sungduk Yu, Brian L. White, Anahita Bhiwandiwalla, Musashi Hinck, Matthew Lyle Olson, Yaniv Gurwicz, Raanan Y. Rohekar, Tung Nguyen, Vasudev Lal

    Abstract: Detecting and attributing temperature increases driven by climate change is crucial for understanding global warming and informing adaptation strategies. However, distinguishing human-induced climate signals from natural variability remains challenging for traditional detection and attribution (D&A) methods, which rely on identifying specific "fingerprints" -- spatial patterns expected to emerge f… ▽ More

    Submitted 10 March, 2025; v1 submitted 28 August, 2024; originally announced August 2024.

  9. arXiv:2407.02333  [pdf, other

    cs.CL cs.CV

    Why do LLaVA Vision-Language Models Reply to Images in English?

    Authors: Musashi Hinck, Carolin Holtermann, Matthew Lyle Olson, Florian Schneider, Sungduk Yu, Anahita Bhiwandiwalla, Anne Lauscher, Shaoyen Tseng, Vasudev Lal

    Abstract: We uncover a surprising multilingual bias occurring in a popular class of multimodal vision-language models (VLMs). Including an image in the query to a LLaVA-style VLM significantly increases the likelihood of the model returning an English response, regardless of the language of the query. This paper investigates the causes of this loss with a two-pronged approach that combines extensive ablatio… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Pre-print

  10. arXiv:2404.01331  [pdf, other

    cs.CL cs.AI

    LLaVA-Gemma: Accelerating Multimodal Foundation Models with a Compact Language Model

    Authors: Musashi Hinck, Matthew L. Olson, David Cobbley, Shao-Yen Tseng, Vasudev Lal

    Abstract: We train a suite of multimodal foundation models (MMFM) using the popular LLaVA framework with the recently released Gemma family of large language models (LLMs). Of particular interest is the 2B parameter Gemma model, which provides opportunities to construct capable small-scale MMFMs. In line with findings from other papers in this space, we test the effect of ablating three design features: pre… ▽ More

    Submitted 10 June, 2024; v1 submitted 29 March, 2024; originally announced April 2024.

    Comments: CVPR 2024, MMFM workshop. Authors 1 and 2 contributed equally. Models available at https://huggingface.co/intel/llava-gemma-2b/ and https://huggingface.co/intel/llava-gemma-7b/ Training code at https://github.com/IntelLabs/multimodal_cognitive_ai/tree/main/LLaVA-Gemma

  11. arXiv:2402.16786  [pdf, other

    cs.CL cs.AI

    Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models

    Authors: Paul Röttger, Valentin Hofmann, Valentina Pyatkin, Musashi Hinck, Hannah Rose Kirk, Hinrich Schütze, Dirk Hovy

    Abstract: Much recent work seeks to evaluate values and opinions in large language models (LLMs) using multiple-choice surveys and questionnaires. Most of this work is motivated by concerns around real-world LLM applications. For example, politically-biased LLMs may subtly influence society when they are used by millions of people. Such real-world concerns, however, stand in stark contrast to the artificial… ▽ More

    Submitted 5 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: Accepted at ACL 2024 (Main Conference)

  12. arXiv:2306.04746  [pdf, other

    stat.ME cs.CL cs.LG stat.ML

    Using Imperfect Surrogates for Downstream Inference: Design-based Supervised Learning for Social Science Applications of Large Language Models

    Authors: Naoki Egami, Musashi Hinck, Brandon M. Stewart, Hanying Wei

    Abstract: In computational social science (CSS), researchers analyze documents to explain social and political phenomena. In most scenarios, CSS researchers first obtain labels for documents and then explain labels using interpretable regression analyses in the second step. One increasingly common way to annotate documents cheaply at scale is through large language models (LLMs). However, like other scalabl… ▽ More

    Submitted 14 January, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)