Skip to main content

Showing 1–50 of 245 results for author: Hwang, D

.
  1. arXiv:2506.05211  [pdf

    cs.CY cs.AI

    Intentionally Unintentional: GenAI Exceptionalism and the First Amendment

    Authors: David Atkinson, Jena D. Hwang, Jacob Morrison

    Abstract: This paper challenges the assumption that courts should grant First Amendment protections to outputs from large generative AI models, such as GPT-4 and Gemini. We argue that because these models lack intentionality, their outputs do not constitute speech as understood in the context of established legal precedent, so there can be no speech to protect. Furthermore, if the model outputs are not spee… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

  2. arXiv:2506.00195  [pdf, ps, other

    cs.CL cs.AI cs.HC

    Let Them Down Easy! Contextual Effects of LLM Guardrails on User Perceptions and Preferences

    Authors: Mingqian Zheng, Wenjia Hu, Patrick Zhao, Motahhare Eslami, Jena D. Hwang, Faeze Brahman, Carolyn Rose, Maarten Sap

    Abstract: Current LLMs are trained to refuse potentially harmful input queries regardless of whether users actually had harmful intents, causing a tradeoff between safety and user experience. Through a study of 480 participants evaluating 3,840 query-response pairs, we examine how different refusal strategies affect user perceptions across varying motivations. Our findings reveal that response strategy larg… ▽ More

    Submitted 30 May, 2025; originally announced June 2025.

  3. arXiv:2505.08891  [pdf, other

    cs.CL

    Clicking some of the silly options: Exploring Player Motivation in Static and Dynamic Educational Interactive Narratives

    Authors: Daeun Hwang, Samuel Shields, Alex Calderwood, Shi Johnson-Bey, Michael Mateas, Noah Wardrip-Fruin, Edward F. Melcer

    Abstract: Motivation is an important factor underlying successful learning. Previous research has demonstrated the positive effects that static interactive narrative games can have on motivation. Concurrently, advances in AI have made dynamic and adaptive approaches to interactive narrative increasingly accessible. However, limited work has explored the impact that dynamic narratives can have on learner mot… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

    Comments: 8 pages, 3 figures, 1 table, 1 appendix. Workshop paper, CHI 2025 Augmented Educators and AI

  4. arXiv:2505.06271  [pdf, other

    cs.LG cs.AI cs.SD

    Tri-MTL: A Triple Multitask Learning Approach for Respiratory Disease Diagnosis

    Authors: June-Woo Kim, Sanghoon Lee, Miika Toikkanen, Daehwan Hwang, Kyunghoon Kim

    Abstract: Auscultation remains a cornerstone of clinical practice, essential for both initial evaluation and continuous monitoring. Clinicians listen to the lung sounds and make a diagnosis by combining the patient's medical history and test results. Given this strong association, multitask learning (MTL) can offer a compelling framework to simultaneously model these relationships, integrating respiratory s… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: Accepted to EMBC 2025

  5. arXiv:2504.11393  [pdf, other

    cs.LG cs.CL

    DataDecide: How to Predict Best Pretraining Data with Small Experiments

    Authors: Ian Magnusson, Nguyen Tai, Ben Bogin, David Heineman, Jena D. Hwang, Luca Soldaini, Akshita Bhagia, Jiacheng Liu, Dirk Groeneveld, Oyvind Tafjord, Noah A. Smith, Pang Wei Koh, Jesse Dodge

    Abstract: Because large language models are expensive to pretrain on different datasets, using smaller-scale experiments to decide on data is crucial for reducing costs. Which benchmarks and methods of making decisions from observed performance at small scale most accurately predict the datasets that yield the best large models? To empower open exploration of this question, we release models, data, and eval… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

  6. arXiv:2504.10861  [pdf, other

    cs.CL

    Ai2 Scholar QA: Organized Literature Synthesis with Attribution

    Authors: Amanpreet Singh, Joseph Chee Chang, Chloe Anastasiades, Dany Haddad, Aakanksha Naik, Amber Tanaka, Angele Zamarron, Cecile Nguyen, Jena D. Hwang, Jason Dunkleberger, Matt Latzke, Smita Rao, Jaron Lochner, Rob Evans, Rodney Kinney, Daniel S. Weld, Doug Downey, Sergey Feldman

    Abstract: Retrieval-augmented generation is increasingly effective in answering scientific questions from literature, but many state-of-the-art systems are expensive and closed-source. We introduce Ai2 Scholar QA, a free online scientific question answering application. To facilitate research, we make our entire pipeline public: as a customizable open-source Python package and interactive web app, along wit… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

    Comments: 7 pages

  7. arXiv:2504.04659  [pdf, ps, other

    econ.TH

    Competitive Information Disclosure with Heterogeneous Consumer Search

    Authors: Dongjin Hwang, Ilwoo Hwang

    Abstract: We study a model of competitive information design in an oligopoly search market with heterogeneous consumer search costs. A unique class of equilibria -- upper-censorship equilibria -- emerges under intense competition. In equilibrium, firms balance competitive pressure with local monopoly power granted by search frictions. Notably, firms disclose only partial information even as the number of fi… ▽ More

    Submitted 6 April, 2025; originally announced April 2025.

  8. arXiv:2503.10003  [pdf, other

    cs.AI cs.LG

    A New Benchmark for Few-Shot Class-Incremental Learning: Redefining the Upper Bound

    Authors: Shiwon Kim, Dongjun Hwang, Sungwon Woo, Rita Singh

    Abstract: Class-incremental learning (CIL) aims to continuously adapt to emerging classes while retaining knowledge of previously learned ones. Few-shot class-incremental learning (FSCIL) presents an even greater challenge which requires the model to learn incremental classes with only a limited number of samples. In conventional CIL, joint training is widely considered the upper bound, serving as both a be… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

  9. arXiv:2412.12511  [pdf, other

    cs.CV

    Invisible Watermarks: Attacks and Robustness

    Authors: Dongjun Hwang, Sungwon Woo, Tom Gao, Raymond Luo, Sunghwan Baek

    Abstract: As Generative AI continues to become more accessible, the case for robust detection of generated images in order to combat misinformation is stronger than ever. Invisible watermarking methods act as identifiers of generated content, embedding image- and latent-space messages that are robust to many forms of perturbations. The majority of current research investigates full-image attacks against ima… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

    Comments: YouTube link for the presentation: https://www.youtube.com/watch?v=0vwFG1HSrUE

  10. arXiv:2411.17574  [pdf, ps, other

    math.AG math.DG

    Toric Fano manifolds that do not admit extremal Kähler metrics

    Authors: DongSeon Hwang, Hiroshi Sato, Naoto Yotsutani

    Abstract: We show that there exists a toric Fano manifold of dimension $10$ that does not admit an extremal Kähler metric in the first Chern class, answering a question of Mabuchi. By taking a product with a suitable toric Fano manifold, one can also produce a toric Fano manifold of dimension $n$ admitting no extremal Kähler metric in the first Chern class for each $n \geq 11$.

    Submitted 6 December, 2024; v1 submitted 26 November, 2024; originally announced November 2024.

    Comments: Discussion on higher-dimensional cases is included. Comments welcome!

    MSC Class: 53C55; 14L24; 14M25

  11. arXiv:2411.15124  [pdf, other

    cs.CL

    Tulu 3: Pushing Frontiers in Open Language Model Post-Training

    Authors: Nathan Lambert, Jacob Morrison, Valentina Pyatkin, Shengyi Huang, Hamish Ivison, Faeze Brahman, Lester James V. Miranda, Alisa Liu, Nouha Dziri, Shane Lyu, Yuling Gu, Saumya Malik, Victoria Graf, Jena D. Hwang, Jiangjiang Yang, Ronan Le Bras, Oyvind Tafjord, Chris Wilhelm, Luca Soldaini, Noah A. Smith, Yizhong Wang, Pradeep Dasigi, Hannaneh Hajishirzi

    Abstract: Language model post-training is applied to refine behaviors and unlock new skills across a wide range of recent language models, but open recipes for applying these techniques lag behind proprietary ones. The underlying training data and recipes for post-training are simultaneously the most important pieces of the puzzle and the portion with the least transparency. To bridge this gap, we introduce… ▽ More

    Submitted 14 April, 2025; v1 submitted 22 November, 2024; originally announced November 2024.

    Comments: Added Tulu 3 405B results and additional analyses

  12. arXiv:2411.00626  [pdf, other

    cs.CV

    ZIM: Zero-Shot Image Matting for Anything

    Authors: Beomyoung Kim, Chanyong Shin, Joonhyun Jeong, Hyungsik Jung, Se-Yun Lee, Sewhan Chun, Dong-Hyun Hwang, Joonsang Yu

    Abstract: The recent segmentation foundation model, Segment Anything Model (SAM), exhibits strong zero-shot segmentation capabilities, but it falls short in generating fine-grained precise masks. To address this limitation, we propose a novel zero-shot image matting model, called ZIM, with two key contributions: First, we develop a label converter that transforms segmentation labels into detailed matte labe… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

    Comments: preprint (21 pages, 16 figures, and 8 tables)

  13. arXiv:2410.18385  [pdf, other

    cs.AI cs.IR cs.LG

    Link, Synthesize, Retrieve: Universal Document Linking for Zero-Shot Information Retrieval

    Authors: Dae Yon Hwang, Bilal Taha, Harshit Pande, Yaroslav Nechaev

    Abstract: Despite the recent advancements in information retrieval (IR), zero-shot IR remains a significant challenge, especially when dealing with new domains, languages, and newly-released use cases that lack historical query traffic from existing users. For such cases, it is common to use query augmentations followed by fine-tuning pre-trained models on the document data paired with synthetic queries. In… ▽ More

    Submitted 24 October, 2024; v1 submitted 23 October, 2024; originally announced October 2024.

    Comments: Accepted for publication at EMNLP 2024 Main Conference

  14. arXiv:2410.14632  [pdf, other

    cs.CL

    Diverging Preferences: When do Annotators Disagree and do Models Know?

    Authors: Michael JQ Zhang, Zhilin Wang, Jena D. Hwang, Yi Dong, Olivier Delalleau, Yejin Choi, Eunsol Choi, Xiang Ren, Valentina Pyatkin

    Abstract: We examine diverging preferences in human-labeled preference datasets. We develop a taxonomy of disagreement sources spanning 10 categories across four high-level classes -- task underspecification, response style, refusals, and annotation errors. We find that the majority of disagreements are in opposition with standard reward modeling approaches, which are designed with the assumption that annot… ▽ More

    Submitted 6 November, 2024; v1 submitted 18 October, 2024; originally announced October 2024.

  15. arXiv:2410.11536  [pdf, other

    cs.CV

    Overcoming Domain Limitations in Open-vocabulary Segmentation

    Authors: Dongjun Hwang, Seong Joon Oh, Junsuk Choe

    Abstract: Open-vocabulary segmentation (OVS) has gained attention for its ability to recognize a broader range of classes. However, OVS models show significant performance drops when applied to unseen domains beyond the previous training dataset. Fine-tuning these models on new datasets can improve performance, but often leads to the catastrophic forgetting of previously learned knowledge. To address this i… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

  16. arXiv:2410.10335  [pdf, ps, other

    cs.IT eess.SP

    Performance of a Threshold-based WDM and ACM for FSO Communication between Mobile Platforms in Maritime Environments

    Authors: Jae-Eun Han, Sung Sik Nam, Duck Dong Hwang, Mohamed-Slim Alouini

    Abstract: In this study, we statistically analyze the performance of a threshold-based multiple optical signal selection scheme (TMOS) for wavelength division multiplexing (WDM) and adaptive coded modulation (ACM) using free space optical (FSO) communication between mobile platforms in maritime environments with fog and 3D pointing errors. Specifically, we derive a new closed-form expression for a composite… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

  17. arXiv:2410.09754  [pdf, ps, other

    cs.LG cs.AI

    SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning

    Authors: Hojoon Lee, Dongyoon Hwang, Donghu Kim, Hyunseung Kim, Jun Jet Tai, Kaushik Subramanian, Peter R. Wurman, Jaegul Choo, Peter Stone, Takuma Seno

    Abstract: Recent advances in CV and NLP have been largely driven by scaling up the number of network parameters, despite traditional theories suggesting that larger networks are prone to overfitting. These large networks avoid overfitting by integrating components that induce a simplicity bias, guiding models toward simple and generalizable solutions. However, in deep RL, designing and scaling up networks h… ▽ More

    Submitted 29 May, 2025; v1 submitted 13 October, 2024; originally announced October 2024.

    Comments: ICLR'25 (spotlight)

  18. arXiv:2409.16497  [pdf, other

    cs.AI

    Unsupervised Text Representation Learning via Instruction-Tuning for Zero-Shot Dense Retrieval

    Authors: Qiuhai Zeng, Zimeng Qiu, Dae Yon Hwang, Xin He, William M. Campbell

    Abstract: Dense retrieval systems are commonly used for information retrieval (IR). They rely on learning text representations through an encoder and usually require supervised modeling via labelled data which can be costly to obtain or simply unavailable. In this study, we introduce a novel unsupervised text representation learning technique via instruction-tuning the pre-trained encoder-decoder large lang… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

    Comments: Accepted at DCAI24 workshop@CIKM2024

  19. arXiv:2408.06621  [pdf, other

    cs.LG cs.CL

    Towards Robust and Parameter-Efficient Knowledge Unlearning for LLMs

    Authors: Sungmin Cha, Sungjun Cho, Dasol Hwang, Moontae Lee

    Abstract: Large Language Models (LLMs) have demonstrated strong reasoning and memorization capabilities via pretraining on massive textual corpora. However, this poses risk of privacy and copyright violations, highlighting the need for efficient machine unlearning methods that remove sensitive data without retraining from scratch. While Gradient Ascent (GA) is commonly used to unlearn by reducing the likeli… ▽ More

    Submitted 24 April, 2025; v1 submitted 13 August, 2024; originally announced August 2024.

    Comments: ICLR 2025 camera-ready version

  20. arXiv:2407.12227  [pdf, other

    physics.ins-det astro-ph.IM hep-ex nucl-ex

    Development of MMC-based lithium molybdate cryogenic calorimeters for AMoRE-II

    Authors: A. Agrawal, V. V. Alenkov, P. Aryal, H. Bae, J. Beyer, B. Bhandari, R. S. Boiko, K. Boonin, O. Buzanov, C. R. Byeon, N. Chanthima, M. K. Cheoun, J. S. Choe, S. Choi, S. Choudhury, J. S. Chung, F. A. Danevich, M. Djamal, D. Drung, C. Enss, A. Fleischmann, A. M. Gangapshev, L. Gastaldo, Y. M. Gavrilyuk, A. M. Gezhaev , et al. (84 additional authors not shown)

    Abstract: The AMoRE collaboration searches for neutrinoless double beta decay of $^{100}$Mo using molybdate scintillating crystals via low temperature thermal calorimetric detection. The early phases of the experiment, AMoRE-pilot and AMoRE-I, have demonstrated competitive discovery potential. Presently, the AMoRE-II experiment, featuring a large detector array with about 90 kg of $^{100}$Mo isotope, is und… ▽ More

    Submitted 3 March, 2025; v1 submitted 16 July, 2024; originally announced July 2024.

    Journal ref: Eur. Phys. J. C 85, 172 (2025)

  21. arXiv:2407.07950  [pdf, other

    cs.CL cs.AI cs.HC

    Rel-A.I.: An Interaction-Centered Approach To Measuring Human-LM Reliance

    Authors: Kaitlyn Zhou, Jena D. Hwang, Xiang Ren, Nouha Dziri, Dan Jurafsky, Maarten Sap

    Abstract: The ability to communicate uncertainty, risk, and limitation is crucial for the safety of large language models. However, current evaluations of these abilities rely on simple calibration, asking whether the language generated by the model matches appropriate probabilities. Instead, evaluation of this aspect of LLM communication should focus on the behaviors of their human interlocutors: how much… ▽ More

    Submitted 3 October, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: Preprint

  22. Improved limit on neutrinoless double beta decay of $^{100}$Mo from AMoRE-I

    Authors: A. Agrawal, V. V. Alenkov, P. Aryal, J. Beyer, B. Bhandari, R. S. Boiko, K. Boonin, O. Buzanov, C. R. Byeon, N. Chanthima, M. K. Cheoun, J. S. Choe, Seonho Choi, S. Choudhury, J. S. Chung, F. A. Danevich, M. Djamal, D. Drung, C. Enss, A. Fleischmann, A. M. Gangapshev, L. Gastaldo, Y. M. Gavrilyuk, A. M. Gezhaev, O. Gileva , et al. (83 additional authors not shown)

    Abstract: AMoRE searches for the signature of neutrinoless double beta decay of $^{100}$Mo with a 100 kg sample of enriched $^{100}$Mo. Scintillating molybdate crystals coupled with a metallic magnetic calorimeter operate at milli-Kelvin temperatures to measure the energy of electrons emitted in the decay. As a demonstration of the full-scale AMoRE, we conducted AMoRE-I, a pre-experiment with 18 molybdate c… ▽ More

    Submitted 3 March, 2025; v1 submitted 8 July, 2024; originally announced July 2024.

    Comments: 7 pages, 5 figures

    Journal ref: Phys. Rev. Lett., 134, 082501 (2025)

  23. arXiv:2407.04249  [pdf, other

    cs.CV

    FeatureSORT: Essential Features for Effective Tracking

    Authors: Hamidreza Hashempoor, Rosemary Koikara, Yu Dong Hwang

    Abstract: In this work, we introduce a novel tracker designed for online multiple object tracking with a focus on being simple, while being effective. we provide multiple feature modules each of which stands for a particular appearance information. By integrating distinct appearance features, including clothing color, style, and target direction, alongside a ReID network for robust embedding extraction, our… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  24. arXiv:2406.06072  [pdf, other

    cs.CV cs.LG cs.RO

    Adapting Pretrained ViTs with Convolution Injector for Visuo-Motor Control

    Authors: Dongyoon Hwang, Byungkun Lee, Hojoon Lee, Hyunseung Kim, Jaegul Choo

    Abstract: Vision Transformers (ViT), when paired with large-scale pretraining, have shown remarkable performance across various computer vision tasks, primarily due to their weak inductive bias. However, while such weak inductive bias aids in pretraining scalability, this may hinder the effective adaptation of ViTs for visuo-motor control tasks as a result of the absence of control-centric inductive biases.… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: accepted to ICML 2024

  25. arXiv:2406.06037  [pdf, other

    cs.LG cs.AI cs.CV

    Investigating Pre-Training Objectives for Generalization in Vision-Based Reinforcement Learning

    Authors: Donghu Kim, Hojoon Lee, Kyungmin Lee, Dongyoon Hwang, Jaegul Choo

    Abstract: Recently, various pre-training methods have been introduced in vision-based Reinforcement Learning (RL). However, their generalization ability remains unclear due to evaluations being limited to in-distribution environments and non-unified experimental setups. To address this, we introduce the Atari Pre-training Benchmark (Atari-PB), which pre-trains a ResNet-50 model on 10 million transitions fro… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: accepted to ICML 2024

  26. arXiv:2406.00324  [pdf, other

    cs.LG cs.AI

    Do's and Don'ts: Learning Desirable Skills with Instruction Videos

    Authors: Hyunseung Kim, Byungkun Lee, Hojoon Lee, Dongyoon Hwang, Donghu Kim, Jaegul Choo

    Abstract: Unsupervised skill discovery is a learning paradigm that aims to acquire diverse behaviors without explicit rewards. However, it faces challenges in learning complex behaviors and often leads to learning unsafe or undesirable behaviors. For instance, in various continuous control tasks, current unsupervised skill discovery methods succeed in learning basic locomotions like standing but struggle wi… ▽ More

    Submitted 22 January, 2025; v1 submitted 1 June, 2024; originally announced June 2024.

    Comments: published at NeurIPS 2024

  27. arXiv:2405.19703  [pdf, other

    cs.LG cs.CV stat.ML

    Towards a Better Evaluation of Out-of-Domain Generalization

    Authors: Duhun Hwang, Suhyun Kang, Moonjung Eo, Jimyeong Kim, Wonjong Rhee

    Abstract: The objective of Domain Generalization (DG) is to devise algorithms and models capable of achieving high performance on previously unseen test distributions. In the pursuit of this objective, average measure has been employed as the prevalent measure for evaluating models and comparing algorithms in the existing DG studies. Despite its significance, a comprehensive exploration of the average measu… ▽ More

    Submitted 2 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

  28. arXiv:2405.12807  [pdf, other

    cs.LG cs.AI cs.IT

    FAdam: Adam is a natural gradient optimizer using diagonal empirical Fisher information

    Authors: Dongseong Hwang

    Abstract: This paper establishes a mathematical foundation for the Adam optimizer, elucidating its connection to natural gradient descent through Riemannian and information geometry. We provide an accessible and detailed analysis of the diagonal empirical Fisher information matrix (FIM) in Adam, clarifying all detailed approximations and advocating for the use of log probability functions as loss, which sho… ▽ More

    Submitted 3 September, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

    Comments: 21 pages, 4 figures, 6 tables

  29. arXiv:2404.10199  [pdf, other

    cs.CL cs.AI

    CULTURE-GEN: Revealing Global Cultural Perception in Language Models through Natural Language Prompting

    Authors: Huihan Li, Liwei Jiang, Jena D. Hwang, Hyunwoo Kim, Sebastin Santy, Taylor Sorensen, Bill Yuchen Lin, Nouha Dziri, Xiang Ren, Yejin Choi

    Abstract: As the utilization of large language models (LLMs) has proliferated world-wide, it is crucial for them to have adequate knowledge and fair representation for diverse global cultures. In this work, we uncover culture perceptions of three SOTA models on 110 countries and regions on 8 culture-related topics through culture-conditioned generations, and extract symbols from these generations that are a… ▽ More

    Submitted 20 August, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

  30. arXiv:2404.09173  [pdf, other

    cs.LG cs.AI cs.CL

    TransformerFAM: Feedback attention is working memory

    Authors: Dongseong Hwang, Weiran Wang, Zhuoyuan Huo, Khe Chai Sim, Pedro Moreno Mengibar

    Abstract: While Transformers have revolutionized deep learning, their quadratic attention complexity hinders their ability to process infinitely long inputs. We propose Feedback Attention Memory (FAM), a novel Transformer architecture that leverages a feedback loop to enable the network to attend to its own latent representations. This design fosters the emergence of working memory within the Transformer, a… ▽ More

    Submitted 7 May, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

    Comments: 26 pages, 12 figures, 14 tables

  31. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  32. arXiv:2403.14238  [pdf, other

    cs.CL cs.AI

    Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection

    Authors: Kyungjae Lee, Dasol Hwang, Sunghyun Park, Youngsoo Jang, Moontae Lee

    Abstract: Despite the promise of RLHF in aligning LLMs with human preferences, it often leads to superficial alignment, prioritizing stylistic changes over improving downstream performance of LLMs. Underspecified preferences could obscure directions to align the models. Lacking exploration restricts identification of desirable outputs to improve the models. To overcome these challenges, we propose a novel f… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 22 pages, 5 figures, Submitted to ACL 2024

  33. arXiv:2403.12821  [pdf, other

    cs.LG cs.AI

    FlowerFormer: Empowering Neural Architecture Encoding using a Flow-aware Graph Transformer

    Authors: Dongyeong Hwang, Hyunju Kim, Sunwoo Kim, Kijung Shin

    Abstract: The success of a specific neural network architecture is closely tied to the dataset and task it tackles; there is no one-size-fits-all solution. Thus, considerable efforts have been made to quickly and accurately estimate the performances of neural architectures, without full training or evaluation, for given tasks and datasets. Neural architecture encoding has played a crucial role in the estima… ▽ More

    Submitted 21 March, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: CVPR 2024 Camera-Ready

  34. arXiv:2403.12231  [pdf, other

    cs.NI cs.DC math.CO

    Edge-Disjoint Spanning Trees on Star-Product Networks

    Authors: Kelly Isham, Laura Monroe, Kartik Lakhotia, Aleyah Dawkins, Daniel Hwang, Ales Kubicek

    Abstract: A star-product operation may be used to create large graphs from smaller factor graphs. Network topologies based on star-products demonstrate several advantages including low-diameter, high scalability, modularity and others. Many state-of-the-art diameter-2 and -3 topologies~(Slim Fly, Bundlefly, PolarStar etc.) can be represented as star products. In this paper, we explore constructions of edg… ▽ More

    Submitted 14 May, 2025; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: Extended version of our paper with the same title accepted to IPDPS '25. Author order changed and a new author added

  35. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1112 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 16 December, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  36. arXiv:2402.17184  [pdf, other

    cs.CL cs.SD eess.AS

    Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models

    Authors: Rohit Prabhavalkar, Zhong Meng, Weiran Wang, Adam Stooke, Xingyu Cai, Yanzhang He, Arun Narayanan, Dongseong Hwang, Tara N. Sainath, Pedro J. Moreno

    Abstract: The accuracy of end-to-end (E2E) automatic speech recognition (ASR) models continues to improve as they are scaled to larger sizes, with some now reaching billions of parameters. Widespread deployment and adoption of these models, however, requires computationally efficient strategies for decoding. In the present work, we study one such strategy: applying multiple frame reduction layers in the enc… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: Accepted to 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024)

  37. arXiv:2402.01183  [pdf, other

    cs.RO cs.AI cs.CL cs.CV

    LINGO-Space: Language-Conditioned Incremental Grounding for Space

    Authors: Dohyun Kim, Nayoung Oh, Deokmin Hwang, Daehyung Park

    Abstract: We aim to solve the problem of spatially localizing composite instructions referring to space: space grounding. Compared to current instance grounding, space grounding is challenging due to the ill-posedness of identifying locations referred to by discrete expressions and the compositional ambiguity of referring expressions. Therefore, we propose a novel probabilistic space-grounding methodology (… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: Accepted by AAAI 2024

  38. arXiv:2401.07476  [pdf, other

    nucl-ex hep-ex

    Background study of the AMoRE-pilot experiment

    Authors: A. Agrawal, V. V. Alenkov, P. Aryal, J. Beyer, B. Bhandari, R. S. Boiko, K. Boonin, O. Buzanov, C. R. Byeon, N. Chanthima, M. K. Cheoun, J. S. Choe, Seonho Choi, S. Choudhury, J. S. Chung, F. A. Danevich, M. Djamal, D. Drung, C. Enss, A. Fleischmann, A. M. Gangapshev, L. Gastaldo, Yu. M. Gavrilyuk, A. M. Gezhaev, O. Gileva , et al. (83 additional authors not shown)

    Abstract: We report a study on the background of the Advanced Molybdenum-Based Rare process Experiment (AMoRE), a search for neutrinoless double beta decay (\znbb) of $^{100}$Mo. The pilot stage of the experiment was conducted using $\sim$1.9 kg of \CAMOO~ crystals at the Yangyang Underground Laboratory, South Korea, from 2015 to 2018. We compared the measured $β/γ$ energy spectra in three experimental conf… ▽ More

    Submitted 7 April, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

  39. arXiv:2401.06730  [pdf, other

    cs.CL cs.AI cs.HC

    Relying on the Unreliable: The Impact of Language Models' Reluctance to Express Uncertainty

    Authors: Kaitlyn Zhou, Jena D. Hwang, Xiang Ren, Maarten Sap

    Abstract: As natural language becomes the default interface for human-AI interaction, there is a need for LMs to appropriately communicate uncertainties in downstream applications. In this work, we investigate how LMs incorporate confidence in responses via natural language and how downstream users behave in response to LM-articulated uncertainties. We examine publicly deployed models and find that LMs are… ▽ More

    Submitted 9 July, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

    Comments: ACL 2024 (Camera Ready)

  40. arXiv:2401.05730  [pdf, other

    cs.CV cs.AI

    Enhancing Contrastive Learning with Efficient Combinatorial Positive Pairing

    Authors: Jaeill Kim, Duhun Hwang, Eunjung Lee, Jangwon Suh, Jimyeong Kim, Wonjong Rhee

    Abstract: In the past few years, contrastive learning has played a central role for the success of visual unsupervised representation learning. Around the same time, high-performance non-contrastive learning methods have been developed as well. While most of the works utilize only two views, we carefully review the existing multi-view methods and propose a general multi-view strategy that can improve learni… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  41. arXiv:2312.10087  [pdf, ps, other

    eess.AS cs.LG cs.SD

    Revisiting the Entropy Semiring for Neural Speech Recognition

    Authors: Oscar Chang, Dongseong Hwang, Olivier Siohan

    Abstract: In streaming settings, speech recognition models have to map sub-sequences of speech to text before the full audio stream becomes available. However, since alignment information between speech and text is rarely available during training, models need to learn it in a completely self-supervised way. In practice, the exponential number of possible alignments makes this extremely challenging, with mo… ▽ More

    Submitted 18 December, 2023; v1 submitted 13 December, 2023; originally announced December 2023.

  42. arXiv:2312.07399  [pdf, other

    cs.CL cs.AI

    Large Language Models are Clinical Reasoners: Reasoning-Aware Diagnosis Framework with Prompt-Generated Rationales

    Authors: Taeyoon Kwon, Kai Tzu-iunn Ong, Dongjin Kang, Seungjun Moon, Jeong Ryong Lee, Dosik Hwang, Yongsik Sim, Beomseok Sohn, Dongha Lee, Jinyoung Yeo

    Abstract: Machine reasoning has made great progress in recent years owing to large language models (LLMs). In the clinical domain, however, most NLP-driven projects mainly focus on clinical classification or reading comprehension, and under-explore clinical reasoning for disease diagnosis due to the expensive rationale annotation with clinicians. In this work, we present a "reasoning-aware" diagnosis framew… ▽ More

    Submitted 10 May, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: Accepted to AAAI 2024

  43. arXiv:2311.16586  [pdf, other

    cs.IR

    SARDINE: A Simulator for Automated Recommendation in Dynamic and Interactive Environments

    Authors: Romain Deffayet, Thibaut Thonet, Dongyoon Hwang, Vassilissa Lehoux, Jean-Michel Renders, Maarten de Rijke

    Abstract: Simulators can provide valuable insights for researchers and practitioners who wish to improve recommender systems, because they allow one to easily tweak the experimental setup in which recommender systems operate, and as a result lower the cost of identifying general trends and uncovering novel findings about the candidate methods. A key requirement to enable this accelerated improvement cycle i… ▽ More

    Submitted 8 April, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

  44. arXiv:2311.11215  [pdf, other

    cs.CL cs.AI

    SPLAIN: Augmenting Cybersecurity Warnings with Reasons and Data

    Authors: Vera A. Kazakova, Jena D. Hwang, Bonnie J. Dorr, Yorick Wilks, J. Blake Gage, Alex Memory, Mark A. Clark

    Abstract: Effective cyber threat recognition and prevention demand comprehensible forecasting systems, as prior approaches commonly offer limited and, ultimately, unconvincing information. We introduce Simplified Plaintext Language (SPLAIN), a natural language generator that converts warning data into user-friendly cyber threat explanations. SPLAIN is designed to generate clear, actionable outputs, incorpor… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    Comments: Presented at FLAIRS-2019 as poster (see ancillary files)

    ACM Class: I.2

    Journal ref: FLAIRS-2019

  45. arXiv:2311.08469  [pdf, other

    cs.CL

    UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations

    Authors: Wenting Zhao, Justin T Chiu, Jena D. Hwang, Faeze Brahman, Jack Hessel, Sanjiban Choudhury, Yejin Choi, Xiang Lorraine Li, Alane Suhr

    Abstract: Language technologies that accurately model the dynamics of events must perform commonsense reasoning. Existing work evaluating commonsense reasoning focuses on making inferences about common, everyday situations. To instead investigate the ability to model unusual, unexpected, and unlikely situations, we explore the task of uncommonsense abductive reasoning. Given a piece of context with an unexp… ▽ More

    Submitted 1 May, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: accepted at NAACL'24

  46. arXiv:2311.00059  [pdf, other

    cs.AI cs.CL cs.CV cs.LG

    The Generative AI Paradox: "What It Can Create, It May Not Understand"

    Authors: Peter West, Ximing Lu, Nouha Dziri, Faeze Brahman, Linjie Li, Jena D. Hwang, Liwei Jiang, Jillian Fisher, Abhilasha Ravichander, Khyathi Chandu, Benjamin Newman, Pang Wei Koh, Allyson Ettinger, Yejin Choi

    Abstract: The recent wave of generative AI has sparked unprecedented global attention, with both excitement and concern over potentially superhuman levels of artificial intelligence: models now take only seconds to produce outputs that would challenge or exceed the capabilities even of expert humans. At the same time, models still show basic errors in understanding that would not be expected even in non-exp… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

  47. arXiv:2310.20178  [pdf, other

    cs.LG cs.AI

    Learning to Discover Skills through Guidance

    Authors: Hyunseung Kim, Byungkun Lee, Hojoon Lee, Dongyoon Hwang, Sejik Park, Kyushik Min, Jaegul Choo

    Abstract: In the field of unsupervised skill discovery (USD), a major challenge is limited exploration, primarily due to substantial penalties when skills deviate from their initial trajectories. To enhance exploration, recent methodologies employ auxiliary rewards to maximize the epistemic uncertainty or entropy of states. However, we have identified that the effectiveness of these rewards declines as the… ▽ More

    Submitted 1 November, 2023; v1 submitted 31 October, 2023; originally announced October 2023.

    Comments: 29 pages, 18 figures, published at NeurIPS 2023

  48. arXiv:2310.17793  [pdf, other

    cs.CL cs.AI

    "You Are An Expert Linguistic Annotator": Limits of LLMs as Analyzers of Abstract Meaning Representation

    Authors: Allyson Ettinger, Jena D. Hwang, Valentina Pyatkin, Chandra Bhagavatula, Yejin Choi

    Abstract: Large language models (LLMs) show amazing proficiency and fluency in the use of language. Does this mean that they have also acquired insightful linguistic knowledge about the language, to an extent that they can serve as an "expert linguistic annotator"? In this paper, we examine the successes and limitations of the GPT-3, ChatGPT, and GPT-4 models in analysis of sentence meaning structure, focus… ▽ More

    Submitted 11 December, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Findings (short)

  49. arXiv:2310.14356  [pdf, other

    cs.CV cs.CL cs.CY cs.HC

    Semantic and Expressive Variation in Image Captions Across Languages

    Authors: Andre Ye, Sebastin Santy, Jena D. Hwang, Amy X. Zhang, Ranjay Krishna

    Abstract: Computer vision often treats human perception as homogeneous: an implicit assumption that visual stimuli are perceived similarly by everyone. This assumption is reflected in the way researchers collect datasets and train vision models. By contrast, literature in cross-cultural psychology and linguistics has provided evidence that people from different cultural backgrounds observe vastly different… ▽ More

    Submitted 12 May, 2025; v1 submitted 22 October, 2023; originally announced October 2023.

    Comments: CVPR 2025

  50. arXiv:2309.12963  [pdf, ps, other

    eess.AS cs.SD

    Massive End-to-end Models for Short Search Queries

    Authors: Weiran Wang, Rohit Prabhavalkar, Dongseong Hwang, Qiujia Li, Khe Chai Sim, Bo Li, James Qin, Xingyu Cai, Adam Stooke, Zhong Meng, CJ Zheng, Yanzhang He, Tara Sainath, Pedro Moreno Mengibar

    Abstract: In this work, we investigate two popular end-to-end automatic speech recognition (ASR) models, namely Connectionist Temporal Classification (CTC) and RNN-Transducer (RNN-T), for offline recognition of voice search queries, with up to 2B model parameters. The encoders of our models use the neural architecture of Google's universal speech model (USM), with additional funnel pooling layers to signifi… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.