Skip to main content

Showing 201–250 of 589 results for author: Lin, B

.
  1. arXiv:2310.16654  [pdf, other

    cs.CL

    ChatGPT is a Potential Zero-Shot Dependency Parser

    Authors: Boda Lin, Xinyi Zhou, Binghao Tang, Xiaocheng Gong, Si Li

    Abstract: Pre-trained language models have been widely used in dependency parsing task and have achieved significant improvements in parser performance. However, it remains an understudied question whether pre-trained language models can spontaneously exhibit the ability of dependency parsing without introducing additional parser structure in the zero-shot scenario. In this paper, we propose to explore the… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 10 pages

  2. arXiv:2310.13573  [pdf, other

    cs.CV cs.AI

    Boosting Generalization with Adaptive Style Techniques for Fingerprint Liveness Detection

    Authors: Kexin Zhu, Bo Lin, Yang Qiu, Adam Yule, Yao Tang, Jiajun Liang

    Abstract: We introduce a high-performance fingerprint liveness feature extraction technique that secured first place in LivDet 2023 Fingerprint Representation Challenge. Additionally, we developed a practical fingerprint recognition system with 94.68% accuracy, earning second place in LivDet 2023 Liveness Detection in Action. By investigating various methods, particularly style transfer, we demonstrate impr… ▽ More

    Submitted 24 October, 2023; v1 submitted 20 October, 2023; originally announced October 2023.

    Comments: 1st Place in LivDet2023 Fingerprint Representation Challenge

  3. arXiv:2310.12487  [pdf, other

    cs.LG

    Improved Operator Learning by Orthogonal Attention

    Authors: Zipeng Xiao, Zhongkai Hao, Bokai Lin, Zhijie Deng, Hang Su

    Abstract: Neural operators, as an efficient surrogate model for learning the solutions of PDEs, have received extensive attention in the field of scientific machine learning. Among them, attention-based neural operators have become one of the mainstreams in related research. However, existing approaches overfit the limited training data due to the considerable number of parameters in the attention mechanism… ▽ More

    Submitted 26 December, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: 14 pages, 5 figures

  4. arXiv:2310.11564  [pdf, other

    cs.CL

    Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging

    Authors: Joel Jang, Seungone Kim, Bill Yuchen Lin, Yizhong Wang, Jack Hessel, Luke Zettlemoyer, Hannaneh Hajishirzi, Yejin Choi, Prithviraj Ammanabrolu

    Abstract: While Reinforcement Learning from Human Feedback (RLHF) aligns Large Language Models (LLMs) with general, aggregate human preferences, it is suboptimal for learning diverse, individual perspectives. In this work, we study Reinforcement Learning from Personalized Human Feedback (RLPHF) problem, wherein LLMs are aligned to multiple (sometimes conflicting) preferences by modeling alignment as a Multi… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: Preprint

  5. arXiv:2310.08370  [pdf, other

    cs.CV

    UniPAD: A Universal Pre-training Paradigm for Autonomous Driving

    Authors: Honghui Yang, Sha Zhang, Di Huang, Xiaoyang Wu, Haoyi Zhu, Tong He, Shixiang Tang, Hengshuang Zhao, Qibo Qiu, Binbin Lin, Xiaofei He, Wanli Ouyang

    Abstract: In the context of autonomous driving, the significance of effective feature learning is widely acknowledged. While conventional 3D self-supervised pre-training methods have shown widespread success, most methods follow the ideas originally designed for 2D images. In this paper, we present UniPAD, a novel self-supervised learning paradigm applying 3D volumetric differentiable rendering. UniPAD impl… ▽ More

    Submitted 7 April, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: CVPR2024

  6. arXiv:2310.05402  [pdf

    cond-mat.mtrl-sci cond-mat.str-el

    Challenges for density functional theory in simulating metal-metal singlet bonding: a case study of dimerized VO2

    Authors: Yubo Zhang, Da Ke, Junxiong Wu, Chutong Zhang, Baichen Lin, Zuhuang Chen, John P. Perdew, Jianwei Sun

    Abstract: VO2 is renowned for its electric transition from an insulating monoclinic (M1) phase characterized by V-V dimerized structures, to a metallic rutile (R) phase above 340 Kelvin. This transition is accompanied by a magnetic change: the M1 phase exhibits a non-magnetic spin-singlet state, while the R phase exhibits a state with local magnetic moments. Simultaneous simulation of the structural, electr… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: 14 pages, 6 figures

  7. arXiv:2310.01886  [pdf, other

    cs.LG cs.CL cs.CV

    BYOM: Building Your Own Multi-Task Model For Free

    Authors: Weisen Jiang, Baijiong Lin, Han Shi, Yu Zhang, Zhenguo Li, James T. Kwok

    Abstract: Recently, various merging methods have been proposed to build a multi-task model from task-specific finetuned models without retraining. However, existing methods suffer from a large performance deterioration compared to using multiple task-specific models. In this paper, we propose to inject task-specific knowledge into the merged model and design two parameter-efficient approaches (BYOM-FFT and… ▽ More

    Submitted 3 February, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: Technical Report

  8. arXiv:2310.01852  [pdf, other

    cs.CV cs.AI

    LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

    Authors: Bin Zhu, Bin Lin, Munan Ning, Yang Yan, Jiaxi Cui, HongFa Wang, Yatian Pang, Wenhao Jiang, Junwu Zhang, Zongwei Li, Wancai Zhang, Zhifeng Li, Wei Liu, Li Yuan

    Abstract: The video-language (VL) pretraining has achieved remarkable improvement in multiple downstream tasks. However, the current VL pretraining framework is hard to extend to multiple modalities (N modalities, N>=3) beyond vision and language. We thus propose LanguageBind, taking the language as the bind across different modalities because the language modality is well-explored and contains rich semanti… ▽ More

    Submitted 21 January, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: Accepted by ICLR 2024

  9. arXiv:2310.00752  [pdf, other

    cs.CL cs.AI

    TIGERScore: Towards Building Explainable Metric for All Text Generation Tasks

    Authors: Dongfu Jiang, Yishan Li, Ge Zhang, Wenhao Huang, Bill Yuchen Lin, Wenhu Chen

    Abstract: We present TIGERScore, a \textbf{T}rained metric that follows \textbf{I}nstruction \textbf{G}uidance to perform \textbf{E}xplainable, and \textbf{R}eference-free evaluation over a wide spectrum of text generation tasks. Different from other automatic evaluation methods that only provide arcane scores, TIGERScore is guided by natural language instruction to provide error analysis to pinpoint the mi… ▽ More

    Submitted 9 May, 2024; v1 submitted 1 October, 2023; originally announced October 2023.

  10. arXiv:2309.17277  [pdf, other

    cs.AI

    Suspicion-Agent: Playing Imperfect Information Games with Theory of Mind Aware GPT-4

    Authors: Jiaxian Guo, Bo Yang, Paul Yoo, Bill Yuchen Lin, Yusuke Iwasawa, Yutaka Matsuo

    Abstract: Unlike perfect information games, where all elements are known to every player, imperfect information games emulate the real-world complexities of decision-making under uncertain or incomplete information. GPT-4, the recent breakthrough in large language models (LLMs) trained on massive passive data, is notable for its knowledge retrieval and reasoning abilities. This paper delves into the applica… ▽ More

    Submitted 31 August, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

  11. arXiv:2309.16855  [pdf, other

    stat.ME math.ST

    A Variational Spike-and-Slab Approach for Group Variable Selection

    Authors: Buyu Lin, Changhao Ge, Jun S. Liu

    Abstract: We introduce a class of generic spike-and-slab priors for high-dimensional linear regression with grouped variables and present a Coordinate-ascent Variational Inference (CAVI) algorithm for obtaining an optimal variational Bayes approximation. Using parameter expansion for a specific, yet comprehensive, family of slab distributions, we obtain a further gain in computational efficiency. The method… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: 64 pages, 6 figures

  12. arXiv:2309.12444  [pdf, other

    cs.CL

    Foundation Metrics for Evaluating Effectiveness of Healthcare Conversations Powered by Generative AI

    Authors: Mahyar Abbasian, Elahe Khatibi, Iman Azimi, David Oniani, Zahra Shakeri Hossein Abad, Alexander Thieme, Ram Sriram, Zhongqi Yang, Yanshan Wang, Bryant Lin, Olivier Gevaert, Li-Jia Li, Ramesh Jain, Amir M. Rahmani

    Abstract: Generative Artificial Intelligence is set to revolutionize healthcare delivery by transforming traditional patient care into a more personalized, efficient, and proactive process. Chatbots, serving as interactive conversational models, will probably drive this patient-centered transformation in healthcare. Through the provision of various services, including diagnosis, personalized lifestyle recom… ▽ More

    Submitted 28 February, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: 14 pages, 4 figures, 2 tables, journal paper

  13. Efficient Long-Short Temporal Attention Network for Unsupervised Video Object Segmentation

    Authors: Ping Li, Yu Zhang, Li Yuan, Huaxin Xiao, Binbin Lin, Xianghua Xu

    Abstract: Unsupervised Video Object Segmentation (VOS) aims at identifying the contours of primary foreground objects in videos without any prior knowledge. However, previous methods do not fully use spatial-temporal context and fail to tackle this challenging task in real-time. This motivates us to develop an efficient Long-Short Temporal Attention network (termed LSTA) for unsupervised VOS task from a hol… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

    Report number: 12 tables, 11 figures, https://github.com/mlvccn/LSTA_UVOS

    Journal ref: Pattern Recognition (PR'2024)

  14. arXiv:2309.11028  [pdf, other

    q-bio.NC cs.LG stat.ME

    The Topology and Geometry of Neural Representations

    Authors: Baihan Lin, Nikolaus Kriegeskorte

    Abstract: A central question for neuroscience is how to characterize brain representations of perceptual and cognitive content. An ideal characterization should distinguish different functional regions with robustness to noise and idiosyncrasies of individual brains that do not correspond to computational differences. Previous studies have characterized brain representations by their representational geomet… ▽ More

    Submitted 3 June, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: codes: https://github.com/doerlbh/TopologicalRSA

    Journal ref: Proceedings of the National Academy of Sciences, 121(42), e2317881121 (2024)

  15. arXiv:2309.02144  [pdf, other

    cs.CL cs.AI cs.LG

    Making Large Language Models Better Reasoners with Alignment

    Authors: Peiyi Wang, Lei Li, Liang Chen, Feifan Song, Binghuai Lin, Yunbo Cao, Tianyu Liu, Zhifang Sui

    Abstract: Reasoning is a cognitive process of using evidence to reach a sound conclusion. The reasoning capability is essential for large language models (LLMs) to serve as the brain of the artificial general intelligence agent. Recent studies reveal that fine-tuning LLMs on data with the chain of thought (COT) reasoning process can significantly enhance their reasoning capabilities. However, we find that t… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: Large Language Models; Reasoning; Alignment

  16. arXiv:2308.13429  [pdf, other

    cs.SE

    Investigating the Impact of Vocabulary Difficulty and Code Naturalness on Program Comprehension

    Authors: Bin Lin, Gregorio Robles

    Abstract: Context: Developers spend most of their time comprehending source code during software development. Automatically assessing how readable and understandable source code is can provide various benefits in different tasks, such as task triaging and code reviews. While several studies have proposed approaches to predict software readability and understandability, most of them only focus on local chara… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

    Comments: Accepted at ICSME 2023 Registered Reports Track

  17. arXiv:2308.12029  [pdf, other

    cs.LG cs.AI

    Dual-Balancing for Multi-Task Learning

    Authors: Baijiong Lin, Weisen Jiang, Feiyang Ye, Yu Zhang, Pengguang Chen, Ying-Cong Chen, Shu Liu, James T. Kwok

    Abstract: Multi-task learning (MTL), a learning paradigm to learn multiple related tasks simultaneously, has achieved great success in various fields. However, task balancing problem remains a significant challenge in MTL, with the disparity in loss/gradient scales often leading to performance compromises. In this paper, we propose a Dual-Balancing Multi-Task Learning (DB-MTL) method to alleviate the task b… ▽ More

    Submitted 29 September, 2023; v1 submitted 23 August, 2023; originally announced August 2023.

    Comments: Technical Report

  18. arXiv:2308.11948  [pdf, other

    cs.CV

    Efficient Transfer Learning in Diffusion Models via Adversarial Noise

    Authors: Xiyu Wang, Baijiong Lin, Daochang Liu, Chang Xu

    Abstract: Diffusion Probabilistic Models (DPMs) have demonstrated substantial promise in image generation tasks but heavily rely on the availability of large amounts of training data. Previous works, like GANs, have tackled the limited data problem by transferring pre-trained models learned with sufficient data. However, those methods are hard to be utilized in DPMs since the distinct differences between DP… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

  19. arXiv:2308.11733  [pdf

    cs.DC

    Demand-driven provisioning of Kubernetes-like resources in OSG

    Authors: Igor Sfiligoi, Frank Würthwein, Jeff Dost, Brian Lin, David Schultz

    Abstract: The OSG-operated Open Science Pool is an HTCondor-based virtual cluster that aggregates resources from compute clusters provided by several organizations. Most of the resources are not owned by OSG, so demand-based dynamic provisioning is important for maximizing usage without incurring excessive waste. OSG has long relied on GlideinWMS for most of its resource provisioning needs but is limited to… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: 6 pages, 3 figures, Submitted to Proceedings of CHEP23

  20. arXiv:2308.08275  [pdf, other

    hep-ph nucl-th

    Generalized parton distributions of gluon in proton: a light-front quantization approach

    Authors: Bolang Lin, Sreeraj Nair, Siqi Xu, Zhi Hu, Chandan Mondal, Xingbo Zhao, James P. Vary

    Abstract: We solve for the gluon generalized parton distributions (GPDs) inside the proton, focusing specifically on leading twist chiral-even GPDs. We obtain and employ the light-front wavefunctions (LFWFs) of the proton from a light-front quantized Hamiltonian with Quantum Chromodynamics input using basis light-front quantization (BLFQ). Our investigation incorporates the valence Fock sector with three co… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: 10 pages, 4 figures

  21. AutoLTS: Automating Cycling Stress Assessment via Contrastive Learning and Spatial Post-processing

    Authors: Bo Lin, Shoshanna Saxe, Timothy C. Y. Chan

    Abstract: Cycling stress assessment, which quantifies cyclists' perceived stress imposed by the built environment and motor traffics, increasingly informs cycling infrastructure planning and cycling route recommendation. However, currently calculating cycling stress is slow and data-intensive, which hinders its broader application. In this paper, We propose a deep learning framework to support accurate, fas… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

  22. arXiv:2308.01738  [pdf, other

    cs.CV

    Enhancing Visibility in Nighttime Haze Images Using Guided APSF and Gradient Adaptive Convolution

    Authors: Yeying Jin, Beibei Lin, Wending Yan, Yuan Yuan, Wei Ye, Robby T. Tan

    Abstract: Visibility in hazy nighttime scenes is frequently reduced by multiple factors, including low light, intense glow, light scattering, and the presence of multicolored light sources. Existing nighttime dehazing methods often struggle with handling glow or low-light conditions, resulting in either excessively dark visuals or unsuppressed glow outputs. In this paper, we enhance the visibility from a si… ▽ More

    Submitted 21 January, 2024; v1 submitted 3 August, 2023; originally announced August 2023.

    Comments: Accepted to ACM'MM2023, https://github.com/jinyeying/nighttime_dehaze

    Journal ref: Published in ACM'MM2023

  23. arXiv:2308.00520  [pdf, other

    cs.CV

    NormKD: Normalized Logits for Knowledge Distillation

    Authors: Zhihao Chi, Tu Zheng, Hengjia Li, Zheng Yang, Boxi Wu, Binbin Lin, Deng Cai

    Abstract: Logit based knowledge distillation gets less attention in recent years since feature based methods perform better in most cases. Nevertheless, we find it still has untapped potential when we re-investigate the temperature, which is a crucial hyper-parameter to soften the logit outputs. For most of the previous works, it was set as a fixed value for the entire distillation procedure. However, as th… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  24. arXiv:2308.00287  [pdf, other

    cs.CV cs.LG

    A Study of Unsupervised Evaluation Metrics for Practical and Automatic Domain Adaptation

    Authors: Minghao Chen, Zepeng Gao, Shuai Zhao, Qibo Qiu, Wenxiao Wang, Binbin Lin, Xiaofei He

    Abstract: Unsupervised domain adaptation (UDA) methods facilitate the transfer of models to target domains without labels. However, these methods necessitate a labeled target validation set for hyper-parameter tuning and model selection. In this paper, we aim to find an evaluation metric capable of assessing the quality of a transferred model without access to target validation labels. We begin with the met… ▽ More

    Submitted 18 September, 2023; v1 submitted 1 August, 2023; originally announced August 2023.

  25. arXiv:2307.15245  [pdf, other

    cs.LG cs.AI

    A Practical Recipe for Federated Learning Under Statistical Heterogeneity Experimental Design

    Authors: Mahdi Morafah, Weijia Wang, Bill Lin

    Abstract: Federated Learning (FL) has been an area of active research in recent years. There have been numerous studies in FL to make it more successful in the presence of data heterogeneity. However, despite the existence of many publications, the state of progress in the field is unknown. Many of the works use inconsistent experimental settings and there are no comprehensive studies on the effect of FL-sp… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

  26. arXiv:2307.13269  [pdf, other

    cs.CL cs.AI

    LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

    Authors: Chengsong Huang, Qian Liu, Bill Yuchen Lin, Tianyu Pang, Chao Du, Min Lin

    Abstract: Low-rank adaptations (LoRA) are often employed to fine-tune large language models (LLMs) for new tasks. This paper investigates LoRA composability for cross-task generalization and introduces LoraHub, a simple framework devised for the purposive assembly of LoRA modules trained on diverse given tasks, with the objective of achieving adaptable performance on unseen tasks. With just a few examples f… ▽ More

    Submitted 18 August, 2024; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: COLM 2024

  27. arXiv:2307.08536  [pdf, other

    cs.CV

    Variational Probabilistic Fusion Network for RGB-T Semantic Segmentation

    Authors: Baihong Lin, Zengrong Lin, Yulan Guo, Yulan Zhang, Jianxiao Zou, Shicai Fan

    Abstract: RGB-T semantic segmentation has been widely adopted to handle hard scenes with poor lighting conditions by fusing different modality features of RGB and thermal images. Existing methods try to find an optimal fusion feature for segmentation, resulting in sensitivity to modality noise, class-imbalance, and modality bias. To overcome the problems, this paper proposes a novel Variational Probabilisti… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

  28. arXiv:2307.07496  [pdf, other

    physics.optics physics.app-ph physics.ins-det

    Cryogenic Optical Packaging Using Photonic Wire Bonds

    Authors: Becky Lin, Donald Witt, Jeff F. Young, Lukas Chrostowski

    Abstract: We present the required techniques for the successful low loss packaging of integrated photonic devices capable of operating down to 970 mK utilizing photonic wire bonds. This scalable technique is shown to have an insertion loss of less than 2 dB per connection between a SMF-28 single mode fibre and a silicon photonic chip at these temperatures. This technique has shown robustness to thermal cycl… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

  29. arXiv:2306.10364  [pdf, other

    cs.CV

    Residual Spatial Fusion Network for RGB-Thermal Semantic Segmentation

    Authors: Ping Li, Junjie Chen, Binbin Lin, Xianghua Xu

    Abstract: Semantic segmentation plays an important role in widespread applications such as autonomous driving and robotic sensing. Traditional methods mostly use RGB images which are heavily affected by lighting conditions, \eg, darkness. Recent studies show thermal images are robust to the night scenario as a compensating modality for segmentation. However, existing works either simply fuse RGB-Thermal (RG… ▽ More

    Submitted 17 June, 2023; originally announced June 2023.

  30. arXiv:2306.07610  [pdf, other

    cs.CL

    Soft Language Clustering for Multilingual Model Pre-training

    Authors: Jiali Zeng, Yufan Jiang, Yongjing Yin, Yi Jing, Fandong Meng, Binghuai Lin, Yunbo Cao, Jie Zhou

    Abstract: Multilingual pre-trained language models have demonstrated impressive (zero-shot) cross-lingual transfer abilities, however, their performance is hindered when the target language has distant typology from source languages or when pre-training data is limited in size. In this paper, we propose XLM-P, which contextually retrieves prompts as flexible guidance for encoding instances conditionally. Ou… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

  31. arXiv:2306.06402  [pdf, other

    cs.LG

    A Single-Loop Deep Actor-Critic Algorithm for Constrained Reinforcement Learning with Provable Convergence

    Authors: Kexuan Wang, An Liu, Baishuo Lin

    Abstract: Deep Actor-Critic algorithms, which combine Actor-Critic with deep neural network (DNN), have been among the most prevalent reinforcement learning algorithms for decision-making problems in simulated environments. However, the existing deep Actor-Critic algorithms are still not mature to solve realistic problems with non-convex stochastic constraints and high cost to interact with the environment.… ▽ More

    Submitted 18 September, 2024; v1 submitted 10 June, 2023; originally announced June 2023.

  32. arXiv:2306.03902  [pdf, other

    cs.CL cs.AI cs.LO q-bio.NC

    Utterance Classification with Logical Neural Network: Explainable AI for Mental Disorder Diagnosis

    Authors: Yeldar Toleubay, Don Joven Agravante, Daiki Kimura, Baihan Lin, Djallel Bouneffouf, Michiaki Tatsubori

    Abstract: In response to the global challenge of mental health problems, we proposes a Logical Neural Network (LNN) based Neuro-Symbolic AI method for the diagnosis of mental disorders. Due to the lack of effective therapy coverage for mental disorders, there is a need for an AI solution that can assist therapists with the diagnosis. However, current Neural Network models lack explainability and may not be… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: ACL 2023

  33. Rhythm-controllable Attention with High Robustness for Long Sentence Speech Synthesis

    Authors: Dengfeng Ke, Yayue Deng, Yukang Jia, Jinlong Xue, Qi Luo, Ya Li, Jianqing Sun, Jiaen Liang, Binghuai Lin

    Abstract: Regressive Text-to-Speech (TTS) system utilizes attention mechanism to generate alignment between text and acoustic feature sequence. Alignment determines synthesis robustness (e.g, the occurence of skipping, repeating, and collapse) and rhythm via duration control. However, current attention algorithms used in speech synthesis cannot control rhythm using external duration information to generate… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: 5 pages, 3 figures, Published in: 2022 13th International Symposium on Chinese Spoken Language Processing (ISCSLP)

  34. arXiv:2306.02561  [pdf, other

    cs.CL cs.AI cs.LG

    LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and Generative Fusion

    Authors: Dongfu Jiang, Xiang Ren, Bill Yuchen Lin

    Abstract: We present LLM-Blender, an ensembling framework designed to attain consistently superior performance by leveraging the diverse strengths of multiple open-source large language models (LLMs). Our framework consists of two modules: PairRanker and GenFuser, addressing the observation that optimal LLMs for different examples can significantly vary. PairRanker employs a specialized pairwise comparison… ▽ More

    Submitted 30 June, 2023; v1 submitted 4 June, 2023; originally announced June 2023.

    Comments: Accepted to ACL 2023 (main conference); Project website: https://yuchenlin.xyz/LLM-Blender/ V3 update: fix a few typos and update a few citations; V2 update: The experiments on summarization, translation, and constrained generation tasks in the prior version have been moved to the appendix

  35. arXiv:2306.00416  [pdf, other

    cs.CV cs.AI cs.GR

    Interactive Character Control with Auto-Regressive Motion Diffusion Models

    Authors: Yi Shi, Jingbo Wang, Xuekun Jiang, Bingkun Lin, Bo Dai, Xue Bin Peng

    Abstract: Real-time character control is an essential component for interactive experiences, with a broad range of applications, including physics simulations, video games, and virtual reality. The success of diffusion models for image synthesis has led to the use of these models for motion synthesis. However, the majority of these motion diffusion models are primarily designed for offline applications, whe… ▽ More

    Submitted 15 August, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

  36. arXiv:2305.19540  [pdf, ps, other

    cond-mat.mtrl-sci physics.app-ph

    Numerical analysis and optimization of a hybrid layer structure for triplet-triplet fusion mechanism in organic light-emitting diodes

    Authors: Jun-Yu Huang, Hsiao-Chun Hung, Kung-Chi Hsu, Chia-Hsun Chen, Pei-Hsi Lee, Hung-Yi Lin, Bo-Yen Lin, Man-kit Leung, Tien-Lung Chiu, Jiun-Haw Lee, Richard H. Friend, Yuh-Renn Wu

    Abstract: In this study, we develop a steady state and time-dependent exciton diffusion model including singlet and triplet excitons coupled with a modified Poisson and drift-diffusion solver to explain the mechanism of hyper triplet-triplet fusion (TTF) organic light-emitting diodes (OLEDs). Using this modified simulator, we demonstrate various characteristics of OLEDs, including the J-V curve, internal qu… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Journal ref: Adv. Theory Simul. 2022, 2200633

  37. arXiv:2305.19202  [pdf, ps, other

    cs.PL

    Integrating Logic Rules with Everything Else, Seamlessly

    Authors: Yanhong A. Liu, Scott D. Stoller, Yi Tong, Bo Lin

    Abstract: This paper presents a language, Alda, that supports all of logic rules, sets, functions, updates, and objects as seamlessly integrated built-ins. The key idea is to support predicates in rules as set-valued variables that can be used and updated in any scope, and support queries using rules as either explicit or implicit automatic calls to an inference function. We have defined a formal semantic… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: To be published in Theory and Practice of Logic Programming, Special issue for selected papers from 39nd International Conference on Logic Programming. arXiv admin note: substantial text overlap with arXiv:2205.15204

  38. arXiv:2305.18654  [pdf, other

    cs.CL cs.AI cs.LG

    Faith and Fate: Limits of Transformers on Compositionality

    Authors: Nouha Dziri, Ximing Lu, Melanie Sclar, Xiang Lorraine Li, Liwei Jiang, Bill Yuchen Lin, Peter West, Chandra Bhagavatula, Ronan Le Bras, Jena D. Hwang, Soumya Sanyal, Sean Welleck, Xiang Ren, Allyson Ettinger, Zaid Harchaoui, Yejin Choi

    Abstract: Transformer large language models (LLMs) have sparked admiration for their exceptional performance on tasks that demand intricate multi-step reasoning. Yet, these models simultaneously show failures on surprisingly trivial problems. This begs the question: Are these errors incidental, or do they signal more substantial limitations? In an attempt to demystify transformer LLMs, we investigate the li… ▽ More

    Submitted 31 October, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: 10 pages + appendix (40 pages)

  39. arXiv:2305.17926  [pdf, other

    cs.CL cs.AI cs.IR

    Large Language Models are not Fair Evaluators

    Authors: Peiyi Wang, Lei Li, Liang Chen, Zefan Cai, Dawei Zhu, Binghuai Lin, Yunbo Cao, Qi Liu, Tianyu Liu, Zhifang Sui

    Abstract: In this paper, we uncover a systematic bias in the evaluation paradigm of adopting large language models~(LLMs), e.g., GPT-4, as a referee to score and compare the quality of responses generated by candidate models. We find that the quality ranking of candidate responses can be easily hacked by simply altering their order of appearance in the context. This manipulation allows us to skew the evalua… ▽ More

    Submitted 30 August, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

  40. arXiv:2305.17390  [pdf, other

    cs.CL cs.AI cs.LG cs.MA cs.RO

    SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks

    Authors: Bill Yuchen Lin, Yicheng Fu, Karina Yang, Faeze Brahman, Shiyu Huang, Chandra Bhagavatula, Prithviraj Ammanabrolu, Yejin Choi, Xiang Ren

    Abstract: We introduce SwiftSage, a novel agent framework inspired by the dual-process theory of human cognition, designed to excel in action planning for complex interactive reasoning tasks. SwiftSage integrates the strengths of behavior cloning and prompting large language models (LLMs) to enhance task completion performance. The framework comprises two primary modules: the Swift module, representing fast… ▽ More

    Submitted 6 December, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: Accepted to NeurIPS 2023 (spotlight). Project website: https://swiftsage.github.io

  41. arXiv:2305.15835  [pdf, other

    cs.LG cs.AI

    PDE+: Enhancing Generalization via PDE with Adaptive Distributional Diffusion

    Authors: Yige Yuan, Bingbing Xu, Bo Lin, Liang Hou, Fei Sun, Huawei Shen, Xueqi Cheng

    Abstract: The generalization of neural networks is a central challenge in machine learning, especially concerning the performance under distributions that differ from training ones. Current methods, mainly based on the data-driven paradigm such as data augmentation, adversarial training, and noise injection, may encounter limited generalization due to model non-smoothness. In this paper, we propose to inves… ▽ More

    Submitted 15 December, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Accepted by Annual AAAI Conference on Artificial Intelligence (AAAI) 2024. Code is available at https://github.com/yuanyige/pde-add

  42. arXiv:2305.15065  [pdf, other

    cs.CL

    Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning

    Authors: Ximing Lu, Faeze Brahman, Peter West, Jaehun Jang, Khyathi Chandu, Abhilasha Ravichander, Lianhui Qin, Prithviraj Ammanabrolu, Liwei Jiang, Sahana Ramnath, Nouha Dziri, Jillian Fisher, Bill Yuchen Lin, Skyler Hallinan, Xiang Ren, Sean Welleck, Yejin Choi

    Abstract: While extreme-scale language models have demonstrated exceptional performance on a variety of language tasks, the degree of control over these language models through pure prompting can often be limited. Directly fine-tuning such language models can be effective for tailoring them, but it can be either extremely costly (e.g., GPT-3) or not even feasible for the broader community (e.g., GPT-4). W… ▽ More

    Submitted 6 December, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023

  43. arXiv:2305.14927  [pdf

    physics.optics eess.SP

    Scalable wavelength-multiplexing photonic reservoir computing

    Authors: Rui-Qian Li, Yi-Wei Shen, Bao-De Lin, Jingyi Yu, Xuming He, Cheng Wang

    Abstract: Photonic reservoir computing (PRC) is a special hardware recurrent neural network, which is featured with fast training speed and low training cost. This work shows a wavelength-multiplexing PRC architecture, taking advantage of the numerous longitudinal modes in a Fabry-Perot semiconductor laser. These modes construct connected physical neurons in parallel, while an optical feedback loop provides… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  44. arXiv:2305.14760  [pdf, other

    cs.CL

    Bi-Drop: Enhancing Fine-tuning Generalization via Synchronous sub-net Estimation and Optimization

    Authors: Shoujie Tong, Heming Xia, Damai Dai, Runxin Xu, Tianyu Liu, Binghuai Lin, Yunbo Cao, Zhifang Sui

    Abstract: Pretrained language models have achieved remarkable success in natural language understanding. However, fine-tuning pretrained models on limited training data tends to overfit and thus diminish performance. This paper presents Bi-Drop, a fine-tuning strategy that selectively updates model parameters using gradients from various sub-nets dynamically generated by dropout. The sub-net estimation of B… ▽ More

    Submitted 22 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023 Findings. Camera-ready version. Co-first authors with equal contributions

  45. arXiv:2305.14751  [pdf, other

    cs.CL cs.AI

    DialogVCS: Robust Natural Language Understanding in Dialogue System Upgrade

    Authors: Zefan Cai, Xin Zheng, Tianyu Liu, Xu Wang, Haoran Meng, Jiaqi Han, Gang Yuan, Binghuai Lin, Baobao Chang, Yunbo Cao

    Abstract: In the constant updates of the product dialogue systems, we need to retrain the natural language understanding (NLU) model as new data from the real users would be merged into the existent data accumulated in the last updates. Within the newly added data, new intents would emerge and might have semantic entanglement with the existing intents, e.g. new intents that are semantically too specific or… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: work in progress. The first three authors contribute equally

  46. arXiv:2305.14652  [pdf, other

    cs.CL

    Denoising Bottleneck with Mutual Information Maximization for Video Multimodal Fusion

    Authors: Shaoxiang Wu, Damai Dai, Ziwei Qin, Tianyu Liu, Binghuai Lin, Yunbo Cao, Zhifang Sui

    Abstract: Video multimodal fusion aims to integrate multimodal signals in videos, such as visual, audio and text, to make a complementary prediction with multiple modalities contents. However, unlike other image-text multimodal tasks, video has longer multimodal sequences with more redundancy and noise in both visual and audio modalities. Prior denoising methods like forget gate are coarse in the granularit… ▽ More

    Submitted 31 May, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accept at ACL2023

  47. arXiv:2305.11291  [pdf

    cs.HC

    A systematic review of safety-critical scenarios between automated vehicles and vulnerable road users

    Authors: Aditya Deshmukh, Zifei Wang, Aaron Gunn, Huizhong Guo, Rini Sherony, Fred Feng, Brian Lin, Shan Bao, Feng Zhou

    Abstract: Automated vehicles (AVs) are of great potential in reducing crashes on the road. However, it is still complicated to eliminate all the possible accidents, especially those with vulnerable road users (VRUs), who are among the greater risk than vehicle occupants in traffic accidents. Thus, in this paper, we conducted a systematic review of safety-critical scenarios between AVs and VRUs. We identifie… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

  48. arXiv:2305.10785  [pdf

    cs.SE

    CCT5: A Code-Change-Oriented Pre-Trained Model

    Authors: Bo Lin, Shangwen Wang, Zhongxin Liu, Yepang Liu, Xin Xia, Xiaoguang Mao

    Abstract: Software is constantly changing, requiring developers to perform several derived tasks in a timely manner, such as writing a description for the intention of the code change, or identifying the defect-prone code changes. Considering that the cost of dealing with these tasks can account for a large proportion (typically around 70 percent) of the total development expenditure, automating such proces… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

  49. arXiv:2305.06621  [pdf, other

    cs.CV

    PVT-SSD: Single-Stage 3D Object Detector with Point-Voxel Transformer

    Authors: Honghui Yang, Wenxiao Wang, Minghao Chen, Binbin Lin, Tong He, Hua Chen, Xiaofei He, Wanli Ouyang

    Abstract: Recent Transformer-based 3D object detectors learn point cloud features either from point- or voxel-based representations. However, the former requires time-consuming sampling while the latter introduces quantization errors. In this paper, we present a novel Point-Voxel Transformer for single-stage 3D detection (PVT-SSD) that takes advantage of these two representations. Specifically, we first use… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Comments: CVPR 2023

  50. arXiv:2305.04636  [pdf, other

    cs.CL

    Enhancing Continual Relation Extraction via Classifier Decomposition

    Authors: Heming Xia, Peiyi Wang, Tianyu Liu, Binghuai Lin, Yunbo Cao, Zhifang Sui

    Abstract: Continual relation extraction (CRE) models aim at handling emerging new relations while avoiding catastrophically forgetting old ones in the streaming data. Though improvements have been shown by previous CRE studies, most of them only adopt a vanilla strategy when models first learn representations of new relations. In this work, we point out that there exist two typical biases after training of… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: Accepted to Findings of ACL 2023