Skip to main content

Showing 1–50 of 80 results for author: Long, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.06950  [pdf, ps, other

    cs.CL

    What Makes a Good Natural Language Prompt?

    Authors: Do Xuan Long, Duy Dinh, Ngoc-Hai Nguyen, Kenji Kawaguchi, Nancy F. Chen, Shafiq Joty, Min-Yen Kan

    Abstract: As large language models (LLMs) have progressed towards more human-like and human--AI communications have become prevalent, prompting has emerged as a decisive component. However, there is limited conceptual consensus on what exactly quantifies natural language prompts. We attempt to address this question by conducting a meta-analysis surveying more than 150 prompting-related papers from leading N… ▽ More

    Submitted 7 June, 2025; originally announced June 2025.

    Comments: ACL 2025 Main Conference

  2. arXiv:2506.05176  [pdf, ps, other

    cs.CL

    Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

    Authors: Yanzhao Zhang, Mingxin Li, Dingkun Long, Xin Zhang, Huan Lin, Baosong Yang, Pengjun Xie, An Yang, Dayiheng Liu, Junyang Lin, Fei Huang, Jingren Zhou

    Abstract: In this work, we introduce the Qwen3 Embedding series, a significant advancement over its predecessor, the GTE-Qwen series, in text embedding and reranking capabilities, built upon the Qwen3 foundation models. Leveraging the Qwen3 LLMs' robust capabilities in multilingual text understanding and generation, our innovative multi-stage training pipeline combines large-scale unsupervised pre-training… ▽ More

    Submitted 10 June, 2025; v1 submitted 5 June, 2025; originally announced June 2025.

  3. arXiv:2506.04252  [pdf, ps, other

    cs.AI cs.CL cs.LG

    A Graph-Retrieval-Augmented Generation Framework Enhances Decision-Making in the Circular Economy

    Authors: Yang Zhao, Chengxiao Dai, Dusit Niyato, Chuan Fu Tan, Keyi Xiang, Yueyang Wang, Zhiquan Yeo, Daren Tan Zong Loong, Jonathan Low Zhaozhi, Eugene H. Z. HO

    Abstract: Large language models (LLMs) hold promise for sustainable manufacturing, but often hallucinate industrial codes and emission factors, undermining regulatory and investment decisions. We introduce CircuGraphRAG, a retrieval-augmented generation (RAG) framework that grounds LLMs outputs in a domain-specific knowledge graph for the circular economy. This graph connects 117,380 industrial and waste en… ▽ More

    Submitted 1 June, 2025; originally announced June 2025.

  4. arXiv:2506.01265  [pdf, ps, other

    cs.CL

    Beyond In-Context Learning: Aligning Long-form Generation of Large Language Models via Task-Inherent Attribute Guidelines

    Authors: Do Xuan Long, Duong Ngoc Yen, Do Xuan Trong, Luu Anh Tuan, Kenji Kawaguchi, Shafiq Joty, Min-Yen Kan, Nancy F. Chen

    Abstract: In-context learning (ICL) is an important yet not fully understood ability of pre-trained large language models (LLMs). It can greatly enhance task performance using a few examples, termed demonstrations, without fine-tuning. Although effective in question answering, ICL often underperforms in long-form generation tasks such as summarization. Under appropriately realistic assumptions, we empirical… ▽ More

    Submitted 1 June, 2025; originally announced June 2025.

    Comments: ACL 2025 Findings

  5. arXiv:2504.17432  [pdf, other

    cs.CV

    Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs

    Authors: Tiancheng Gu, Kaicheng Yang, Ziyong Feng, Xingjun Wang, Yanzhao Zhang, Dingkun Long, Yingda Chen, Weidong Cai, Jiankang Deng

    Abstract: The Contrastive Language-Image Pre-training (CLIP) framework has become a widely used approach for multimodal representation learning, particularly in image-text retrieval and clustering. However, its efficacy is constrained by three key limitations: (1) text token truncation, (2) isolated image-text encoding, and (3) deficient compositionality due to bag-of-words behavior. While recent Multimodal… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

    Comments: 13 pages, 8 figures, Project page: https://garygutc.github.io/UniME

  6. arXiv:2504.09037  [pdf, other

    cs.AI cs.CL

    A Survey of Frontiers in LLM Reasoning: Inference Scaling, Learning to Reason, and Agentic Systems

    Authors: Zixuan Ke, Fangkai Jiao, Yifei Ming, Xuan-Phi Nguyen, Austin Xu, Do Xuan Long, Minzhi Li, Chengwei Qin, Peifeng Wang, Silvio Savarese, Caiming Xiong, Shafiq Joty

    Abstract: Reasoning is a fundamental cognitive process that enables logical inference, problem-solving, and decision-making. With the rapid advancement of large language models (LLMs), reasoning has emerged as a key capability that distinguishes advanced AI systems from conventional models that empower chatbots. In this survey, we categorize existing methods along two orthogonal dimensions: (1) Regimes, whi… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

    Comments: 72 pages, 6 figures

  7. arXiv:2503.16690  [pdf, other

    astro-ph.IM astro-ph.EP cs.LG physics.optics

    Making the unmodulated pyramid wavefront sensor smart II. First on-sky demonstration of extreme adaptive optics with deep learning

    Authors: R. Landman, S. Y. Haffert, J. D. Long, J. R. Males, L. M. Close, W. B. Foster, K. Van Gorkom, O. Guyon, A. D. Hedglen, P. T. Johnson, M. Y. Kautz, J. K. Kueny, J. Li, J. Liberman, J. Lumbres, E. A. McEwen, A. McLeod, L. Schatz, E. Tonucci, K. Twitchell

    Abstract: Pyramid wavefront sensors (PWFSs) are the preferred choice for current and future extreme adaptive optics (XAO) systems. Almost all instruments use the PWFS in its modulated form to mitigate its limited linearity range. However, this modulation comes at the cost of a reduction in sensitivity, a blindness to petal-piston modes, and a limit to the sensor's ability to operate at high speeds. Therefor… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

    Comments: Accepted for publication in A&A

  8. arXiv:2503.11899  [pdf, other

    cs.LG eess.SP

    Spatio-temporal Fourier Transformer (StFT) for Long-term Dynamics Prediction

    Authors: Da Long, Shandian Zhe, Samuel Williams, Leonid Oliker, Zhe Bai

    Abstract: Simulating the long-term dynamics of multi-scale and multi-physics systems poses a significant challenge in understanding complex phenomena across science and engineering. The complexity arises from the intricate interactions between scales and the interplay of diverse physical processes. Neural operators have emerged as promising models for predicting such dynamics due to their flexibility and co… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

    Comments: 16 pages, 10 figures

  9. arXiv:2502.13024  [pdf, other

    cs.LG math.OC

    Fragility-aware Classification for Understanding Risk and Improving Generalization

    Authors: Chen Yang, Zheng Cui, Daniel Zhuoyu Long, Jin Qi, Ruohan Zhan

    Abstract: Classification models play a critical role in data-driven decision-making applications such as medical diagnosis, user profiling, recommendation systems, and default detection. Traditional performance metrics, such as accuracy, focus on overall error rates but fail to account for the confidence of incorrect predictions, thereby overlooking the risk of confident misjudgments. This risk is particula… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

  10. arXiv:2502.12799  [pdf, other

    cs.CL cs.CV cs.IR

    Towards Text-Image Interleaved Retrieval

    Authors: Xin Zhang, Ziqi Dai, Yongqi Li, Yanzhao Zhang, Dingkun Long, Pengjun Xie, Meishan Zhang, Jun Yu, Wenjie Li, Min Zhang

    Abstract: Current multimodal information retrieval studies mainly focus on single-image inputs, which limits real-world applications involving multiple images and text-image interleaved content. In this work, we introduce the text-image interleaved retrieval (TIIR) task, where the query and document are interleaved text-image sequences, and the model is required to understand the semantics from the interlea… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

    Comments: 16 pages, 14 figures

  11. arXiv:2502.02682  [pdf, other

    cs.LG physics.comp-ph

    Pseudo-Physics-Informed Neural Operators: Enhancing Operator Learning from Limited Data

    Authors: Keyan Chen, Yile Li, Da Long, Zhitong Xu, Wei Xing, Jacob Hochhalter, Shandian Zhe

    Abstract: Neural operators have shown great potential in surrogate modeling. However, training a well-performing neural operator typically requires a substantial amount of data, which can pose a major challenge in complex applications. In such scenarios, detailed physical knowledge can be unavailable or difficult to obtain, and collecting extensive data is often prohibitively expensive. To mitigate this cha… ▽ More

    Submitted 4 February, 2025; originally announced February 2025.

  12. arXiv:2501.00977  [pdf, other

    cs.OS cs.ET

    Host-guided data placement: whose job is it anyway?

    Authors: Devashish R. Purandare, Peter Alvaro, Avani Wildani, Darrell D. E. Long, Ethan L. Miller

    Abstract: The increasing demand for SSDs coupled with scaling difficulties have left manufacturers scrambling for newer SSD interfaces which promise better performance and durability. While these interfaces reduce the rigidity of traditional abstractions, they require application or system-level changes that can impact the stability, security, and portability of systems. To make matters worse, such changes… ▽ More

    Submitted 1 January, 2025; originally announced January 2025.

    Comments: 14 pages, 10 figures, 3 tables

  13. arXiv:2412.16855  [pdf, other

    cs.CL cs.IR

    GME: Improving Universal Multimodal Retrieval by Multimodal LLMs

    Authors: Xin Zhang, Yanzhao Zhang, Wen Xie, Mingxin Li, Ziqi Dai, Dingkun Long, Pengjun Xie, Meishan Zhang, Wenjie Li, Min Zhang

    Abstract: Universal Multimodal Retrieval (UMR) aims to enable search across various modalities using a unified model, where queries and candidates can consist of pure text, images, or a combination of both. Previous work has attempted to adopt multimodal large language models (MLLMs) to realize UMR using only text data. However, our preliminary experiments demonstrate that more diverse multimodal training d… ▽ More

    Submitted 1 April, 2025; v1 submitted 21 December, 2024; originally announced December 2024.

    Comments: Accepted to CVPR 2025, models at https://huggingface.co/Alibaba-NLP/gme-Qwen2-VL-2B-Instruct

  14. arXiv:2412.09165  [pdf, other

    cs.CL cs.AI cs.IR

    When Text Embedding Meets Large Language Model: A Comprehensive Survey

    Authors: Zhijie Nie, Zhangchi Feng, Mingxin Li, Cunwang Zhang, Yanzhao Zhang, Dingkun Long, Richong Zhang

    Abstract: Text embedding has become a foundational technology in natural language processing (NLP) during the deep learning era, driving advancements across a wide array of downstream tasks. While many natural language understanding challenges can now be modeled using generative paradigms and leverage the robust generative and comprehension capabilities of large language models (LLMs), numerous practical ap… ▽ More

    Submitted 20 March, 2025; v1 submitted 12 December, 2024; originally announced December 2024.

    Comments: Version 3: We added some latest works of LLM-based Embedders and MLLM-based Embedders

  15. arXiv:2411.00492  [pdf, other

    cs.CL

    Multi-expert Prompting Improves Reliability, Safety, and Usefulness of Large Language Models

    Authors: Do Xuan Long, Duong Ngoc Yen, Anh Tuan Luu, Kenji Kawaguchi, Min-Yen Kan, Nancy F. Chen

    Abstract: We present Multi-expert Prompting, a novel enhancement of ExpertPrompting (Xu et al., 2023), designed to improve the large language model (LLM) generation. Specifically, it guides an LLM to fulfill an input instruction by simulating multiple experts, aggregating their responses, and selecting the best among individual and aggregated responses. This process is performed in a single chain of thought… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

    Comments: EMNLP 2024 Main Conference

  16. arXiv:2410.15035  [pdf, other

    cs.CL

    Improving General Text Embedding Model: Tackling Task Conflict and Data Imbalance through Model Merging

    Authors: Mingxin Li, Zhijie Nie, Yanzhao Zhang, Dingkun Long, Richong Zhang, Pengjun Xie

    Abstract: Text embeddings are vital for tasks such as text retrieval and semantic textual similarity (STS). Recently, the advent of pretrained language models, along with unified benchmarks like the Massive Text Embedding Benchmark (MTEB), has facilitated the development of versatile general-purpose text embedding models. Advanced embedding models are typically developed using large-scale multi-task data an… ▽ More

    Submitted 19 October, 2024; originally announced October 2024.

    Comments: working in progress

  17. arXiv:2410.13794  [pdf, ps, other

    cs.LG

    Arbitrarily-Conditioned Multi-Functional Diffusion for Multi-Physics Emulation

    Authors: Da Long, Zhitong Xu, Guang Yang, Akil Narayan, Shandian Zhe

    Abstract: Modern physics simulation often involves multiple functions of interests, and traditional numerical approaches are known to be complex and computationally costly. While machine learning-based surrogate models can offer significant cost reductions, most focus on a single task, such as forward prediction, and typically lack uncertainty quantification -- an essential component in many applications. T… ▽ More

    Submitted 6 June, 2025; v1 submitted 17 October, 2024; originally announced October 2024.

  18. arXiv:2410.11165  [pdf, ps, other

    cs.LG

    Toward Efficient Kernel-Based Solvers for Nonlinear PDEs

    Authors: Zhitong Xu, Da Long, Yiming Xu, Guang Yang, Shandian Zhe, Houman Owhadi

    Abstract: We introduce a novel kernel learning framework toward efficiently solving nonlinear partial differential equations (PDEs). In contrast to the state-of-the-art kernel solver that embeds differential operators within kernels, posing challenges with a large number of collocation points, our approach eliminates these operators from the kernel. We model the solution using a standard kernel interpolatio… ▽ More

    Submitted 5 June, 2025; v1 submitted 14 October, 2024; originally announced October 2024.

    Journal ref: Forty-Second International Conference on Machine Learning (ICML2025)

  19. arXiv:2410.07558  [pdf

    cs.RO

    Streamlined shape of cyborg cockroach promotes traversability in confined environments by gap negotiation

    Authors: Kazuki Kai, Le Duc Long, Hirotaka Sato

    Abstract: The centimeter-scale cyborg insects have a potential advantage for application in narrow environments where humans cannot operate. To realize such tasks, researchers have developed a small printed-circuit-board (PCB) which an insect can carry and control it. The electronic components usually remain bare on the board and the whole board is mounted on platform animals, resulting in uneven morphology… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  20. arXiv:2410.04182  [pdf, other

    cs.CV

    Artistic Portrait Drawing with Vector Strokes

    Authors: Yiqi Liang, Ying Liu, Dandan Long, Ruihui Li

    Abstract: In this paper, we present a method, VectorPD, for converting a given human face image into a vector portrait sketch. VectorPD supports different levels of abstraction by simply controlling the number of strokes. Since vector graphics are composed of different shape primitives, it is challenging for rendering complex faces to accurately express facial details and structure. To address this, VectorP… ▽ More

    Submitted 5 October, 2024; originally announced October 2024.

    Comments: 9 pages, 12 figures

  21. arXiv:2410.04072  [pdf, other

    cs.CV cs.AI

    MROSS: Multi-Round Region-based Optimization for Scene Sketching

    Authors: Yiqi Liang, Ying Liu, Dandan Long, Ruihui Li

    Abstract: Scene sketching is to convert a scene into a simplified, abstract representation that captures the essential elements and composition of the original scene. It requires a semantic understanding of the scene and consideration of different regions within the scene. Since scenes often contain diverse visual information across various regions, such as foreground objects, background elements, and spati… ▽ More

    Submitted 15 April, 2025; v1 submitted 5 October, 2024; originally announced October 2024.

    Comments: 6 pages, 8 figures

  22. arXiv:2408.08656  [pdf, other

    cs.CL

    LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias of LLMs

    Authors: Do Xuan Long, Hai Nguyen Ngoc, Tiviatis Sim, Hieu Dao, Shafiq Joty, Kenji Kawaguchi, Nancy F. Chen, Min-Yen Kan

    Abstract: We present the first systematic evaluation examining format bias in performance of large language models (LLMs). Our approach distinguishes between two categories of an evaluation metric under format constraints to reliably and accurately assess performance: one measures performance when format constraints are adhered to, while the other evaluates performance regardless of constraint adherence. We… ▽ More

    Submitted 22 February, 2025; v1 submitted 16 August, 2024; originally announced August 2024.

    Comments: NAACL 2025 Main Conference

  23. arXiv:2408.08650  [pdf, other

    cs.CL

    An End-to-End Model for Photo-Sharing Multi-modal Dialogue Generation

    Authors: Peiming Guo, Sinuo Liu, Yanzhao Zhang, Dingkun Long, Pengjun Xie, Meishan Zhang, Min Zhang

    Abstract: Photo-Sharing Multi-modal dialogue generation requires a dialogue agent not only to generate text responses but also to share photos at the proper moment. Using image text caption as the bridge, a pipeline model integrates an image caption model, a text generation model, and an image generation model to handle this complex multi-modal task. However, representing the images with text captions may l… ▽ More

    Submitted 29 March, 2025; v1 submitted 16 August, 2024; originally announced August 2024.

    Comments: Accepted by ICME2025

  24. arXiv:2407.19669  [pdf, other

    cs.CL cs.IR

    mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval

    Authors: Xin Zhang, Yanzhao Zhang, Dingkun Long, Wen Xie, Ziqi Dai, Jialong Tang, Huan Lin, Baosong Yang, Pengjun Xie, Fei Huang, Meishan Zhang, Wenjie Li, Min Zhang

    Abstract: We present systematic efforts in building long-context multilingual text representation model (TRM) and reranker from scratch for text retrieval. We first introduce a text encoder (base size) enhanced with RoPE and unpadding, pre-trained in a native 8192-token context (longer than 512 of previous multilingual encoders). Then we construct a hybrid TRM and a cross-encoder reranker by contrastive lea… ▽ More

    Submitted 14 October, 2024; v1 submitted 28 July, 2024; originally announced July 2024.

    Comments: Camera-ready version of EMNLP 2024: Industry Track

  25. arXiv:2404.05560  [pdf, other

    cs.CL

    Chinese Sequence Labeling with Semi-Supervised Boundary-Aware Language Model Pre-training

    Authors: Longhui Zhang, Dingkun Long, Meishan Zhang, Yanzhao Zhang, Pengjun Xie, Min Zhang

    Abstract: Chinese sequence labeling tasks are heavily reliant on accurate word boundary demarcation. Although current pre-trained language models (PLMs) have achieved substantial gains on these tasks, they rarely explicitly incorporate boundary information into the modeling process. An exception to this is BABERT, which incorporates unsupervised statistical boundary information into Chinese BERT's pre-train… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: Accepted to COLING 2024

  26. arXiv:2402.11722  [pdf, other

    cs.LG

    Invertible Fourier Neural Operators for Tackling Both Forward and Inverse Problems

    Authors: Da Long, Zhitong Xu, Qiwei Yuan, Yin Yang, Shandian Zhe

    Abstract: Fourier Neural Operator (FNO) is a powerful and popular operator learning method. However, FNO is mainly used in forward prediction, yet a great many applications rely on solving inverse problems. In this paper, we propose an invertible Fourier Neural Operator (iFNO) for jointly tackling the forward and inverse problems. We developed a series of invertible Fourier blocks in the latent channel spac… ▽ More

    Submitted 5 May, 2025; v1 submitted 18 February, 2024; originally announced February 2024.

  27. arXiv:2401.03403  [pdf, other

    cs.CE

    Deep peak property learning for efficient chiral molecules ECD spectra prediction

    Authors: Hao Li, Da Long, Li Yuan, Yonghong Tian, Xinchang Wang, Fanyang Mo

    Abstract: Chiral molecule assignation is crucial for asymmetric catalysis, functional materials, and the drug industry. The conventional approach requires theoretical calculations of electronic circular dichroism (ECD) spectra, which is time-consuming and costly. To speed up this process, we have incorporated deep learning techniques for the ECD prediction. We first set up a large-scale dataset of Chiral Mo… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Comments: 14 pages, 8 Figures, Submitted to Nature Machine Intelligence

  28. arXiv:2311.16720  [pdf, other

    cs.IR

    A Two-Stage Adaptation of Large Language Models for Text Ranking

    Authors: Longhui Zhang, Yanzhao Zhang, Dingkun Long, Pengjun Xie, Meishan Zhang, Min Zhang

    Abstract: Text ranking is a critical task in information retrieval. Recent advances in pre-trained language models (PLMs), especially large language models (LLMs), present new opportunities for applying them to text ranking. While supervised fine-tuning (SFT) with ranking data has been widely explored to better align PLMs with text ranking goals, previous studies have focused primarily on encoder-only and e… ▽ More

    Submitted 1 June, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: Accepted to Findings of ACL 2024. Code and models available at https://github.com/Alibaba-NLP/RankingGPT

  29. arXiv:2311.08385  [pdf, other

    cs.CL

    Aligning Large Language Models with Human Opinions through Persona Selection and Value--Belief--Norm Reasoning

    Authors: Do Xuan Long, Kenji Kawaguchi, Min-Yen Kan, Nancy F. Chen

    Abstract: Reasoning and predicting human opinions with large language models (LLMs) is essential yet challenging. Current methods employ role-playing with personae but face two major issues: LLMs are sensitive to even a single irrelevant persona, skewing predictions by up to 30%, and LLMs fail to reason strategically over personae. We propose Chain-of-Opinion (COO), a simple four-step solution modeling whic… ▽ More

    Submitted 14 December, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: COLING 2025

  30. arXiv:2311.05472  [pdf, other

    cs.CL cs.AI

    Text Representation Distillation via Information Bottleneck Principle

    Authors: Yanzhao Zhang, Dingkun Long, Zehan Li, Pengjun Xie

    Abstract: Pre-trained language models (PLMs) have recently shown great success in text representation field. However, the high computational cost and high-dimensional representation of PLMs pose significant challenges for practical applications. To make models more accessible, an effective method is to distill large models into smaller representation models. In order to relieve the issue of performance degr… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: Accepted to EMNLP 2023. The code and pre-trained models are available at [this https URL](https://github.com/Alibaba-NLP/IBKD)

  31. arXiv:2311.04465  [pdf, other

    cs.LG cs.CE

    Solving High Frequency and Multi-Scale PDEs with Gaussian Processes

    Authors: Shikai Fang, Madison Cooley, Da Long, Shibo Li, Robert Kirby, Shandian Zhe

    Abstract: Machine learning based solvers have garnered much attention in physical simulation and scientific computing, with a prominent example, physics-informed neural networks (PINNs). However, PINNs often struggle to solve high-frequency and multi-scale PDEs, which can be due to spectral bias during neural network training. To address this problem, we resort to the Gaussian process (GP) framework. To fle… ▽ More

    Submitted 18 March, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

    Journal ref: The Twelfth International Conference on Learning Representations (ICLR 2024)

  32. arXiv:2310.08232  [pdf, other

    cs.CL

    Language Models are Universal Embedders

    Authors: Xin Zhang, Zehan Li, Yanzhao Zhang, Dingkun Long, Pengjun Xie, Meishan Zhang, Min Zhang

    Abstract: In the large language model (LLM) revolution, embedding is a key component of various systems, such as retrieving knowledge or memories for LLMs or building content moderation filters. As such cases span from English to other natural or programming languages, from retrieval to classification and beyond, it is advantageous to build a unified embedding model rather than dedicated ones for each scena… ▽ More

    Submitted 22 May, 2025; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: XLLM Workshop, ACL 2025

  33. arXiv:2310.05387  [pdf, other

    cs.LG stat.ML

    Equation Discovery with Bayesian Spike-and-Slab Priors and Efficient Kernels

    Authors: Da Long, Wei W. Xing, Aditi S. Krishnapriyan, Robert M. Kirby, Shandian Zhe, Michael W. Mahoney

    Abstract: Discovering governing equations from data is important to many scientific and engineering applications. Despite promising successes, existing methods are still challenged by data sparsity and noise issues, both of which are ubiquitous in practice. Moreover, state-of-the-art methods lack uncertainty quantification and/or are costly in training. To overcome these limitations, we propose a novel equa… ▽ More

    Submitted 21 April, 2024; v1 submitted 8 October, 2023; originally announced October 2023.

  34. arXiv:2308.12039  [pdf, other

    cs.IR cs.CL

    Hybrid Retrieval and Multi-stage Text Ranking Solution at TREC 2022 Deep Learning Track

    Authors: Guangwei Xu, Yangzhao Zhang, Longhui Zhang, Dingkun Long, Pengjun Xie, Ruijie Guo

    Abstract: Large-scale text retrieval technology has been widely used in various practical business scenarios. This paper presents our systems for the TREC 2022 Deep Learning Track. We explain the hybrid text retrieval and multi-stage text ranking method adopted in our solution. The retrieval stage combined the two structures of traditional sparse retrieval and neural dense retrieval. In the ranking stage, i… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: TREC 2022 Deep Learning Track

  35. arXiv:2308.03281  [pdf, other

    cs.CL

    Towards General Text Embeddings with Multi-stage Contrastive Learning

    Authors: Zehan Li, Xin Zhang, Yanzhao Zhang, Dingkun Long, Pengjun Xie, Meishan Zhang

    Abstract: We present GTE, a general-purpose text embedding model trained with multi-stage contrastive learning. In line with recent advancements in unifying various NLP tasks into a single format, we train a unified text embedding model by employing contrastive learning over a diverse mixture of datasets from multiple sources. By significantly increasing the number of training data during both unsupervised… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

  36. arXiv:2305.13197  [pdf, other

    cs.IR cs.CL

    Challenging Decoder helps in Masked Auto-Encoder Pre-training for Dense Passage Retrieval

    Authors: Zehan Li, Yanzhao Zhang, Dingkun Long, Pengjun Xie

    Abstract: Recently, various studies have been directed towards exploring dense passage retrieval techniques employing pre-trained language models, among which the masked auto-encoder (MAE) pre-training architecture has emerged as the most promising. The conventional MAE framework relies on leveraging the passage reconstruction of decoder to bolster the text representation ability of encoder, thereby enhanci… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: Work in progress

  37. arXiv:2305.02840  [pdf

    cs.CY

    Making Sense of Machine Learning: Integrating Youth's Conceptual, Creative, and Critical Understandings of AI

    Authors: Luis Morales-Navarro, Yasmin B. Kafai, Francisco Castro, William Payne, Kayla DesPortes, Daniella DiPaola, Randi Williams, Safinah Ali, Cynthia Breazeal, Clifford Lee, Elisabeth Soep, Duri Long, Brian Magerko, Jaemarie Solyst, Amy Ogan, Cansu Tatar, Shiyan Jiang, Jie Chao, Carolyn P. Rosé, Sepehr Vakil

    Abstract: Understanding how youth make sense of machine learning and how learning about machine learning can be supported in and out of school is more relevant than ever before as young people interact with machine learning powered applications everyday; while connecting with friends, listening to music, playing games, or attending school. In this symposium, we present different perspectives on understandin… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    ACM Class: K.3.2; H.5.3

    Journal ref: Proceedings of the 17th International Conference of the Learning Sciences - ICLS 2023

  38. arXiv:2210.15231  [pdf, other

    cs.CL cs.AI

    Unsupervised Boundary-Aware Language Model Pretraining for Chinese Sequence Labeling

    Authors: Peijie Jiang, Dingkun Long, Yanzhao Zhang, Pengjun Xie, Meishan Zhang, Min Zhang

    Abstract: Boundary information is critical for various Chinese language processing tasks, such as word segmentation, part-of-speech tagging, and named entity recognition. Previous studies usually resorted to the use of a high-quality external lexicon, where lexicon items can offer explicit boundary information. However, to ensure the quality of the lexicon, great human effort is always necessary, which has… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: 12 pages, 2 figures, 7 tables, EMNLP 2022

  39. arXiv:2210.15133  [pdf, other

    cs.CL cs.IR

    Retrieval Oriented Masking Pre-training Language Model for Dense Passage Retrieval

    Authors: Dingkun Long, Yanzhao Zhang, Guangwei Xu, Pengjun Xie

    Abstract: Pre-trained language model (PTM) has been shown to yield powerful text representations for dense passage retrieval task. The Masked Language Modeling (MLM) is a major sub-task of the pre-training process. However, we found that the conventional random masking strategy tend to select a large number of tokens that have limited effect on the passage retrieval task (e,g. stop-words and punctuation). B… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: Search LM part of the "AliceMind SLM + HLAR" method in MS MARCO Passage Ranking Leaderboard Submission

  40. arXiv:2210.08140  [pdf, other

    stat.ML cs.LG

    A Kernel Approach for PDE Discovery and Operator Learning

    Authors: Da Long, Nicole Mrvaljevic, Shandian Zhe, Bamdad Hosseini

    Abstract: This article presents a three-step framework for learning and solving partial differential equations (PDEs) using kernel methods. Given a training set consisting of pairs of noisy PDE solutions and source/boundary terms on a mesh, kernel smoothing is utilized to denoise the data and approximate derivatives of the solution. This information is then used in a kernel regression model to learn the alg… ▽ More

    Submitted 30 March, 2023; v1 submitted 14 October, 2022; originally announced October 2022.

  41. arXiv:2209.02389  [pdf, other

    cs.RO

    Autonomous Passage Planning for a Polar Vessel

    Authors: Jonathan D. Smith, Samuel Hall, George Coombs, James Byrne, Michael A. S. Thorne, J. Alexander Brearley, Derek Long, Michael Meredith, Maria Fox

    Abstract: We introduce a method for long-distance maritime route planning in polar regions, taking into account complex changing environmental conditions. The method allows the construction of optimised routes, describing the three main stages of the process: discrete modelling of the environmental conditions using a non-uniform mesh, the construction of mesh-optimal paths, and path smoothing. In order to a… ▽ More

    Submitted 13 September, 2022; v1 submitted 17 August, 2022; originally announced September 2022.

  42. arXiv:2206.10038  [pdf, other

    cs.AI cs.RO

    Understanding a Robot's Guiding Ethical Principles via Automatically Generated Explanations

    Authors: Benjamin Krarup, Felix Lindner, Senka Krivic, Derek Long

    Abstract: The continued development of robots has enabled their wider usage in human surroundings. Robots are more trusted to make increasingly important decisions with potentially critical outcomes. Therefore, it is essential to consider the ethical principles under which robots operate. In this paper we examine how contrastive and non-contrastive explanations can be used in understanding the ethics of rob… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

    Comments: 8 pages, 2 figures, 2022 IEEE 18th International Conference on Automation Science and Engineering

  43. arXiv:2205.10569  [pdf, other

    cs.IR cs.CL

    HLATR: Enhance Multi-stage Text Retrieval with Hybrid List Aware Transformer Reranking

    Authors: Yanzhao Zhang, Dingkun Long, Guangwei Xu, Pengjun Xie

    Abstract: Deep pre-trained language models (e,g. BERT) are effective at large-scale text retrieval task. Existing text retrieval systems with state-of-the-art performance usually adopt a retrieve-then-reranking architecture due to the high computational cost of pre-trained language models and the large corpus size. Under such a multi-stage architecture, previous studies mainly focused on optimizing single s… ▽ More

    Submitted 21 May, 2022; originally announced May 2022.

    Comments: Work in progress. HLAR part of the "AliceMind SLM + HLAR" method in MS MARCO Passage Ranking Submission

  44. arXiv:2205.07554  [pdf, other

    astro-ph.IM cs.LG cs.RO eess.SY

    Towards on-sky adaptive optics control using reinforcement learning

    Authors: J. Nousiainen, C. Rajani, M. Kasper, T. Helin, S. Y. Haffert, C. Vérinaud, J. R. Males, K. Van Gorkom, L. M. Close, J. D. Long, A. D. Hedglen, O. Guyon, L. Schatz, M. Kautz, J. Lumbres, A. Rodack, J. M. Knight, K. Miller

    Abstract: The direct imaging of potentially habitable Exoplanets is one prime science case for the next generation of high contrast imaging instruments on ground-based extremely large telescopes. To reach this demanding science goal, the instruments are equipped with eXtreme Adaptive Optics (XAO) systems which will control thousands of actuators at a framerate of kilohertz to several kilohertz. Most of the… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

    Journal ref: A&A 664, A71 (2022)

  45. arXiv:2203.10244  [pdf, other

    cs.CL

    ChartQA: A Benchmark for Question Answering about Charts with Visual and Logical Reasoning

    Authors: Ahmed Masry, Do Xuan Long, Jia Qing Tan, Shafiq Joty, Enamul Hoque

    Abstract: Charts are very popular for analyzing data. When exploring charts, people often ask a variety of complex reasoning questions that involve several logical and arithmetic operations. They also commonly refer to visual features of a chart in their questions. However, most existing datasets do not focus on such complex reasoning questions as their questions are template-based and answers come from a f… ▽ More

    Submitted 19 March, 2022; originally announced March 2022.

    Comments: Accepted by ACL 2022 Findings

  46. arXiv:2203.03367  [pdf, other

    cs.IR cs.CL

    Multi-CPR: A Multi Domain Chinese Dataset for Passage Retrieval

    Authors: Dingkun Long, Qiong Gao, Kuan Zou, Guangwei Xu, Pengjun Xie, Ruijie Guo, Jian Xu, Guanjun Jiang, Luxi Xing, Ping Yang

    Abstract: Passage retrieval is a fundamental task in information retrieval (IR) research, which has drawn much attention recently. In the English field, the availability of large-scale annotated dataset (e.g, MS MARCO) and the emergence of deep pre-trained language models (e.g, BERT) has resulted in a substantial improvement of existing passage retrieval systems. However, in the Chinese field, especially fo… ▽ More

    Submitted 24 April, 2022; v1 submitted 7 March, 2022; originally announced March 2022.

    Comments: SIGIR 2022 Resource Track

  47. arXiv:2203.00129  [pdf, other

    eess.IV cs.CV

    BlazeNeo: Blazing fast polyp segmentation and neoplasm detection

    Authors: Nguyen Sy An, Phan Ngoc Lan, Dao Viet Hang, Dao Van Long, Tran Quang Trung, Nguyen Thi Thuy, Dinh Viet Sang

    Abstract: In recent years, computer-aided automatic polyp segmentation and neoplasm detection have been an emerging topic in medical image analysis, providing valuable support to colonoscopy procedures. Attentions have been paid to improving the accuracy of polyp detection and segmentation. However, not much focus has been given to latency and throughput for performing these tasks on dedicated devices, whic… ▽ More

    Submitted 28 February, 2022; originally announced March 2022.

  48. arXiv:2202.12316  [pdf, other

    cs.LG

    AutoIP: A United Framework to Integrate Physics into Gaussian Processes

    Authors: Da Long, Zheng Wang, Aditi Krishnapriyan, Robert Kirby, Shandian Zhe, Michael Mahoney

    Abstract: Physical modeling is critical for many modern science and engineering applications. From a data science or machine learning perspective, where more domain-agnostic, data-driven models are pervasive, physical knowledge -- often expressed as differential equations -- is valuable in that it is complementary to data, and it can potentially help overcome issues such as data sparsity, noise, and inaccur… ▽ More

    Submitted 20 July, 2022; v1 submitted 24 February, 2022; originally announced February 2022.

  49. arXiv:2111.11660  [pdf, other

    cs.CV physics.comp-ph

    Non-invasive hemodynamic analysis for aortic regurgitation using computational fluid dynamics and deep learning

    Authors: Derek Long, Cameron McMurdo, Edward Ferdian, Charlene A. Mauger, David Marlevi, Alistair A. Young, Martyn P. Nash

    Abstract: Changes in cardiovascular hemodynamics are closely related to the development of aortic regurgitation, a type of valvular heart disease. Metrics derived from blood flows are used to indicate aortic regurgitation onset and evaluate its severity. These metrics can be non-invasively obtained using four-dimensional (4D) flow magnetic resonance imaging (MRI), where accuracy is primarily dependent on sp… ▽ More

    Submitted 5 April, 2022; v1 submitted 23 November, 2021; originally announced November 2021.

  50. arXiv:2111.00293  [pdf, other

    cs.RO cs.AI

    Long-Range Route-planning for Autonomous Vehicles in the Polar Oceans

    Authors: Maria Fox, Michael Meredith, J. Alexander Brearley, Dan Jones, Derek Long

    Abstract: There is an increasing demand for piloted autonomous underwater vehicles (AUVs) to operate in polar ice conditions. At present, AUVs are deployed from ships and directly human-piloted in these regions, entailing a high carbon cost and limiting the scope of operations. A key requirement for long-term autonomous missions is a long-range route planning capability that is aware of the changing ice con… ▽ More

    Submitted 20 November, 2021; v1 submitted 30 October, 2021; originally announced November 2021.

    Comments: Submitted to the AMS Journal of Atmospheric and Oceanic Technology