Skip to main content

Showing 1–11 of 11 results for author: Ru, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.12609  [pdf, ps, other

    cs.CV

    Not All Tokens and Heads Are Equally Important: Dual-Level Attention Intervention for Hallucination Mitigation

    Authors: Lexiang Tang, Xianwei Zhuang, Bang Yang, Zhiyuan Hu, Hongxiang Li, Lu Ma, Jinghan Ru, Yuexian Zou

    Abstract: Large vision-language models (LVLMs) have shown remarkable capabilities across a wide range of multimodal tasks. However, they remain prone to visual hallucination (VH), often producing confident but incorrect descriptions of visual content. We present VisFlow, an efficient and training-free framework designed to mitigate VH by directly manipulating attention patterns during inference. Through sys… ▽ More

    Submitted 14 June, 2025; originally announced June 2025.

  2. arXiv:2506.09054  [pdf, ps, other

    physics.ed-ph cs.HC

    Particle Builder -- Learn about the Standard Model while playing against an AI

    Authors: Mohammad Attar, Andrew Carse, Yeming Chen, Thomas Green, Jeong-Yeon Ha, Yanbai Jin, Amy McWilliams, Theirry Panggabean, Zhengyu Peng, Lujin Sun, Jing Ru, Jiacheng She, Jialin Wang, Zilun Wei, Jiayuan Zhu, Lachlan McGinness

    Abstract: Particle Builder Online is a web-based education game designed for high school physics students. Students can play against an AI opponent or peers to familiarise themselves with the Standard Model of Particle Physics. The game is aimed at a high school level and tailored to the International Baccalaureate and the Australian Curriculum. Students from four schools in Canberra took pre/post-tests and… ▽ More

    Submitted 27 May, 2025; originally announced June 2025.

    Comments: This demo has been accepted for presentation at the AIED 2025 Interactive Events Track

  3. arXiv:2504.02949  [pdf, other

    cs.CV cs.AI

    VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning

    Authors: Xianwei Zhuang, Yuxin Xie, Yufan Deng, Dongchao Yang, Liming Liang, Jinghan Ru, Yuguo Yin, Yuexian Zou

    Abstract: In this work, we present VARGPT-v1.1, an advanced unified visual autoregressive model that builds upon our previous framework VARGPT. The model preserves the dual paradigm of next-token prediction for visual understanding and next-scale generation for image synthesis. Specifically, VARGPT-v1.1 integrates: (1) a novel training strategy combining iterative visual instruction tuning with reinforcemen… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

    Comments: Code is available at: https://github.com/VARGPT-family/VARGPT-v1.1. arXiv admin note: text overlap with arXiv:2501.12327

  4. arXiv:2502.14627  [pdf, ps, other

    cs.SD cs.AI eess.AS

    ATRI: Mitigating Multilingual Audio Text Retrieval Inconsistencies by Reducing Data Distribution Errors

    Authors: Yuguo Yin, Yuxin Xie, Wenyuan Yang, Dongchao Yang, Jinghan Ru, Xianwei Zhuang, Liming Liang, Yuexian Zou

    Abstract: Multilingual audio-text retrieval (ML-ATR) is a challenging task that aims to retrieve audio clips or multilingual texts from databases. However, existing ML-ATR schemes suffer from inconsistencies for instance similarity matching across languages. We theoretically analyze the inconsistency in terms of both multilingual modal alignment direction error and weight error, and propose the theoretical… ▽ More

    Submitted 4 June, 2025; v1 submitted 20 February, 2025; originally announced February 2025.

  5. arXiv:2502.14597  [pdf, other

    cs.LG cs.NE

    Multi-Class Imbalanced Learning with Support Vector Machines via Differential Evolution

    Authors: Zhong-Liang Zhang, Jie Yang, Jian-Ming Ru, Xiao-Xi Zhao, Xing-Gang Luo

    Abstract: Support vector machine (SVM) is a powerful machine learning algorithm to handle classification tasks. However, the classical SVM is developed for binary problems with the assumption of balanced datasets. Obviously, the multi-class imbalanced classification problems are more complex. In this paper, we propose an improved SVM via Differential Evolution (i-SVM-DE) method to deal with it. An improved… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

  6. arXiv:2502.06604  [pdf, other

    cs.CL

    Do we really have to filter out random noise in pre-training data for language models?

    Authors: Jinghan Ru, Yuxin Xie, Xianwei Zhuang, Yuguo Yin, Zhihui Guo, Zhiming Liu, Qianli Ren, Yuexian Zou

    Abstract: Web-scale pre-training datasets are the cornerstone of LLMs' success. However, text data curated from the Internet inevitably contains random noise caused by decoding errors or unregulated web content. In contrast to previous works that focus on low quality or synthetic data, our study \textbf{provides the first systematic investigation of such random noise through a cohesive ``What-Why-How'' fram… ▽ More

    Submitted 15 May, 2025; v1 submitted 10 February, 2025; originally announced February 2025.

  7. arXiv:2501.12327  [pdf, other

    cs.CV

    VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model

    Authors: Xianwei Zhuang, Yuxin Xie, Yufan Deng, Liming Liang, Jinghan Ru, Yuguo Yin, Yuexian Zou

    Abstract: We present VARGPT, a novel multimodal large language model (MLLM) that unifies visual understanding and generation within a single autoregressive framework. VARGPT employs a next-token prediction paradigm for visual understanding and a next-scale prediction paradigm for visual autoregressive generation. VARGPT innovatively extends the LLaVA architecture, achieving efficient scale-wise autoregressi… ▽ More

    Submitted 21 January, 2025; originally announced January 2025.

  8. arXiv:2303.04393  [pdf, other

    cs.CV

    Imbalanced Open Set Domain Adaptation via Moving-threshold Estimation and Gradual Alignment

    Authors: Jinghan Ru, Jun Tian, Zhekai Du, Chengwei Xiao, Jingjing Li, Heng Tao Shen

    Abstract: Multimedia applications are often associated with cross-domain knowledge transfer, where Unsupervised Domain Adaptation (UDA) can be used to reduce the domain shifts. Open Set Domain Adaptation (OSDA) aims to transfer knowledge from a well-labeled source domain to an unlabeled target domain under the assumption that the target domain contains unknown classes. Existing OSDA methods consistently lay… ▽ More

    Submitted 8 March, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

    Comments: 11 pages, 5 figures, 7 tables

  9. On the expected number of perfect matchings in cubic planar graphs

    Authors: Marc Noy, Clément Requilé, Juanjo Rué

    Abstract: A well-known conjecture by Lovász and Plummer from the 1970s asserted that a bridgeless cubic graph has exponentially many perfect matchings. It was solved in the affirmative by Esperet et al. (Adv. Math. 2011). On the other hand, Chudnovsky and Seymour (Combinatorica 2012) proved the conjecture in the special case of cubic planar graphs. In our work we consider random bridgeless cubic planar grap… ▽ More

    Submitted 1 March, 2021; v1 submitted 28 May, 2020; originally announced May 2020.

    Comments: 19 pages, 4 figures

    Journal ref: Publicacions Matemàtiques, 2022, Vol. 66, Núm. 1, p. 325-353

  10. arXiv:1104.2486  [pdf, ps, other

    cs.DS math.CO

    Dynamic Programming for Graphs on Surfaces

    Authors: Juanjo Rué, Ignasi Sau, Dimitrios M. Thilikos

    Abstract: We provide a framework for the design and analysis of dynamic programming algorithms for surface-embedded graphs on n vertices and branchwidth at most k. Our technique applies to general families of problems where standard dynamic programming runs in 2^{O(k log k)} n steps. Our approach combines tools from topological graph theory and analytic combinatorics. In particular, we introduce a new type… ▽ More

    Submitted 25 April, 2011; v1 submitted 13 April, 2011; originally announced April 2011.

    Comments: 28 pages, 3 figures

    MSC Class: 05C85

  11. arXiv:1104.2477  [pdf, other

    math.CO cs.DM

    Asymptotic Enumeration of Non-crossing Partitions on Surfaces

    Authors: Juanjo Rué, Ignasi Sau, Dimitrios M. Thilikos

    Abstract: We generalize the notion of non-crossing partition on a disk to general surfaces with boundary. For this, we consider a surface $Σ$ and introduce the number $C_Σ(n)$ of non-crossing partitions of a set of $n$ points laying on the boundary of $Σ$. Our proofs use bijective techniques arising from map enumeration, joint with the symbolic method and singularity analysis on generating functions. An out… ▽ More

    Submitted 14 April, 2011; v1 submitted 13 April, 2011; originally announced April 2011.

    Comments: 17 pages, 9 figures

    MSC Class: 05A16