Skip to main content

Showing 1–23 of 23 results for author: Liao, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.02751  [pdf, ps, other

    cs.CV

    Partial Weakly-Supervised Oriented Object Detection

    Authors: Mingxin Liu, Peiyuan Zhang, Yuan Liu, Wei Zhang, Yue Zhou, Ning Liao, Ziyang Gong, Junwei Luo, Zhirui Wang, Yi Yu, Xue Yang

    Abstract: The growing demand for oriented object detection (OOD) across various domains has driven significant research in this area. However, the high cost of dataset annotation remains a major concern. Current mainstream OOD algorithms can be mainly categorized into three types: (1) fully supervised methods using complete oriented bounding box (OBB) annotations, (2) semi-supervised methods using partial O… ▽ More

    Submitted 3 July, 2025; originally announced July 2025.

    Comments: 10 pages, 5 figures, 4 tables, source code: https://github.com/VisionXLab/PWOOD

  2. arXiv:2505.21849  [pdf, other

    cs.IR cs.AI

    Xinyu AI Search: Enhanced Relevance and Comprehensive Results with Rich Answer Presentations

    Authors: Bo Tang, Junyi Zhu, Chenyang Xi, Yunhang Ge, Jiahao Wu, Yuchen Feng, Yijun Niu, Wenqiang Wei, Yu Yu, Chunyu Li, Zehao Lin, Hao Wu, Ning Liao, Yebin Yang, Jiajia Wang, Zhiyu Li, Feiyu Xiong, Jingrun Chen

    Abstract: Traditional search engines struggle to synthesize fragmented information for complex queries, while generative AI search engines face challenges in relevance, comprehensiveness, and presentation. To address these limitations, we introduce Xinyu AI Search, a novel system that incorporates a query-decomposition graph to dynamically break down complex queries into sub-queries, enabling stepwise retri… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  3. arXiv:2505.17011  [pdf, ps, other

    cs.CV

    Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space

    Authors: Yan Li, Changyao Tian, Renqiu Xia, Ning Liao, Weiwei Guo, Junchi Yan, Hongsheng Li, Jifeng Dai, Hao Li, Xue Yang

    Abstract: We propose AdapTok, an adaptive temporal causal video tokenizer that can flexibly allocate tokens for different frames based on video content. AdapTok is equipped with a block-wise masking strategy that randomly drops tail tokens of each block during training, and a block causal scorer to predict the reconstruction quality of video frames using different numbers of tokens. During inference, an ada… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

    Comments: Code: https://github.com/VisionXLab/AdapTok

  4. arXiv:2504.15302  [pdf, other

    cs.DC cs.OS

    RAGDoll: Efficient Offloading-based Online RAG System on a Single GPU

    Authors: Weiping Yu, Ningyi Liao, Siqiang Luo, Junfeng Liu

    Abstract: Retrieval-Augmented Generation (RAG) enhances large language model (LLM) generation quality by incorporating relevant external knowledge. However, deploying RAG on consumer-grade platforms is challenging due to limited memory and the increasing scale of both models and knowledge bases. In this work, we introduce RAGDoll, a resource-efficient, self-adaptive RAG serving system integrated with LLMs,… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

  5. arXiv:2501.07890  [pdf, other

    cs.CL cs.AI

    GRAPHMOE: Amplifying Cognitive Depth of Mixture-of-Experts Network via Introducing Self-Rethinking Mechanism

    Authors: Chen Tang, Bo Lv, Zifan Zheng, Bohao Yang, Kun Zhao, Ning Liao, Xiaoxing Wang, Feiyu Xiong, Zhiyu Li, Nayu Liu, Jingchi Jiang

    Abstract: Traditional Mixture-of-Experts (MoE) networks benefit from utilizing multiple smaller expert models as opposed to a single large network. However, these experts typically operate independently, leaving a question open about whether interconnecting these models could enhance the performance of MoE networks. In response, we introduce GRAPHMOE, a novel method aimed at augmenting the cognitive depth o… ▽ More

    Submitted 11 February, 2025; v1 submitted 14 January, 2025; originally announced January 2025.

    Comments: 10 pages

  6. arXiv:2412.04738  [pdf, other

    cs.LG

    DHIL-GT: Scalable Graph Transformer with Decoupled Hierarchy Labeling

    Authors: Ningyi Liao, Zihao Yu, Siqiang Luo

    Abstract: Graph Transformer (GT) has recently emerged as a promising neural network architecture for learning graph-structured data. However, its global attention mechanism with quadratic complexity concerning the graph scale prevents wider application to large graphs. While current methods attempt to enhance GT scalability by altering model architecture or encoding hierarchical graph data, our analysis rev… ▽ More

    Submitted 5 December, 2024; originally announced December 2024.

  7. arXiv:2411.02057  [pdf, other

    cs.CV

    Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation

    Authors: Yan Li, Weiwei Guo, Xue Yang, Ning Liao, Shaofeng Zhang, Yi Yu, Wenxian Yu, Junchi Yan

    Abstract: In recent years, aerial object detection has been increasingly pivotal in various earth observation applications. However, current algorithms are limited to detecting a set of pre-defined object categories, demanding sufficient annotated training samples, and fail to detect novel object categories. In this paper, we put forth a novel formulation of the aerial object detection problem, namely open-… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

  8. arXiv:2409.16084  [pdf, other

    cs.CV

    MM-CamObj: A Comprehensive Multimodal Dataset for Camouflaged Object Scenarios

    Authors: Jiacheng Ruan, Wenzhen Yuan, Zehao Lin, Ning Liao, Zhiyu Li, Feiyu Xiong, Ting Liu, Yuzhuo Fu

    Abstract: Large visual-language models (LVLMs) have achieved great success in multiple applications. However, they still encounter challenges in complex scenes, especially those involving camouflaged objects. This is primarily due to the lack of samples related to camouflaged scenes in the training dataset. To mitigate this issue, we construct the MM-CamObj dataset for the first time, comprising two subsets… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

    Comments: 9 pages, 5 figures. Work in progress

  9. arXiv:2406.09675  [pdf, other

    cs.LG cs.AI

    Benchmarking Spectral Graph Neural Networks: A Comprehensive Study on Effectiveness and Efficiency

    Authors: Ningyi Liao, Haoyu Liu, Zulun Zhu, Siqiang Luo, Laks V. S. Lakshmanan

    Abstract: With the recent advancements in graph neural networks (GNNs), spectral GNNs have received increasing popularity by virtue of their specialty in capturing graph signals in the frequency domain, demonstrating promising capability in specific tasks. However, few systematic studies have been conducted on assessing their spectral characteristics. This emerging family of models also varies in terms of d… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  10. arXiv:2403.13268  [pdf, other

    cs.LG cs.DB

    Unifews: Unified Entry-Wise Sparsification for Efficient Graph Neural Network

    Authors: Ningyi Liao, Zihao Yu, Siqiang Luo

    Abstract: Graph Neural Networks (GNNs) have shown promising performance in various graph learning tasks, but at the cost of resource-intensive computations. The primary overhead of GNN update stems from graph propagation and weight transformation, both involving operations on graph-scale matrices. Previous studies attempt to reduce the computational budget by leveraging graph-level or network-level sparsifi… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  11. arXiv:2311.11646  [pdf, other

    cs.CV

    Toward Open Vocabulary Aerial Object Detection with CLIP-Activated Student-Teacher Learning

    Authors: Yan Li, Weiwei Guo, Xue Yang, Ning Liao, Dunyun He, Jiaqi Zhou, Wenxian Yu

    Abstract: An increasingly massive number of remote-sensing images spurs the development of extensible object detectors that can detect objects beyond training categories without costly collecting new labeled data. In this paper, we aim to develop open-vocabulary object detection (OVD) technique in aerial images that scales up object vocabulary size beyond training data. The performance of OVD greatly relies… ▽ More

    Submitted 28 October, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: Accepted by ECCV2024

  12. arXiv:2310.06594  [pdf, other

    cs.CV

    On the Evaluation and Refinement of Vision-Language Instruction Tuning Datasets

    Authors: Ning Liao, Shaofeng Zhang, Renqiu Xia, Min Cao, Yu Qiao, Junchi Yan

    Abstract: There is an emerging line of research on multimodal instruction tuning, and a line of benchmarks has been proposed for evaluating these models recently. Instead of evaluating the models directly, in this paper, we try to evaluate the Vision-Language Instruction-Tuning (VLIT) datasets. Also, we seek the way of building a dataset for developing an all-powerful VLIT model, which we believe could also… ▽ More

    Submitted 29 December, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

  13. arXiv:2305.09958  [pdf, other

    cs.LG cs.SI

    SIGMA: An Efficient Heterophilous Graph Neural Network with Fast Global Aggregation

    Authors: Haoyu Liu, Ningyi Liao, Siqiang Luo

    Abstract: Graph neural networks (GNNs) realize great success in graph learning but suffer from performance loss when meeting heterophily, i.e. neighboring nodes are dissimilar, due to their local and uniform aggregation. Existing attempts of heterophilous GNNs incorporate long-range or global aggregations to distinguish nodes in the graph. However, these aggregations usually require iteratively maintaining… ▽ More

    Submitted 9 April, 2025; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: Acceptted to ICDE 2025

  14. arXiv:2303.05122  [pdf, other

    cs.CV

    M-Tuning: Prompt Tuning with Mitigated Label Bias in Open-Set Scenarios

    Authors: Ning Liao, Xiaopeng Zhang, Min Cao, Junchi Yan

    Abstract: In realistic open-set scenarios where labels of a part of testing data are totally unknown, when vision-language (VL) prompt learning methods encounter inputs related to unknown classes (i.e., not seen during training), they always predict them as one of the training classes. The exhibited label bias causes difficulty in open set recognition (OSR), in which an image should be correctly predicted a… ▽ More

    Submitted 27 December, 2024; v1 submitted 9 March, 2023; originally announced March 2023.

    Comments: Accepted by IEEE TCSVT

  15. arXiv:2303.04998  [pdf, other

    cs.CV

    Rethinking Visual Prompt Learning as Masked Visual Token Modeling

    Authors: Ning Liao, Bowen Shi, Xiaopeng Zhang, Min Cao, Junchi Yan, Qi Tian

    Abstract: Prompt learning has achieved great success in efficiently exploiting large-scale pre-trained models in natural language processing (NLP). It reformulates the downstream tasks as the generative pre-training ones to achieve consistency, thus improving the performance stably. However, when transferring it to the vision area, current visual prompt learning methods are almost designed on discriminative… ▽ More

    Submitted 15 December, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

  16. arXiv:2211.02806  [pdf

    cs.AI eess.SY

    Modified EDAS Method Based on Cumulative Prospect Theory for Multiple Attributes Group Decision Making with Interval-valued Intuitionistic Fuzzy Information

    Authors: Jing Wang, Qiang Cai, Guiwu Wei, Ningna Liao

    Abstract: The Interval-valued intuitionistic fuzzy sets (IVIFSs) based on the intuitionistic fuzzy sets combines the classical decision method is in its research and application is attracting attention. After comparative analysis, there are multiple classical methods with IVIFSs information have been applied into many practical issues. In this paper, we extended the classical EDAS method based on cumulative… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: 48 pages

    MSC Class: 91B06 ACM Class: F.2.2

  17. SCARA: Scalable Graph Neural Networks with Feature-Oriented Optimization

    Authors: Ningyi Liao, Dingheng Mo, Siqiang Luo, Xiang Li, Pengcheng Yin

    Abstract: Recent advances in data processing have stimulated the demand for learning graphs of very large scales. Graph Neural Networks (GNNs), being an emerging and powerful approach in solving graph learning tasks, are known to be difficult to scale up. Most scalable models apply node-based techniques in simplifying the expensive graph message-passing propagation procedure of GNN. However, we find such ac… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Journal ref: Proceedings of the VLDB Endowment 15 (2022) 3240-3248

  18. arXiv:2204.01057  [pdf, other

    math.CO cs.DM cs.LG

    A Survey on Machine Learning Solutions for Graph Pattern Extraction

    Authors: Kai Siong Yow, Ningyi Liao, Siqiang Luo, Reynold Cheng, Chenhao Ma, Xiaolin Han

    Abstract: A subgraph is constructed by using a subset of vertices and edges of a given graph. There exist many graph properties that are hereditary for subgraphs. Hence, researchers from different communities have paid a great deal of attention in studying numerous subgraph problems, on top of the ordinary graph problems. Many algorithms are proposed in studying subgraph problems, where one common approach… ▽ More

    Submitted 2 June, 2023; v1 submitted 3 April, 2022; originally announced April 2022.

    Comments: v1: 41 pages; v2: 40 pages ; v3: This version focuses on just subgraph problems (discussions on other classic graph problems can be found in the earlier versions)

    MSC Class: 05C90; 68M07; 68R10

  19. Achieving Adversarial Robustness via Sparsity

    Authors: Shufan Wang, Ningyi Liao, Liyao Xiang, Nanyang Ye, Quanshi Zhang

    Abstract: Network pruning has been known to produce compact models without much accuracy degradation. However, how the pruning process affects a network's robustness and the working mechanism behind remain unresolved. In this work, we theoretically prove that the sparsity of network weights is closely associated with model robustness. Through experiments on a variety of adversarial pruning methods, we find… ▽ More

    Submitted 11 September, 2020; originally announced September 2020.

  20. Friend, Collaborator, Student, Manager: How Design of an AI-Driven Game Level Editor Affects Creators

    Authors: Matthew Guzdial, Nicholas Liao, Jonathan Chen, Shao-Yu Chen, Shukan Shah, Vishwa Shah, Joshua Reno, Gillian Smith, Mark Riedl

    Abstract: Machine learning advances have afforded an increase in algorithms capable of creating art, music, stories, games, and more. However, it is not yet well-understood how machine learning algorithms might best collaborate with people to support creative expression. To investigate how practicing designers perceive the role of AI in the creative process, we developed a game level design tool for Super M… ▽ More

    Submitted 18 January, 2019; originally announced January 2019.

    Comments: 13 pages, 3 figures, CHI Conference on Human Factors in Computing Systems

  21. arXiv:1809.09420  [pdf, other

    cs.AI cs.LG

    Co-Creative Level Design via Machine Learning

    Authors: Matthew Guzdial, Nicholas Liao, Mark Riedl

    Abstract: Procedural Level Generation via Machine Learning (PLGML), the study of generating game levels with machine learning, has received a large amount of recent academic attention. For certain measures these approaches have shown success at replicating the quality of existing game levels. However, it is unclear the extent to which they might benefit human designers. In this paper we present a framework… ▽ More

    Submitted 25 September, 2018; originally announced September 2018.

    Comments: 7 pages, 2 figures, Fifth Experimental AI in Games Workshop

  22. arXiv:1809.06201  [pdf, other

    cs.CV cs.AI cs.LG

    Player Experience Extraction from Gameplay Video

    Authors: Zijin Luo, Matthew Guzdial, Nicholas Liao, Mark Riedl

    Abstract: The ability to extract the sequence of game events for a given player's play-through has traditionally required access to the game's engine or source code. This serves as a barrier to researchers, developers, and hobbyists who might otherwise benefit from these game logs. In this paper we present two approaches to derive game logs from game video via convolutional neural networks and transfer lear… ▽ More

    Submitted 6 September, 2018; originally announced September 2018.

    Comments: 8 pages, 6 figures, AIIDE 2018

  23. arXiv:1805.03720  [pdf, other

    cs.AI

    Creative Invention Benchmark

    Authors: Matthew Guzdial, Nicholas Liao, Vishwa Shah, Mark O. Riedl

    Abstract: In this paper we present the Creative Invention Benchmark (CrIB), a 2000-problem benchmark for evaluating a particular facet of computational creativity. Specifically, we address combinational p-creativity, the creativity at play when someone combines existing knowledge to achieve a solution novel to that individual. We present generation strategies for the five problem categories of the benchmark… ▽ More

    Submitted 9 May, 2018; originally announced May 2018.

    Comments: 8 pages, 4 figures, International Conference on Computational Creativity