Skip to main content

Showing 1–50 of 144 results for author: Niu, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.11012  [pdf, other

    cs.AI cs.CL

    A Survey of Task-Oriented Knowledge Graph Reasoning: Status, Applications, and Prospects

    Authors: Guanglin Niu, Bo Li, Yangguang Lin

    Abstract: Knowledge graphs (KGs) have emerged as a powerful paradigm for structuring and leveraging diverse real-world knowledge, which serve as a fundamental technology for enabling cognitive intelligence systems with advanced understanding and reasoning capabilities. Knowledge graph reasoning (KGR) aims to infer new knowledge based on existing facts in KGs, playing a crucial role in applications such as p… ▽ More

    Submitted 27 April, 2025; originally announced June 2025.

    Comments: 45 pages, 17 figures, 12 tables

    ACM Class: I.2.7

  2. arXiv:2505.18909  [pdf, ps, other

    stat.ML cs.LG

    On the Role of Label Noise in the Feature Learning Process

    Authors: Andi Han, Wei Huang, Zhanpeng Zhou, Gang Niu, Wuyang Chen, Junchi Yan, Akiko Takeda, Taiji Suzuki

    Abstract: Deep learning with noisy labels presents significant challenges. In this work, we theoretically characterize the role of label noise from a feature learning perspective. Specifically, we consider a signal-noise data distribution, where each sample comprises a label-dependent signal and label-independent noise, and rigorously analyze the training dynamics of a two-layer convolutional neural network… ▽ More

    Submitted 24 May, 2025; originally announced May 2025.

    Comments: Accepted to ICML 2025

  3. arXiv:2505.18568  [pdf, ps, other

    cs.LG cs.AI cs.CV

    Learning without Isolation: Pathway Protection for Continual Learning

    Authors: Zhikang Chen, Abudukelimu Wuerkaixi, Sen Cui, Haoxuan Li, Ding Li, Jingfeng Zhang, Bo Han, Gang Niu, Houfang Liu, Yi Yang, Sifan Yang, Changshui Zhang, Tianling Ren

    Abstract: Deep networks are prone to catastrophic forgetting during sequential task learning, i.e., losing the knowledge about old tasks upon learning new tasks. To this end, continual learning(CL) has emerged, whose existing methods focus mostly on regulating or protecting the parameters associated with the previous tasks. However, parameter protection is often impractical, since the size of parameters for… ▽ More

    Submitted 24 May, 2025; originally announced May 2025.

    Comments: 23 pages

  4. arXiv:2504.11798  [pdf, other

    cs.CV

    Neighbor-Based Feature and Index Enhancement for Person Re-Identification

    Authors: Chao Yuan, Tianyi Zhang, Guanglin Niu

    Abstract: Person re-identification (Re-ID) aims to match the same pedestrian in a large gallery with different cameras and views. Enhancing the robustness of the extracted feature representations is a main challenge in Re-ID. Existing methods usually improve feature representation by improving model architecture, but most methods ignore the potential contextual information, which limits the effectiveness of… ▽ More

    Submitted 16 April, 2025; originally announced April 2025.

    Comments: Comment: This paper has been accepted for publication in the 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

  5. arXiv:2503.04151  [pdf, other

    cs.CV cs.AI cs.LG

    Robust Multi-View Learning via Representation Fusion of Sample-Level Attention and Alignment of Simulated Perturbation

    Authors: Jie Xu, Na Zhao, Gang Niu, Masashi Sugiyama, Xiaofeng Zhu

    Abstract: Recently, multi-view learning (MVL) has garnered significant attention due to its ability to fuse discriminative information from multiple views. However, real-world multi-view datasets are often heterogeneous and imperfect, which usually makes MVL methods designed for specific combinations of views lack application potential and limits their effectiveness. To address this issue, we propose a nove… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

  6. arXiv:2503.00938  [pdf, other

    cs.CV

    From Poses to Identity: Training-Free Person Re-Identification via Feature Centralization

    Authors: Chao Yuan, Guiwei Zhang, Changxiao Ma, Tianyi Zhang, Guanglin Niu

    Abstract: Person re-identification (ReID) aims to extract accurate identity representation features. However, during feature extraction, individual samples are inevitably affected by noise (background, occlusions, and model limitations). Considering that features from the same identity follow a normal distribution around identity centers after training, we propose a Training-Free Feature Centralization ReID… ▽ More

    Submitted 11 March, 2025; v1 submitted 2 March, 2025; originally announced March 2025.

  7. arXiv:2502.15803  [pdf, other

    cs.LG cs.CL

    Megrez-Omni Technical Report

    Authors: Boxun Li, Yadong Li, Zhiyuan Li, Congyi Liu, Weilin Liu, Guowei Niu, Zheyue Tan, Haiyang Xu, Zhuyu Yao, Tao Yuan, Dong Zhou, Yueqing Zhuang, Shengen Yan, Guohao Dai, Yu Wang

    Abstract: In this work, we present the Megrez models, comprising a language model (Megrez-3B-Instruct) and a multimodal model (Megrez-3B-Omni). These models are designed to deliver fast inference, compactness, and robust edge-side intelligence through a software-hardware co-design approach. Megrez-3B-Instruct offers several advantages, including high accuracy, high speed, ease of use, and a wide range of ap… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

  8. arXiv:2502.14205  [pdf, other

    cs.LG cs.AI

    Accurate Forgetting for Heterogeneous Federated Continual Learning

    Authors: Abudukelimu Wuerkaixi, Sen Cui, Jingfeng Zhang, Kunda Yan, Bo Han, Gang Niu, Lei Fang, Changshui Zhang, Masashi Sugiyama

    Abstract: Recent years have witnessed a burgeoning interest in federated learning (FL). However, the contexts in which clients engage in sequential learning remain under-explored. Bridging FL and continual learning (CL) gives rise to a challenging practical problem: federated continual learning (FCL). Existing research in FCL primarily focuses on mitigating the catastrophic forgetting issue of continual lea… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

    Comments: published in ICLR 2024

  9. arXiv:2502.10184  [pdf, other

    cs.LG

    Realistic Evaluation of Deep Partial-Label Learning Algorithms

    Authors: Wei Wang, Dong-Dong Wu, Jindong Wang, Gang Niu, Min-Ling Zhang, Masashi Sugiyama

    Abstract: Partial-label learning (PLL) is a weakly supervised learning problem in which each example is associated with multiple candidate labels and only one is the true label. In recent years, many deep PLL algorithms have been developed to improve model performance. However, we find that some early developed algorithms are often underestimated and can outperform many later algorithms with complicated des… ▽ More

    Submitted 14 February, 2025; originally announced February 2025.

    Comments: ICLR 2025 Spotlight

  10. arXiv:2501.15393  [pdf, other

    cs.AI cs.CL

    Diffusion-based Hierarchical Negative Sampling for Multimodal Knowledge Graph Completion

    Authors: Guanglin Niu, Xiaowei Zhang

    Abstract: Multimodal Knowledge Graph Completion (MMKGC) aims to address the critical issue of missing knowledge in multimodal knowledge graphs (MMKGs) for their better applications. However, both the previous MMGKC and negative sampling (NS) approaches ignore the employment of multimodal information to generate diverse and high-quality negative triples from various semantic levels and hardness levels, there… ▽ More

    Submitted 25 January, 2025; originally announced January 2025.

    Comments: The version of a full paper accepted to DASFAA 2025

    ACM Class: I.2.7

  11. arXiv:2501.05851  [pdf, other

    cs.CV

    Identity-aware Feature Decoupling Learning for Clothing-change Person Re-identification

    Authors: Haoxuan Xu, Bo Li, Guanglin Niu

    Abstract: Clothing-change person re-identification (CC Re-ID) has attracted increasing attention in recent years due to its application prospect. Most existing works struggle to adequately extract the ID-related information from the original RGB images. In this paper, we propose an Identity-aware Feature Decoupling (IFD) learning framework to mine identity-related features. Particularly, IFD exploits a dual… ▽ More

    Submitted 10 January, 2025; originally announced January 2025.

    Comments: Accepted by ICASSP2025

  12. arXiv:2412.00452  [pdf, other

    cs.LG cs.CV

    Learning Locally, Revising Globally: Global Reviser for Federated Learning with Noisy Labels

    Authors: Yuxin Tian, Mouxing Yang, Yuhao Zhou, Jian Wang, Qing Ye, Tongliang Liu, Gang Niu, Jiancheng Lv

    Abstract: The success of most federated learning (FL) methods heavily depends on label quality, which is often inaccessible in real-world scenarios, such as medicine, leading to the federated label-noise (F-LN) problem. In this study, we observe that the global model of FL memorizes the noisy labels slowly. Based on the observations, we propose a novel approach dubbed Global Reviser for Federated Learning w… ▽ More

    Submitted 30 November, 2024; originally announced December 2024.

    Comments: 19 pages

  13. arXiv:2411.15785  [pdf, other

    cs.CL

    A Method for Building Large Language Models with Predefined KV Cache Capacity

    Authors: Zhonghua Yi, Ge Niu, Lei Wang, Wei Tang, Liqiu Zhang

    Abstract: This paper introduces a novel approach, the Bounded-Cache Transformer (BCT), for building large language models with a predefined Key-Value (KV) cache capacity. The BCT addresses the excessive memory consumption issue in traditional KV caches by implementing a bounded-length KV cache, which is particularly suitable for the attention layers in Transformer decode-only architectures. By dynamically u… ▽ More

    Submitted 26 November, 2024; v1 submitted 24 November, 2024; originally announced November 2024.

  14. arXiv:2411.02310  [pdf, other

    cs.CL

    MdEval: Massively Multilingual Code Debugging

    Authors: Shukai Liu, Linzheng Chai, Jian Yang, Jiajun Shi, He Zhu, Liran Wang, Ke Jin, Wei Zhang, Hualei Zhu, Shuyue Guo, Tao Sun, Jiaheng Liu, Yunlong Duan, Yu Hao, Liqun Yang, Guanglin Niu, Ge Zhang, Zhoujun Li

    Abstract: Code large language models (LLMs) have made significant progress in code debugging by directly generating the correct code based on the buggy code snippet. Programming benchmarks, typically consisting of buggy code snippet and their associated test cases, are used to assess the debugging capabilities of LLMs. However, many existing benchmarks primarily focus on Python and are often limited in term… ▽ More

    Submitted 24 February, 2025; v1 submitted 4 November, 2024; originally announced November 2024.

    Comments: 15 pages

  15. arXiv:2410.14733  [pdf

    cs.LG cs.AI cs.CL

    Knowledge Graph Embeddings: A Comprehensive Survey on Capturing Relation Properties

    Authors: Guanglin Niu

    Abstract: Knowledge Graph Embedding (KGE) techniques play a pivotal role in transforming symbolic Knowledge Graphs (KGs) into numerical representations, thereby enhancing various deep learning models for knowledge-augmented applications. Unlike entities, relations in KGs are the carriers of semantic meaning, and their accurate modeling is crucial for the performance of KGE models. Firstly, we address the co… ▽ More

    Submitted 20 March, 2025; v1 submitted 16 October, 2024; originally announced October 2024.

    Comments: 22 pages, 8 figures, 3 tables, this paper is a modified English version of our article already published in Computer Science journal (in Chinese), released to facilitate communication among international researchers in the relevant fields

    ACM Class: I.2.7

  16. arXiv:2410.13567  [pdf, other

    cs.CV cs.AI

    CCUP: A Controllable Synthetic Data Generation Pipeline for Pretraining Cloth-Changing Person Re-Identification Models

    Authors: Yujian Zhao, Chengru Wu, Yinong Xu, Xuanzheng Du, Ruiyu Li, Guanglin Niu

    Abstract: Cloth-changing person re-identification (CC-ReID), also known as Long-Term Person Re-Identification (LT-ReID) is a critical and challenging research topic in computer vision that has recently garnered significant attention. However, due to the high cost of constructing CC-ReID data, the existing data-driven models are hard to train efficiently on limited data, causing overfitting issue. To address… ▽ More

    Submitted 30 March, 2025; v1 submitted 17 October, 2024; originally announced October 2024.

    Comments: Accepted by ICME 2025

  17. arXiv:2410.04488  [pdf, other

    cs.AI cs.CL

    A Pluggable Common Sense-Enhanced Framework for Knowledge Graph Completion

    Authors: Guanglin Niu, Bo Li, Siling Feng

    Abstract: Knowledge graph completion (KGC) tasks aim to infer missing facts in a knowledge graph (KG) for many knowledge-intensive applications. However, existing embedding-based KGC approaches primarily rely on factual triples, potentially leading to outcomes inconsistent with common sense. Besides, generating explicit common sense is often impractical or costly for a KG. To address these challenges, we pr… ▽ More

    Submitted 6 October, 2024; originally announced October 2024.

    Comments: 18 pages, 7 figures, 9 tables

    ACM Class: I.2; I.2.4; I.2.7

  18. arXiv:2410.03124  [pdf, other

    cs.CL cs.LG

    In-context Demonstration Matters: On Prompt Optimization for Pseudo-Supervision Refinement

    Authors: Zhen-Yu Zhang, Jiandong Zhang, Huaxiu Yao, Gang Niu, Masashi Sugiyama

    Abstract: Large language models (LLMs) have achieved great success across diverse tasks, and fine-tuning is sometimes needed to further enhance generation quality. Most existing methods rely on human supervision or parameter retraining, both of which are costly in terms of data collection and computational resources. To handle these challenges, a direct solution is to generate ``high-confidence'' data from… ▽ More

    Submitted 26 May, 2025; v1 submitted 3 October, 2024; originally announced October 2024.

  19. arXiv:2409.01944  [pdf, other

    cs.CL

    FuzzCoder: Byte-level Fuzzing Test via Large Language Model

    Authors: Liqun Yang, Jian Yang, Chaoren Wei, Guanglin Niu, Ge Zhang, Yunli Wang, Linzheng ChaI, Wanxu Xia, Hongcheng Guo, Shun Zhang, Jiaheng Liu, Yuwei Yin, Junran Peng, Jiaxin Ma, Liang Sun, Zhoujun Li

    Abstract: Fuzzing is an important dynamic program analysis technique designed for finding vulnerabilities in complex software. Fuzzing involves presenting a target program with crafted malicious input to cause crashes, buffer overflows, memory errors, and exceptions. Crafting malicious inputs in an efficient manner is a difficult open problem and the best approaches often apply uniform random mutations to p… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

    Comments: 11 pages

  20. arXiv:2408.09174  [pdf, other

    cs.CL

    TableBench: A Comprehensive and Complex Benchmark for Table Question Answering

    Authors: Xianjie Wu, Jian Yang, Linzheng Chai, Ge Zhang, Jiaheng Liu, Xinrun Du, Di Liang, Daixin Shu, Xianfu Cheng, Tianzhen Sun, Guanglin Niu, Tongliang Li, Zhoujun Li

    Abstract: Recent advancements in Large Language Models (LLMs) have markedly enhanced the interpretation and processing of tabular data, introducing previously unimaginable capabilities. Despite these achievements, LLMs still encounter significant challenges when applied in industrial scenarios, particularly due to the increased complexity of reasoning required with real-world tabular data, underscoring a no… ▽ More

    Submitted 18 March, 2025; v1 submitted 17 August, 2024; originally announced August 2024.

    Comments: 12 pages

  21. arXiv:2407.18624  [pdf, other

    cs.LG

    Dual-Decoupling Learning and Metric-Adaptive Thresholding for Semi-Supervised Multi-Label Learning

    Authors: Jia-Hao Xiao, Ming-Kun Xie, Heng-Bo Fan, Gang Niu, Masashi Sugiyama, Sheng-Jun Huang

    Abstract: Semi-supervised multi-label learning (SSMLL) is a powerful framework for leveraging unlabeled data to reduce the expensive cost of collecting precise multi-label annotations. Unlike semi-supervised learning, one cannot select the most probable label as the pseudo-label in SSMLL due to multiple semantics contained in an instance. To solve this problem, the mainstream method developed an effective t… ▽ More

    Submitted 26 December, 2024; v1 submitted 26 July, 2024; originally announced July 2024.

    Comments: Published in ECCV 2024

  22. arXiv:2406.08288  [pdf, other

    cs.LG

    Decoupling the Class Label and the Target Concept in Machine Unlearning

    Authors: Jianing Zhu, Bo Han, Jiangchao Yao, Jianliang Xu, Gang Niu, Masashi Sugiyama

    Abstract: Machine unlearning as an emerging research topic for data regulations, aims to adjust a trained model to approximate a retrained one that excludes a portion of training data. Previous studies showed that class-wise unlearning is successful in forgetting the knowledge of a target class, through gradient ascent on the forgetting data or fine-tuning with the remaining data. However, while these metho… ▽ More

    Submitted 16 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

  23. arXiv:2405.18890  [pdf, other

    cs.LG cs.DC

    Locally Estimated Global Perturbations are Better than Local Perturbations for Federated Sharpness-aware Minimization

    Authors: Ziqing Fan, Shengchao Hu, Jiangchao Yao, Gang Niu, Ya Zhang, Masashi Sugiyama, Yanfeng Wang

    Abstract: In federated learning (FL), the multi-step update and data heterogeneity among clients often lead to a loss landscape with sharper minima, degenerating the performance of the resulted global model. Prevalent federated approaches incorporate sharpness-aware minimization (SAM) into local training to mitigate this problem. However, the local loss landscapes may not accurately reflect the flatness of… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  24. arXiv:2405.09892  [pdf, other

    cs.LG cs.DC

    Balancing Similarity and Complementarity for Federated Learning

    Authors: Kunda Yan, Sen Cui, Abudukelimu Wuerkaixi, Jingfeng Zhang, Bo Han, Gang Niu, Masashi Sugiyama, Changshui Zhang

    Abstract: In mobile and IoT systems, Federated Learning (FL) is increasingly important for effectively using data while maintaining user privacy. One key challenge in FL is managing statistical heterogeneity, such as non-i.i.d. data, arising from numerous clients and diverse data sources. This requires strategic cooperation, often with clients having similar characteristics. However, we are interested in a… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  25. arXiv:2404.06287  [pdf, other

    cs.CV cs.LG

    Counterfactual Reasoning for Multi-Label Image Classification via Patching-Based Training

    Authors: Ming-Kun Xie, Jia-Hao Xiao, Pei Peng, Gang Niu, Masashi Sugiyama, Sheng-Jun Huang

    Abstract: The key to multi-label image classification (MLC) is to improve model performance by leveraging label correlations. Unfortunately, it has been shown that overemphasizing co-occurrence relationships can cause the overfitting issue of the model, ultimately leading to performance degradation. In this paper, we provide a causal inference framework to show that the correlative features caused by the ta… ▽ More

    Submitted 12 June, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  26. arXiv:2402.06918  [pdf, other

    cs.LG cs.AI cs.CL

    Generating Chain-of-Thoughts with a Pairwise-Comparison Approach to Searching for the Most Promising Intermediate Thought

    Authors: Zhen-Yu Zhang, Siwei Han, Huaxiu Yao, Gang Niu, Masashi Sugiyama

    Abstract: To improve the ability of the large language model (LLMs) to tackle complex reasoning problems, chain-of-thoughts (CoT) methods were proposed to guide LLMs to reason step-by-step, enabling problem solving from simple to complex. State-of-the-art methods for generating such a chain involve interactive collaboration, where the learner generates candidate intermediate thoughts, evaluated by the LLM,… ▽ More

    Submitted 26 June, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

    Comments: ICML 2024

  27. arXiv:2401.06826  [pdf, other

    cs.LG cs.AI cs.CV

    Direct Distillation between Different Domains

    Authors: Jialiang Tang, Shuo Chen, Gang Niu, Hongyuan Zhu, Joey Tianyi Zhou, Chen Gong, Masashi Sugiyama

    Abstract: Knowledge Distillation (KD) aims to learn a compact student network using knowledge from a large pre-trained teacher network, where both networks are trained on data from the same distribution. However, in practical applications, the student network may be required to perform in a new scenario (i.e., the target domain), which usually exhibits significant differences from the known scenario of the… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  28. arXiv:2311.15502  [pdf, other

    cs.LG

    Learning with Complementary Labels Revisited: The Selected-Completely-at-Random Setting Is More Practical

    Authors: Wei Wang, Takashi Ishida, Yu-Jie Zhang, Gang Niu, Masashi Sugiyama

    Abstract: Complementary-label learning is a weakly supervised learning problem in which each training example is associated with one or multiple complementary labels indicating the classes to which it does not belong. Existing consistent approaches have relied on the uniform distribution assumption to model the generation of complementary labels, or on an ordinary-label training set to estimate the transiti… ▽ More

    Submitted 11 October, 2024; v1 submitted 26 November, 2023; originally announced November 2023.

    Comments: ICML 2024

  29. arXiv:2310.13923  [pdf, other

    cs.LG

    Diversified Outlier Exposure for Out-of-Distribution Detection via Informative Extrapolation

    Authors: Jianing Zhu, Geng Yu, Jiangchao Yao, Tongliang Liu, Gang Niu, Masashi Sugiyama, Bo Han

    Abstract: Out-of-distribution (OOD) detection is important for deploying reliable machine learning models on real-world applications. Recent advances in outlier exposure have shown promising results on OOD detection via fine-tuning model with informatively sampled auxiliary outliers. However, previous methods assume that the collected outliers can be sufficiently large and representative to cover the bounda… ▽ More

    Submitted 26 October, 2023; v1 submitted 21 October, 2023; originally announced October 2023.

    Comments: accepted by NeurIPS 2023

  30. arXiv:2310.07351  [pdf, other

    cs.LG

    Atom-Motif Contrastive Transformer for Molecular Property Prediction

    Authors: Wentao Yu, Shuo Chen, Chen Gong, Gang Niu, Masashi Sugiyama

    Abstract: Recently, Graph Transformer (GT) models have been widely used in the task of Molecular Property Prediction (MPP) due to their high reliability in characterizing the latent relationship among graph nodes (i.e., the atoms in a molecule). However, most existing GT-based methods usually explore the basic interactions between pairwise atoms, and thus they fail to consider the important interactions amo… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: submit to AAAI-24

  31. arXiv:2310.05632  [pdf, other

    cs.LG

    Binary Classification with Confidence Difference

    Authors: Wei Wang, Lei Feng, Yuchen Jiang, Gang Niu, Min-Ling Zhang, Masashi Sugiyama

    Abstract: Recently, learning with soft labels has been shown to achieve better performance than learning with hard labels in terms of model generalization, calibration, and robustness. However, collecting pointwise labeling confidence for all training examples can be challenging and time-consuming in real-world scenarios. This paper delves into a novel weakly supervised binary classification problem called… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023

  32. Multi-Label Knowledge Distillation

    Authors: Penghui Yang, Ming-Kun Xie, Chen-Chen Zong, Lei Feng, Gang Niu, Masashi Sugiyama, Sheng-Jun Huang

    Abstract: Existing knowledge distillation methods typically work by imparting the knowledge of output logits or intermediate feature maps from the teacher network to the student network, which is very successful in multi-class single-label learning. However, these methods can hardly be extended to the multi-label learning scenario, where each instance is associated with multiple semantic labels, because the… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: Accepted by ICCV 2023. The first two authors contributed equally to this work

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023, pp. 17271-17280

  33. arXiv:2307.11469  [pdf, other

    cs.CV cs.AI

    Distribution Shift Matters for Knowledge Distillation with Webly Collected Images

    Authors: Jialiang Tang, Shuo Chen, Gang Niu, Masashi Sugiyama, Chen Gong

    Abstract: Knowledge distillation aims to learn a lightweight student network from a pre-trained teacher network. In practice, existing knowledge distillation methods are usually infeasible when the original training data is unavailable due to some privacy issues and data management considerations. Therefore, data-free knowledge distillation approaches proposed to collect training instances from the Internet… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

  34. arXiv:2307.05948  [pdf, other

    cs.LG

    Diversity-enhancing Generative Network for Few-shot Hypothesis Adaptation

    Authors: Ruijiang Dong, Feng Liu, Haoang Chi, Tongliang Liu, Mingming Gong, Gang Niu, Masashi Sugiyama, Bo Han

    Abstract: Generating unlabeled data has been recently shown to help address the few-shot hypothesis adaptation (FHA) problem, where we aim to train a classifier for the target domain with a few labeled target-domain data and a well-trained source-domain classifier (i.e., a source hypothesis), for the additional information of the highly-compatible unlabeled data. However, the generated data of the existing… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

  35. arXiv:2306.11343  [pdf, ps, other

    cs.LG

    A Universal Unbiased Method for Classification from Aggregate Observations

    Authors: Zixi Wei, Lei Feng, Bo Han, Tongliang Liu, Gang Niu, Xiaofeng Zhu, Heng Tao Shen

    Abstract: In conventional supervised classification, true labels are required for individual instances. However, it could be prohibitive to collect the true labels for individual instances, due to privacy concerns or unaffordable annotation costs. This motivates the study on classification from aggregate observations (CFAO), where the supervision is provided to groups of instances, instead of individual ins… ▽ More

    Submitted 27 June, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

  36. arXiv:2306.07036  [pdf, other

    cs.LG

    Making Binary Classification from Multiple Unlabeled Datasets Almost Free of Supervision

    Authors: Yuhao Wu, Xiaobo Xia, Jun Yu, Bo Han, Gang Niu, Masashi Sugiyama, Tongliang Liu

    Abstract: Training a classifier exploiting a huge amount of supervised data is expensive or even prohibited in a situation, where the labeling cost is high. The remarkable progress in working with weaker forms of supervision is binary classification from multiple unlabeled datasets which requires the knowledge of exact class priors for all unlabeled datasets. However, the availability of class priors is res… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: 38 pages, 5 figures, 10 tables

  37. arXiv:2305.14690  [pdf, other

    cs.LG

    Generalizing Importance Weighting to A Universal Solver for Distribution Shift Problems

    Authors: Tongtong Fang, Nan Lu, Gang Niu, Masashi Sugiyama

    Abstract: Distribution shift (DS) may have two levels: the distribution itself changes, and the support (i.e., the set where the probability density is non-zero) also changes. When considering the support change between the training and test distributions, there can be four cases: (i) they exactly match; (ii) the training support is wider (and thus covers the test support); (iii) the test support is wider;… ▽ More

    Submitted 1 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023 camera-ready version (this paper was selected for spotlight presentation)

  38. arXiv:2305.08344  [pdf, other

    cs.LG

    Enhancing Label Sharing Efficiency in Complementary-Label Learning with Label Augmentation

    Authors: Wei-I Lin, Gang Niu, Hsuan-Tien Lin, Masashi Sugiyama

    Abstract: Complementary-label Learning (CLL) is a form of weakly supervised learning that trains an ordinary classifier using only complementary labels, which are the classes that certain instances do not belong to. While existing CLL studies typically use novel loss functions or training techniques to solve this problem, few studies focus on how complementary labels collectively provide information to trai… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  39. arXiv:2305.06080  [pdf, other

    cs.CV cs.LG

    Towards Effective Visual Representations for Partial-Label Learning

    Authors: Shiyu Xia, Jiaqi Lv, Ning Xu, Gang Niu, Xin Geng

    Abstract: Under partial-label learning (PLL) where, for each training instance, only a set of ambiguous candidate labels containing the unknown true label is accessible, contrastive learning has recently boosted the performance of PLL on vision tasks, attributed to representations learned by contrasting the same/different classes of entities. Without access to true labels, positive points are predicted usin… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

  40. arXiv:2305.02795  [pdf, other

    cs.LG

    Class-Distribution-Aware Pseudo Labeling for Semi-Supervised Multi-Label Learning

    Authors: Ming-Kun Xie, Jia-Hao Xiao, Hao-Zhe Liu, Gang Niu, Masashi Sugiyama, Sheng-Jun Huang

    Abstract: Pseudo-labeling has emerged as a popular and effective approach for utilizing unlabeled data. However, in the context of semi-supervised multi-label learning (SSMLL), conventional pseudo-labeling methods encounter difficulties when dealing with instances associated with multiple labels and an unknown label count. These limitations often result in the introduction of false positive labels or the ne… ▽ More

    Submitted 20 May, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

  41. arXiv:2305.00399  [pdf, other

    cs.CR

    Assessing Vulnerabilities of Adversarial Learning Algorithm through Poisoning Attacks

    Authors: Jingfeng Zhang, Bo Song, Bo Han, Lei Liu, Gang Niu, Masashi Sugiyama

    Abstract: Adversarial training (AT) is a robust learning algorithm that can defend against adversarial attacks in the inference phase and mitigate the side effects of corrupted data in the training phase. As such, it has become an indispensable component of many artificial intelligence (AI) systems. However, in high-stake AI applications, it is crucial to understand AT's vulnerabilities to ensure reliable d… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

  42. arXiv:2303.17245  [pdf, other

    cs.LG cs.CV

    Investigating and Mitigating the Side Effects of Noisy Views for Self-Supervised Clustering Algorithms in Practical Multi-View Scenarios

    Authors: Jie Xu, Yazhou Ren, Xiaolong Wang, Lei Feng, Zheng Zhang, Gang Niu, Xiaofeng Zhu

    Abstract: Multi-view clustering (MVC) aims at exploring category structures among multi-view data in self-supervised manners. Multiple views provide more information than single views and thus existing MVC methods can achieve satisfactory performance. However, their performance might seriously degenerate when the views are noisy in practical multi-view scenarios. In this paper, we formally investigate the d… ▽ More

    Submitted 25 March, 2024; v1 submitted 30 March, 2023; originally announced March 2023.

  43. arXiv:2303.12291  [pdf, other

    cs.LG

    Fairness Improves Learning from Noisily Labeled Long-Tailed Data

    Authors: Jiaheng Wei, Zhaowei Zhu, Gang Niu, Tongliang Liu, Sijia Liu, Masashi Sugiyama, Yang Liu

    Abstract: Both long-tailed and noisily labeled data frequently appear in real-world applications and impose significant challenges for learning. Most prior works treat either problem in an isolated way and do not explicitly consider the coupling effects of the two. Our empirical observation reveals that such solutions fail to consistently improve the learning when the dataset is long-tailed with label noise… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: Paper under review

  44. arXiv:2212.04055  [pdf, other

    cs.LG cs.AI

    Mitigating Memorization of Noisy Labels by Clipping the Model Prediction

    Authors: Hongxin Wei, Huiping Zhuang, Renchunzi Xie, Lei Feng, Gang Niu, Bo An, Yixuan Li

    Abstract: In the presence of noisy labels, designing robust loss functions is critical for securing the generalization performance of deep neural networks. Cross Entropy (CE) loss has been shown to be not robust to noisy labels due to its unboundedness. To alleviate this issue, existing works typically design specialized robust losses with the symmetric condition, which usually lead to the underfitting issu… ▽ More

    Submitted 13 June, 2023; v1 submitted 7 December, 2022; originally announced December 2022.

    Comments: Accepted by ICML 2023

  45. arXiv:2211.16865  [pdf, other

    cs.AI cs.CL

    Logic and Commonsense-Guided Temporal Knowledge Graph Completion

    Authors: Guanglin Niu, Bo Li

    Abstract: A temporal knowledge graph (TKG) stores the events derived from the data involving time. Predicting events is extremely challenging due to the time-sensitive property of events. Besides, the previous TKG completion (TKGC) approaches cannot represent both the timeliness and the causality properties of events, simultaneously. To address these challenges, we propose a Logic and Commonsense-Guided Emb… ▽ More

    Submitted 15 May, 2023; v1 submitted 30 November, 2022; originally announced November 2022.

    Comments: The full version of a long paper accepted to AAAI 2023

  46. arXiv:2211.00269  [pdf, other

    cs.LG cs.CR

    Adversarial Training with Complementary Labels: On the Benefit of Gradually Informative Attacks

    Authors: Jianan Zhou, Jianing Zhu, Jingfeng Zhang, Tongliang Liu, Gang Niu, Bo Han, Masashi Sugiyama

    Abstract: Adversarial training (AT) with imperfect supervision is significant but receives limited attention. To push AT towards more practical scenarios, we explore a brand new yet challenging setting, i.e., AT with complementary labels (CLs), which specify a class that a data sample does not belong to. However, the direct combination of AT with existing methods for CLs results in consistent failure, but n… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: NeurIPS 2022

  47. arXiv:2210.02042  [pdf, other

    cs.LG cs.AI cs.DC

    FedMT: Federated Learning with Mixed-type Labels

    Authors: Qiong Zhang, Jing Peng, Xin Zhang, Aline Talhouk, Gang Niu, Xiaoxiao Li

    Abstract: In federated learning (FL), classifiers (e.g., deep networks) are trained on datasets from multiple data centers without exchanging data across them, which improves the sample efficiency. However, the conventional FL setting assumes the same labeling criterion in all data centers involved, thus limiting its practical utility. This limitation becomes particularly notable in domains like disease dia… ▽ More

    Submitted 15 February, 2024; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: 23 pages

  48. arXiv:2206.07314  [pdf, other

    cs.LG cs.CR

    Fast and Reliable Evaluation of Adversarial Robustness with Minimum-Margin Attack

    Authors: Ruize Gao, Jiongxiao Wang, Kaiwen Zhou, Feng Liu, Binghui Xie, Gang Niu, Bo Han, James Cheng

    Abstract: The AutoAttack (AA) has been the most reliable method to evaluate adversarial robustness when considerable computational resources are available. However, the high computational cost (e.g., 100 times more than that of the project gradient descent attack) makes AA infeasible for practitioners with limited computational resources, and also hinders applications of AA in the adversarial training (AT).… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

  49. arXiv:2206.02791  [pdf, other

    cs.LG

    Instance-Dependent Label-Noise Learning with Manifold-Regularized Transition Matrix Estimation

    Authors: De Cheng, Tongliang Liu, Yixiong Ning, Nannan Wang, Bo Han, Gang Niu, Xinbo Gao, Masashi Sugiyama

    Abstract: In label-noise learning, estimating the transition matrix has attracted more and more attention as the matrix plays an important role in building statistically consistent classifiers. However, it is very challenging to estimate the transition matrix T(x), where x denotes the instance, because it is unidentifiable under the instance-dependent noise(IDN). To address this problem, we have noticed tha… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

    Comments: accepted by CVPR2022

  50. arXiv:2204.03304  [pdf, other

    cs.LG

    Federated Learning from Only Unlabeled Data with Class-Conditional-Sharing Clients

    Authors: Nan Lu, Zhao Wang, Xiaoxiao Li, Gang Niu, Qi Dou, Masashi Sugiyama

    Abstract: Supervised federated learning (FL) enables multiple clients to share the trained model without sharing their labeled data. However, potential clients might even be reluctant to label their own data, which could limit the applicability of FL in practice. In this paper, we show the possibility of unsupervised FL whose model is still a classifier for predicting class labels, if the class-prior probab… ▽ More

    Submitted 11 May, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

    Comments: ICLR 2022 camera-ready version