Skip to main content

Showing 1–34 of 34 results for author: Da Xu, R Y

.
  1. arXiv:2411.15694  [pdf, ps, other

    cs.CL

    Deep Sparse Latent Feature Models for Knowledge Graph Completion

    Authors: Haotian Li, Rui Zhang, Lingzhi Wang, Bin Yu, Youwei Wang, Yuliang Wei, Kai Wang, Richard Yi Da Xu, Bailing Wang

    Abstract: Recent advances in knowledge graph completion (KGC) have emphasized text-based approaches to navigate the inherent complexities of large-scale knowledge graphs (KGs). While these methods have achieved notable progress, they frequently struggle to fully incorporate the global structural properties of the graph. Stochastic blockmodels (SBMs), especially the latent feature relational model (LFRM), of… ▽ More

    Submitted 12 June, 2025; v1 submitted 23 November, 2024; originally announced November 2024.

  2. arXiv:2408.08703  [pdf, other

    cs.CV

    TsCA: On the Semantic Consistency Alignment via Conditional Transport for Compositional Zero-Shot Learning

    Authors: Miaoge Li, Jingcai Guo, Richard Yi Da Xu, Dongsheng Wang, Xiaofeng Cao, Zhijie Rao, Song Guo

    Abstract: Compositional Zero-Shot Learning (CZSL) aims to recognize novel state-object compositions by leveraging the shared knowledge of their primitive components. Despite considerable progress, effectively calibrating the bias between semantically similar multimodal representations, as well as generalizing pre-trained knowledge to novel compositional contexts, remains an enduring challenge. In this paper… ▽ More

    Submitted 25 January, 2025; v1 submitted 16 August, 2024; originally announced August 2024.

    Comments: 9 pages, 5 figures

  3. arXiv:2406.01885  [pdf, other

    math.OC

    Nonlinear Eigen-approach ADMM for Sparse Optimization on Stiefel Manifold

    Authors: Jiawei Wang, Rencang Li, Richard Yi Da Xu

    Abstract: With the growing interest and applications in machine learning and data science, finding an efficient method to sparse analysis the high-dimensional data and optimizing a dimension reduction model to extract lower dimensional features has becoming more and more important. Orthogonal constraints (Stiefel manifold) is a commonly met constraint in these applications, and the sparsity is usually enfor… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  4. arXiv:2309.14770  [pdf, other

    cs.CL

    KERMIT: Knowledge Graph Completion of Enhanced Relation Modeling with Inverse Transformation

    Authors: Haotian Li, Bin Yu, Yuliang Wei, Kai Wang, Richard Yi Da Xu, Bailing Wang

    Abstract: Knowledge graph completion (KGC) revolves around populating missing triples in a knowledge graph using available information. Text-based methods, which depend on textual descriptions of triples, often encounter difficulties when these descriptions lack sufficient information for accurate prediction-an issue inherent to the datasets and not easily resolved through modeling alone. To address this an… ▽ More

    Submitted 6 April, 2025; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: Accepted to Knowledge-Based Systems

  5. arXiv:2301.07272  [pdf, other

    cs.LG eess.SP

    A variational autoencoder-based nonnegative matrix factorisation model for deep dictionary learning

    Authors: Hong-Bo Xie, Caoyuan Li, Shuliang Wang, Richard Yi Da Xu, Kerrie Mengersen

    Abstract: Construction of dictionaries using nonnegative matrix factorisation (NMF) has extensive applications in signal processing and machine learning. With the advances in deep learning, training compact and robust dictionaries using deep neural networks, i.e., dictionaries of deep features, has been proposed. In this study, we propose a probabilistic generative model which employs a variational autoenco… ▽ More

    Submitted 17 January, 2023; originally announced January 2023.

    Comments: 7 pages, 2 figures

  6. arXiv:2301.06657   

    cs.CV

    Free Lunch for Generating Effective Outlier Supervision

    Authors: Sen Pei, Jiaxi Sun, Richard Yi Da Xu, Bin Fan, Shiming Xiang, Gaofeng Meng

    Abstract: When deployed in practical applications, computer vision systems will encounter numerous unexpected images (\emph{i.e.}, out-of-distribution data). Due to the potentially raised safety risks, these aforementioned unseen data should be carefully identified and handled. Generally, existing approaches in dealing with out-of-distribution (OOD) detection mainly focus on the statistical difference betwe… ▽ More

    Submitted 17 January, 2024; v1 submitted 16 January, 2023; originally announced January 2023.

    Comments: We have rewritten this paper, and published as "Image Background Serves as Good Proxy for Out-of-distribution Data" arXiv:2307.00519

  7. arXiv:2207.12194  [pdf, other

    cs.CV

    Domain Decorrelation with Potential Energy Ranking

    Authors: Sen Pei, Jiaxi Sun, Richard Yi Da Xu, Shiming Xiang, Gaofeng Meng

    Abstract: Machine learning systems, especially the methods based on deep learning, enjoy great success in modern computer vision tasks under experimental settings. Generally, these classic deep learning methods are built on the \emph{i.i.d.} assumption, supposing the training and test data are drawn from a similar distribution independently and identically. However, the aforementioned \emph{i.i.d.} assumpti… ▽ More

    Submitted 16 December, 2022; v1 submitted 25 July, 2022; originally announced July 2022.

    Comments: 2022 ECCV jury award, accepted by AAAI 2023

    Journal ref: AAAI 2023 Oral

  8. arXiv:2202.01958  [pdf, other

    cs.LG

    Demystify Optimization and Generalization of Over-parameterized PAC-Bayesian Learning

    Authors: Wei Huang, Chunrui Liu, Yilan Chen, Tianyu Liu, Richard Yi Da Xu

    Abstract: PAC-Bayesian is an analysis framework where the training error can be expressed as the weighted average of the hypotheses in the posterior distribution whilst incorporating the prior knowledge. In addition to being a pure generalization bound analysis tool, PAC-Bayesian bound can also be incorporated into an objective function to train a probabilistic neural network, making them a powerful and rel… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

    Comments: 19pages, 5 figures

  9. arXiv:2109.09038  [pdf, other

    cs.LG cs.MA stat.ML

    Regularize! Don't Mix: Multi-Agent Reinforcement Learning without Explicit Centralized Structures

    Authors: Chapman Siu, Jason Traish, Richard Yi Da Xu

    Abstract: We propose using regularization for Multi-Agent Reinforcement Learning rather than learning explicit cooperative structures called {\em Multi-Agent Regularized Q-learning} (MARQ). Many MARL approaches leverage centralized structures in order to exploit global state information or removing communication constraints when the agents act in a decentralized manner. Instead of learning redundant structu… ▽ More

    Submitted 18 September, 2021; originally announced September 2021.

  10. arXiv:2109.09037  [pdf, other

    cs.LG stat.ML

    Dual Behavior Regularized Reinforcement Learning

    Authors: Chapman Siu, Jason Traish, Richard Yi Da Xu

    Abstract: Reinforcement learning has been shown to perform a range of complex tasks through interaction with an environment or collected leveraging experience. However, many of these approaches presume optimal or near optimal experiences or the presence of a consistent environment. In this work we propose dual, advantage-based behavior policy based on counterfactual regret minimization. We demonstrate the f… ▽ More

    Submitted 18 September, 2021; originally announced September 2021.

  11. arXiv:2109.09034  [pdf, other

    cs.LG cs.MA stat.ML

    Greedy UnMixing for Q-Learning in Multi-Agent Reinforcement Learning

    Authors: Chapman Siu, Jason Traish, Richard Yi Da Xu

    Abstract: This paper introduces Greedy UnMix (GUM) for cooperative multi-agent reinforcement learning (MARL). Greedy UnMix aims to avoid scenarios where MARL methods fail due to overestimation of values as part of the large joint state-action space. It aims to address this through a conservative Q-learning approach through restricting the state-marginal in the dataset to avoid unobserved joint state action… ▽ More

    Submitted 18 September, 2021; originally announced September 2021.

  12. arXiv:2108.02353  [pdf, other

    cs.CV

    Alleviating Mode Collapse in GAN via Diversity Penalty Module

    Authors: Sen Pei, Richard Yi Da Xu, Shiming Xiang, Gaofeng Meng

    Abstract: The vanilla GAN (Goodfellow et al. 2014) suffers from mode collapse deeply, which usually manifests as that the images generated by generators tend to have a high similarity amongst them, even though their corresponding latent vectors have been very different. In this paper, we introduce a pluggable diversity penalty module (DPM) to alleviate mode collapse of GANs. It reduces the similarity of ima… ▽ More

    Submitted 13 September, 2021; v1 submitted 4 August, 2021; originally announced August 2021.

  13. arXiv:2103.03113  [pdf, other

    cs.LG cs.AI

    Towards Deepening Graph Neural Networks: A GNTK-based Optimization Perspective

    Authors: Wei Huang, Yayong Li, Weitao Du, Jie Yin, Richard Yi Da Xu, Ling Chen, Miao Zhang

    Abstract: Graph convolutional networks (GCNs) and their variants have achieved great success in dealing with graph-structured data. Nevertheless, it is well known that deep GCNs suffer from the over-smoothing problem, where node representations tend to be indistinguishable as more layers are stacked up. The theoretical research to date on deep GCNs has focused primarily on expressive power rather than train… ▽ More

    Submitted 21 April, 2022; v1 submitted 3 March, 2021; originally announced March 2021.

    Comments: 26 pages

    Journal ref: ICLR 2022

  14. arXiv:2101.04544  [pdf, other

    cs.CV

    Resolution-invariant Person ReID Based on Feature Transformation and Self-weighted Attention

    Authors: Ziyue Zhang, Shuai Jiang, Congzhentao Huang, Richard Yi Da Xu

    Abstract: Person Re-identification (ReID) is a critical computer vision task which aims to match the same person in images or video sequences. Most current works focus on settings where the resolution of images is kept the same. However, the resolution is a crucial factor in person ReID, especially when the cameras are at different distances from the person or the camera's models are different from each oth… ▽ More

    Submitted 17 January, 2021; v1 submitted 12 January, 2021; originally announced January 2021.

  15. arXiv:2101.01532  [pdf

    stat.AP physics.bio-ph physics.soc-ph q-bio.PE

    Bayesian data assimilation for estimating epidemic evolution: a COVID-19 study

    Authors: Xian Yang, Shuo Wang, Yuting Xing, Ling Li, Richard Yi Da Xu, Karl J. Friston, Yike Guo

    Abstract: The evolution of epidemiological parameters, such as instantaneous reproduction number Rt, is important for understanding the transmission dynamics of infectious diseases. Current estimates of time-varying epidemiological parameters often face problems such as lagging observations, averaging inference, and improper quantification of uncertainties. To address these problems, we propose a Bayesian d… ▽ More

    Submitted 24 October, 2021; v1 submitted 22 December, 2020; originally announced January 2021.

    Comments: Xian Yang, Shuo Wang and Yuting Xing contribute equally

  16. arXiv:2011.12547  [pdf, other

    cs.LG stat.ML

    Implicit bias of deep linear networks in the large learning rate phase

    Authors: Wei Huang, Weitao Du, Richard Yi Da Xu, Chunrui Liu

    Abstract: Most theoretical studies explaining the regularization effect in deep learning have only focused on gradient descent with a sufficient small learning rate or even gradient flow (infinitesimal learning rate). Such researches, however, have neglected a reasonably large learning rate applied in most practical applications. In this work, we characterize the implicit bias effect of deep linear networks… ▽ More

    Submitted 16 December, 2020; v1 submitted 25 November, 2020; originally announced November 2020.

    Comments: 19 pages, 7 figures

  17. arXiv:2007.07452  [pdf, other

    cs.CV

    RGB-IR Cross-modality Person ReID based on Teacher-Student GAN Model

    Authors: Ziyue Zhang, Shuai Jiang, Congzhentao Huang, Yang Li, Richard Yi Da Xu

    Abstract: RGB-Infrared (RGB-IR) person re-identification (ReID) is a technology where the system can automatically identify the same person appearing at different parts of a video when light is unavailable. The critical challenge of this task is the cross-modality gap of features under different modalities. To solve this challenge, we proposed a Teacher-Student GAN model (TS-GAN) to adopt different domains… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

    Comments: 8 pages including 1 page reference

  18. arXiv:2004.05867  [pdf, other

    cs.LG stat.ML

    On the Neural Tangent Kernel of Deep Networks with Orthogonal Initialization

    Authors: Wei Huang, Weitao Du, Richard Yi Da Xu

    Abstract: The prevailing thinking is that orthogonal weights are crucial to enforcing dynamical isometry and speeding up training. The increase in learning speed that results from orthogonal initialization in linear networks has been well-proven. However, while the same is believed to also hold for nonlinear networks when the dynamical isometry condition is satisfied, the training dynamics behind this conte… ▽ More

    Submitted 21 July, 2021; v1 submitted 13 April, 2020; originally announced April 2020.

    Comments: revised theorems and completed proofs

    Journal ref: IJCAI 2021

  19. arXiv:1912.09593  [pdf, other

    cs.LG cs.IR stat.ML

    Gaussian Process Latent Variable Model Factorization for Context-aware Recommender Systems

    Authors: Wei Huang, Richard Yi Da Xu

    Abstract: Context-aware recommender systems (CARS) have gained increasing attention due to their ability to utilize contextual information. Compared to traditional recommender systems, CARS are, in general, able to generate more accurate recommendations. Latent factors approach accounts for a large proportion of CARS. Recently, a non-linear Gaussian Process (GP) based factorization method was proven to outp… ▽ More

    Submitted 19 December, 2019; originally announced December 2019.

    Comments: 8 pages, 5 figures

  20. arXiv:1912.09132  [pdf, other

    cs.LG stat.ML

    Mean field theory for deep dropout networks: digging up gradient backpropagation deeply

    Authors: Wei Huang, Richard Yi Da Xu, Weitao Du, Yutian Zeng, Yunce Zhao

    Abstract: In recent years, the mean field theory has been applied to the study of neural networks and has achieved a great deal of success. The theory has been applied to various neural network structures, including CNNs, RNNs, Residual networks, and Batch normalization. Inevitably, recent work has also covered the use of dropout. The mean field theory shows that the existence of depth scales that limit the… ▽ More

    Submitted 13 April, 2020; v1 submitted 19 December, 2019; originally announced December 2019.

    Comments: 20 pages, 7 figures

    Journal ref: 24th European Conference on Artificial Intelligence - ECAI 2020

  21. arXiv:1807.05515  [pdf, other

    cs.LG stat.ML

    Magnitude Bounded Matrix Factorisation for Recommender Systems

    Authors: Shuai Jiang, Kan Li, Richard Yi Da Xu

    Abstract: Low rank matrix factorisation is often used in recommender systems as a way of extracting latent features. When dealing with large and sparse datasets, traditional recommendation algorithms face the problem of acquiring large, unrestrained, fluctuating values over predictions especially for users/items with very few corresponding observations. Although the problem has been somewhat solved by impos… ▽ More

    Submitted 15 July, 2018; originally announced July 2018.

    Comments: 11 pages, 6 figures, TNNLS

  22. arXiv:1806.04308  [pdf, ps, other

    stat.ML cs.LG

    Diverse Online Feature Selection

    Authors: Chapman Siu, Richard Yi Da Xu

    Abstract: Online feature selection has been an active research area in recent years. We propose a novel diverse online feature selection method based on Determinantal Point Processes (DPP). Our model aims to provide diverse features which can be composed in either a supervised or unsupervised framework. The framework aims to promote diversity based on the kernel produced on a feature level, through at most… ▽ More

    Submitted 24 April, 2019; v1 submitted 11 June, 2018; originally announced June 2018.

  23. arXiv:1707.05420  [pdf, other

    cs.LG stat.ML

    Cooperative Hierarchical Dirichlet Processes: Superposition vs. Maximization

    Authors: Junyu Xuan, Jie Lu, Guangquan Zhang, Richard Yi Da Xu

    Abstract: The cooperative hierarchical structure is a common and significant data structure observed in, or adopted by, many research areas, such as: text mining (author-paper-word) and multi-label classification (label-instance-feature). Renowned Bayesian approaches for cooperative hierarchical structure modeling are mostly based on topic models. However, these approaches suffer from a serious issue in tha… ▽ More

    Submitted 17 July, 2017; originally announced July 2017.

  24. arXiv:1606.08105  [pdf, ps, other

    stat.ML

    The Dependent Random Measures with Independent Increments in Mixture Models

    Authors: Cheng Luo, Richard Yi Da Xu, Yang Xiang

    Abstract: When observations are organized into groups where commonalties exist amongst them, the dependent random measures can be an ideal choice for modeling. One of the propositions of the dependent random measures is that the atoms of the posterior distribution are shared amongst groups, and hence groups can borrow information from each other. When normalized dependent random measures prior with independ… ▽ More

    Submitted 26 June, 2016; originally announced June 2016.

  25. arXiv:1604.04741  [pdf, other

    stat.ML

    Smoothed Hierarchical Dirichlet Process: A Non-Parametric Approach to Constraint Measures

    Authors: Cheng Luo, Yang Xiang, Richard Yi Da Xu

    Abstract: Time-varying mixture densities occur in many scenarios, for example, the distributions of keywords that appear in publications may evolve from year to year, video frame features associated with multiple targets may evolve in a sequence. Any models that realistically cater to this phenomenon must exhibit two important properties: the underlying mixture densities must have an unknown number of mixtu… ▽ More

    Submitted 16 April, 2016; originally announced April 2016.

  26. arXiv:1602.03048  [pdf, other

    stat.ML

    Bayesian nonparametric image segmentation using a generalized Swendsen-Wang algorithm

    Authors: Richard Yi Da Xu, Francois Caron, Arnaud Doucet

    Abstract: Unsupervised image segmentation aims at clustering the set of pixels of an image into spatially homogeneous regions. We introduce here a class of Bayesian nonparametric models to address this problem. These models are based on a combination of a Potts-like spatial smoothness component and a prior on partitions which is used to control both the number and size of clusters. This class of models is f… ▽ More

    Submitted 9 February, 2016; originally announced February 2016.

  27. arXiv:1507.03176  [pdf, other

    stat.ML

    Dependent Indian Buffet Process-based Sparse Nonparametric Nonnegative Matrix Factorization

    Authors: Junyu Xuan, Jie Lu, Guangquan Zhang, Richard Yi Da Xu, Xiangfeng Luo

    Abstract: Nonnegative Matrix Factorization (NMF) aims to factorize a matrix into two optimized nonnegative matrices appropriate for the intended applications. The method has been widely used for unsupervised learning tasks, including recommender systems (rating matrix of users by items) and document clustering (weighting matrix of papers by keywords). However, traditional NMF methods typically assume the nu… ▽ More

    Submitted 11 July, 2015; originally announced July 2015.

    Comments: 14 pages, 10 figures

  28. arXiv:1503.08542  [pdf, other

    stat.ML cs.CL cs.IR cs.LG

    Nonparametric Relational Topic Models through Dependent Gamma Processes

    Authors: Junyu Xuan, Jie Lu, Guangquan Zhang, Richard Yi Da Xu, Xiangfeng Luo

    Abstract: Traditional Relational Topic Models provide a way to discover the hidden topics from a document network. Many theoretical and practical tasks, such as dimensional reduction, document clustering, link prediction, benefit from this revealed knowledge. However, existing relational topic models are based on an assumption that the number of hidden topics is known in advance, and this is impractical in… ▽ More

    Submitted 30 March, 2015; originally announced March 2015.

  29. arXiv:1503.08535  [pdf, ps, other

    stat.ML cs.IR cs.LG

    Infinite Author Topic Model based on Mixed Gamma-Negative Binomial Process

    Authors: Junyu Xuan, Jie Lu, Guangquan Zhang, Richard Yi Da Xu, Xiangfeng Luo

    Abstract: Incorporating the side information of text corpus, i.e., authors, time stamps, and emotional tags, into the traditional text mining models has gained significant interests in the area of information retrieval, statistical natural language processing, and machine learning. One branch of these works is the so-called Author Topic Model (ATM), which incorporates the authors's interests as side informa… ▽ More

    Submitted 30 March, 2015; originally announced March 2015.

    Comments: 10 pages, 5 figures, submitted to KDD conference

  30. arXiv:1503.02761  [pdf, other

    stat.ML cs.LG

    An Adaptive Online HDP-HMM for Segmentation and Classification of Sequential Data

    Authors: Ava Bargi, Richard Yi Da Xu, Massimo Piccardi

    Abstract: In the recent years, the desire and need to understand sequential data has been increasing, with particular interest in sequential contexts such as patient monitoring, understanding daily activities, video surveillance, stock market and the like. Along with the constant flow of data, it is critical to classify and segment the observations on-the-fly, without being limited to a rigid number of clas… ▽ More

    Submitted 12 March, 2015; v1 submitted 9 March, 2015; originally announced March 2015.

    Comments: 23 pages, 9 figures and 4 tables

  31. arXiv:1310.1545  [pdf, ps, other

    cs.LG cs.SI stat.ML

    Learning Hidden Structures with Relational Models by Adequately Involving Rich Information in A Network

    Authors: Xuhui Fan, Richard Yi Da Xu, Longbing Cao, Yin Song

    Abstract: Effectively modelling hidden structures in a network is very practical but theoretically challenging. Existing relational models only involve very limited information, namely the binary directional link data, embedded in a network to learn hidden networking structures. There is other rich and meaningful information (e.g., various attributes of entities and more granular information than binary ele… ▽ More

    Submitted 6 October, 2013; originally announced October 2013.

  32. arXiv:1307.0578  [pdf, other

    stat.ML cs.LG

    A non-parametric conditional factor regression model for high-dimensional input and response

    Authors: Ava Bargi, Richard Yi Da Xu, Massimo Piccardi

    Abstract: In this paper, we propose a non-parametric conditional factor regression (NCFR)model for domains with high-dimensional input and response. NCFR enhances linear regression in two ways: a) introducing low-dimensional latent factors leading to dimensionality reduction and b) integrating an Indian Buffet Process as a prior for the latent factors to derive unlimited sparse dimensions. Experimental resu… ▽ More

    Submitted 1 July, 2013; originally announced July 2013.

    Comments: 9 pages, 3 figures, NIPS submission

  33. arXiv:1306.2999  [pdf, ps, other

    cs.SI cs.LG stat.ML

    Dynamic Infinite Mixed-Membership Stochastic Blockmodel

    Authors: Xuhui Fan, Longbing Cao, Richard Yi Da Xu

    Abstract: Directional and pairwise measurements are often used to model inter-relationships in a social network setting. The Mixed-Membership Stochastic Blockmodel (MMSB) was a seminal work in this area, and many of its capabilities were extended since then. In this paper, we propose the \emph{Dynamic Infinite Mixed-Membership stochastic blockModel (DIM3)}, a generalised framework that extends the existing… ▽ More

    Submitted 12 June, 2013; originally announced June 2013.

  34. arXiv:1306.2733  [pdf, ps, other

    cs.LG stat.ML

    Copula Mixed-Membership Stochastic Blockmodel for Intra-Subgroup Correlations

    Authors: Xuhui Fan, Longbing Cao, Richard Yi Da Xu

    Abstract: The \emph{Mixed-Membership Stochastic Blockmodel (MMSB)} is a popular framework for modeling social network relationships. It can fully exploit each individual node's participation (or membership) in a social structure. Despite its powerful representations, this model makes an assumption that the distributions of relational membership indicators between two nodes are independent. Under many social… ▽ More

    Submitted 6 October, 2013; v1 submitted 12 June, 2013; originally announced June 2013.