Skip to main content

Showing 1–22 of 22 results for author: Geng, X

Searching in archive stat. Search in all archives.
.
  1. arXiv:2410.21917  [pdf, other

    stat.ML cs.LG

    Identifiability Analysis of Linear ODE Systems with Hidden Confounders

    Authors: Yuanyuan Wang, Biwei Huang, Wei Huang, Xi Geng, Mingming Gong

    Abstract: The identifiability analysis of linear Ordinary Differential Equation (ODE) systems is a necessary prerequisite for making reliable causal inferences about these systems. While identifiability has been well studied in scenarios where the system is fully observable, the conditions for identifiability remain unexplored when latent variables interact with the system. This paper aims to address this g… ▽ More

    Submitted 30 October, 2024; v1 submitted 29 October, 2024; originally announced October 2024.

    Comments: 38th Conference on Neural Information Processing Systems (NeurIPS 2024)

  2. arXiv:2311.12293  [pdf

    stat.ME stat.AP

    Sample size calculation based on the difference in restricted mean time lost for clinical trials with competing risks

    Authors: Xiang Geng, Zhaojin Li, Chengfeng Zhang, Yanjie Wang, Haoning Shen, Zhiheng Huang, Yawen Hou, Zheng Chen

    Abstract: Computation of sample size is important when designing clinical trials. The presence of competing risks makes the design of clinical trials with time-to-event endpoints cumbersome. A model based on the subdistribution hazard ratio (SHR) is commonly used for trials under competing risks. However, this approach has some limitations related to model assumptions and clinical interpretation. Considerin… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  3. arXiv:2310.19491  [pdf, ps, other

    math.ST cs.LG stat.ML

    Generator Identification for Linear SDEs with Additive and Multiplicative Noise

    Authors: Yuanyuan Wang, Xi Geng, Wei Huang, Biwei Huang, Mingming Gong

    Abstract: In this paper, we present conditions for identifying the generator of a linear stochastic differential equation (SDE) from the distribution of its solution process with a given fixed initial state. These identifiability conditions are crucial in causal inference using linear SDEs as they enable the identification of the post-intervention distributions from its observational distribution. Specifica… ▽ More

    Submitted 21 January, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

  4. arXiv:2210.05955  [pdf, other

    stat.ML cs.LG

    Identifiability and Asymptotics in Learning Homogeneous Linear ODE Systems from Discrete Observations

    Authors: Yuanyuan Wang, Wei Huang, Mingming Gong, Xi Geng, Tongliang Liu, Kun Zhang, Dacheng Tao

    Abstract: Ordinary Differential Equations (ODEs) have recently gained a lot of attention in machine learning. However, the theoretical aspects, e.g., identifiability and asymptotic properties of statistical estimation are still obscure. This paper derives a sufficient condition for the identifiability of homogeneous linear ODE systems from a sequence of equally-spaced error-free observations sampled from a… ▽ More

    Submitted 2 June, 2024; v1 submitted 12 October, 2022; originally announced October 2022.

    Journal ref: Journal of Machine Learning Research 25 (2024) 1-50

  5. arXiv:2009.08607  [pdf, ps, other

    cs.LG stat.ML

    Compact Learning for Multi-Label Classification

    Authors: Jiaqi Lv, Tianran Wu, Chenglun Peng, Yunpeng Liu, Ning Xu, Xin Geng

    Abstract: Multi-label classification (MLC) studies the problem where each instance is associated with multiple relevant labels, which leads to the exponential growth of output space. MLC encourages a popular framework named label compression (LC) for capturing label dependency with dimension reduction. Nevertheless, most existing LC methods failed to consider the influence of the feature space or misguided… ▽ More

    Submitted 17 September, 2020; originally announced September 2020.

  6. arXiv:2007.08929  [pdf, other

    cs.LG stat.ML

    Provably Consistent Partial-Label Learning

    Authors: Lei Feng, Jiaqi Lv, Bo Han, Miao Xu, Gang Niu, Xin Geng, Bo An, Masashi Sugiyama

    Abstract: Partial-label learning (PLL) is a multi-class classification problem, where each training example is associated with a set of candidate labels. Even though many practical PLL methods have been proposed in the last two decades, there lacks a theoretical understanding of the consistency of those methods-none of the PLL methods hitherto possesses a generation process of candidate label sets, and then… ▽ More

    Submitted 23 October, 2020; v1 submitted 17 July, 2020; originally announced July 2020.

    Comments: NeurIPS 2020 camera-ready version

  7. arXiv:2006.07178  [pdf, other

    cs.LG stat.ML

    Meta-Reinforcement Learning Robust to Distributional Shift via Model Identification and Experience Relabeling

    Authors: Russell Mendonca, Xinyang Geng, Chelsea Finn, Sergey Levine

    Abstract: Reinforcement learning algorithms can acquire policies for complex tasks autonomously. However, the number of samples required to learn a diverse set of skills can be prohibitively large. While meta-reinforcement learning methods have enabled agents to leverage prior experience to adapt quickly to new tasks, their performance depends crucially on how close the new task is to the previously experie… ▽ More

    Submitted 15 June, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

  8. arXiv:2004.14164  [pdf, other

    cs.CL cs.LG stat.ML

    MICK: A Meta-Learning Framework for Few-shot Relation Classification with Small Training Data

    Authors: Xiaoqing Geng, Xiwen Chen, Kenny Q. Zhu, Libin Shen, Yinggong Zhao

    Abstract: Few-shot relation classification seeks to classify incoming query instances after meeting only few support instances. This ability is gained by training with large amount of in-domain annotated data. In this paper, we tackle an even harder problem by further limiting the amount of data available at training time. We propose a few-shot learning framework for relation classification, which is partic… ▽ More

    Submitted 14 December, 2020; v1 submitted 26 April, 2020; originally announced April 2020.

    Journal ref: CIKM 2020: The 29th ACM International Conference on Information and Knowledge Management

  9. arXiv:2004.08861  [pdf, other

    cs.LG cs.NE stat.ML

    Role-Wise Data Augmentation for Knowledge Distillation

    Authors: Jie Fu, Xue Geng, Zhijian Duan, Bohan Zhuang, Xingdi Yuan, Adam Trischler, Jie Lin, Chris Pal, Hao Dong

    Abstract: Knowledge Distillation (KD) is a common method for transferring the ``knowledge'' learned by one machine learning model (the \textit{teacher}) into another model (the \textit{student}), where typically, the teacher has a greater capacity (e.g., more parameters or higher bit-widths). To our knowledge, existing methods overlook the fact that although the student absorbs extra knowledge from the teac… ▽ More

    Submitted 19 April, 2020; originally announced April 2020.

  10. arXiv:2002.11089  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Rewriting History with Inverse RL: Hindsight Inference for Policy Improvement

    Authors: Benjamin Eysenbach, Xinyang Geng, Sergey Levine, Ruslan Salakhutdinov

    Abstract: Multi-task reinforcement learning (RL) aims to simultaneously learn policies for solving many tasks. Several prior works have found that relabeling past experience with different reward functions can improve sample efficiency. Relabeling methods typically ask: if, in hindsight, we assume that our experience was optimal for some task, for what task was it optimal? In this paper, we show that hindsi… ▽ More

    Submitted 25 February, 2020; originally announced February 2020.

  11. arXiv:2002.08053  [pdf, other

    cs.LG stat.ML

    Progressive Identification of True Labels for Partial-Label Learning

    Authors: Jiaqi Lv, Miao Xu, Lei Feng, Gang Niu, Xin Geng, Masashi Sugiyama

    Abstract: Partial-label learning (PLL) is a typical weakly supervised learning problem, where each training instance is equipped with a set of candidate labels among which only one is the true label. Most existing methods elaborately designed learning objectives as constrained optimizations that must be solved in specific manners, making their computational complexity a bottleneck for scaling up to big data… ▽ More

    Submitted 5 September, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: In Proceedings of the 37th International Conference on Machine Learning (ICML 2020)

  12. arXiv:1911.11508  [pdf

    physics.ao-ph stat.AP

    Dynamic Complex Network Analysis of PM2.5 Concentrations in the UK using Hierarchical Directed Graphs

    Authors: Parya Broomandi, Xueyu Geng, Weisi Guo, Jong Kim, Alessio Pagani, David Topping

    Abstract: Worldwide exposure to fine atmospheric particles can exasperate the risk of a wide range of heart and respiratory diseases, due to their ability to penetrate deep into the lungs and blood streams. Epidemiological studies in Europe and elsewhere have established the evidence base pointing to the important role of PM2.5 in causing over 4 million deaths per year. Traditional approaches to model atmos… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

    Comments: under review

  13. arXiv:1907.12416  [pdf, ps, other

    cs.LG stat.ML

    Quadruply Stochastic Gradients for Large Scale Nonlinear Semi-Supervised AUC Optimization

    Authors: Wanli Shi, Bin Gu, Xiang Li, Xiang Geng, Heng Huang

    Abstract: Semi-supervised learning is pervasive in real-world applications, where only a few labeled data are available and large amounts of instances remain unlabeled. Since AUC is an important model evaluation metric in classification, directly optimizing AUC in semi-supervised learning scenario has drawn much attention in the machine learning community. Recently, it has been shown that one could find an… ▽ More

    Submitted 29 July, 2019; originally announced July 2019.

  14. arXiv:1907.11584  [pdf, ps, other

    cs.LG stat.ML

    Scalable Semi-Supervised SVM via Triply Stochastic Gradients

    Authors: Xiang Geng, Bin Gu, Xiang Li, Wanli Shi, Guansheng Zheng, Heng Huang

    Abstract: Semi-supervised learning (SSL) plays an increasingly important role in the big data era because a large number of unlabeled samples can be used effectively to improve the performance of the classifier. Semi-supervised support vector machine (S$^3$VM) is one of the most appealing methods for SSL, but scaling up S$^3$VM for kernel learning is still an open problem. Recently, a doubly stochastic grad… ▽ More

    Submitted 26 July, 2019; originally announced July 2019.

  15. arXiv:1907.08225  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Dynamical Distance Learning for Semi-Supervised and Unsupervised Skill Discovery

    Authors: Kristian Hartikainen, Xinyang Geng, Tuomas Haarnoja, Sergey Levine

    Abstract: Reinforcement learning requires manual specification of a reward function to learn a task. While in principle this reward function only needs to specify the task goal, in practice reinforcement learning can be very time-consuming or even infeasible unless the reward function is shaped so as to provide a smooth gradient towards a successful outcome. This shaping is difficult to specify by hand, par… ▽ More

    Submitted 14 February, 2020; v1 submitted 18 July, 2019; originally announced July 2019.

    Comments: 11+6 pages, 6+2 figures, last two authors (Tuomas Haarnoja, Sergey Levine) advised equally

  16. arXiv:1905.11395  [pdf, other

    cs.LG stat.ML

    Multi-Modal Graph Interaction for Multi-Graph Convolution Network in Urban Spatiotemporal Forecasting

    Authors: Xu Geng, Xiyu Wu, Lingyu Zhang, Qiang Yang, Yan Liu, Jieping Ye

    Abstract: Graph convolution network based approaches have been recently used to model region-wise relationships in region-level prediction problems in urban computing. Each relationship represents a kind of spatial dependency, like region-wise distance or functional similarity. To incorporate multiple relationships into spatial feature extraction, we define the problem as a multi-modal machine learning prob… ▽ More

    Submitted 27 May, 2019; originally announced May 2019.

  17. arXiv:1901.02064  [pdf, other

    cs.LG stat.ML

    Dataflow-based Joint Quantization of Weights and Activations for Deep Neural Networks

    Authors: Xue Geng, Jie Fu, Bin Zhao, Jie Lin, Mohamed M. Sabry Aly, Christopher Pal, Vijay Chandrasekhar

    Abstract: This paper addresses a challenging problem - how to reduce energy consumption without incurring performance drop when deploying deep neural networks (DNNs) at the inference stage. In order to alleviate the computation and storage burdens, we propose a novel dataflow-based joint quantization approach with the hypothesis that a fewer number of quantization operations would incur less information los… ▽ More

    Submitted 4 January, 2019; originally announced January 2019.

    Journal ref: Data Compression Conference 2019

  18. arXiv:1710.04824  [pdf, other

    stat.ME

    The basic equation for target detection in remote sensing

    Authors: Xiurui Geng, Luyan Ji, Yongchao Zhao

    Abstract: Our research has revealed a hidden relationship among several basic components, which leads to the best target detection result. Further, we have proved that the matched filter (MF) is always superior to the constrained energy minimization (CEM) operator, both of which were originally of parallel importance in the field of target detection for remotely sensed image.

    Submitted 13 October, 2017; originally announced October 2017.

    Comments: 8 pages, 8 figures

  19. arXiv:1705.08409  [pdf, other

    cs.LG stat.ML

    Ridesourcing Car Detection by Transfer Learning

    Authors: Leye Wang, Xu Geng, Jintao Ke, Chen Peng, Xiaojuan Ma, Daqing Zhang, Qiang Yang

    Abstract: Ridesourcing platforms like Uber and Didi are getting more and more popular around the world. However, unauthorized ridesourcing activities taking advantages of the sharing economy can greatly impair the healthy development of this emerging industry. As the first step to regulate on-demand ride services and eliminate black market, we design a method to detect ridesourcing cars from a pool of cars… ▽ More

    Submitted 23 May, 2017; originally announced May 2017.

  20. arXiv:1612.07801  [pdf

    stat.AP cs.CV

    Probabilistic graphical model based approach for water mapping using GaoFen-2 (GF-2) high resolution imagery and Landsat 8 time series

    Authors: Luyan Ji, Jie Wang, Xiurui Geng, Peng Gong

    Abstract: The objective of this paper is to evaluate the potential of Gaofen-2 (GF-2) high resolution multispectral sensor (MS) and panchromatic (PAN) imagery on water mapping. Difficulties of water mapping on high resolution data includes: 1) misclassification between water and shadows or other low-reflectance ground objects, which is mostly caused by the spectral similarity within the given band range; 2)… ▽ More

    Submitted 21 December, 2016; originally announced December 2016.

    Comments: 17 pages, 9 figures, 6 tables

  21. arXiv:1612.00549  [pdf, other

    stat.ME

    MF is always superior to CEM

    Authors: Xiurui Geng, Luyan Ji, Weitun Yang, Fuxiang Wang, Yongchao Zhao

    Abstract: The constrained energy minimization (CEM) and matched filter (MF) are two most frequently used target detection algorithms in the remotely sensed community. In this paper, we first introduce an augmented CEM (ACEM) by adding an all-one band. According to a recently published conclusion that CEM can always achieve a better performance by adding any linearly independent bands, ACEM is better than CE… ▽ More

    Submitted 1 December, 2016; originally announced December 2016.

    Comments: 4 pages

  22. arXiv:1610.05956  [pdf

    stat.ML

    Clustering by connection center evolution

    Authors: Xiurui Geng, Hairong Tang

    Abstract: The determination of cluster centers generally depends on the scale that we use to analyze the data to be clustered. Inappropriate scale usually leads to unreasonable cluster centers and thus unreasonable results. In this study, we first consider the similarity of elements in the data as the connectivity of nodes in an undirected graph, then present the concept of a connection center and regard it… ▽ More

    Submitted 19 October, 2016; originally announced October 2016.