Skip to main content

Showing 1–29 of 29 results for author: Jin, X

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.03042  [pdf, ps, other

    stat.ME stat.AP

    Incorporating Correlated Nugget Effects in Multivariate Spatial Models: An Application to Argo Ocean Data

    Authors: Damilya Saduakhas, David Bolin, Xiaotian Jin, Alexandre B. Simas, Jonas Wallin

    Abstract: Accurate analysis of global oceanographic data, such as temperature and salinity profiles from the Argo program, requires geostatistical models capable of capturing complex spatial dependencies. This study introduces Gaussian and non-Gaussian hierarchical multivariate Matérn-SPDE models with correlated nugget effects to account for small-scale variability and measurement error correlations. Using… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  2. arXiv:2503.02266  [pdf, other

    stat.ME stat.CO

    Generalized Tree-Informed Mixed Model Regression

    Authors: Jeremiah Allis, Xin Jin, Riddhi Ghosh

    Abstract: The standard regression tree method applied to observations within clusters poses both methodological and implementation challenges. Effectively leveraging these data requires methods that account for both individual-level and sample-level effects. We propose Generalized Tree-Informed Mixed Model (GTIMM), which replaces the linear fixed effect in a generalized linear mixed model (GLMM) with the ou… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

    Comments: 19 pages, 6 figures, 4 tables

  3. arXiv:2410.10241  [pdf, other

    cs.LG cs.AI stat.ML

    Revisiting and Benchmarking Graph Autoencoders: A Contrastive Learning Perspective

    Authors: Jintang Li, Ruofan Wu, Yuchang Zhu, Huizhe Zhang, Xinzhou Jin, Guibin Zhang, Zulun Zhu, Zibin Zheng, Liang Chen

    Abstract: Graph autoencoders (GAEs) are self-supervised learning models that can learn meaningful representations of graph-structured data by reconstructing the input graph from a low-dimensional latent space. Over the past few years, GAEs have gained significant attention in academia and industry. In particular, the recent advent of GAEs with masked autoencoding schemes marks a significant advancement in g… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: Preprint, under review

  4. arXiv:2406.14026  [pdf, other

    cs.LG cs.CL stat.ML

    Demystifying Language Model Forgetting with Low-rank Example Associations

    Authors: Xisen Jin, Xiang Ren

    Abstract: Large Language models (LLMs) suffer from forgetting of upstream knowledge when fine-tuned. Despite efforts on mitigating forgetting, few have investigated how forgotten upstream examples are dependent on newly learned tasks. Insights on such dependencies enable efficient and targeted mitigation of forgetting. In this paper, we empirically analyze forgetting that occurs in $N$ upstream examples of… ▽ More

    Submitted 18 May, 2025; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: 9 pages; preprint

  5. arXiv:2402.11133  [pdf, other

    stat.ME

    Two-Sample Hypothesis Testing for Large Random Graphs of Unequal Size

    Authors: Xin Jin, Kit Chan, Ian Barnett, Riddhi Pratim Ghosh

    Abstract: Two-sample hypothesis testing for large graphs is popular in cognitive science, probabilistic machine learning and artificial intelligence. While numerous methods have been proposed in the literature to address this problem, less attention has been devoted to scenarios involving graphs of unequal size or situations where there are only one or a few samples of graphs. In this article, we propose a… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  6. arXiv:2402.01865  [pdf, other

    cs.LG cs.CL stat.ML

    What Will My Model Forget? Forecasting Forgotten Examples in Language Model Refinement

    Authors: Xisen Jin, Xiang Ren

    Abstract: Language models deployed in the wild make errors. However, simply updating the model with the corrected error instances causes catastrophic forgetting -- the updated model makes errors on instances learned during the instruction tuning or upstream training phase. Randomly replaying upstream data yields unsatisfactory performance and often comes with high variance and poor controllability. To this… ▽ More

    Submitted 9 December, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: ICML 2024 (Spotlight)

  7. The Building Data Genome Directory -- An open, comprehensive data sharing platform for building performance research

    Authors: Xiaoyu Jin, Chun Fu, Hussain Kazmi, Atilla Balint, Ada Canaydin, Matias Quintana, Filip Biljecki, Fu Xiao, Clayton Miller

    Abstract: The building sector plays a crucial role in the worldwide decarbonization effort, accounting for significant portions of energy consumption and environmental effects. However, the scarcity of open data sources is a continuous challenge for built environment researchers and practitioners. Although several efforts have been made to consolidate existing open datasets, no database currently offers a c… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Journal ref: J Phys Conf Ser. 2023;2600: 032003

  8. arXiv:2105.03418  [pdf, other

    hep-lat cond-mat.stat-mech cs.LG stat.ML

    Deep Learning Hamiltonian Monte Carlo

    Authors: Sam Foreman, Xiao-Yong Jin, James C. Osborn

    Abstract: We generalize the Hamiltonian Monte Carlo algorithm with a stack of neural network layers and evaluate its ability to sample from different topologies in a two dimensional lattice gauge theory. We demonstrate that our model is able to successfully mix between modes of different topologies, significantly reducing the computational cost required to generated independent gauge field configurations. O… ▽ More

    Submitted 7 May, 2021; originally announced May 2021.

    Comments: 8 pages, 7 figures, Published as a workshop paper at ICLR 2021 SimDL Workshop

  9. arXiv:2102.06828  [pdf, other

    cs.LG stat.ML

    Domain Adaptation for Time Series Forecasting via Attention Sharing

    Authors: Xiaoyong Jin, Youngsuk Park, Danielle C. Maddix, Hao Wang, Yuyang Wang

    Abstract: Recently, deep neural networks have gained increasing popularity in the field of time series forecasting. A primary reason for their success is their ability to effectively capture complex temporal dynamics across multiple related time series. The advantages of these deep forecasters only start to emerge in the presence of a sufficient amount of data. This poses a challenge for typical forecasting… ▽ More

    Submitted 21 June, 2022; v1 submitted 12 February, 2021; originally announced February 2021.

    Comments: ICML 2022

  10. arXiv:2010.13006  [pdf, other

    cs.LG stat.ML

    Inter-Series Attention Model for COVID-19 Forecasting

    Authors: Xiaoyong Jin, Yu-Xiang Wang, Xifeng Yan

    Abstract: COVID-19 pandemic has an unprecedented impact all over the world since early 2020. During this public health crisis, reliable forecasting of the disease becomes critical for resource allocation and administrative planning. The results from compartmental models such as SIR and SEIR are popularly referred by CDC and news media. With more and more COVID-19 data becoming available, we examine the foll… ▽ More

    Submitted 5 April, 2021; v1 submitted 24 October, 2020; originally announced October 2020.

    Comments: Accepted by SDM 2021

  11. arXiv:2010.12864  [pdf, other

    cs.CL stat.ML

    On Transferability of Bias Mitigation Effects in Language Model Fine-Tuning

    Authors: Xisen Jin, Francesco Barbieri, Brendan Kennedy, Aida Mostafazadeh Davani, Leonardo Neves, Xiang Ren

    Abstract: Fine-tuned language models have been shown to exhibit biases against protected groups in a host of modeling tasks such as text classification and coreference resolution. Previous works focus on detecting these biases, reducing bias in data representations, and using auxiliary training objectives to mitigate bias during fine-tuning. Although these techniques achieve bias reduction for the task and… ▽ More

    Submitted 11 April, 2021; v1 submitted 24 October, 2020; originally announced October 2020.

    Comments: 14 pages; Accepted at NAACL 2021

  12. arXiv:2009.13003  [pdf, other

    cs.LG stat.ML

    On Efficient Constructions of Checkpoints

    Authors: Yu Chen, Zhenming Liu, Bin Ren, Xin Jin

    Abstract: Efficient construction of checkpoints/snapshots is a critical tool for training and diagnosing deep learning models. In this paper, we propose a lossy compression scheme for checkpoint constructions (called LC-Checkpoint). LC-Checkpoint simultaneously maximizes the compression rate and optimizes the recovery speed, under the assumption that SGD is used to train the model. LC-Checkpointuses quantiz… ▽ More

    Submitted 27 September, 2020; originally announced September 2020.

    Journal ref: International Conference on Machine Learning, 2020

  13. arXiv:2007.06081  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    VAFL: a Method of Vertical Asynchronous Federated Learning

    Authors: Tianyi Chen, Xiao Jin, Yuejiao Sun, Wotao Yin

    Abstract: Horizontal Federated learning (FL) handles multi-client data that share the same set of features, and vertical FL trains a better predictor that combine all the features from different clients. This paper targets solving vertical FL in an asynchronous fashion, and develops a simple FL method. The new method allows each client to run stochastic gradient algorithms without coordination with other cl… ▽ More

    Submitted 12 July, 2020; originally announced July 2020.

    Comments: FL-ICML'20: Proc. of ICML Workshop on Federated Learning for User Privacy and Data Confidentiality, July 2020

    Journal ref: Proc. of ICML Workshop on Federated Learning for User Privacy and Data Confidentiality, July 2020

  14. arXiv:2006.15294  [pdf, other

    stat.ML cs.LG

    Gradient-based Editing of Memory Examples for Online Task-free Continual Learning

    Authors: Xisen Jin, Arka Sadhu, Junyi Du, Xiang Ren

    Abstract: We explore task-free continual learning (CL), in which a model is trained to avoid catastrophic forgetting in the absence of explicit task boundaries or identities. Among many efforts on task-free CL, a notable family of approaches are memory-based that store and replay a subset of training examples. However, the utility of stored seen examples may diminish over time since CL models are continuall… ▽ More

    Submitted 7 December, 2021; v1 submitted 27 June, 2020; originally announced June 2020.

    Comments: 10 pages. Accepted at NeurIPS 2021

  15. arXiv:1911.06194  [pdf, other

    cs.CL cs.LG stat.ML

    Towards Hierarchical Importance Attribution: Explaining Compositional Semantics for Neural Sequence Models

    Authors: Xisen Jin, Zhongyu Wei, Junyi Du, Xiangyang Xue, Xiang Ren

    Abstract: The impressive performance of neural networks on natural language processing tasks attributes to their ability to model complicated word and phrase compositions. To explain how the model handles semantic compositions, we study hierarchical explanation of neural network predictions. We identify non-additivity and context independent importance attributions within hierarchies as two desirable proper… ▽ More

    Submitted 15 June, 2020; v1 submitted 7 November, 2019; originally announced November 2019.

    Comments: ICLR 2020

  16. arXiv:1911.05942  [pdf, other

    cs.CV cs.LG stat.ML

    Progressive Feature Polishing Network for Salient Object Detection

    Authors: Bo Wang, Quan Chen, Min Zhou, Zhiqiang Zhang, Xiaogang Jin, Kun Gai

    Abstract: Feature matters for salient object detection. Existing methods mainly focus on designing a sophisticated structure to incorporate multi-level features and filter out cluttered features. We present Progressive Feature Polishing Network (PFPN), a simple yet effective framework to progressively polish the multi-level features to be more accurate and representative. By employing multiple Feature Polis… ▽ More

    Submitted 14 November, 2019; originally announced November 2019.

    Comments: Accepted by AAAI 2020

  17. arXiv:1910.09620  [pdf, other

    cs.LG stat.ML

    You May Not Need Order in Time Series Forecasting

    Authors: Yunkai Zhang, Qiao Jiang, Shurui Li, Xiaoyong Jin, Xueying Ma, Xifeng Yan

    Abstract: Time series forecasting with limited data is a challenging yet critical task. While transformers have achieved outstanding performances in time series forecasting, they often require many training samples due to the large number of trainable parameters. In this paper, we propose a training technique for transformers that prepares the training windows through random sampling. As input time steps ne… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.

  18. arXiv:1907.00235  [pdf, other

    cs.LG stat.ML

    Enhancing the Locality and Breaking the Memory Bottleneck of Transformer on Time Series Forecasting

    Authors: Shiyang Li, Xiaoyong Jin, Yao Xuan, Xiyou Zhou, Wenhu Chen, Yu-Xiang Wang, Xifeng Yan

    Abstract: Time series forecasting is an important problem across many domains, including predictions of solar plant energy output, electricity consumption, and traffic jam situation. In this paper, we propose to tackle such forecasting problem with Transformer [1]. Although impressed by its performance in our preliminary study, we found its two major weaknesses: (1) locality-agnostics: the point-wise dot-pr… ▽ More

    Submitted 3 January, 2020; v1 submitted 29 June, 2019; originally announced July 2019.

    Comments: To appear in the proceeding of NeurIPS 2019

  19. arXiv:1905.10756  [pdf, other

    cs.LG cs.CV stat.ML

    Selective Transfer with Reinforced Transfer Network for Partial Domain Adaptation

    Authors: Zhihong Chen, Chao Chen, Zhaowei Cheng, Boyuan Jiang, Ke Fang, Xinyu Jin

    Abstract: One crucial aspect of partial domain adaptation (PDA) is how to select the relevant source samples in the shared classes for knowledge transfer. Previous PDA methods tackle this problem by re-weighting the source samples based on their high-level information (deep features). However, since the domain shift between source and target domains, only using the deep features for sample selection is defe… ▽ More

    Submitted 27 February, 2020; v1 submitted 26 May, 2019; originally announced May 2019.

  20. arXiv:1904.05530  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Recurrent Event Network: Autoregressive Structure Inference over Temporal Knowledge Graphs

    Authors: Woojeong Jin, Meng Qu, Xisen Jin, Xiang Ren

    Abstract: Knowledge graph reasoning is a critical task in natural language processing. The task becomes more challenging on temporal knowledge graphs, where each fact is associated with a timestamp. Most existing methods focus on reasoning at past timestamps and they are not able to predict facts happening in the future. This paper proposes Recurrent Event Network (RE-NET), a novel autoregressive architectu… ▽ More

    Submitted 6 October, 2020; v1 submitted 11 April, 2019; originally announced April 2019.

    Comments: 15 pages, 8 figures, accepted at as full paper in EMNLP 2020

  21. arXiv:1902.08336  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    On the Sensitivity of Adversarial Robustness to Input Data Distributions

    Authors: Gavin Weiguang Ding, Kry Yik Chau Lui, Xiaomeng Jin, Luyu Wang, Ruitong Huang

    Abstract: Neural networks are vulnerable to small adversarial perturbations. Existing literature largely focused on understanding and mitigating the vulnerability of learned models. In this paper, we demonstrate an intriguing phenomenon about the most popular robust training method in the literature, adversarial training: Adversarial robustness, unlike clean accuracy, is sensitive to the input data distribu… ▽ More

    Submitted 21 February, 2019; originally announced February 2019.

    Comments: ICLR 2019, Seventh International Conference on Learning Representations

  22. arXiv:1902.07623  [pdf, ps, other

    cs.LG cs.CR cs.CV stat.ML

    advertorch v0.1: An Adversarial Robustness Toolbox based on PyTorch

    Authors: Gavin Weiguang Ding, Luyu Wang, Xiaomeng Jin

    Abstract: advertorch is a toolbox for adversarial robustness research. It contains various implementations for attacks, defenses and robust training methods. advertorch is built on PyTorch (Paszke et al., 2017), and leverages the advantages of the dynamic computational graph to provide concise and efficient reference implementations. The code is licensed under the LGPL license and is open sourced at https:/… ▽ More

    Submitted 20 February, 2019; originally announced February 2019.

  23. arXiv:1901.01007  [pdf, other

    cs.LG cs.AR cs.DC stat.ML

    FPDeep: Scalable Acceleration of CNN Training on Deeply-Pipelined FPGA Clusters

    Authors: Tong Geng, Tianqi Wang, Ang Li, Xi Jin, Martin Herbordt

    Abstract: Deep Neural Networks (DNNs) have revolutionized numerous applications, but the demand for ever more performance remains unabated. Scaling DNN computations to larger clusters is generally done by distributing tasks in batch mode using methods such as distributed synchronous SGD. Among the issues with this approach is that to make the distributed cluster work with high utilization, the workload dist… ▽ More

    Submitted 21 June, 2020; v1 submitted 4 January, 2019; originally announced January 2019.

    Comments: Accepted by IEEE TRANSACTIONS ON COMPUTERS (TC)

  24. Parameter Transfer Extreme Learning Machine based on Projective Model

    Authors: Chao Chen, Boyuan Jiang, Xinyu Jin

    Abstract: Recent years, transfer learning has attracted much attention in the community of machine learning. In this paper, we mainly focus on the tasks of parameter transfer under the framework of extreme learning machine (ELM). Unlike the existing parameter transfer approaches, which incorporate the source model information into the target by regularizing the di erence between the source and target domain… ▽ More

    Submitted 14 September, 2018; v1 submitted 4 September, 2018; originally announced September 2018.

    Comments: This paper was accepted as an oral paper by IJCNN 2018

    Journal ref: 2018 International Joint Conference on Neural Networks (IJCNN)

  25. arXiv:1808.09347  [pdf, other

    cs.LG cs.CV stat.ML

    Joint Domain Alignment and Discriminative Feature Learning for Unsupervised Deep Domain Adaptation

    Authors: Chao Chen, Zhihong Chen, Boyuan Jiang, Xinyu Jin

    Abstract: Recently, considerable effort has been devoted to deep domain adaptation in computer vision and machine learning communities. However, most of existing work only concentrates on learning shared feature representation by minimizing the distribution discrepancy across different domains. Due to the fact that all the domain alignment approaches can only reduce, but not remove the domain shift. Target… ▽ More

    Submitted 3 November, 2018; v1 submitted 28 August, 2018; originally announced August 2018.

    Comments: This paper has been accepted by AAAI-2019

  26. arXiv:1807.09571  [pdf, ps, other

    stat.ML cs.IT cs.LG

    Deep Learning Detection Networks in MIMO Decode-Forward Relay Channels

    Authors: Xianglan Jin, Hyoung-Nam Kim

    Abstract: In this paper, we consider signal detection algorithms in a multiple-input multiple-output (MIMO) decode-forward (DF) relay channel with one source, one relay, and one destination. The existing suboptimal near maximum likelihood (NML) detector and the NML with two-level pair-wise error probability (NMLw2PEP) detector achieve excellent performance with instantaneous channel state information (CSI)… ▽ More

    Submitted 11 July, 2018; originally announced July 2018.

    Comments: 12 pages, 9 figures

  27. arXiv:1805.07297  [pdf

    cs.LG math.NA stat.ML

    General solutions for nonlinear differential equations: a rule-based self-learning approach using deep reinforcement learning

    Authors: Shiyin Wei, Xiaowei Jin, Hui Li

    Abstract: A universal rule-based self-learning approach using deep reinforcement learning (DRL) is proposed for the first time to solve nonlinear ordinary differential equations and partial differential equations. The solver consists of a deep neural network-structured actor that outputs candidate solutions, and a critic derived only from physical rules (governing equations and boundary and initial conditio… ▽ More

    Submitted 29 May, 2019; v1 submitted 13 May, 2018; originally announced May 2018.

  28. arXiv:1710.09979  [pdf, other

    cs.LG cs.CV stat.ML

    Stochastic Conjugate Gradient Algorithm with Variance Reduction

    Authors: Xiao-Bo Jin, Xu-Yao Zhang, Kaizhu Huang, Guang-Gang Geng

    Abstract: Conjugate gradient (CG) methods are a class of important methods for solving linear equations and nonlinear optimization problems. In this paper, we propose a new stochastic CG algorithm with variance reduction and we prove its linear convergence with the Fletcher and Reeves method for strongly convex and smooth functions. We experimentally demonstrate that the CG with variance reduction algorithm… ▽ More

    Submitted 16 October, 2018; v1 submitted 26 October, 2017; originally announced October 2017.

    Comments: 10 pages, 4 figures, appeared in IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, CGVR algorithm is available on github: https://github.com/xbjin/cgvr

    Journal ref: IEEE Transactions on Neural Networks and Learning Systems,2018

  29. arXiv:1303.2417  [pdf, ps, other

    cs.LG stat.ML

    Linear NDCG and Pair-wise Loss

    Authors: Xiao-Bo Jin, Guang-Gang Geng

    Abstract: Linear NDCG is used for measuring the performance of the Web content quality assessment in ECML/PKDD Discovery Challenge 2010. In this paper, we will prove that the DCG error equals a new pair-wise loss.

    Submitted 10 March, 2013; originally announced March 2013.

    Comments: 5 pages, 3 figures