Skip to main content

Showing 1–44 of 44 results for author: Zhu, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.12542  [pdf, ps, other

    cs.LG cs.AI cs.CV stat.ML

    PLD: A Choice-Theoretic List-Wise Knowledge Distillation

    Authors: Ejafa Bassam, Dawei Zhu, Kaigui Bian

    Abstract: Knowledge distillation is a model compression technique in which a compact "student" network is trained to replicate the predictive behavior of a larger "teacher" network. In logit-based knowledge distillation it has become the de facto approach to augment cross-entropy with a distillation term. Typically this term is either a KL divergence-matching marginal probabilities or a correlation-based lo… ▽ More

    Submitted 17 June, 2025; v1 submitted 14 June, 2025; originally announced June 2025.

  2. arXiv:2504.14772  [pdf, other

    cs.CL cs.LG stat.ML

    Knowledge Distillation and Dataset Distillation of Large Language Models: Emerging Trends, Challenges, and Future Directions

    Authors: Luyang Fang, Xiaowei Yu, Jiazhang Cai, Yongkai Chen, Shushan Wu, Zhengliang Liu, Zhenyuan Yang, Haoran Lu, Xilin Gong, Yufang Liu, Terry Ma, Wei Ruan, Ali Abbasi, Jing Zhang, Tao Wang, Ehsan Latif, Wei Liu, Wei Zhang, Soheil Kolouri, Xiaoming Zhai, Dajiang Zhu, Wenxuan Zhong, Tianming Liu, Ping Ma

    Abstract: The exponential growth of Large Language Models (LLMs) continues to highlight the need for efficient strategies to meet ever-expanding computational and data demands. This survey provides a comprehensive analysis of two complementary paradigms: Knowledge Distillation (KD) and Dataset Distillation (DD), both aimed at compressing LLMs while preserving their advanced reasoning capabilities and lingui… ▽ More

    Submitted 20 April, 2025; originally announced April 2025.

  3. arXiv:2406.06829  [pdf, other

    cs.LG stat.ML

    Personalized Binomial DAGs Learning with Network Structured Covariates

    Authors: Boxin Zhao, Weishi Wang, Dingyuan Zhu, Ziqi Liu, Dong Wang, Zhiqiang Zhang, Jun Zhou, Mladen Kolar

    Abstract: The causal dependence in data is often characterized by Directed Acyclic Graphical (DAG) models, widely used in many areas. Causal discovery aims to recover the DAG structure using observational data. This paper focuses on causal discovery with multi-variate count data. We are motivated by real-world web visit data, recording individual user visits to multiple websites. Building a causal diagram c… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  4. arXiv:2403.12456  [pdf, ps, other

    econ.EM stat.ME

    Inflation Target at Risk: A Time-varying Parameter Distributional Regression

    Authors: Yunyun Wang, Tatsushi Oka, Dan Zhu

    Abstract: Macro variables frequently display time-varying distributions, driven by the dynamic and evolving characteristics of economic, social, and environmental factors that consistently reshape the fundamental patterns and relationships governing these variables. To better understand the distributional dynamics beyond the central tendency, this paper introduces a novel semi-parametric approach for constr… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  5. arXiv:2401.09874  [pdf, ps, other

    stat.AP econ.EM

    A Quantile Nelson-Siegel model

    Authors: Matteo Iacopini, Aubrey Poon, Luca Rossini, Dan Zhu

    Abstract: We propose a novel framework for modeling the yield curve from a quantile perspective. Building on the dynamic Nelson-Siegel model of Diebold et al. (2006), we extend its traditional mean-based approach to a quantile regression setting, enabling the estimation of yield curve factors - level, slope, and curvature - at specific quantiles of the conditional distribution. A key advantage of our framew… ▽ More

    Submitted 8 July, 2025; v1 submitted 18 January, 2024; originally announced January 2024.

  6. arXiv:2311.11913  [pdf, other

    cs.LG q-fin.CP stat.ML

    Deep Calibration of Market Simulations using Neural Density Estimators and Embedding Networks

    Authors: Namid R. Stillman, Rory Baggott, Justin Lyon, Jianfei Zhang, Dingqiu Zhu, Tao Chen, Perukrishnen Vytelingum

    Abstract: The ability to construct a realistic simulator of financial exchanges, including reproducing the dynamics of the limit order book, can give insight into many counterfactual scenarios, such as a flash crash, a margin call, or changes in macroeconomic outlook. In recent years, agent-based models have been developed that reproduce many features of an exchange, as summarised by a set of stylised facts… ▽ More

    Submitted 27 November, 2023; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: 4th ACM International Conference on AI in Finance (ICAIF 2023)

  7. Discordance Minimization-based Imputation Algorithms for Missing Values in Rating Data

    Authors: Young Woong Park, Jinhak Kim, Dan Zhu

    Abstract: Ratings are frequently used to evaluate and compare subjects in various applications, from education to healthcare, because ratings provide succinct yet credible measures for comparing subjects. However, when multiple rating lists are combined or considered together, subjects often have missing ratings, because most rating lists do not rate every subject in the combined list. In this study, we pro… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  8. arXiv:2310.03234  [pdf, other

    math.OC cs.AI cs.LG stat.ML

    Non-Smooth Weakly-Convex Finite-sum Coupled Compositional Optimization

    Authors: Quanqi Hu, Dixian Zhu, Tianbao Yang

    Abstract: This paper investigates new families of compositional optimization problems, called $\underline{\bf n}$on-$\underline{\bf s}$mooth $\underline{\bf w}$eakly-$\underline{\bf c}$onvex $\underline{\bf f}$inite-sum $\underline{\bf c}$oupled $\underline{\bf c}$ompositional $\underline{\bf o}$ptimization (NSWC FCCO). There has been a growing interest in FCCO due to its wide-ranging applications in machin… ▽ More

    Submitted 24 September, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

  9. arXiv:2307.01389  [pdf, other

    cs.LG stat.ME

    Identification of Causal Relationship between Amyloid-beta Accumulation and Alzheimer's Disease Progression via Counterfactual Inference

    Authors: Haixing Dai, Mengxuan Hu, Qing Li, Lu Zhang, Lin Zhao, Dajiang Zhu, Ibai Diez, Jorge Sepulcre, Fan Zhang, Xingyu Gao, Manhua Liu, Quanzheng Li, Sheng Li, Tianming Liu, Xiang Li

    Abstract: Alzheimer's disease (AD) is a neurodegenerative disorder that is beginning with amyloidosis, followed by neuronal loss and deterioration in structure, function, and cognition. The accumulation of amyloid-beta in the brain, measured through 18F-florbetapir (AV45) positron emission tomography (PET) imaging, has been widely used for early diagnosis of AD. However, the relationship between amyloid-bet… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

  10. arXiv:2306.03065  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    LibAUC: A Deep Learning Library for X-Risk Optimization

    Authors: Zhuoning Yuan, Dixian Zhu, Zi-Hao Qiu, Gang Li, Xuanhui Wang, Tianbao Yang

    Abstract: This paper introduces the award-winning deep learning (DL) library called LibAUC for implementing state-of-the-art algorithms towards optimizing a family of risk functions named X-risks. X-risks refer to a family of compositional functions in which the loss function of each data point is defined in a way that contrasts the data point with a large number of others. They have broad applications in A… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted by KDD2023

  11. arXiv:2303.04994  [pdf, ps, other

    econ.EM stat.AP stat.ME

    Distributional Vector Autoregression: Eliciting Macro and Financial Dependence

    Authors: Yunyun Wang, Tatsushi Oka, Dan Zhu

    Abstract: Vector autoregression is an essential tool in empirical macroeconomics and finance for understanding the dynamic interdependencies among multivariate time series. In this study, we expand the scope of vector autoregression by incorporating a multivariate distributional regression framework and introducing a distributional impulse response function, providing a comprehensive view of dynamic heterog… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

  12. arXiv:2302.13929  [pdf, other

    cs.LG stat.ML

    Efficient Informed Proposals for Discrete Distributions via Newton's Series Approximation

    Authors: Yue Xiang, Dongyao Zhu, Bowen Lei, Dongkuan Xu, Ruqi Zhang

    Abstract: Gradients have been exploited in proposal distributions to accelerate the convergence of Markov chain Monte Carlo algorithms on discrete distributions. However, these methods require a natural differentiable extension of the target discrete distribution, which often does not exist or does not provide effective gradient guidance. In this paper, we develop a gradient-like proposal for any discrete d… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: Published at AISTATS 2023

  13. arXiv:2302.03172  [pdf, ps, other

    econ.EM stat.CO stat.ME

    High-Dimensional Conditionally Gaussian State Space Models with Missing Data

    Authors: Joshua C. C. Chan, Aubrey Poon, Dan Zhu

    Abstract: We develop an efficient sampling approach for handling complex missing data patterns and a large number of missing observations in conditionally Gaussian state space models. Two important examples are dynamic factor models with unbalanced datasets and large Bayesian VARs with variables in multiple frequencies. A key insight underlying the proposed approach is that the joint distribution of the mis… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

  14. arXiv:2209.01910  [pdf, ps, other

    econ.EM stat.ME

    Bayesian Mixed-Frequency Quantile Vector Autoregression: Eliciting tail risks of Monthly US GDP

    Authors: Matteo Iacopini, Aubrey Poon, Luca Rossini, Dan Zhu

    Abstract: Timely characterizations of risks in economic and financial systems play an essential role in both economic policy and private sector decisions. However, the informational content of low-frequency variables and the results from conditional mean models provide only limited evidence to investigate this problem. We propose a novel mixed-frequency quantile vector autoregression (MF-QVAR) model to addr… ▽ More

    Submitted 5 September, 2022; originally announced September 2022.

  15. arXiv:2203.12228  [pdf, ps, other

    stat.ME econ.EM q-fin.RM stat.AP

    Bivariate Distribution Regression with Application to Insurance Data

    Authors: Yunyun Wang, Tatsushi Oka, Dan Zhu

    Abstract: Understanding variable dependence, particularly eliciting their statistical properties given a set of covariates, provides the mathematical foundation in practical operations management such as risk analysis and decision-making given observed circumstances. This article presents an estimation method for modeling the conditional joint distribution of bivariate outcomes based on the distribution reg… ▽ More

    Submitted 3 September, 2023; v1 submitted 23 March, 2022; originally announced March 2022.

  16. arXiv:2203.00176  [pdf, other

    cs.LG math.OC stat.ML

    When AUC meets DRO: Optimizing Partial AUC for Deep Learning with Non-Convex Convergence Guarantee

    Authors: Dixian Zhu, Gang Li, Bokun Wang, Xiaodong Wu, Tianbao Yang

    Abstract: In this paper, we propose systematic and efficient gradient-based methods for both one-way and two-way partial AUC (pAUC) maximization that are applicable to deep learning. We propose new formulations of pAUC surrogate objectives by using the distributionally robust optimization (DRO) to define the loss for each individual positive data. We consider two formulations of DRO, one of which is based o… ▽ More

    Submitted 17 September, 2023; v1 submitted 28 February, 2022; originally announced March 2022.

    Comments: 25 pages

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, 2022

  17. arXiv:2112.11315  [pdf, ps, other

    econ.EM stat.CO

    Efficient Estimation of State-Space Mixed-Frequency VARs: A Precision-Based Approach

    Authors: Joshua C. C. Chan, Aubrey Poon, Dan Zhu

    Abstract: State-space mixed-frequency vector autoregressions are now widely used for nowcasting. Despite their popularity, estimating such models can be computationally intensive, especially for large systems with stochastic volatility. To tackle the computational challenges, we propose two novel precision-based samplers to draw the missing observations of the low-frequency variables in these models, buildi… ▽ More

    Submitted 21 December, 2021; originally announced December 2021.

  18. arXiv:2111.07052   

    physics.ao-ph stat.AP

    Distribution and Determinants of Correlation between PM2.5 and O3 in China Mainland: Dynamitic simil-Hu Lines

    Authors: Chenru Chen, Miaoqing Xu, Shuyi Liu, Dehai Zhu, Jianyu Yang, Bingbo Gao, Ziyue Chen

    Abstract: In recent years, China has made great efforts to control air pollution. During the governance process, it is found that fine particulate matter (PM2.5) and ozone (O3) change in the same trend among some areas and the opposite in others, which brings some difficulties to take measures in a planned way. Therefore, this study adopted multi-year and large-scale air quality data to explore the distribu… ▽ More

    Submitted 30 September, 2022; v1 submitted 13 November, 2021; originally announced November 2021.

    Comments: Our research group have decided to withdraw this preprint

  19. arXiv:2101.09763  [pdf, other

    cs.LG cs.CL stat.ML

    Analysing the Noise Model Error for Realistic Noisy Label Data

    Authors: Michael A. Hedderich, Dawei Zhu, Dietrich Klakow

    Abstract: Distant and weak supervision allow to obtain large amounts of labeled training data quickly and cheaply, but these automatic annotations tend to contain a high amount of errors. A popular technique to overcome the negative effects of these noisy labels is noise modelling where the underlying noise process is modelled. In this work, we study the quality of these estimated noise models from the theo… ▽ More

    Submitted 1 March, 2021; v1 submitted 24 January, 2021; originally announced January 2021.

    Comments: Accepted at AAAI 2021, additional material at https://github.com/uds-lsv/noise-estimation

  20. arXiv:2008.06051  [pdf, ps, other

    q-bio.PE econ.GN physics.soc-ph stat.AP

    A Spatial Stochastic SIR Model for Transmission Networks with Application to COVID-19 Epidemic in China

    Authors: Tatsushi Oka, Wei Wei, Dan Zhu

    Abstract: Governments around the world have implemented preventive measures against the spread of the coronavirus disease (COVID-19). In this study, we consider a multivariate discrete-time Markov model to analyze the propagation of COVID-19 across 33 provincial regions in China. This approach enables us to evaluate the effect of mobility restriction policies on the spread of the disease. We use data on dai… ▽ More

    Submitted 16 August, 2020; v1 submitted 13 August, 2020; originally announced August 2020.

    Comments: Typos were fixed

  21. Explainable Recommendation via Interpretable Feature Mapping and Evaluation of Explainability

    Authors: Deng Pan, Xiangrui Li, Xin Li, Dongxiao Zhu

    Abstract: Latent factor collaborative filtering (CF) has been a widely used technique for recommender system by learning the semantic representations of users and items. Recently, explainable recommendation has attracted much attention from research community. However, trade-off exists between explainability and performance of the recommendation where metadata is often needed to alleviate the dilemma. We pr… ▽ More

    Submitted 12 July, 2020; originally announced July 2020.

    Comments: Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence (IJCAI)

    Journal ref: IJCAI 2020, pages 2690-2696

  22. arXiv:2003.02309  [pdf, other

    cs.LG stat.ML

    On the Learning Property of Logistic and Softmax Losses for Deep Neural Networks

    Authors: Xiangrui Li, Xin Li, Deng Pan, Dongxiao Zhu

    Abstract: Deep convolutional neural networks (CNNs) trained with logistic and softmax losses have made significant advancement in visual recognition tasks in computer vision. When training data exhibit class imbalances, the class-wise reweighted version of logistic and softmax losses are often used to boost performance of the unweighted version. In this paper, motivated to explain the reweighting mechanism,… ▽ More

    Submitted 4 March, 2020; originally announced March 2020.

    Comments: AAAI2020. Previously this appeared as arXiv:1906.04026v2, which was submitted as a replacement by accident

  23. arXiv:2002.09917  [pdf, other

    cs.LG stat.ML

    Improve SGD Training via Aligning Mini-batches

    Authors: Xiangrui Li, Deng Pan, Xin Li, Dongxiao Zhu

    Abstract: Deep neural networks (DNNs) for supervised learning can be viewed as a pipeline of a feature extractor (i.e. last hidden layer) and a linear classifier (i.e. output layer) that is trained jointly with stochastic gradient descent (SGD). In each iteration of SGD, a mini-batch from the training data is sampled and the true gradient of the loss function is estimated as the noisy gradient calculated on… ▽ More

    Submitted 26 February, 2020; v1 submitted 23 February, 2020; originally announced February 2020.

  24. arXiv:2002.00542  [pdf, other

    stat.AP

    Predictive Risk Analysis in Collective Risk Model: Choices between Historical Frequency and Aggregate Severity

    Authors: Rosy Oh, Youngju Lee, Dan Zhu, Jae Youn Ahn

    Abstract: Typical risk classification procedure in insurance is consists of a priori risk classification determined by observable risk characteristics, and a posteriori risk classification where the premium is adjusted to reflect the policyholder's claim history. While using the full claim history data is optimal in a posteriori risk classification procedure, i.e. giving premium estimators with the minimal… ▽ More

    Submitted 2 February, 2020; originally announced February 2020.

  25. arXiv:1911.08459  [pdf, other

    stat.ML cs.LG

    Deep Unsupervised Clustering with Clustered Generator Model

    Authors: Dandan Zhu, Tian Han, Linqi Zhou, Xiaokang Yang, Ying Nian Wu

    Abstract: This paper addresses the problem of unsupervised clustering which remains one of the most fundamental challenges in machine learning and artificial intelligence. We propose the clustered generator model for clustering which contains both continuous and discrete latent variables. Discrete latent variables model the cluster label while the continuous ones model variations within each cluster. The le… ▽ More

    Submitted 19 November, 2019; originally announced November 2019.

  26. Pre-train and Learn: Preserve Global Information for Graph Neural Networks

    Authors: Danhao Zhu, Xin-yu Dai, Jiajun Chen

    Abstract: Graph neural networks (GNNs) have shown great power in learning on attributed graphs. However, it is still a challenge for GNNs to utilize information faraway from the source node. Moreover, general GNNs require graph attributes as input, so they cannot be appled to plain graphs. In the paper, we propose new models named G-GNNs (Global information for GNNs) to address the above limitations. First,… ▽ More

    Submitted 23 December, 2021; v1 submitted 27 October, 2019; originally announced October 2019.

    Journal ref: JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY 36(6): 1420-1430 Nov. 2021

  27. arXiv:1908.09174   

    cs.LG cs.CL stat.ML

    Representation Learning with Autoencoders for Electronic Health Records: A Comparative Study

    Authors: Najibesadat Sadati, Milad Zafar Nezhad, Ratna Babu Chinnam, Dongxiao Zhu

    Abstract: Increasing volume of Electronic Health Records (EHR) in recent years provides great opportunities for data scientists to collaborate on different aspects of healthcare research by applying advanced analytics to these EHR clinical data. A key requirement however is obtaining meaningful insights from high dimensional, sparse and complex clinical data. Data science approaches typically address this c… ▽ More

    Submitted 19 September, 2019; v1 submitted 24 August, 2019; originally announced August 2019.

    Comments: Reason: This submission is the extension of our other research which has already submitted in arXiv (arXiv:1801.02961), therefore we decided update that version and withdraw this submission

  28. arXiv:1907.12508  [pdf, other

    cs.LG stat.ML

    Tackling Ordinal Regression Problem for Heterogeneous Data: Sparse and Deep Multi-Task Learning Approaches

    Authors: Lu Wang, Dongxiao Zhu

    Abstract: Many real-world datasets are labeled with natural orders, i.e., ordinal labels. Ordinal regression is a method to predict ordinal labels that finds a wide range of applications in data-rich domains, such as natural, health and social sciences. Most existing ordinal regression approaches work well for independent and identically distributed (IID) instances via formulating a single ordinal regressio… ▽ More

    Submitted 26 April, 2020; v1 submitted 29 July, 2019; originally announced July 2019.

    Comments: 21 pages, 3 figures

  29. arXiv:1906.04026  [pdf, other

    cs.LG stat.ML

    CRCEN: A Generalized Cost-sensitive Neural Network Approach for Imbalanced Classification

    Authors: Xiangrui Li, Dongxiao Zhu

    Abstract: Classification on imbalanced datasets is a challenging task in real-world applications. Training conventional classification algorithms directly by minimizing classification error in this scenario can compromise model performance for minority class while optimizing performance for majority class. Traditional approaches to the imbalance problem include re-sampling and cost-sensitive methods. In thi… ▽ More

    Submitted 3 March, 2020; v1 submitted 10 June, 2019; originally announced June 2019.

  30. arXiv:1906.03691  [pdf, other

    eess.IV cs.LG stat.ML

    Interpreting Age Effects of Human Fetal Brain from Spontaneous fMRI using Deep 3D Convolutional Neural Networks

    Authors: Xiangrui Li, Jasmine Hect, Moriah Thomason, Dongxiao Zhu

    Abstract: Understanding human fetal neurodevelopment is of great clinical importance as abnormal development is linked to adverse neuropsychiatric outcomes after birth. Recent advances in functional Magnetic Resonance Imaging (fMRI) have provided new insight into development of the human brain before birth, but these studies have predominately focused on brain functional connectivity (i.e. Fisher z-score),… ▽ More

    Submitted 9 June, 2019; originally announced June 2019.

    Comments: 9 pages

  31. arXiv:1902.07903  [pdf, other

    cs.NI cs.LG stat.ML

    Learning Deterministic Policy with Target for Power Control in Wireless Networks

    Authors: Yujiao Lu, Hancheng Lu, Liangliang Cao, Feng Wu, Daren Zhu

    Abstract: Inter-Cell Interference Coordination (ICIC) is a promising way to improve energy efficiency in wireless networks, especially where small base stations are densely deployed. However, traditional optimization based ICIC schemes suffer from severe performance degradation with complex interference pattern. To address this issue, we propose a Deep Reinforcement Learning with Deterministic Policy and Ta… ▽ More

    Submitted 21 February, 2019; originally announced February 2019.

    Comments: 7 pages, 7 figures, GlobeCom2018

  32. arXiv:1901.00746  [pdf

    cs.CY cs.LG stat.ML

    Multi-task Prediction of Patient Workload

    Authors: Mohammad Hessam Olya, Dongxiao Zhu, Kai Yang

    Abstract: Developing reliable workload predictive models can affect many aspects of clinical decision making procedure. The primary challenge in healthcare systems is handling the demand uncertainty over the time. This issue becomes more critical for the healthcare facilities that provide service for chronic disease treatment because of the need for continuous treatments over the time. Although some researc… ▽ More

    Submitted 27 December, 2018; originally announced January 2019.

  33. arXiv:1808.09802  [pdf, other

    stat.ML cs.LG

    Modelling Irregular Spatial Patterns using Graph Convolutional Neural Networks

    Authors: Di Zhu, Yu Liu

    Abstract: The understanding of geographical reality is a process of data representation and pattern discovery. Former studies mainly adopted continuous-field models to represent spatial variables and to investigate the underlying spatial continuity/heterogeneity in the regular spatial domain. In this article, we introduce a more generalized model based on graph convolutional neural networks (GCNs) that can… ▽ More

    Submitted 15 August, 2018; originally announced August 2018.

    Comments: 10 pages, 8 figures, preprint for arxiv

  34. arXiv:1804.10690  [pdf, other

    cs.LG stat.ML

    Negative Log Likelihood Ratio Loss for Deep Neural Network Classification

    Authors: Donglai Zhu, Hengshuai Yao, Bei Jiang, Peng Yu

    Abstract: In deep neural network, the cross-entropy loss function is commonly used for classification. Minimizing cross-entropy is equivalent to maximizing likelihood under assumptions of uniform feature and class distributions. It belongs to generative training criteria which does not directly discriminate correct class from competing classes. We propose a discriminative loss function with negative log lik… ▽ More

    Submitted 27 April, 2018; originally announced April 2018.

  35. arXiv:1804.03280  [pdf, other

    cs.LG cs.CY stat.ML

    A Deep Active Survival Analysis Approach for Precision Treatment Recommendations: Application of Prostate Cancer

    Authors: Milad Zafar Nezhad, Najibesadat Sadati, Kai Yang, Dongxiao Zhu

    Abstract: Survival analysis has been developed and applied in the number of areas including manufacturing, finance, economics and healthcare. In healthcare domain, usually clinical data are high-dimensional, sparse and complex and sometimes there exists few amount of time-to-event (labeled) instances. Therefore building an accurate survival model from electronic health records is challenging. With this moti… ▽ More

    Submitted 9 April, 2018; originally announced April 2018.

    Journal ref: Expert Systems with Applications Volume 115, January 2019, Pages 16-26

  36. arXiv:1801.02961  [pdf, other

    cs.LG stat.ML

    Representation Learning with Autoencoders for Electronic Health Records: A Comparative Study

    Authors: Najibesadat Sadati, Milad Zafar Nezhad, Ratna Babu Chinnam, Dongxiao Zhu

    Abstract: Increasing volume of Electronic Health Records (EHR) in recent years provides great opportunities for data scientists to collaborate on different aspects of healthcare research by applying advanced analytics to these EHR clinical data. A key requirement however is obtaining meaningful insights from high dimensional, sparse and complex clinical data. Data science approaches typically address this c… ▽ More

    Submitted 29 September, 2019; v1 submitted 6 January, 2018; originally announced January 2018.

  37. SUBIC: A Supervised Bi-Clustering Approach for Precision Medicine

    Authors: Milad Zafar Nezhad, Dongxiao Zhu, Najibesadat Sadati, Kai Yang, Phillip Levy

    Abstract: Traditional medicine typically applies one-size-fits-all treatment for the entire patient population whereas precision medicine develops tailored treatment schemes for different patient subgroups. The fact that some factors may be more significant for a specific patient subgroup motivates clinicians and medical researchers to develop new approaches to subgroup detection and analysis, which is an e… ▽ More

    Submitted 26 September, 2017; originally announced September 2017.

  38. arXiv:1709.06134  [pdf, other

    q-bio.NC cs.IT stat.AP

    Discrete Dynamic Causal Modeling and Its Relationship with Directed Information

    Authors: Zhe Wang, Yu Zheng, David C. Zhu, Jian Ren, Tongtong Li

    Abstract: This paper explores the discrete Dynamic Causal Modeling (DDCM) and its relationship with Directed Information (DI). We prove the conditional equivalence between DDCM and DI in characterizing the causal relationship between two brain regions. The theoretical results are demonstrated using fMRI data obtained under both resting state and stimulus based state. Our numerical analysis is consistent wit… ▽ More

    Submitted 18 September, 2017; originally announced September 2017.

  39. arXiv:1708.02365  [pdf, ps, other

    econ.GN stat.CO stat.ME

    Indirect Inference with a Non-Smooth Criterion Function

    Authors: David T. Frazier, Tatsushi Oka, Dan Zhu

    Abstract: Indirect inference requires simulating realisations of endogenous variables from the model under study. When the endogenous variables are discontinuous functions of the model parameters, the resulting indirect inference criterion function is discontinuous and does not permit the use of derivative-based optimisation routines. Using a change of variables technique, we propose a novel simulation algo… ▽ More

    Submitted 9 July, 2019; v1 submitted 8 August, 2017; originally announced August 2017.

    Comments: This paper is a revision of arXiv:1708.02365 and supersedes the earlier arXiv paper "Derivative-Based Optimization with a Non-Smooth Simulated Criterion"

  40. arXiv:1705.10312  [pdf

    cs.LG cs.CE stat.AP

    Classification of Major Depressive Disorder via Multi-Site Weighted LASSO Model

    Authors: Dajiang Zhu, Brandalyn C. Riedel, Neda Jahanshad, Nynke A. Groenewold, Dan J. Stein, Ian H. Gotlib, Matthew D. Sacchet, Danai Dima, James H. Cole, Cynthia H. Y. Fu, Henrik Walter, Ilya M. Veer, Thomas Frodl, Lianne Schmaal, Dick J. Veltman, Paul M. Thompson

    Abstract: Large-scale collaborative analysis of brain imaging data, in psychiatry and neu-rology, offers a new source of statistical power to discover features that boost ac-curacy in disease classification, differential diagnosis, and outcome prediction. However, due to data privacy regulations or limited accessibility to large datasets across the world, it is challenging to efficiently integrate distribut… ▽ More

    Submitted 3 June, 2017; v1 submitted 26 May, 2017; originally announced May 2017.

    Comments: Accepted by MICCAI 2017

  41. arXiv:1704.08383  [pdf, other

    cs.LG stat.ML

    Large-scale Feature Selection of Risk Genetic Factors for Alzheimer's Disease via Distributed Group Lasso Regression

    Authors: Qingyang Li, Dajiang Zhu, Jie Zhang, Derrek Paul Hibar, Neda Jahanshad, Yalin Wang, Jieping Ye, Paul M. Thompson, Jie Wang

    Abstract: Genome-wide association studies (GWAS) have achieved great success in the genetic study of Alzheimer's disease (AD). Collaborative imaging genetics studies across different research institutions show the effectiveness of detecting genetic risk factors. However, the high dimensionality of GWAS data poses significant challenges in detecting risk SNPs for AD. Selecting relevant features is crucial in… ▽ More

    Submitted 26 April, 2017; originally announced April 2017.

  42. SAFS: A Deep Feature Selection Approach for Precision Medicine

    Authors: Milad Zafar Nezhad, Dongxiao Zhu, Xiangrui Li, Kai Yang, Phillip Levy

    Abstract: In this paper, we propose a new deep feature selection method based on deep architecture. Our method uses stacked auto-encoders for feature representation in higher-level abstraction. We developed and applied a novel feature learning approach to a specific precision medicine problem, which focuses on assessing and prioritizing risk factors for hypertension (HTN) in a vulnerable demographic subgrou… ▽ More

    Submitted 19 April, 2017; originally announced April 2017.

  43. arXiv:1511.07944  [pdf, other

    stat.ML

    Maximum Likelihood Estimation for Single Linkage Hierarchical Clustering

    Authors: Dekang Zhu, Dan P. Guralnik, Xuezhi Wang, Xiang Li, Bill Moran

    Abstract: We derive a statistical model for estimation of a dendrogram from single linkage hierarchical clustering (SLHC) that takes account of uncertainty through noise or corruption in the measurements of separation of data. Our focus is on just the estimation of the hierarchy of partitions afforded by the dendrogram, rather than the heights in the latter. The concept of estimating this "dendrogram struct… ▽ More

    Submitted 24 November, 2015; originally announced November 2015.

    Comments: 15 pages, 6 figures

  44. arXiv:1511.07715  [pdf, ps, other

    stat.ML

    Statistical Properties of the Single Linkage Hierarchical Clustering Estimator

    Authors: Dekang Zhu, Dan P. Guralnik, Xuezhi Wang, Xiang Li, Bill Moran

    Abstract: Distance-based hierarchical clustering (HC) methods are widely used in unsupervised data analysis but few authors take account of uncertainty in the distance data. We incorporate a statistical model of the uncertainty through corruption or noise in the pairwise distances and investigate the problem of estimating the HC as unknown parameters from measurements. Specifically, we focus on single linka… ▽ More

    Submitted 31 August, 2016; v1 submitted 24 November, 2015; originally announced November 2015.

    Comments: 21 pages, 6 figures