Skip to main content

Showing 1–50 of 50 results for author: Zeng, Y

Searching in archive stat. Search in all archives.
.
  1. arXiv:2507.05990  [pdf, ps, other

    stat.ME

    Multivariate regression with missing response data for modelling regional DNA methylation QTLs

    Authors: Shomoita Alam, Yixiao Zeng, Sasha Bernatsky, Marie Hudson, Inés Colmegna, David A. Stephens, Celia M. T. Greenwood, Archer Y. Yang

    Abstract: Identifying genetic regulators of DNA methylation (mQTLs) with multivariate models enhances statistical power, but is challenged by missing data from bisulfite sequencing. Standard imputation-based methods can introduce bias, limiting reliable inference. We propose \texttt{missoNet}, a novel convex estimation framework that jointly estimates regression coefficients and the precision matrix from da… ▽ More

    Submitted 8 July, 2025; originally announced July 2025.

  2. arXiv:2506.02410  [pdf, ps, other

    stat.ME

    Testing for large-dimensional covariance matrix under differential privacy

    Authors: Shiwei Sang, Yicheng Zeng, Xuehu Zhu, Shurong Zheng

    Abstract: The increasing prevalence of high-dimensional data across various applications has raised significant privacy concerns in statistical inference. In this paper, we propose a differentially private integrated statistic for testing large-dimensional covariance structures, enabling accurate statistical insights while safeguarding privacy. First, we analyze the global sensitivity of sample eigenvalues… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

  3. arXiv:2504.07946  [pdf, other

    stat.ME stat.CO

    Characteristic function-based tests for spatial randomness

    Authors: Yiran Zeng, Dale L. Zimmerman

    Abstract: We introduce a new type of test for complete spatial randomness that applies to mapped point patterns in a rectangle or a cube of any dimension. This is the first test of its kind to be based on characteristic functions and utilizes a weighted L2-distance between the empirical and uniform characteristic functions. It is simple to calculate and does not require adjusting for edge effects. An effici… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

    Comments: 24 pages, 4 figures

  4. arXiv:2504.04016  [pdf, ps, other

    stat.ML cs.LG

    Computational Efficient and Minimax Optimal Nonignorable Matrix Completion

    Authors: Yuanhong A, Guoyu Zhang, Yongcheng Zeng, Bo Zhang

    Abstract: While the matrix completion problem has attracted considerable attention over the decades, few works address the nonignorable missing issue and all have their limitations. In this article, we propose a nuclear norm regularized row- and column-wise matrix U-statistic loss function for the generalized nonignorable missing mechanism, a flexible and generally applicable missing mechanism which contain… ▽ More

    Submitted 26 June, 2025; v1 submitted 4 April, 2025; originally announced April 2025.

  5. arXiv:2502.06398  [pdf, other

    cs.LG stat.ML

    Learning Counterfactual Outcomes Under Rank Preservation

    Authors: Peng Wu, Haoxuan Li, Chunyuan Zheng, Yan Zeng, Jiawei Chen, Yang Liu, Ruocheng Guo, Kun Zhang

    Abstract: Counterfactual inference aims to estimate the counterfactual outcome at the individual level given knowledge of an observed treatment and the factual outcome, with broad applications in fields such as epidemiology, econometrics, and management science. Previous methods rely on a known structural causal model (SCM) or assume the homogeneity of the exogenous variable and strict monotonicity between… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

  6. arXiv:2501.16156  [pdf, other

    stat.ME

    Moving toward best practice when using propensity score weighting in survey observational studies

    Authors: Yukang Zeng, Fan Li, Guangyu Tong

    Abstract: Propensity score weighting is a common method for estimating treatment effects with survey data. The method is applied to minimize confounding using measured covariates that are often different between individuals in treatment and control. However, existing literature does not reach a consensus on the optimal use of survey weights for population-level inference in the propensity score weighting an… ▽ More

    Submitted 27 January, 2025; originally announced January 2025.

  7. arXiv:2501.11696  [pdf, other

    stat.ME

    Exact Bounds of Spearman's footrule in the Presence of Missing Data with Applications to Independence Testing

    Authors: Yijin Zeng, Niall M. Adams, Dean A. Bodenham

    Abstract: This work studies exact bounds of Spearman's footrule between two partially observed $n$-dimensional distinct real-valued vectors $X$ and $Y$. The lower bound is obtained by sequentially constructing imputations of the partially observed vectors, each with a non-increasing value of Spearman's footrule. The upper bound is found by first considering the set of all possible values of Spearman's footr… ▽ More

    Submitted 20 January, 2025; originally announced January 2025.

    Comments: 187 pages, 29 figures

  8. arXiv:2501.06429  [pdf, other

    cs.LG stat.ML

    Reliable Imputed-Sample Assisted Vertical Federated Learning

    Authors: Yaopei Zeng, Lei Liu, Shaoguo Liu, Hongjian Dou, Baoyuan Wu, Li Liu

    Abstract: Vertical Federated Learning (VFL) is a well-known FL variant that enables multiple parties to collaboratively train a model without sharing their raw data. Existing VFL approaches focus on overlapping samples among different parties, while their performance is constrained by the limited number of these samples, leaving numerous non-overlapping samples unexplored. Some previous work has explored te… ▽ More

    Submitted 10 January, 2025; originally announced January 2025.

  9. arXiv:2411.16315  [pdf, ps, other

    cs.LG math.ST stat.ML

    Local Learning for Covariate Selection in Nonparametric Causal Effect Estimation with Latent Variables

    Authors: Zheng Li, Feng Xie, Xichen Guo, Yan Zeng, Hao Zhang, Zhi Geng

    Abstract: Estimating causal effects from nonexperimental data is a fundamental problem in many fields of science. A key component of this task is selecting an appropriate set of covariates for confounding adjustment to avoid bias. Most existing methods for covariate selection often assume the absence of latent variables and rely on learning the global network structure among variables. However, identifying… ▽ More

    Submitted 19 May, 2025; v1 submitted 25 November, 2024; originally announced November 2024.

  10. arXiv:2411.12184  [pdf, ps, other

    stat.ME cs.AI cs.LG

    Testability of Instrumental Variables in Additive Nonlinear, Non-Constant Effects Models

    Authors: Xichen Guo, Zheng Li, Biwei Huang, Yan Zeng, Zhi Geng, Feng Xie

    Abstract: We address the issue of the testability of instrumental variables derived from observational data. Most existing testable implications are centered on scenarios where the treatment is a discrete variable, e.g., instrumental inequality (Pearl, 1995), or where the effect is assumed to be constant, e.g., instrumental variables condition based on the principle of independent mechanisms (Burauel, 2023)… ▽ More

    Submitted 18 November, 2024; originally announced November 2024.

  11. arXiv:2410.07395  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    LLM Embeddings Improve Test-time Adaptation to Tabular $Y|X$-Shifts

    Authors: Yibo Zeng, Jiashuo Liu, Henry Lam, Hongseok Namkoong

    Abstract: For tabular datasets, the change in the relationship between the label and covariates ($Y|X$-shifts) is common due to missing variables (a.k.a. confounders). Since it is impossible to generalize to a completely new and unknown domain, we study models that are easy to adapt to the target domain even with few labeled examples. We focus on building more informative representations of tabular data tha… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  12. arXiv:2408.00799  [pdf, other

    cs.IR cs.LG stat.ML

    Deep Uncertainty-Based Explore for Index Construction and Retrieval in Recommendation System

    Authors: Xin Jiang, Kaiqiang Wang, Yinlong Wang, Fengchang Lv, Taiyang Peng, Shuai Yang, Xianteng Wu, Pengye Zhang, Shuo Yuan, Yifan Zeng

    Abstract: In recommendation systems, the relevance and novelty of the final results are selected through a cascade system of Matching -> Ranking -> Strategy. The matching model serves as the starting point of the pipeline and determines the upper bound of the subsequent stages. Balancing the relevance and novelty of matching results is a crucial step in the design and optimization of recommendation systems,… ▽ More

    Submitted 5 August, 2024; v1 submitted 21 July, 2024; originally announced August 2024.

    Comments: accepted by cikm2024

  13. arXiv:2407.07933  [pdf, other

    stat.ME cs.LG stat.ML

    Identification and Estimation of the Bi-Directional MR with Some Invalid Instruments

    Authors: Feng Xie, Zhen Yao, Lin Xie, Yan Zeng, Zhi Geng

    Abstract: We consider the challenging problem of estimating causal effects from purely observational data in the bi-directional Mendelian randomization (MR), where some invalid instruments, as well as unmeasured confounding, usually exist. To address this problem, most existing methods attempt to find proper valid instrumental variables (IVs) for the target causal effect by expert knowledge or by assuming t… ▽ More

    Submitted 12 July, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: 27 pages, 6 tables, 7 figures

  14. arXiv:2405.15531  [pdf, other

    stat.ME

    MMD Two-sample Testing in the Presence of Arbitrarily Missing Data

    Authors: Yijin Zeng, Niall M. Adams, Dean A. Bodenham

    Abstract: In many real-world applications, it is common that a proportion of the data may be missing or only partially observed. We develop a novel two-sample testing method based on the Maximum Mean Discrepancy (MMD) which accounts for missing data in both samples, without making assumptions about the missingness mechanism. Our approach is based on deriving the mathematically precise bounds of the MMD test… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 13 pages, 3 figures, 1 table, and appendix

  15. arXiv:2405.03329  [pdf, other

    cs.LG stat.ML

    Policy Learning for Balancing Short-Term and Long-Term Rewards

    Authors: Peng Wu, Ziyu Shen, Feng Xie, Zhongyao Wang, Chunchen Liu, Yan Zeng

    Abstract: Empirical researchers and decision-makers spanning various domains frequently seek profound insights into the long-term impacts of interventions. While the significance of long-term outcomes is undeniable, an overemphasis on them may inadvertently overshadow short-term gains. Motivated by this, this paper formalizes a new framework for learning the optimal policy that effectively balances both lon… ▽ More

    Submitted 15 September, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

  16. arXiv:2403.15327  [pdf, other

    stat.ME

    On two-sample testing for data with arbitrarily missing values

    Authors: Yijin Zeng, Niall M. Adams, Dean A. Bodenham

    Abstract: We develop a new rank-based approach for univariate two-sample testing in the presence of missing data which makes no assumptions about the missingness mechanism. This approach is a theoretical extension of the Wilcoxon-Mann-Whitney test that controls the Type I error by providing exact bounds for the test statistic after accounting for the number of missing values. Greater statistical power is sh… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 60 pages, 12 figures

    MSC Class: 62

  17. arXiv:2310.18572  [pdf, ps, other

    stat.AP

    Where to serve and return in Badminton Men's Double?

    Authors: Xuelin Zhu, Yu Sun, Yumin Zeng, Cong Xu

    Abstract: This study aims to analyze the service and return landing areas in badminton men's double, based on data extracted from 20 badminton matches. We find that most services land near the center-line, while returns tend to land in the crossing areas of the serving team's court. Using generalized logit models, we are able to predict the return landing area based on features of the service and return rou… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

  18. arXiv:2310.17513  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    The Expressive Power of Low-Rank Adaptation

    Authors: Yuchen Zeng, Kangwook Lee

    Abstract: Low-Rank Adaptation (LoRA), a parameter-efficient fine-tuning method that leverages low-rank adaptation of weight matrices, has emerged as a prevalent technique for fine-tuning pre-trained models such as large language models and diffusion models. Despite its huge success in practice, the theoretical underpinnings of LoRA have largely remained unexplored. This paper takes the first step to bridge… ▽ More

    Submitted 17 March, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: 40 pages, 5 figures

  19. arXiv:2310.06306  [pdf, other

    cs.LG stat.ML

    Ensemble Active Learning by Contextual Bandits for AI Incubation in Manufacturing

    Authors: Yingyan Zeng, Xiaoyu Chen, Ran Jin

    Abstract: It is challenging but important to save annotation efforts in streaming data acquisition to maintain data quality for supervised learning base learners. We propose an ensemble active learning method to actively acquire samples for annotation by contextual bandits, which is will enforce the exploration-exploitation balance and leading to improved AI modeling performance.

    Submitted 10 October, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

  20. arXiv:2305.06898  [pdf, other

    cs.SI physics.soc-ph stat.CO

    Identifying vital nodes through augmented random walks on higher-order networks

    Authors: Yujie Zeng, Yiming Huang, Xiao-Long Ren, Linyuan Lü

    Abstract: Empirical networks possess considerable heterogeneity of node connections, resulting in a small portion of nodes playing crucial roles in network structure and function. Yet, how to characterize nodes' influence and identify vital nodes is by far still unclear in the study of networks with higher-order interactions. In this paper, we introduce a multi-order graph obtained by incorporating the high… ▽ More

    Submitted 3 December, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

  21. arXiv:2305.00054  [pdf, other

    cs.LG cs.AI stat.ML

    LAVA: Data Valuation without Pre-Specified Learning Algorithms

    Authors: Hoang Anh Just, Feiyang Kang, Jiachen T. Wang, Yi Zeng, Myeongseob Ko, Ming Jin, Ruoxi Jia

    Abstract: Traditionally, data valuation (DV) is posed as a problem of equitably splitting the validation performance of a learning algorithm among the training data. As a result, the calculated data values depend on many design choices of the underlying learning algorithm. However, this dependence is undesirable for many DV use cases, such as setting priorities over different data sources in a data acquisit… ▽ More

    Submitted 19 December, 2023; v1 submitted 28 April, 2023; originally announced May 2023.

    Comments: ICLR 2023 Spotlight Latest Updated Version: 2023/12/19

  22. arXiv:2302.01088  [pdf, other

    math.ST stat.ML

    Sketched Ridgeless Linear Regression: The Role of Downsampling

    Authors: Xin Chen, Yicheng Zeng, Siyue Yang, Qiang Sun

    Abstract: Overparametrization often helps improve the generalization performance. This paper presents a dual view of overparametrization suggesting that downsampling may also help generalize. Focusing on the proportional regime $m\asymp n \asymp p$, where $m$ represents the sketching size, $n$ is the sample size, and $p$ is the feature dimensionality, we investigate two out-of-sample prediction risks of the… ▽ More

    Submitted 13 October, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Comments: Add more numerical experiments and some discussions, relax the Gaussian assumption of coefficient vector to moment conditions

  23. arXiv:2206.14421  [pdf, other

    cs.LG stat.ML

    Cyclical Kernel Adaptive Metropolis

    Authors: Jianan Canal Li, Yimeng Zeng, Wentao Guo

    Abstract: We propose cKAM, cyclical Kernel Adaptive Metropolis, which incorporates a cyclical stepsize scheme to allow control for exploration and sampling. We show that on a crafted bimodal distribution, existing Adaptive Metropolis type algorithms would fail to converge to the true posterior distribution. We point out that this is because adaptive samplers estimates the local/global covariance structure u… ▽ More

    Submitted 29 June, 2022; v1 submitted 29 June, 2022; originally announced June 2022.

  24. arXiv:2204.13916  [pdf, ps, other

    stat.ML cs.LG

    A study of tree-based methods and their combination

    Authors: Yinuo Zeng

    Abstract: Tree-based methods are popular machine learning techniques used in various fields. In this work, we review their foundations and a general framework the importance sampled learning ensemble (ISLE) that accelerates their fitting process. Furthermore, we describe a model combination strategy called the adaptive regression by mixing (ARM), which is feasible for tree-based methods via ISLE. Moreover,… ▽ More

    Submitted 29 April, 2022; originally announced April 2022.

  25. arXiv:2111.12545  [pdf, other

    cs.LG stat.CO

    ModelPred: A Framework for Predicting Trained Model from Training Data

    Authors: Yingyan Zeng, Jiachen T. Wang, Si Chen, Hoang Anh Just, Ran Jin, Ruoxi Jia

    Abstract: In this work, we propose ModelPred, a framework that helps to understand the impact of changes in training data on a trained model. This is critical for building trust in various stages of a machine learning pipeline: from cleaning poor-quality samples and tracking important ones to be collected during data preparation, to calibrating uncertainty of model prediction, to interpreting why certain be… ▽ More

    Submitted 23 December, 2022; v1 submitted 24 November, 2021; originally announced November 2021.

  26. arXiv:2108.00874  [pdf, other

    cs.LG cs.IT eess.SP stat.ML

    Few-Shot Domain Adaptation For End-to-End Communication

    Authors: Jayaram Raghuram, Yijing Zeng, Dolores García Martí, Rafael Ruiz Ortiz, Somesh Jha, Joerg Widmer, Suman Banerjee

    Abstract: The problem of end-to-end learning of a communication system using an autoencoder -- consisting of an encoder, channel, and decoder modeled using neural networks -- has recently been shown to be a promising approach. A challenge faced in the practical adoption of this learning approach is that under changing channel conditions (e.g. a wireless link), it requires frequent retraining of the autoenco… ▽ More

    Submitted 25 July, 2022; v1 submitted 2 August, 2021; originally announced August 2021.

    Comments: 32 pages, 11 figures

  27. arXiv:2106.11180  [pdf, other

    math.OC cs.LG stat.ME

    Generalization Bounds with Minimal Dependency on Hypothesis Class via Distributionally Robust Optimization

    Authors: Yibo Zeng, Henry Lam

    Abstract: Established approaches to obtain generalization bounds in data-driven optimization and machine learning mostly build on solutions from empirical risk minimization (ERM), which depend crucially on the functional complexity of the hypothesis class. In this paper, we present an alternate route to obtain these bounds on the solution from distributionally robust optimization (DRO), a recent data-driven… ▽ More

    Submitted 12 October, 2022; v1 submitted 21 June, 2021; originally announced June 2021.

    Comments: Accepted by NeurIPS 2022

  28. arXiv:2105.04330  [pdf, ps, other

    econ.EM stat.AP

    Efficient Peer Effects Estimators with Group Effects

    Authors: Guido M. Kuersteiner, Ingmar R. Prucha, Ying Zeng

    Abstract: We study linear peer effects models where peers interact in groups, individual's outcomes are linear in the group mean outcome and characteristics, and group effects are random. Our specification is motivated by the moment conditions imposed in Graham 2008. We show that these moment conditions can be cast in terms of a linear random group effects model and lead to a class of GMM estimators that ar… ▽ More

    Submitted 25 April, 2022; v1 submitted 10 May, 2021; originally announced May 2021.

    MSC Class: 62F10; 62F12

  29. arXiv:2009.13038  [pdf, other

    cs.LG stat.ML

    A Robust graph attention network with dynamic adjusted Graph

    Authors: Xianchen Zhou, Yaoyun Zeng, Hongxia Wang

    Abstract: Graph Attention Networks(GATs) are useful deep learning models to deal with the graph data. However, recent works show that the classical GAT is vulnerable to adversarial attacks. It degrades dramatically with slight perturbations. Therefore, how to enhance the robustness of GAT is a critical problem. Robust GAT(RoGAT) is proposed in this paper to improve the robustness of GAT based on the revisio… ▽ More

    Submitted 4 August, 2022; v1 submitted 27 September, 2020; originally announced September 2020.

    Comments: 21 pages,13 figures

  30. arXiv:2009.11508  [pdf, other

    cs.LG stat.ML

    Improving Query Efficiency of Black-box Adversarial Attack

    Authors: Yang Bai, Yuyuan Zeng, Yong Jiang, Yisen Wang, Shu-Tao Xia, Weiwei Guo

    Abstract: Deep neural networks (DNNs) have demonstrated excellent performance on various tasks, however they are under the risk of adversarial examples that can be easily generated when the target model is accessible to an attacker (white-box setting). As plenty of machine learning models have been deployed via online services that only provide query outputs from inaccessible models (e.g. Google Cloud Visio… ▽ More

    Submitted 25 September, 2020; v1 submitted 24 September, 2020; originally announced September 2020.

    Comments: Accepted to ECCV2020

  31. arXiv:2009.09176  [pdf, other

    cs.LG stat.ML

    Causal Discovery with Multi-Domain LiNGAM for Latent Factors

    Authors: Yan Zeng, Shohei Shimizu, Ruichu Cai, Feng Xie, Michio Yamamoto, Zhifeng Hao

    Abstract: Discovering causal structures among latent factors from observed data is a particularly challenging problem. Despite some efforts for this problem, existing methods focus on the single-domain data only. In this paper, we propose Multi-Domain Linear Non-Gaussian Acyclic Models for Latent Factors (MD-LiNA), where the causal structure among latent factors of interest is shared for all domains, and we… ▽ More

    Submitted 22 April, 2022; v1 submitted 19 September, 2020; originally announced September 2020.

    Comments: 16 pages, 11 figures

  32. arXiv:2009.08697  [pdf, other

    cs.CR cs.LG stat.ML

    Fine-tuning Is Not Enough: A Simple yet Effective Watermark Removal Attack for DNN Models

    Authors: Shangwei Guo, Tianwei Zhang, Han Qiu, Yi Zeng, Tao Xiang, Yang Liu

    Abstract: Watermarking has become the tendency in protecting the intellectual property of DNN models. Recent works, from the adversary's perspective, attempted to subvert watermarking mechanisms by designing watermark removal attacks. However, these attacks mainly adopted sophisticated fine-tuning techniques, which have certain fatal drawbacks or unrealistic assumptions. In this paper, we propose a novel wa… ▽ More

    Submitted 17 May, 2021; v1 submitted 18 September, 2020; originally announced September 2020.

    Comments: 7 pages, 4 figures, accpeted by IJCAI 2021

  33. arXiv:2008.09983  [pdf, other

    cs.LG cs.DB stat.ML

    Leveraging Organizational Resources to Adapt Models to New Data Modalities

    Authors: Sahaana Suri, Raghuveer Chanda, Neslihan Bulut, Pradyumna Narayana, Yemao Zeng, Peter Bailis, Sugato Basu, Girija Narlikar, Christopher Re, Abishek Sethi

    Abstract: As applications in large organizations evolve, the machine learning (ML) models that power them must adapt the same predictive tasks to newly arising data modalities (e.g., a new video content launch in a social media application requires existing text or image models to extend to video). To solve this problem, organizations typically create ML pipelines from scratch. However, this fails to utiliz… ▽ More

    Submitted 23 August, 2020; originally announced August 2020.

    Journal ref: PVLDB,13(12): 3396-3410, 2020

  34. arXiv:1912.09132  [pdf, other

    cs.LG stat.ML

    Mean field theory for deep dropout networks: digging up gradient backpropagation deeply

    Authors: Wei Huang, Richard Yi Da Xu, Weitao Du, Yutian Zeng, Yunce Zhao

    Abstract: In recent years, the mean field theory has been applied to the study of neural networks and has achieved a great deal of success. The theory has been applied to various neural network structures, including CNNs, RNNs, Residual networks, and Batch normalization. Inevitably, recent work has also covered the use of dropout. The mean field theory shows that the existence of depth scales that limit the… ▽ More

    Submitted 13 April, 2020; v1 submitted 19 December, 2019; originally announced December 2019.

    Comments: 20 pages, 7 figures

    Journal ref: 24th European Conference on Artificial Intelligence - ECAI 2020

  35. arXiv:1910.14498  [pdf, other

    stat.ME

    Order Determination for Spiked Models

    Authors: Yicheng Zeng, Lixing Zhu

    Abstract: Motivated by dimension reduction in regression analysis and signal detection, we investigate the order determination for large dimension matrices including spiked models of which the numbers of covariates are proportional to the sample sizes for different models. Because the asymptotic behaviour of the estimated eigenvalues of the corresponding matrices differ completely from those in fixed dimens… ▽ More

    Submitted 31 October, 2019; originally announced October 2019.

    Comments: 42 pages, 4 figures

  36. arXiv:1906.03807  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Multiway clustering via tensor block models

    Authors: Miaoyan Wang, Yuchen Zeng

    Abstract: We consider the problem of identifying multiway block structure from a large noisy tensor. Such problems arise frequently in applications such as genomics, recommendation system, topic modeling, and sensor network localization. We propose a tensor block model, develop a unified least-square estimation, and obtain the theoretical accuracy guarantees for multiway clustering. The statistical converge… ▽ More

    Submitted 2 January, 2021; v1 submitted 10 June, 2019; originally announced June 2019.

    Comments: add the supplements

    MSC Class: 62H25; 62H12

    Journal ref: Advances in Neural Information Processing Systems 32 (NeurIPS 2019)

  37. arXiv:1901.03415  [pdf, other

    cs.LG stat.ML

    Context Aware Machine Learning

    Authors: Yun Zeng

    Abstract: We propose a principle for exploring context in machine learning models. Starting with a simple assumption that each observation may or may not depend on its context, a conditional probability distribution is decomposed into two parts: context-free and context-sensitive. Then by employing the log-linear word production model for relating random variables to their embedding space representation and… ▽ More

    Submitted 19 January, 2019; v1 submitted 10 January, 2019; originally announced January 2019.

  38. arXiv:1812.01101  [pdf, other

    physics.geo-ph cs.LG stat.ML

    Automatic Seismic Salt Interpretation with Deep Convolutional Neural Networks

    Authors: Yu Zeng, Kebei Jiang, Jie Chen

    Abstract: One of the most crucial tasks in seismic reflection imaging is to identify the salt bodies with high precision. Traditionally, this is accomplished by visually picking the salt/sediment boundaries, which requires a great amount of manual work and may introduce systematic bias. With recent progress of deep learning algorithm and growing computational power, a great deal of efforts have been made to… ▽ More

    Submitted 24 November, 2018; originally announced December 2018.

    Comments: 11 pages, 7 figures

    Journal ref: ICISDM 2019 - The 3rd International Conference on Information System and Data Mining

  39. arXiv:1808.09856  [pdf, other

    stat.ML cs.LG physics.geo-ph

    Application of Machine Learning in Rock Facies Classification with Physics-Motivated Feature Augmentation

    Authors: Jie Chen, Yu Zeng

    Abstract: With recent progress in algorithms and the availability of massive amounts of computation power, application of machine learning techniques is becoming a hot topic in the oil and gas industry. One of the most promising aspects to apply machine learning to the upstream field is the rock facies classification in reservoir characterization, which is crucial in determining the net pay thickness of res… ▽ More

    Submitted 29 August, 2018; originally announced August 2018.

    Comments: 8 pages, 7 figures

  40. arXiv:1804.01777  [pdf

    stat.AP

    Future Energy Consumption Prediction Based on Grey Forecast Model

    Authors: Yuan Zeng, Miao Luo, Yuzhong Liu

    Abstract: We use grey forecast model to predict the future energy consumption of four states in the U.S, and make some improvments to the model.

    Submitted 5 April, 2018; originally announced April 2018.

    Comments: 25 pages,21 figurea

  41. arXiv:1801.09125  [pdf, other

    cs.IT stat.ML

    Scalable Mutual Information Estimation using Dependence Graphs

    Authors: Morteza Noshad, Yu Zeng, Alfred O. Hero III

    Abstract: The Mutual Information (MI) is an often used measure of dependency between two random variables utilized in information theory, statistics and machine learning. Recently several MI estimators have been proposed that can achieve parametric MSE convergence rate. However, most of the previously proposed estimators have the high computational complexity of at least $O(N^2)$. We propose a unified metho… ▽ More

    Submitted 23 November, 2018; v1 submitted 27 January, 2018; originally announced January 2018.

    Comments: 19 Pages

  42. arXiv:1710.10944  [pdf, other

    cs.NE cs.LG q-bio.QM stat.ML

    A Supervised STDP-based Training Algorithm for Living Neural Networks

    Authors: Yuan Zeng, Kevin Devincentis, Yao Xiao, Zubayer Ibne Ferdous, Xiaochen Guo, Zhiyuan Yan, Yevgeny Berdichevsky

    Abstract: Neural networks have shown great potential in many applications like speech recognition, drug discovery, image classification, and object detection. Neural network models are inspired by biological neural networks, but they are optimized to perform machine learning tasks on digital computers. The proposed work explores the possibilities of using living neural networks in vitro as basic computation… ▽ More

    Submitted 21 March, 2018; v1 submitted 30 October, 2017; originally announced October 2017.

    Comments: 5 pages, 3 figures, Accepted by ICASSP 2018

  43. arXiv:1706.01833  [pdf

    stat.ML cs.LG q-fin.CP

    Online Adaptive Machine Learning Based Algorithm for Implied Volatility Surface Modeling

    Authors: Yaxiong Zeng, Diego Klabjan

    Abstract: In this work, we design a machine learning based method, online adaptive primal support vector regression (SVR), to model the implied volatility surface (IVS). The algorithm proposed is the first derivation and implementation of an online primal kernel SVR. It features enhancements that allow efficient online adaptive learning by embedding the idea of local fitness and budget maintenance to dynami… ▽ More

    Submitted 7 June, 2018; v1 submitted 6 June, 2017; originally announced June 2017.

    Comments: 34 Pages

  44. arXiv:1704.08742  [pdf, other

    stat.ML stat.CO

    Hybrid safe-strong rules for efficient optimization in lasso-type problems

    Authors: Yaohui Zeng, Tianbao Yang, Patrick Breheny

    Abstract: The lasso model has been widely used for model selection in data mining, machine learning, and high-dimensional statistical analysis. However, with the ultrahigh-dimensional, large-scale data sets now collected in many real-world applications, it is important to develop algorithms to solve the lasso that efficiently scale up to problems of this size. Discarding features from certain steps of the a… ▽ More

    Submitted 1 June, 2020; v1 submitted 27 April, 2017; originally announced April 2017.

    Comments: 31 pages, 4 figures

  45. Optimized Cost per Click in Taobao Display Advertising

    Authors: Han Zhu, Junqi Jin, Chang Tan, Fei Pan, Yifan Zeng, Han Li, Kun Gai

    Abstract: Taobao, as the largest online retail platform in the world, provides billions of online display advertising impressions for millions of advertisers every day. For commercial purposes, the advertisers bid for specific spots and target crowds to compete for business traffic. The platform chooses the most suitable ads to display in tens of milliseconds. Common pricing methods include cost per mille (… ▽ More

    Submitted 29 January, 2019; v1 submitted 27 February, 2017; originally announced March 2017.

    Comments: Accepted by KDD 2017

  46. arXiv:1702.06943  [pdf, other

    cs.LG cs.DB stat.ML

    Tuple-oriented Compression for Large-scale Mini-batch Stochastic Gradient Descent

    Authors: Fengan Li, Lingjiao Chen, Yijing Zeng, Arun Kumar, Jeffrey F. Naughton, Jignesh M. Patel, Xi Wu

    Abstract: Data compression is a popular technique for improving the efficiency of data processing workloads such as SQL queries and more recently, machine learning (ML) with classical batch gradient methods. But the efficacy of such ideas for mini-batch stochastic gradient descent (MGD), arguably the workhorse algorithm of modern ML, is an open question. MGD's unique data access pattern renders prior art, i… ▽ More

    Submitted 20 January, 2019; v1 submitted 22 February, 2017; originally announced February 2017.

    Comments: Accepted to Sigmod 2019

  47. arXiv:1701.05936  [pdf, other

    stat.CO stat.ML

    The biglasso Package: A Memory- and Computation-Efficient Solver for Lasso Model Fitting with Big Data in R

    Authors: Yaohui Zeng, Patrick Breheny

    Abstract: Penalized regression models such as the lasso have been extensively applied to analyzing high-dimensional data sets. However, due to memory limitations, existing R packages like glmnet and ncvreg are not capable of fitting lasso-type models for ultrahigh-dimensional, multi-gigabyte data sets that are increasingly seen in many areas such as genetics, genomics, biomedical imaging, and high-frequency… ▽ More

    Submitted 11 March, 2018; v1 submitted 20 January, 2017; originally announced January 2017.

    Comments: 20 pages, 6 figures

  48. Overlapping group logistic regression with applications to genetic pathway selection

    Authors: Yaohui Zeng, Patrick Breheny

    Abstract: Discovering important genes that account for the phenotype of interest has long been challenging in genomewide expression analysis. Analyses such as Gene Set Enrichment Analysis (GSEA) that incorporate pathway information have become widespread in hypothesis testing, but pathway-based approaches have been largely absent from regression methods due to the challenges of dealing with overlapping path… ▽ More

    Submitted 13 September, 2016; v1 submitted 17 October, 2015; originally announced October 2015.

    Journal ref: Cancer Informatics, 15:179-187, 2016

  49. arXiv:1306.3003  [pdf, ps, other

    cs.LG cs.CV stat.ML

    Non-parametric Power-law Data Clustering

    Authors: Xuhui Fan, Yiling Zeng, Longbing Cao

    Abstract: It has always been a great challenge for clustering algorithms to automatically determine the cluster numbers according to the distribution of datasets. Several approaches have been proposed to address this issue, including the recent promising work which incorporate Bayesian Nonparametrics into the $k$-means clustering procedure. This approach shows simplicity in implementation and solidity in th… ▽ More

    Submitted 12 June, 2013; originally announced June 2013.

  50. On the Performance of Spectrum Sensing Algorithms using Multiple Antennas

    Authors: Ying-Chang Liang, Guangming Pan, Yonghong Zeng

    Abstract: In recent years, some spectrum sensing algorithms using multiple antennas, such as the eigenvalue based detection (EBD), have attracted a lot of attention. In this paper, we are interested in deriving the asymptotic distributions of the test statistics of the EBD algorithms. Two EBD algorithms using sample covariance matrices are considered: maximum eigenvalue detection (MED) and condition number… ▽ More

    Submitted 18 August, 2010; originally announced August 2010.

    Comments: IEEE GlobeCom 2010