Skip to main content

Showing 1–27 of 27 results for author: Yin, D

Searching in archive stat. Search in all archives.
.
  1. Analysis of Full-scale Riser Responses in Field Conditions Based on Gaussian Mixture Model

    Authors: Jie Wu, Sølve Eidnes, Jingzhe Jin, Halvor Lie, Decao Yin, Elizabeth Passano, Svein Sævik, Signe Riemer-Sorensen

    Abstract: Offshore slender marine structures experience complex and combined load conditions from waves, current and vessel motions that may result in both wave frequency and vortex shedding response patterns. Field measurements often consist of records of environmental conditions and riser responses, typically with 30-minute intervals. These data can be represented in a high-dimensional parameter space. Ho… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Matches accepted version

    Journal ref: Journal of Fluids and Structures, Volume 116, 2023, 103793

  2. arXiv:2403.10070  [pdf, other

    stat.ML cs.LG math.DS

    A Structure-Preserving Kernel Method for Learning Hamiltonian Systems

    Authors: Jianyu Hu, Juan-Pablo Ortega, Daiying Yin

    Abstract: A structure-preserving kernel ridge regression method is presented that allows the recovery of nonlinear Hamiltonian functions out of datasets made of noisy observations of Hamiltonian vector fields. The method proposes a closed-form solution that yields excellent numerical performances that surpass other techniques proposed in the literature in this setup. From the methodological point of view, t… ▽ More

    Submitted 4 April, 2025; v1 submitted 15 March, 2024; originally announced March 2024.

  3. arXiv:2403.00019  [pdf, other

    cs.LG stat.ML

    Transformer-based Parameter Estimation in Statistics

    Authors: Xiaoxin Yin, David S. Yin

    Abstract: Parameter estimation is one of the most important tasks in statistics, and is key to helping people understand the distribution behind a sample of observations. Traditionally parameter estimation is done either by closed-form solutions (e.g., maximum likelihood estimation for Gaussian distribution), or by iterative numerical methods such as Newton-Raphson method when closed-form solution does not… ▽ More

    Submitted 27 February, 2024; originally announced March 2024.

  4. arXiv:2108.05533  [pdf, ps, other

    cs.LG math.OC stat.ML

    Efficient Local Planning with Linear Function Approximation

    Authors: Dong Yin, Botao Hao, Yasin Abbasi-Yadkori, Nevena Lazić, Csaba Szepesvári

    Abstract: We study query and computationally efficient planning algorithms with linear function approximation and a simulator. We assume that the agent only has local access to the simulator, meaning that the agent can only query the simulator at states that have been visited before. This setting is more practical than many prior works on reinforcement learning with a generative model. We propose two algori… ▽ More

    Submitted 4 February, 2022; v1 submitted 12 August, 2021; originally announced August 2021.

    Comments: Algorithmic Learning Theory 2022

  5. arXiv:2102.12611  [pdf, other

    cs.LG stat.ML

    Improved Regret Bound and Experience Replay in Regularized Policy Iteration

    Authors: Nevena Lazic, Dong Yin, Yasin Abbasi-Yadkori, Csaba Szepesvari

    Abstract: In this work, we study algorithms for learning in infinite-horizon undiscounted Markov decision processes (MDPs) with function approximation. We first show that the regret analysis of the Politex algorithm (a version of regularized policy iteration) can be sharpened from $O(T^{3/4})$ to $O(\sqrt{T})$ under nearly identical assumptions, and instantiate the bound with linear function approximation.… ▽ More

    Submitted 24 February, 2021; originally announced February 2021.

  6. arXiv:2006.10974  [pdf, ps, other

    cs.LG stat.ML

    Optimization and Generalization of Regularization-Based Continual Learning: a Loss Approximation Viewpoint

    Authors: Dong Yin, Mehrdad Farajtabar, Ang Li, Nir Levine, Alex Mott

    Abstract: Neural networks have achieved remarkable success in many cognitive tasks. However, when they are trained sequentially on multiple tasks without access to old data, their performance on early tasks tend to drop significantly. This problem is often referred to as catastrophic forgetting, a key challenge in continual learning of neural networks. The regularization-based approach is one of the primary… ▽ More

    Submitted 8 February, 2021; v1 submitted 19 June, 2020; originally announced June 2020.

    Comments: Preliminary version with a different title presented at ICML Workshop on Continual Learning, 2020 (spotlight)

  7. arXiv:2006.04435  [pdf, other

    cs.LG cs.AI stat.ML

    CAST: A Correlation-based Adaptive Spectral Clustering Algorithm on Multi-scale Data

    Authors: Xiang Li, Ben Kao, Caihua Shan, Dawei Yin, Martin Ester

    Abstract: We study the problem of applying spectral clustering to cluster multi-scale data, which is data whose clusters are of various sizes and densities. Traditional spectral clustering techniques discover clusters by processing a similarity matrix that reflects the proximity of objects. For multi-scale data, distance-based similarity is not effective because objects of a sparse cluster could be far apar… ▽ More

    Submitted 8 June, 2020; originally announced June 2020.

  8. arXiv:2006.04088  [pdf, other

    stat.ML cs.LG

    An Efficient Framework for Clustered Federated Learning

    Authors: Avishek Ghosh, Jichan Chung, Dong Yin, Kannan Ramchandran

    Abstract: We address the problem of federated learning (FL) where users are distributed and partitioned into clusters. This setup captures settings where different groups of users have their own objectives (learning tasks) but by aggregating their data with others in the same cluster (same learning task), they can leverage the strength in numbers in order to perform more efficient federated learning. For th… ▽ More

    Submitted 8 June, 2021; v1 submitted 7 June, 2020; originally announced June 2020.

    Comments: Preliminary results appeared at NeurIPS 2020

  9. arXiv:2006.00945  [pdf, other

    cs.LG stat.ML

    Robust Reinforcement Learning with Wasserstein Constraint

    Authors: Linfang Hou, Liang Pang, Xin Hong, Yanyan Lan, Zhiming Ma, Dawei Yin

    Abstract: Robust Reinforcement Learning aims to find the optimal policy with some extent of robustness to environmental dynamics. Existing learning algorithms usually enable the robustness through disturbing the current state or simulating environmental parameters in a heuristic way, which lack quantified robustness to the system dynamics (i.e. transition probability). To overcome this issue, we leverage Wa… ▽ More

    Submitted 1 June, 2020; originally announced June 2020.

  10. arXiv:2001.06216  [pdf, other

    cs.LG stat.ML

    GraphLIME: Local Interpretable Model Explanations for Graph Neural Networks

    Authors: Qiang Huang, Makoto Yamada, Yuan Tian, Dinesh Singh, Dawei Yin, Yi Chang

    Abstract: Graph structured data has wide applicability in various domains such as physics, chemistry, biology, computer vision, and social networks, to name a few. Recently, graph neural networks (GNN) were shown to be successful in effectively representing graph structured data because of their good performance and generalization ability. GNN is a deep learning based method that learns a node representatio… ▽ More

    Submitted 27 September, 2020; v1 submitted 17 January, 2020; originally announced January 2020.

  11. arXiv:1907.09652  [pdf, other

    stat.ML cs.LG

    Off-policy Learning for Multiple Loggers

    Authors: Li He, Long Xia, Wei Zeng, Zhi-Ming Ma, Yihong Zhao, Dawei Yin

    Abstract: It is well known that the historical logs are used for evaluating and learning policies in interactive systems, e.g. recommendation, search, and online advertising. Since direct online policy learning usually harms user experiences, it is more crucial to apply off-policy learning in real-world applications instead. Though there have been some existing works, most are focusing on learning with one… ▽ More

    Submitted 5 August, 2019; v1 submitted 22 July, 2019; originally announced July 2019.

  12. arXiv:1907.03215  [pdf, other

    cs.LG stat.ML

    Stochastic Gradient and Langevin Processes

    Authors: Xiang Cheng, Dong Yin, Peter L. Bartlett, Michael I. Jordan

    Abstract: We prove quantitative convergence rates at which discrete Langevin-like processes converge to the invariant distribution of a related stochastic differential equation. We study the setup where the additive noise can be non-Gaussian and state-dependent and the potential function can be non-convex. We show that the key properties of these processes depend on the potential function and the second mom… ▽ More

    Submitted 18 November, 2020; v1 submitted 6 July, 2019; originally announced July 2019.

    Comments: ICML 2020, code available at https://github.com/dongyin92/noise_covariance

  13. arXiv:1906.08988  [pdf, other

    cs.LG cs.CV stat.ML

    A Fourier Perspective on Model Robustness in Computer Vision

    Authors: Dong Yin, Raphael Gontijo Lopes, Jonathon Shlens, Ekin D. Cubuk, Justin Gilmer

    Abstract: Achieving robustness to distributional shift is a longstanding and challenging goal of computer vision. Data augmentation is a commonly used approach for improving robustness, however robustness gains are typically not uniform across corruption types. Indeed increasing performance in the presence of random noise is often met with reduced performance on other corruptions such as contrast change. Un… ▽ More

    Submitted 16 September, 2020; v1 submitted 21 June, 2019; originally announced June 2019.

    Comments: NeurIPS 2019

  14. arXiv:1906.06629  [pdf, other

    cs.LG stat.ML

    Robust Federated Learning in a Heterogeneous Environment

    Authors: Avishek Ghosh, Justin Hong, Dong Yin, Kannan Ramchandran

    Abstract: We study a recently proposed large-scale distributed learning paradigm, namely Federated Learning, where the worker machines are end users' own devices. Statistical and computational challenges arise in Federated Learning particularly in the presence of heterogeneous data distribution (i.e., data points on different devices belong to different distributions signifying different clusters) and Byzan… ▽ More

    Submitted 9 October, 2019; v1 submitted 15 June, 2019; originally announced June 2019.

    Comments: Fixing technical issues. Please discard any previous version

  15. arXiv:1906.02611  [pdf, other

    cs.LG cs.CV stat.ML

    Improving Robustness Without Sacrificing Accuracy with Patch Gaussian Augmentation

    Authors: Raphael Gontijo Lopes, Dong Yin, Ben Poole, Justin Gilmer, Ekin D. Cubuk

    Abstract: Deploying machine learning systems in the real world requires both high accuracy on clean data and robustness to naturally occurring corruptions. While architectural advances have led to improved accuracy, building robust models remains challenging. Prior work has argued that there is an inherent trade-off between robustness and accuracy, which is exemplified by standard data augment techniques su… ▽ More

    Submitted 6 June, 2019; originally announced June 2019.

  16. arXiv:1810.11914  [pdf, other

    cs.LG cs.CR cs.NE stat.ML

    Rademacher Complexity for Adversarially Robust Generalization

    Authors: Dong Yin, Kannan Ramchandran, Peter Bartlett

    Abstract: Many machine learning models are vulnerable to adversarial attacks; for example, adding adversarial perturbations that are imperceptible to humans can often make machine learning models produce wrong predictions with high confidence. Moreover, although we may obtain robust models on the training dataset via adversarial training, in some problems the learned models cannot generalize well to the tes… ▽ More

    Submitted 29 July, 2020; v1 submitted 28 October, 2018; originally announced October 2018.

    Comments: ICML 2019

  17. arXiv:1810.10627  [pdf, other

    cs.LG stat.ML

    Streaming Graph Neural Networks

    Authors: Yao Ma, Ziyi Guo, Zhaochun Ren, Eric Zhao, Jiliang Tang, Dawei Yin

    Abstract: Graphs are essential representations of many real-world data such as social networks. Recent years have witnessed the increasing efforts made to extend the neural network models to graph-structured data. These methods, which are usually known as the graph neural networks, have been applied to advance many graphs related tasks such as reasoning dynamics of the physical system, graph classification,… ▽ More

    Submitted 6 November, 2018; v1 submitted 24 October, 2018; originally announced October 2018.

  18. arXiv:1808.06170  [pdf, other

    cs.CL cs.LG stat.ML

    Linked Recurrent Neural Networks

    Authors: Zhiwei Wang, Yao Ma, Dawei Yin, Jiliang Tang

    Abstract: Recurrent Neural Networks (RNNs) have been proven to be effective in modeling sequential data and they have been applied to boost a variety of tasks such as document classification, speech recognition and machine translation. Most of existing RNN models have been designed for sequences assumed to be identically and independently distributed (i.i.d). However, in many real-world applications, sequen… ▽ More

    Submitted 19 August, 2018; originally announced August 2018.

  19. arXiv:1806.05358  [pdf, ps, other

    cs.LG cs.CR cs.DC math.OC stat.ML

    Defending Against Saddle Point Attack in Byzantine-Robust Distributed Learning

    Authors: Dong Yin, Yudong Chen, Kannan Ramchandran, Peter Bartlett

    Abstract: We study robust distributed learning that involves minimizing a non-convex loss function with saddle points. We consider the Byzantine setting where some worker machines have abnormal or even arbitrary and adversarial behavior. In this setting, the Byzantine machines may create fake local minima near a saddle point that is far away from any true local minimum, even when robust gradient estimators… ▽ More

    Submitted 29 July, 2020; v1 submitted 14 June, 2018; originally announced June 2018.

    Comments: ICML 2019

  20. arXiv:1803.01498  [pdf, other

    cs.LG cs.CR cs.DC stat.ML

    Byzantine-Robust Distributed Learning: Towards Optimal Statistical Rates

    Authors: Dong Yin, Yudong Chen, Kannan Ramchandran, Peter Bartlett

    Abstract: In large-scale distributed learning, security issues have become increasingly important. Particularly in a decentralized environment, some computing units may behave abnormally, or even exhibit Byzantine failures -- arbitrary and potentially adversarial behavior. In this paper, we develop distributed learning algorithms that are provably robust against such failures, with a focus on achieving opti… ▽ More

    Submitted 25 February, 2021; v1 submitted 5 March, 2018; originally announced March 2018.

    Comments: ICML 2018

  21. arXiv:1802.06501  [pdf, other

    cs.IR cs.LG stat.ML

    Recommendations with Negative Feedback via Pairwise Deep Reinforcement Learning

    Authors: Xiangyu Zhao, Liang Zhang, Zhuoye Ding, Long Xia, Jiliang Tang, Dawei Yin

    Abstract: Recommender systems play a crucial role in mitigating the problem of information overload by suggesting users' personalized items or services. The vast majority of traditional recommender systems consider the recommendation procedure as a static process and make recommendations following a fixed strategy. In this paper, we propose a novel recommender system with the capability of continuously impr… ▽ More

    Submitted 9 August, 2018; v1 submitted 18 February, 2018; originally announced February 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1801.00209

  22. arXiv:1802.05315  [pdf, other

    cs.LG stat.ML

    Online Learning for Non-Stationary A/B Tests

    Authors: Andrés Muñoz Medina, Sergei Vassilvitskii, Dong Yin

    Abstract: The rollout of new versions of a feature in modern applications is a manual multi-stage process, as the feature is released to ever larger groups of users, while its performance is carefully monitored. This kind of A/B testing is ubiquitous, but suboptimal, as the monitoring requires heavy human intervention, is not guaranteed to capture consistent, but short-term fluctuations in performance, and… ▽ More

    Submitted 27 May, 2018; v1 submitted 14 February, 2018; originally announced February 2018.

  23. arXiv:1801.00209  [pdf, other

    cs.LG stat.ML

    Deep Reinforcement Learning for List-wise Recommendations

    Authors: Xiangyu Zhao, Liang Zhang, Long Xia, Zhuoye Ding, Dawei Yin, Jiliang Tang

    Abstract: Recommender systems play a crucial role in mitigating the problem of information overload by suggesting users' personalized items or services. The vast majority of traditional recommender systems consider the recommendation procedure as a static process and make recommendations following a fixed strategy. In this paper, we propose a novel recommender system with the capability of continuously impr… ▽ More

    Submitted 27 June, 2019; v1 submitted 30 December, 2017; originally announced January 2018.

  24. arXiv:1608.04048  [pdf, other

    stat.ML

    Ultra High-Dimensional Nonlinear Feature Selection for Big Biological Data

    Authors: Makoto Yamada, Jiliang Tang, Jose Lugo-Martinez, Ermin Hodzic, Raunak Shrestha, Avishek Saha, Hua Ouyang, Dawei Yin, Hiroshi Mamitsuka, Cenk Sahinalp, Predrag Radivojac, Filippo Menczer, Yi Chang

    Abstract: Machine learning methods are used to discover complex nonlinear relationships in biological and medical data. However, sophisticated learning models are computationally unfeasible for data with millions of features. Here we introduce the first feature selection method for nonlinear learning problems that can scale up to large, ultra-high dimensional biological data. More specifically, we scale up… ▽ More

    Submitted 13 August, 2016; originally announced August 2016.

    Comments: Substantially improved version of arXiv:1411.2331

  25. arXiv:1605.08346  [pdf, other

    cs.IT stat.ML

    Distributed Sequence Memory of Multidimensional Inputs in Recurrent Networks

    Authors: Adam Charles, Dong Yin, Christopher Rozell

    Abstract: Recurrent neural networks (RNNs) have drawn interest from machine learning researchers because of their effectiveness at preserving past inputs for time-varying data processing tasks. To understand the success and limitations of RNNs, it is critical that we advance our analysis of their fundamental memory properties. We focus on echo state networks (ESNs), which are RNNs with simple memoryless nod… ▽ More

    Submitted 27 January, 2017; v1 submitted 26 May, 2016; originally announced May 2016.

    Comments: 37 pages, 3 figures

    Journal ref: Journal of Machine Learning Research, 18:1-37 Jan. 2017

  26. arXiv:1412.2113  [pdf, other

    stat.ML cs.LG

    Consistent Collective Matrix Completion under Joint Low Rank Structure

    Authors: Suriya Gunasekar, Makoto Yamada, Dawei Yin, Yi Chang

    Abstract: We address the collective matrix completion problem of jointly recovering a collection of matrices with shared structure from partial (and potentially noisy) observations. To ensure well--posedness of the problem, we impose a joint low rank structure, wherein each component matrix is low rank and the latent space of the low rank factors corresponding to each entity is shared across the entire coll… ▽ More

    Submitted 7 April, 2015; v1 submitted 5 December, 2014; originally announced December 2014.

    Comments: 19 pages, 3 figures

  27. arXiv:1411.2331  [pdf, ps, other

    stat.ML cs.LG

    N$^3$LARS: Minimum Redundancy Maximum Relevance Feature Selection for Large and High-dimensional Data

    Authors: Makoto Yamada, Avishek Saha, Hua Ouyang, Dawei Yin, Yi Chang

    Abstract: We propose a feature selection method that finds non-redundant features from a large and high-dimensional data in nonlinear way. Specifically, we propose a nonlinear extension of the non-negative least-angle regression (LARS) called N${}^3$LARS, where the similarity between input and output is measured through the normalized version of the Hilbert-Schmidt Independence Criterion (HSIC). An advantag… ▽ More

    Submitted 10 November, 2014; originally announced November 2014.

    Comments: arXiv admin note: text overlap with arXiv:1202.0515