Skip to main content

Showing 1–7 of 7 results for author: Zhuo, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2211.14227  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Bypass Exponential Time Preprocessing: Fast Neural Network Training via Weight-Data Correlation Preprocessing

    Authors: Josh Alman, Jiehao Liang, Zhao Song, Ruizhe Zhang, Danyang Zhuo

    Abstract: Over the last decade, deep neural networks have transformed our society, and they are already widely applied in various machine learning applications. State-of-art deep neural networks are becoming larger in size every year to deliver increasing model accuracy, and as a result, model training consumes substantial computing resources and will only consume more in the future. Using current training… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

  2. arXiv:2208.04508  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Training Overparametrized Neural Networks in Sublinear Time

    Authors: Yichuan Deng, Hang Hu, Zhao Song, Omri Weinstein, Danyang Zhuo

    Abstract: The success of deep learning comes at a tremendous computational and energy cost, and the scalability of training massively overparametrized neural networks is becoming a real barrier to the progress of artificial intelligence (AI). Despite the popularity and low cost-per-iteration of traditional backpropagation via gradient decent, stochastic gradient descent (SGD) has prohibitive convergence rat… ▽ More

    Submitted 7 February, 2024; v1 submitted 8 August, 2022; originally announced August 2022.

  3. arXiv:2208.03915  [pdf, ps, other

    cs.LG stat.ML

    Dynamic Maintenance of Kernel Density Estimation Data Structure: From Practice to Theory

    Authors: Jiehao Liang, Zhao Song, Zhaozhuo Xu, Junze Yin, Danyang Zhuo

    Abstract: Kernel density estimation (KDE) stands out as a challenging task in machine learning. The problem is defined in the following way: given a kernel function $f(x,y)$ and a set of points $\{x_1, x_2, \cdots, x_n \} \subset \mathbb{R}^d$, we would like to compute $\frac{1}{n}\sum_{i=1}^{n} f(x_i,y)$ for any query point $y \in \mathbb{R}^d$. Recently, there has been a growing trend of using data struct… ▽ More

    Submitted 13 February, 2024; v1 submitted 8 August, 2022; originally announced August 2022.

  4. arXiv:2105.05328  [pdf, other

    cs.LG stat.ML

    Comparing interpretability and explainability for feature selection

    Authors: Jack Dunn, Luca Mingardi, Ying Daisy Zhuo

    Abstract: A common approach for feature selection is to examine the variable importance scores for a machine learning model, as a way to understand which features are the most relevant for making predictions. Given the significance of feature selection, it is crucial for the calculated importance scores to reflect reality. Falsely overestimating the importance of irrelevant features can lead to false discov… ▽ More

    Submitted 11 May, 2021; originally announced May 2021.

  5. arXiv:2011.11877  [pdf, other

    cs.LG cs.CC cs.CR cs.DS stat.ML

    InstaHide's Sample Complexity When Mixing Two Private Images

    Authors: Baihe Huang, Zhao Song, Runzhou Tao, Junze Yin, Ruizhe Zhang, Danyang Zhuo

    Abstract: Training neural networks usually require large numbers of sensitive training data, and how to protect the privacy of training data has thus become a critical topic in deep learning research. InstaHide is a state-of-the-art scheme to protect training data privacy with only minor effects on test accuracy, and its security has become a salient question. In this paper, we systematically study recent a… ▽ More

    Submitted 5 February, 2024; v1 submitted 23 November, 2020; originally announced November 2020.

  6. arXiv:2011.11181  [pdf, other

    cs.LG cs.CR cs.DS stat.ML

    On InstaHide, Phase Retrieval, and Sparse Matrix Factorization

    Authors: Sitan Chen, Xiaoxiao Li, Zhao Song, Danyang Zhuo

    Abstract: In this work, we examine the security of InstaHide, a scheme recently proposed by [Huang, Song, Li and Arora, ICML'20] for preserving the security of private datasets in the context of distributed learning. To generate a synthetic training example to be shared among the distributed learners, InstaHide takes a convex combination of private feature vectors and randomly flips the sign of each entry o… ▽ More

    Submitted 24 March, 2021; v1 submitted 22 November, 2020; originally announced November 2020.

    Comments: 30 pages, to appear in ICLR 2021, v2: updated discussion of follow-up work

  7. arXiv:2006.06762  [pdf, other

    cs.LG cs.NE cs.PF cs.PL stat.ML

    Ansor: Generating High-Performance Tensor Programs for Deep Learning

    Authors: Lianmin Zheng, Chengfan Jia, Minmin Sun, Zhao Wu, Cody Hao Yu, Ameer Haj-Ali, Yida Wang, Jun Yang, Danyang Zhuo, Koushik Sen, Joseph E. Gonzalez, Ion Stoica

    Abstract: High-performance tensor programs are crucial to guarantee efficient execution of deep neural networks. However, obtaining performant tensor programs for different operators on various hardware platforms is notoriously challenging. Currently, deep learning systems rely on vendor-provided kernel libraries or various search strategies to get performant tensor programs. These approaches either require… ▽ More

    Submitted 15 October, 2023; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: OSDI 2020