Skip to main content

Showing 1–37 of 37 results for author: Sun, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2504.12175  [pdf, ps, other

    stat.ML cs.LG

    Approximation Bounds for Transformer Networks with Application to Regression

    Authors: Yuling Jiao, Yanming Lai, Defeng Sun, Yang Wang, Bokai Yan

    Abstract: We explore the approximation capabilities of Transformer networks for Hölder and Sobolev functions, and apply these results to address nonparametric regression estimation with dependent observations. First, we establish novel upper bounds for standard Transformer networks approximating sequence-to-sequence mappings whose component functions are Hölder continuous with smoothness index $γ\in (0,1]$.… ▽ More

    Submitted 16 April, 2025; originally announced April 2025.

  2. arXiv:2502.14424  [pdf, other

    stat.ML cs.AI cs.LG stat.ME

    Distribution Matching for Self-Supervised Transfer Learning

    Authors: Yuling Jiao, Wensen Ma, Defeng Sun, Hansheng Wang, Yang Wang

    Abstract: In this paper, we propose a novel self-supervised transfer learning method called Distribution Matching (DM), which drives the representation distribution toward a predefined reference distribution while preserving augmentation invariance. The design of DM results in a learned representation space that is intuitively structured and offers easily interpretable hyperparameters. Experimental results… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

  3. arXiv:2412.14222  [pdf, other

    cs.AI cs.CL cs.LG stat.OT

    A Survey on Large Language Model-based Agents for Statistics and Data Science

    Authors: Maojun Sun, Ruijian Han, Binyan Jiang, Houduo Qi, Defeng Sun, Yancheng Yuan, Jian Huang

    Abstract: In recent years, data science agents powered by Large Language Models (LLMs), known as "data agents," have shown significant potential to transform the traditional data analysis paradigm. This survey provides an overview of the evolution, capabilities, and applications of LLM-based data agents, highlighting their role in simplifying complex data tasks and lowering the entry barrier for users witho… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

  4. arXiv:2412.04773  [pdf, other

    stat.ME

    Robust and Optimal Tensor Estimation via Robust Gradient Descent

    Authors: Xiaoyu Zhang, Di Wang, Guodong Li, Defeng Sun

    Abstract: Low-rank tensor models are widely used in statistics and machine learning. However, most existing methods rely heavily on the assumption that data follows a sub-Gaussian distribution. To address the challenges associated with heavy-tailed distributions encountered in real-world applications, we propose a novel robust estimation procedure based on truncated gradient descent for general low-rank ten… ▽ More

    Submitted 5 December, 2024; originally announced December 2024.

    Comments: 47 pages, 3 figures

  5. arXiv:2410.23580   

    stat.AP stat.ME

    Bayesian Hierarchical Model for Synthesizing Registry and Survey Data on Female Breast Cancer Prevalence

    Authors: Qiao Wang, Chester Lee Schmaltz, Jeannette Jackson-Thompson, Dongchu Sun, Zhuoqiong He, Zhongheng Cai, Hwanhee Hong

    Abstract: In public health, it is critical for policymakers to assess the relationship between the disease prevalence and associated risk factors or clinical characteristics, facilitating effective resources allocation. However, for diseases like female breast cancer (FBC), reliable prevalence data at specific geographical levels, such as the county-level, are limited because the gold standard data typicall… ▽ More

    Submitted 30 October, 2024; originally announced October 2024.

    Comments: arXiv admin comment: This version has been removed by arXiv administrators as the submitter did not have the rights to agree to the license at the time of submission

  6. arXiv:2410.12117  [pdf, other

    stat.ME

    Empirical Bayes estimation via data fission

    Authors: Nikolaos Ignatiadis, Dennis L. Sun

    Abstract: We demonstrate how data fission, a method for creating synthetic replicates from single observations, can be applied to empirical Bayes estimation. This extends recent work on empirical Bayes with multiple replicates to the classical single-replicate setting. The key insight is that after data fission, empirical Bayes estimation can be cast as a general regression problem.

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: This note was prepared as a comment on "Data Fission: Splitting a Single Data Point," by James Leiner, Boyan Duan, Larry Wasserman, and Aaditya Ramdas, a discussion paper in the Journal of the American Statistical Association

  7. arXiv:2310.09384  [pdf, other

    stat.ME stat.AP

    Modeling Missing at Random Neuropsychological Test Scores Using a Mixture of Binomial Product Experts

    Authors: Daniel Suen, Yen-Chi Chen

    Abstract: Multivariate bounded discrete data arises in many fields. In the setting of longitudinal dementia studies, such data is collected when individuals complete neuropsychological tests. We outline a modeling and inference procedure that can model the joint distribution conditional on baseline covariates, leveraging previous work on mixtures of experts and latent class models. Furthermore, we illustrat… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: 37 pages, 8 figures, 5 tables

  8. arXiv:2308.15549  [pdf, ps, other

    stat.ME math.ST

    Kernel meets sieve: transformed hazards models with sparse longitudinal covariates

    Authors: Dayu Sun, Zhuowei Sun, Xingqiu Zhao, Hongyuan Cao

    Abstract: We study the transformed hazards model with time-dependent covariates observed intermittently for the censored outcome. Existing work assumes the availability of the whole trajectory of the time-dependent covariates, which is unrealistic. We propose to combine kernel-weighted log-likelihood and sieve maximum log-likelihood estimation to conduct statistical inference. The method is robust and easy… ▽ More

    Submitted 17 September, 2023; v1 submitted 29 August, 2023; originally announced August 2023.

    MSC Class: 62N02 (primary); 62F12; 62E20 (secondary)

  9. arXiv:2308.01839  [pdf, other

    q-bio.QM cs.CV q-bio.GN stat.AP stat.ML

    Is your data alignable? Principled and interpretable alignability testing and integration of single-cell data

    Authors: Rong Ma, Eric D. Sun, David Donoho, James Zou

    Abstract: Single-cell data integration can provide a comprehensive molecular view of cells, and many algorithms have been developed to remove unwanted technical or biological variations and integrate heterogeneous single-cell datasets. Despite their wide usage, existing methods suffer from several fundamental limitations. In particular, we lack a rigorous statistical test for whether two high-dimensional si… ▽ More

    Submitted 29 February, 2024; v1 submitted 3 August, 2023; originally announced August 2023.

    Journal ref: Proceedings of the National Academy of Sciences, 2024, 121(10) e2313719121

  10. arXiv:2303.16841  [pdf, other

    cs.LG stat.ML

    Randomly Projected Convex Clustering Model: Motivation, Realization, and Cluster Recovery Guarantees

    Authors: Ziwen Wang, Yancheng Yuan, Jiaming Ma, Tieyong Zeng, Defeng Sun

    Abstract: In this paper, we propose a randomly projected convex clustering model for clustering a collection of $n$ high dimensional data points in $\mathbb{R}^d$ with $K$ hidden clusters. Compared to the convex clustering model for clustering original data with dimension $d$, we prove that, under some mild conditions, the perfect recovery of the cluster membership assignments of the convex clustering model… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

  11. arXiv:2301.01620  [pdf, ps, other

    stat.AP

    Anonymous Pattern Molecular Fingerprint and its Applications on Property Identification

    Authors: Xue Liu, Qian Cheng, Dan Sun, Xing Li, Wei Wei, Zhiming Zheng

    Abstract: Molecular fingerprints are significant cheminformatics tools to map molecules into vectorial space according to their characteristics in diverse functional groups, atom sequences, and other topological structures. In this paper, we set out to investigate a novel molecular fingerprint \emph{Anonymous-FP} that possesses abundant perception about the underlying interactions shaped in small, medium, a… ▽ More

    Submitted 4 January, 2023; originally announced January 2023.

    Comments: 11 pages

  12. arXiv:2210.13711  [pdf, other

    stat.ML cs.LG q-bio.QM stat.AP stat.ME

    A Spectral Method for Assessing and Combining Multiple Data Visualizations

    Authors: Rong Ma, Eric D. Sun, James Zou

    Abstract: Dimension reduction and data visualization aim to project a high-dimensional dataset to a low-dimensional space while capturing the intrinsic structures in the data. It is an indispensable part of modern data science, and many dimensional reduction and visualization algorithms have been developed. However, different algorithms have their own strengths and weaknesses, making it critically important… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: Under revision of Nature Communications

  13. arXiv:2210.00415  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Metric Distribution to Vector: Constructing Data Representation via Broad-Scale Discrepancies

    Authors: Xue Liu, Dan Sun, Xiaobo Cao, Hao Ye, Wei Wei

    Abstract: Graph embedding provides a feasible methodology to conduct pattern classification for graph-structured data by mapping each data into the vectorial space. Various pioneering works are essentially coding method that concentrates on a vectorial representation about the inner properties of a graph in terms of the topological constitution, node attributions, link relations, etc. However, the classific… ▽ More

    Submitted 1 October, 2022; originally announced October 2022.

  14. arXiv:2111.02367  [pdf, other

    stat.ME

    Multistage Estimators for Missing Covariates and Incomplete Outcomes

    Authors: Daniel Suen, Yen-Chi Chen

    Abstract: We study problems with multiple missing covariates and partially observed responses. We develop a new framework to handle complex missing covariate scenarios via inverse probability weighting, regression adjustment, and a multiply-robust procedure. We apply our framework to three classical problems: the Cox model from survival analysis, missing response, and binary treatment from causal inference.… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

    Comments: 92 pages, 12 figures

  15. arXiv:2106.13508  [pdf, other

    stat.CO math.OC stat.ME

    MARS: A second-order reduction algorithm for high-dimensional sparse precision matrices estimation

    Authors: Qian LI, Binyan Jiang, Defeng Sun

    Abstract: Estimation of the precision matrix (or inverse covariance matrix) is of great importance in statistical data analysis and machine learning. However, as the number of parameters scales quadratically with the dimension $p$, computation becomes very challenging when $p$ is large. In this paper, we propose an adaptive sieving reduction algorithm to generate a solution path for the estimation of precis… ▽ More

    Submitted 1 November, 2022; v1 submitted 25 June, 2021; originally announced June 2021.

  16. arXiv:2007.13140   

    stat.ML cs.LG stat.CO

    Fully Bayesian Analysis of the Relevance Vector Machine Classification for Imbalanced Data

    Authors: Wenyang Wang, Dongchu Sun, Zhuoqiong He

    Abstract: Relevance Vector Machine (RVM) is a supervised learning algorithm extended from Support Vector Machine (SVM) based on the Bayesian sparsity model. Compared with the regression problem, RVM classification is difficult to be conducted because there is no closed-form solution for the weight parameter posterior. Original RVM classification algorithm used Newton's method in optimization to obtain the m… ▽ More

    Submitted 27 October, 2022; v1 submitted 26 July, 2020; originally announced July 2020.

    Comments: The extended and final version of this paper has been published with open access modality in the CAAI Transactions on Intelligence Technology and can be found at link https://ietresearch.onlinelibrary.wiley.com/doi/full/10.1049/cit2.12111. Please refer to the TRIT published version in your scientific papers

  17. arXiv:2007.07366  [pdf, other

    cs.DC cs.LG stat.ML

    Serverless inferencing on Kubernetes

    Authors: Clive Cox, Dan Sun, Ellis Tarn, Animesh Singh, Rakesh Kelkar, David Goodwin

    Abstract: Organisations are increasingly putting machine learning models into production at scale. The increasing popularity of serverless scale-to-zero paradigms presents an opportunity for deploying machine learning models to help mitigate infrastructure costs when many models may not be in continuous use. We will discuss the KFServing project which builds on the KNative serverless paradigm to provide a s… ▽ More

    Submitted 24 July, 2020; v1 submitted 14 July, 2020; originally announced July 2020.

    Comments: 4 pages, 1 figure, presented at workshop on "Challenges in Deploying and Monitoring Machine Learning System" at ICML 2020

  18. arXiv:2006.05622  [pdf, other

    cs.LG stat.ML

    P-ADMMiRNN: Training RNN with Stable Convergence via An Efficient and Paralleled ADMM Approach

    Authors: Yu Tang, Zhigang Kan, Dequan Sun, Jingjing Xiao, Zhiquan Lai, Linbo Qiao, Dongsheng Li

    Abstract: It is hard to train Recurrent Neural Network (RNN) with stable convergence and avoid gradient vanishing and exploding problems, as the weights in the recurrent unit are repeated from iteration to iteration. Moreover, RNN is sensitive to the initialization of weights and bias, which brings difficulties in training. The Alternating Direction Method of Multipliers (ADMM) has become a promising algori… ▽ More

    Submitted 28 March, 2022; v1 submitted 9 June, 2020; originally announced June 2020.

    Comments: 13 pages, 12 figures

  19. arXiv:2005.12415  [pdf, other

    stat.ML cs.LG math.ST

    Robust Matrix Completion with Mixed Data Types

    Authors: Daqian Sun, Martin T. Wells

    Abstract: We consider the matrix completion problem of recovering a structured low rank matrix with partially observed entries with mixed data types. Vast majority of the solutions have proposed computationally feasible estimators with strong statistical guarantees for the case where the underlying distribution of data in the matrix is continuous. A few recent approaches have extended using similar ideas th… ▽ More

    Submitted 25 May, 2020; originally announced May 2020.

    Comments: 35 pages

  20. arXiv:2004.08115  [pdf, other

    math.OC stat.AP stat.CO stat.ML

    Estimation of sparse Gaussian graphical models with hidden clustering structure

    Authors: Meixia Lin, Defeng Sun, Kim-Chuan Toh, Chengjing Wang

    Abstract: Estimation of Gaussian graphical models is important in natural science when modeling the statistical relationships between variables in the form of a graph. The sparsity and clustering structure of the concentration matrix is enforced to reduce model complexity and describe inherent regularities. We propose a model to estimate the sparse Gaussian graphical models with hidden clustering structure,… ▽ More

    Submitted 17 April, 2020; originally announced April 2020.

  21. arXiv:2004.05988  [pdf, other

    cs.LG stat.ML

    ControlVAE: Controllable Variational Autoencoder

    Authors: Huajie Shao, Shuochao Yao, Dachun Sun, Aston Zhang, Shengzhong Liu, Dongxin Liu, Jun Wang, Tarek Abdelzaher

    Abstract: Variational Autoencoders (VAE) and their variants have been widely used in a variety of applications, such as dialog generation, image generation and disentangled representation learning. However, the existing VAE models have some limitations in different applications. For example, a VAE easily suffers from KL vanishing in language modeling and low reconstruction quality for disentangling. To addr… ▽ More

    Submitted 20 June, 2020; v1 submitted 13 April, 2020; originally announced April 2020.

    Comments: accepted by ICML2020

    Journal ref: 37th proceedings of ICML, 2020

  22. arXiv:2002.11410  [pdf, other

    math.OC stat.ML

    Efficient algorithms for multivariate shape-constrained convex regression problems

    Authors: Meixia Lin, Defeng Sun, Kim-Chuan Toh

    Abstract: Shape-constrained convex regression problem deals with fitting a convex function to the observed data, where additional constraints are imposed, such as component-wise monotonicity and uniform Lipschitz continuity. This paper provides a comprehensive mechanism for computing the least squares estimator of a multivariate shape-constrained convex regression function in $\mathbb{R}^d$. We prove that t… ▽ More

    Submitted 26 February, 2020; originally announced February 2020.

  23. arXiv:1911.05970  [pdf, other

    stat.ME

    Empirical Bayes mean estimation with nonparametric errors via order statistic regression on replicated data

    Authors: Nikolaos Ignatiadis, Sujayam Saha, Dennis L. Sun, Omkar Muralidharan

    Abstract: We study empirical Bayes estimation of the effect sizes of $N$ units from $K$ noisy observations on each unit. We show that it is possible to achieve near-Bayes optimal mean squared error, without any assumptions or knowledge about the effect size distribution or the noise. The noise distribution can be heteroskedastic and vary arbitrarily from unit to unit. Our proposal, which we call Aurora, lev… ▽ More

    Submitted 10 August, 2021; v1 submitted 14 November, 2019; originally announced November 2019.

  24. salmon: A Symbolic Linear Regression Package for Python

    Authors: Alex Boyd, Dennis L. Sun

    Abstract: One of the most attractive features of R is its linear modeling capabilities. We describe a Python package, salmon, that brings the best of R's linear modeling functionality to Python in a Pythonic way -- by providing composable objects for specifying and fitting linear models. This object-oriented design also enables other features that enhance ease-of-use, such as automatic visualizations and in… ▽ More

    Submitted 4 February, 2023; v1 submitted 2 November, 2019; originally announced November 2019.

    Comments: Accepted in the Journal of Statistical Software

    Journal ref: Journal of Statistical Software, 108(8), 1-26 (2024)

  25. arXiv:1906.05575  [pdf, other

    stat.ME stat.CO

    Direct Sampling of Bayesian Thin-Plate Splines for Spatial Smoothing

    Authors: Gentry White, Dongchu Sun, Paul Speckman

    Abstract: Radial basis functions are a common mathematical tool used to construct a smooth interpolating function from a set of data points. A spatial prior based on thin-plate spline radial basis functions can be easily implemented resulting in a posterior that can be sampled directly using Monte Carlo integration, avoiding the computational burden and potential inefficiency of an Monte Carlo Markov Chain… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

  26. arXiv:1904.01014  [pdf, other

    eess.IV cs.LG stat.ML

    Comparison of Possibilistic Fuzzy Local Information C-Means and Possibilistic K-Nearest Neighbors for Synthetic Aperture Sonar Image Segmentation

    Authors: Joshua Peeples, Matthew Cook, Daniel Suen, Alina Zare, James Keller

    Abstract: Synthetic aperture sonar (SAS) imagery can generate high resolution images of the seafloor. Thus, segmentation algorithms can be used to partition the images into different seafloor environments. In this paper, we compare two possibilistic segmentation approaches. Possibilistic approaches allow for the ability to detect novel or outlier environments as well as well known classes. The Possibilistic… ▽ More

    Submitted 1 April, 2019; originally announced April 2019.

    Journal ref: Proc. SPIE 110120, Detection and Sensing of Mines, Explosive Objects, and Obscured Targets XXIV (10 May 2019)

  27. arXiv:1903.11460  [pdf, ps, other

    math.OC cs.LG math.NA stat.CO stat.ML

    A sparse semismooth Newton based proximal majorization-minimization algorithm for nonconvex square-root-loss regression problems

    Authors: Peipei Tang, Chengjing Wang, Defeng Sun, Kim-Chuan Toh

    Abstract: In this paper, we consider high-dimensional nonconvex square-root-loss regression problems and introduce a proximal majorization-minimization (PMM) algorithm for these problems. Our key idea for making the proposed PMM to be efficient is to develop a sparse semismooth Newton method to solve the corresponding subproblems. By using the Kurdyka-Łojasiewicz property exhibited in the underlining proble… ▽ More

    Submitted 27 May, 2020; v1 submitted 27 March, 2019; originally announced March 2019.

    Comments: 34 pages, 8 tables

  28. arXiv:1810.02677  [pdf, other

    cs.LG math.OC stat.ML

    Convex Clustering: Model, Theoretical Guarantee and Efficient Algorithm

    Authors: Defeng Sun, Kim-Chuan Toh, Yancheng Yuan

    Abstract: Clustering is a fundamental problem in unsupervised learning. Popular methods like K-means, may suffer from poor performance as they are prone to get stuck in its local minima. Recently, the sum-of-norms (SON) model (also known as the clustering path) has been proposed in Pelckmans et al. (2005), Lindsten et al. (2011) and Hocking et al. (2011). The perfect recovery properties of the convex cluste… ▽ More

    Submitted 4 October, 2018; originally announced October 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1802.07091

  29. arXiv:1809.04249  [pdf, other

    math.OC stat.ML

    A Fast Globally Linearly Convergent Algorithm for the Computation of Wasserstein Barycenters

    Authors: Lei Yang, Jia Li, Defeng Sun, Kim-Chuan Toh

    Abstract: We consider the problem of computing a Wasserstein barycenter for a set of discrete probability distributions with finite supports, which finds many applications in areas such as statistics, machine learning and image processing. When the support points of the barycenter are pre-specified, this problem can be modeled as a linear programming (LP) problem whose size can be extremely large. To handle… ▽ More

    Submitted 26 December, 2020; v1 submitted 12 September, 2018; originally announced September 2018.

  30. arXiv:1808.07181  [pdf, other

    math.OC stat.ML

    Efficient sparse semismooth Newton methods for the clustered lasso problem

    Authors: Meixia Lin, Yong-Jin Liu, Defeng Sun, Kim-Chuan Toh

    Abstract: We focus on solving the clustered lasso problem, which is a least squares problem with the $\ell_1$-type penalties imposed on both the coefficients and their pairwise differences to learn the group structure of the regression parameters. Here we first reformulate the clustered lasso regularizer as a weighted ordered-lasso regularizer, which is essential in reducing the computational cost from… ▽ More

    Submitted 1 May, 2019; v1 submitted 21 August, 2018; originally announced August 2018.

  31. arXiv:1712.06424  [pdf, other

    cs.CV cs.LG stat.ML

    Learning to Write Stylized Chinese Characters by Reading a Handful of Examples

    Authors: Danyang Sun, Tongzheng Ren, Chongxun Li, Hang Su, Jun Zhu

    Abstract: Automatically writing stylized Chinese characters is an attractive yet challenging task due to its wide applicabilities. In this paper, we propose a novel framework named Style-Aware Variational Auto-Encoder (SA-VAE) to flexibly generate Chinese characters. Specifically, we propose to capture the different characteristics of a Chinese character by disentangling the latent features into content-rel… ▽ More

    Submitted 18 June, 2018; v1 submitted 6 December, 2017; originally announced December 2017.

    Comments: Accepted by IJCAI 2018

  32. arXiv:1410.3853  [pdf, other

    stat.AP physics.ed-ph

    Peer assessment enhances student learning

    Authors: Dennis L. Sun, Naftali Harris, Guenther Walther, Michael Baiocchi

    Abstract: Feedback has a powerful influence on learning, but it is also expensive to provide. In large classes, it may even be impossible for instructors to provide individualized feedback. Peer assessment has received attention lately as a way of providing personalized feedback that scales to large classes. Besides these obvious benefits, some researchers have also conjectured that students learn by peer a… ▽ More

    Submitted 14 October, 2014; originally announced October 2014.

  33. arXiv:1410.2597  [pdf, other

    math.ST stat.ME

    Optimal Inference After Model Selection

    Authors: William Fithian, Dennis Sun, Jonathan Taylor

    Abstract: To perform inference after model selection, we propose controlling the selective type I error; i.e., the error rate of a test given that it was performed. By doing so, we recover long-run frequency properties among selected hypotheses analogous to those that apply in the classical (non-adaptive) context. Our proposal is closely related to data splitting and has a similar intuitive justification, b… ▽ More

    Submitted 18 April, 2017; v1 submitted 9 October, 2014; originally announced October 2014.

  34. arXiv:1311.6238  [pdf, ps, other

    math.ST stat.ME stat.ML

    Exact post-selection inference, with application to the lasso

    Authors: Jason D. Lee, Dennis L. Sun, Yuekai Sun, Jonathan E. Taylor

    Abstract: We develop a general approach to valid inference after model selection. At the core of our framework is a result that characterizes the distribution of a post-selection estimator conditioned on the selection event. We specialize the approach to model selection by the lasso to form valid confidence intervals for the selected coefficients and test whether all relevant variables have been included in… ▽ More

    Submitted 3 May, 2016; v1 submitted 25 November, 2013; originally announced November 2013.

    Comments: Published at http://dx.doi.org/10.1214/15-AOS1371 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1371

    Journal ref: Annals of Statistics 2016, Vol. 44, No. 3, 907-927

  35. arXiv:1210.3709  [pdf, other

    math.OC cs.IT math.NA stat.ML

    A Rank-Corrected Procedure for Matrix Completion with Fixed Basis Coefficients

    Authors: Weimin Miao, Shaohua Pan, Defeng Sun

    Abstract: For the problems of low-rank matrix completion, the efficiency of the widely-used nuclear norm technique may be challenged under many circumstances, especially when certain basis coefficients are fixed, for example, the low-rank correlation matrix completion in various fields such as the financial market and the low-rank density matrix completion from the quantum state tomography. To seek a soluti… ▽ More

    Submitted 22 June, 2015; v1 submitted 13 October, 2012; originally announced October 2012.

    Comments: 51 pages, 4 figures

  36. arXiv:1209.2076  [pdf, other

    stat.AP

    Estimating a Signal from a Magnitude Spectrogram via Convex Optimization

    Authors: Dennis L. Sun, Julius O. Smith III

    Abstract: The problem of recovering a signal from the magnitude of its short-time Fourier transform (STFT) is a longstanding one in audio signal processing. Existing approaches rely on heuristics that often perform poorly because of the nonconvexity of the problem. We introduce a formulation of the problem that lends itself to a tractable convex program. We observe that our method yields better reconstructi… ▽ More

    Submitted 10 September, 2012; originally announced September 2012.

  37. Objective Bayesian analysis under sequential experimentation

    Authors: Dongchu Sun, James O. Berger

    Abstract: Objective priors for sequential experiments are considered. Common priors, such as the Jeffreys prior and the reference prior, will typically depend on the stopping rule used for the sequential experiment. New expressions for reference priors are obtained in various contexts, and computational issues involving such priors are considered.

    Submitted 20 May, 2008; originally announced May 2008.

    Comments: Published in at http://dx.doi.org/10.1214/074921708000000020 the IMS Collections (http://www.imstat.org/publications/imscollections.htm) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-COLL3-IMSCOLL302 MSC Class: 62L12; 62C10 (Primary) 62F15; 62L10 (Secondary)

    Journal ref: IMS Collections 2008, Vol. 3, 19-32