Skip to main content

Showing 1–25 of 25 results for author: Kolda, T G

Searching in archive math. Search in all archives.
.
  1. arXiv:2504.03937  [pdf, other

    math.NA math.AG math.OC

    The Fascinating World of 2 $\times$ 2 $\times$ 2 Tensors: Its Geometry and Optimization Challenges

    Authors: Gabriel H. Brown, Joe Kileel, Tamara G. Kolda

    Abstract: This educational article highlights the geometric and algebraic complexities that distinguish tensors from matrices, to supplement coverage in advanced courses on linear algebra, matrix analysis, and tensor decompositions. Using the case of real-valued 2 $\times$ 2 $\times$ 2 tensors, we show how tensors violate many well-known properties of matrices: (1) The rank of a matrix is bounded by its sma… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

  2. arXiv:2408.05677  [pdf, other

    math.NA cs.LG

    Tensor Decomposition Meets RKHS: Efficient Algorithms for Smooth and Misaligned Data

    Authors: Brett W. Larsen, Tamara G. Kolda, Anru R. Zhang, Alex H. Williams

    Abstract: The canonical polyadic (CP) tensor decomposition decomposes a multidimensional data array into a sum of outer products of finite-dimensional vectors. Instead, we can replace some or all of the vectors with continuous functions (infinite-dimensional vectors) from a reproducing kernel Hilbert space (RKHS). We refer to tensors with some infinite-dimensional modes as quasitensors, and the approach of… ▽ More

    Submitted 10 August, 2024; originally announced August 2024.

  3. arXiv:2305.06927  [pdf, other

    cs.LG math.OC stat.ML

    Convergence of Alternating Gradient Descent for Matrix Factorization

    Authors: Rachel Ward, Tamara G. Kolda

    Abstract: We consider alternating gradient descent (AGD) with fixed step size applied to the asymmetric matrix factorization objective. We show that, for a rank-$r$ matrix $\mathbf{A} \in \mathbb{R}^{m \times n}$, $T = C (\frac{σ_1(\mathbf{A})}{σ_r(\mathbf{A})})^2 \log(1/ε)$ iterations of alternating gradient descent suffice to reach an $ε$-optimal factorization… ▽ More

    Submitted 7 February, 2024; v1 submitted 11 May, 2023; originally announced May 2023.

  4. arXiv:2204.10824  [pdf, ps, other

    math.NA

    Scalable symmetric Tucker tensor decomposition

    Authors: Ruhui Jin, Joe Kileel, Tamara G. Kolda, Rachel Ward

    Abstract: We study the best low-rank Tucker decomposition of symmetric tensors. The motivating application is decomposing higher-order multivariate moments. Moment tensors have special structure and are important to various data science problems. We advocate for projected gradient descent (PGD) method and higher-order eigenvalue decomposition (HOEVD) approximation as computation schemes. Most importantly, w… ▽ More

    Submitted 10 June, 2023; v1 submitted 22 April, 2022; originally announced April 2022.

  5. arXiv:2202.06930  [pdf, other

    stat.ML cs.LG math.NA

    Tensor Moments of Gaussian Mixture Models: Theory and Applications

    Authors: João M. Pereira, Joe Kileel, Tamara G. Kolda

    Abstract: Gaussian mixture models (GMMs) are fundamental tools in statistical and data sciences. We study the moments of multivariate Gaussians and GMMs. The $d$-th moment of an $n$-dimensional random variable is a symmetric $d$-way tensor of size $n^d$, so working with moments naively is assumed to be prohibitively expensive for $d>2$ and larger values of $n$. In this work, we develop theory and numerical… ▽ More

    Submitted 21 March, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

  6. arXiv:2201.10638  [pdf, ps, other

    math.NA cs.DS

    Sketching Matrix Least Squares via Leverage Scores Estimates

    Authors: Brett W. Larsen, Tamara G. Kolda

    Abstract: We consider the matrix least squares problem of the form $\| \mathbf{A} \mathbf{X}-\mathbf{B} \|_F^2$ where the design matrix $\mathbf{A} \in \mathbb{R}^{N \times r}$ is tall and skinny with $N \gg r$. We propose to create a sketched version $\| \tilde{\mathbf{A}}\mathbf{X}-\tilde{\mathbf{B}} \|_F^2$ where the sketched matrices $\tilde{\mathbf{A}}$ and $\tilde{\mathbf{B}}$ contain weighted subsets… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

    Comments: This is detailed and standalone derivation of a result that already appears in (arXiv:2006.16438, Appendix A). arXiv admin note: substantial text overlap with arXiv:2006.16438

  7. arXiv:2110.14514  [pdf, other

    math.NA cs.LG cs.MS

    Streaming Generalized Canonical Polyadic Tensor Decompositions

    Authors: Eric Phipps, Nick Johnson, Tamara G. Kolda

    Abstract: In this paper, we develop a method which we call OnlineGCP for computing the Generalized Canonical Polyadic (GCP) tensor decomposition of streaming data. GCP differs from traditional canonical polyadic (CP) tensor decompositions as it allows for arbitrary objective functions which the CP model attempts to minimize. This approach can provide better fits and more interpretable models when the observ… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

  8. arXiv:2006.16438  [pdf, other

    math.NA

    Practical Leverage-Based Sampling for Low-Rank Tensor Decomposition

    Authors: Brett W. Larsen, Tamara G. Kolda

    Abstract: The low-rank canonical polyadic tensor decomposition is useful in data analysis and can be computed by solving a sequence of overdetermined least squares subproblems. Motivated by consideration of sparse tensors, we propose sketching each subproblem using leverage scores to select a subset of the rows, with probabilistic guarantees on the solution accuracy. We randomly sample rows proportional to… ▽ More

    Submitted 3 January, 2022; v1 submitted 29 June, 2020; originally announced June 2020.

  9. Estimating Higher-Order Moments Using Symmetric Tensor Decomposition

    Authors: Samantha Sherman, Tamara G. Kolda

    Abstract: We consider the problem of decomposing higher-order moment tensors, i.e., the sum of symmetric outer products of data vectors. Such a decomposition can be used to estimate the means in a Gaussian mixture model and for other applications in machine learning. The $d$th-order empirical moment tensor of a set of $p$ observations of $n$ variables is a symmetric $d$-way tensor. Our goal is to find a low… ▽ More

    Submitted 21 April, 2020; v1 submitted 9 November, 2019; originally announced November 2019.

    Journal ref: SIAM Journal on Matrix Analysis and Applications, Vol. 41, No. 3, pp. 1369-1387, September 2020

  10. arXiv:1909.04801  [pdf, ps, other

    cs.IT math.NA math.PR

    Faster Johnson-Lindenstrauss Transforms via Kronecker Products

    Authors: Ruhui Jin, Tamara G. Kolda, Rachel Ward

    Abstract: The Kronecker product is an important matrix operation with a wide range of applications in supporting fast linear transforms, including signal processing, graph theory, quantum computing and deep learning. In this work, we introduce a generalization of the fast Johnson-Lindenstrauss projection for embedding vectors with Kronecker product structure, the Kronecker fast Johnson-Lindenstrauss transfo… ▽ More

    Submitted 30 July, 2020; v1 submitted 10 September, 2019; originally announced September 2019.

    Comments: Information and Inference: A Journal of the IMA, 2020

  11. arXiv:1906.01687  [pdf, other

    math.NA cs.LG stat.ML

    Stochastic Gradients for Large-Scale Tensor Decomposition

    Authors: Tamara G. Kolda, David Hong

    Abstract: Tensor decomposition is a well-known tool for multiway data analysis. This work proposes using stochastic gradients for efficient generalized canonical polyadic (GCP) tensor decomposition of large-scale tensors. GCP tensor decomposition is a recently proposed version of tensor decomposition that allows for a variety of loss functions such as Bernoulli loss for binary data or Huber loss for robust… ▽ More

    Submitted 7 July, 2020; v1 submitted 4 June, 2019; originally announced June 2019.

    Journal ref: SIAM Journal on Mathematics of Data Science, Vol. 2, No. 4, pp. 1066-1095, 2020

  12. arXiv:1808.07452  [pdf, other

    math.NA cs.LG

    Generalized Canonical Polyadic Tensor Decomposition

    Authors: David Hong, Tamara G. Kolda, Jed A. Duersch

    Abstract: Tensor decomposition is a fundamental unsupervised machine learning method in data science, with applications including network analysis and sensor data processing. This work develops a generalized canonical polyadic (GCP) low-rank tensor decomposition that allows other loss functions besides squared error. For instance, we can use logistic loss or Kullback-Leibler divergence, enabling tensor deco… ▽ More

    Submitted 21 January, 2019; v1 submitted 22 August, 2018; originally announced August 2018.

    Journal ref: SIAM Review, Vol. 62, No. 1, pp. 133-163, 2020

  13. A Practical Randomized CP Tensor Decomposition

    Authors: Casey Battaglino, Grey Ballard, Tamara G. Kolda

    Abstract: The CANDECOMP/PARAFAC (CP) decomposition is a leading method for the analysis of multiway data. The standard alternating least squares algorithm for the CP decomposition (CP-ALS) involves a series of highly overdetermined linear least squares problems. We extend randomized least squares methods to tensors and show the workload of CP-ALS can be drastically reduced without a sacrifice in quality. We… ▽ More

    Submitted 22 October, 2017; v1 submitted 23 January, 2017; originally announced January 2017.

    Journal ref: SIAM Journal on Matrix Analysis and Applications, Vol. 39, No. 2, pp. 876-901, 2018

  14. Parallel Tensor Compression for Large-Scale Scientific Data

    Authors: Woody Austin, Grey Ballard, Tamara G. Kolda

    Abstract: As parallel computing trends towards the exascale, scientific data produced by high-fidelity simulations are growing increasingly massive. For instance, a simulation on a three-dimensional spatial grid with 512 points per dimension that tracks 64 variables per grid point for 128 time steps yields 8~TB of data, assuming double precision. By viewing the data as a dense five-way tensor, we can comput… ▽ More

    Submitted 23 February, 2016; v1 submitted 22 October, 2015; originally announced October 2015.

    Journal ref: IPDPS'16: Proceedings of the 30th IEEE International Parallel and Distributed Processing Symposium, pp. 912-922, May 2016

  15. arXiv:1503.01375  [pdf, other

    math.NA

    Symmetric Orthogonal Tensor Decomposition is Trivial

    Authors: Tamara G. Kolda

    Abstract: We consider the problem of decomposing a real-valued symmetric tensor as the sum of outer products of real-valued, pairwise orthogonal vectors. Such decompositions do not generally exist, but we show that some symmetric tensor decomposition problems can be converted to orthogonal problems following the whitening procedure proposed by Anandkumar et al. (2012). If an orthogonal decomposition of an… ▽ More

    Submitted 4 March, 2015; originally announced March 2015.

  16. Numerical Optimization for Symmetric Tensor Decomposition

    Authors: Tamara G. Kolda

    Abstract: We consider the problem of decomposing a real-valued symmetric tensor as the sum of outer products of real-valued vectors. Algebraic methods exist for computing complex-valued decompositions of symmetric tensors, but here we focus on real-valued decompositions, both unconstrained and nonnegative, for problems with low-rank structure. We discuss when solutions exist and how to formulate the mathema… ▽ More

    Submitted 19 February, 2015; v1 submitted 16 October, 2014; originally announced October 2014.

    Journal ref: Mathematical Programming B, Vol. 151, No. 1, pp. 225-248, April 2015

  17. An Adaptive Shifted Power Method for Computing Generalized Tensor Eigenpairs

    Authors: Tamara G. Kolda, Jackson R. Mayo

    Abstract: Several tensor eigenpair definitions have been put forth in the past decade, but these can all be unified under generalized tensor eigenpair framework, introduced by Chang, Pearson, and Zhang (2009). Given mth-order, n-dimensional real-valued symmetric tensors A and B, the goal is to find $λ\in R$ and $x \in R^n$, $x \neq 0$, such that $Ax^{m-1} = λBx^{m-1}$. Different choices for B yield differen… ▽ More

    Submitted 9 June, 2014; v1 submitted 6 January, 2014; originally announced January 2014.

    MSC Class: 15A18; 15A69

    Journal ref: SIAM Journal on Matrix Analysis and Applications 35(4):1563-1581, December 2014

  18. Newton-Based Optimization for Kullback-Leibler Nonnegative Tensor Factorizations

    Authors: Samantha Hansen, Todd Plantenga, Tamara G. Kolda

    Abstract: Tensor factorizations with nonnegative constraints have found application in analyzing data from cyber traffic, social networks, and other areas. We consider application data best described as being generated by a Poisson process (e.g., count data), which leads to sparse tensors that can be modeled by sparse factor matrices. In this paper we investigate efficient techniques for computing an approp… ▽ More

    Submitted 10 November, 2014; v1 submitted 17 April, 2013; originally announced April 2013.

    Comments: Clarified notation in section 3.1.1, and used simpler score() function in section B.2

    Journal ref: Optimization Methods and Software, Vol. 30, No. 5, pp. 1002-1029, April 2015

  19. arXiv:1301.7744  [pdf, ps, other

    math.NA cs.MS

    Exploiting Symmetry in Tensors for High Performance: Multiplication with Symmetric Tensors

    Authors: Martin D. Schatz, Tze Meng Low, Robert A. van de Geijn, Tamara G. Kolda

    Abstract: Symmetric tensor operations arise in a wide variety of computations. However, the benefits of exploiting symmetry in order to reduce storage and computation is in conflict with a desire to simplify memory access patterns. In this paper, we propose a blocked data structure (Blocked Compact Symmetric Storage) wherein we consider the tensor by blocks and store only the unique blocks of a symmetric te… ▽ More

    Submitted 9 April, 2014; v1 submitted 31 January, 2013; originally announced January 2013.

    MSC Class: 15-02 (Primary)

    Journal ref: SIAM Journal on Scientific Computing, Vol. 36, No. 5, pp. C453-C479, September 2014

  20. On Tensors, Sparsity, and Nonnegative Factorizations

    Authors: Eric C. Chi, Tamara G. Kolda

    Abstract: Tensors have found application in a variety of fields, ranging from chemometrics to signal processing and beyond. In this paper, we consider the problem of multilinear modeling of sparse count data. Our goal is to develop a descriptive tensor factorization model of such data, along with appropriate algorithms and theory. To do so, we propose that the random variation is best described via a Poisso… ▽ More

    Submitted 14 August, 2012; v1 submitted 11 December, 2011; originally announced December 2011.

    Journal ref: SIAM Journal on Matrix Analysis and Applications 33(4):1272-1299, 2012

  21. arXiv:1105.3422  [pdf, other

    math.NA physics.data-an stat.ML

    All-at-once Optimization for Coupled Matrix and Tensor Factorizations

    Authors: Evrim Acar, Tamara G. Kolda, Daniel M. Dunlavy

    Abstract: Joint analysis of data from multiple sources has the potential to improve our understanding of the underlying structures in complex data sets. For instance, in restaurant recommendation systems, recommendations can be based on rating histories of customers. In addition to rating histories, customers' social networks (e.g., Facebook friendships) and restaurant categories information (e.g., Thai or… ▽ More

    Submitted 17 May, 2011; originally announced May 2011.

  22. arXiv:1010.3043  [pdf, other

    math.NA stat.CO stat.ME

    Making Tensor Factorizations Robust to Non-Gaussian Noise

    Authors: Eric C. Chi, Tamara G. Kolda

    Abstract: Tensors are multi-way arrays, and the Candecomp/Parafac (CP) tensor factorization has found application in many different domains. The CP model is typically fit using a least squares objective function, which is a maximum likelihood estimate under the assumption of i.i.d. Gaussian noise. We demonstrate that this loss function can actually be highly sensitive to non-Gaussian noise. Therefore, we pr… ▽ More

    Submitted 14 October, 2010; originally announced October 2010.

    Comments: Contributed presentation at the NIPS Workshop on Tensors, Kernels, and Machine Learning, Whistler, BC, Canada, December 10, 2010

  23. Shifted Power Method for Computing Tensor Eigenpairs

    Authors: Tamara G. Kolda, Jackson R. Mayo

    Abstract: Recent work on eigenvalues and eigenvectors for tensors of order m >= 3 has been motivated by applications in blind source separation, magnetic resonance imaging, molecular conformation, and more. In this paper, we consider methods for computing real symmetric-tensor eigenpairs of the form Ax^{m-1} = λx subject to ||x||=1, which is closely related to optimal rank-1 approximation of a symmetric ten… ▽ More

    Submitted 22 February, 2011; v1 submitted 7 July, 2010; originally announced July 2010.

    MSC Class: 15A18; 15A69

    Journal ref: SIAM Journal on Matrix Analysis and Applications, 32(4):1095-1124, 2011

  24. arXiv:1005.4006  [pdf, other

    math.NA physics.data-an stat.ML

    Temporal Link Prediction using Matrix and Tensor Factorizations

    Authors: Daniel M. Dunlavy, Tamara G. Kolda, Evrim Acar

    Abstract: The data in many disciplines such as social networks, web analysis, etc. is link-based, and the link structure can be exploited for many different data mining tasks. In this paper, we consider the problem of temporal link prediction: Given link data for times 1 through T, can we predict the links at time T+1? If our data has underlying periodic structure, can we predict out even further in time, i… ▽ More

    Submitted 19 June, 2010; v1 submitted 21 May, 2010; originally announced May 2010.

    Journal ref: ACM Transactions on Knowledge Discovery from Data 5(2):10 (27 pages), February 2011

  25. arXiv:1005.2197  [pdf, other

    math.NA physics.data-an

    Scalable Tensor Factorizations for Incomplete Data

    Authors: Evrim Acar, Tamara G. Kolda, Daniel M. Dunlavy, Morten Morup

    Abstract: The problem of incomplete data - i.e., data with missing or unknown values - in multi-way arrays is ubiquitous in biomedical signal processing, network traffic analysis, bibliometrics, social network analysis, chemometrics, computer vision, communication networks, etc. We consider the problem of how to factorize data sets with missing values with the goal of capturing the underlying latent struc… ▽ More

    Submitted 12 May, 2010; originally announced May 2010.

    ACM Class: G.1.3; G.1.6

    Journal ref: Chemometrics and Intelligent Laboratory Systems 106(1):41-56, Mar. 2011