Skip to main content

Showing 1–13 of 13 results for author: Dunlavy, D M

.
  1. arXiv:2505.04957  [pdf, other

    math.ST stat.ME

    The Poisson tensor completion non-parametric differential entropy estimator

    Authors: Daniel M. Dunlavy, Richard B. Lehoucq, Carolyn D. Mayer, Arvind Prasadan

    Abstract: We introduce the Poisson tensor completion (PTC) estimator, a non-parametric differential entropy estimator. The PTC estimator leverages inter-sample relationships to compute a low-rank Poisson tensor decomposition of the frequency histogram. Our crucial observation is that the histogram bins are an instance of a space partitioning of counts and thus can be identified with a spatial Poisson proces… ▽ More

    Submitted 8 May, 2025; v1 submitted 8 May, 2025; originally announced May 2025.

    Comments: 14 pages, 8 figures

    Report number: SAND2025-05664R

  2. arXiv:2404.10085   

    cs.IT

    The Average Spectrum Norm and Near-Optimal Tensor Completion

    Authors: Oscar López, Richard Lehoucq, Carlos Llosa-Vite, Arvind Prasadan, Daniel M. Dunlavy

    Abstract: We introduce a new tensor norm, the average spectrum norm, to study sample complexity of tensor completion problems based on the canonical polyadic decomposition (CPD). Properties of the average spectrum norm and its dual norm are investigated, demonstrating their utility for low-rank tensor recovery analysis. Our novel approach significantly reduces the provable sample rate for CPD-based noisy te… ▽ More

    Submitted 17 June, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: Error, in Section 2.1.2

  3. arXiv:2310.10872  [pdf, other

    cs.DC

    Computing Sparse Tensor Decompositions via Chapel and C++/MPI Interoperability without Intermediate I/O

    Authors: S. Isaac Geronimo Anderson, Daniel M. Dunlavy

    Abstract: We extend an existing approach for efficient use of shared mapped memory across Chapel and C++ for graph data stored as 1-D arrays to sparse tensor data stored using a combination of 2-D and 1-D arrays. We describe the specific extensions that provide use of shared mapped memory tensor data for a particular C++ tensor decomposition tool called GentenMPI. We then demonstrate our approach on several… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: 9 pages, 2 tables

    Report number: SAND2023-11029R

  4. arXiv:2307.03276  [pdf, other

    cs.DC

    Analyzing the Performance Portability of Tensor Decomposition

    Authors: S. Isaac Geronimo Anderson, Keita Teranishi, Daniel M. Dunlavy, Jee Choi

    Abstract: We employ pressure point analysis and roofline modeling to identify performance bottlenecks and determine an upper bound on the performance of the Canonical Polyadic Alternating Poisson Regression Multiplicative Update (CP-APR MU) algorithm in the SparTen software library. Our analyses reveal that a particular matrix computation, $Φ^{(n)}$, is the critical performance bottleneck in the SparTen CP-… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Comments: 28 pages, 19 figures

    ACM Class: C.1.2; C.1.4; D.4.8; G.4

  5. arXiv:2207.14341  [pdf, other

    math.NA cs.MS

    Tensor Decompositions for Count Data that Leverage Stochastic and Deterministic Optimization

    Authors: Jeremy M. Myers, Daniel M. Dunlavy

    Abstract: There is growing interest to extend low-rank matrix decompositions to multi-way arrays, or tensors. One fundamental low-rank tensor decomposition is the canonical polyadic decomposition (CPD). The challenge of fitting a low-rank, nonnegative CPD model to Poisson-distributed count data is of particular interest. Several popular algorithms use local search methods to approximate the maximum likeliho… ▽ More

    Submitted 11 July, 2024; v1 submitted 18 July, 2022; originally announced July 2022.

    Report number: SAND2023-08238R ACM Class: G.1.3; G.4

  6. arXiv:2201.10014  [pdf, other

    stat.ME cs.MS math.NA math.ST stat.ML

    Zero-Truncated Poisson Regression for Sparse Multiway Count Data Corrupted by False Zeros

    Authors: Oscar López, Daniel M. Dunlavy, Richard B. Lehoucq

    Abstract: We propose a novel statistical inference methodology for multiway count data that is corrupted by false zeros that are indistinguishable from true zero counts. Our approach consists of zero-truncating the Poisson distribution to neglect all zero values. This simple truncated approach dispenses with the need to distinguish between true and false zero counts and reduces the amount of data to be proc… ▽ More

    Submitted 11 April, 2023; v1 submitted 24 January, 2022; originally announced January 2022.

    Comments: 30 pages, 5 figures

    Report number: SAND2022-0803R

  7. arXiv:2012.01520  [pdf, other

    math.NA cs.MS cs.PF stat.CO

    Parameter Sensitivity Analysis of the SparTen High Performance Sparse Tensor Decomposition Software: Extended Analysis

    Authors: Jeremy M. Myers, Daniel M. Dunlavy, Keita Teranishi, D. S. Hollman

    Abstract: Tensor decomposition models play an increasingly important role in modern data science applications. One problem of particular interest is fitting a low-rank Canonical Polyadic (CP) tensor decomposition model when the tensor has sparse structure and the tensor elements are nonnegative count data. SparTen is a high-performance C++ library which computes a low-rank decomposition using different solv… ▽ More

    Submitted 2 December, 2020; originally announced December 2020.

    Comments: 33 pages, 13 figures

    Report number: SAND2020-11901R

  8. arXiv:2009.10644  [pdf, other

    stat.ML cs.AI cs.CR cs.LG

    Using Neural Architecture Search for Improving Software Flaw Detection in Multimodal Deep Learning Models

    Authors: Alexis Cooper, Xin Zhou, Scott Heidbrink, Daniel M. Dunlavy

    Abstract: Software flaw detection using multimodal deep learning models has been demonstrated as a very competitive approach on benchmark problems. In this work, we demonstrate that even better performance can be achieved using neural architecture search (NAS) combined with multimodal learning models. We adapt a NAS framework aimed at investigating image classification to the problem of software flaw detect… ▽ More

    Submitted 22 September, 2020; originally announced September 2020.

    Comments: 10 pages, 5 figures, 4 tables

    Report number: SAND2020-10141R

  9. arXiv:2009.04549  [pdf, other

    cs.LG cs.AI cs.CR cs.SE stat.ML

    Multimodal Deep Learning for Flaw Detection in Software Programs

    Authors: Scott Heidbrink, Kathryn N. Rodhouse, Daniel M. Dunlavy

    Abstract: We explore the use of multiple deep learning models for detecting flaws in software programs. Current, standard approaches for flaw detection rely on a single representation of a software program (e.g., source code or a program binary). We illustrate that, by using techniques from multimodal deep learning, we can simultaneously leverage multiple representations of software programs to improve flaw… ▽ More

    Submitted 9 September, 2020; originally announced September 2020.

    Comments: 13 pages, 2 figures, 5 tables

    Report number: SAND2020-9429R

  10. arXiv:1906.11133  [pdf, other

    cs.CR cs.AI cs.LG stat.ML

    A Review of Machine Learning Applications in Fuzzing

    Authors: Gary J Saavedra, Kathryn N Rodhouse, Daniel M Dunlavy, Philip W Kegelmeyer

    Abstract: Fuzzing has played an important role in improving software development and testing over the course of several decades. Recent research in fuzzing has focused on applications of machine learning (ML), offering useful tools to overcome challenges in the fuzzing process. This review surveys the current research in applying ML to fuzzing. Specifically, this review discusses successful applications of… ▽ More

    Submitted 9 October, 2019; v1 submitted 13 June, 2019; originally announced June 2019.

  11. arXiv:1105.3422  [pdf, other

    math.NA physics.data-an stat.ML

    All-at-once Optimization for Coupled Matrix and Tensor Factorizations

    Authors: Evrim Acar, Tamara G. Kolda, Daniel M. Dunlavy

    Abstract: Joint analysis of data from multiple sources has the potential to improve our understanding of the underlying structures in complex data sets. For instance, in restaurant recommendation systems, recommendations can be based on rating histories of customers. In addition to rating histories, customers' social networks (e.g., Facebook friendships) and restaurant categories information (e.g., Thai or… ▽ More

    Submitted 17 May, 2011; originally announced May 2011.

  12. arXiv:1005.4006  [pdf, other

    math.NA physics.data-an stat.ML

    Temporal Link Prediction using Matrix and Tensor Factorizations

    Authors: Daniel M. Dunlavy, Tamara G. Kolda, Evrim Acar

    Abstract: The data in many disciplines such as social networks, web analysis, etc. is link-based, and the link structure can be exploited for many different data mining tasks. In this paper, we consider the problem of temporal link prediction: Given link data for times 1 through T, can we predict the links at time T+1? If our data has underlying periodic structure, can we predict out even further in time, i… ▽ More

    Submitted 19 June, 2010; v1 submitted 21 May, 2010; originally announced May 2010.

    Journal ref: ACM Transactions on Knowledge Discovery from Data 5(2):10 (27 pages), February 2011

  13. arXiv:1005.2197  [pdf, other

    math.NA physics.data-an

    Scalable Tensor Factorizations for Incomplete Data

    Authors: Evrim Acar, Tamara G. Kolda, Daniel M. Dunlavy, Morten Morup

    Abstract: The problem of incomplete data - i.e., data with missing or unknown values - in multi-way arrays is ubiquitous in biomedical signal processing, network traffic analysis, bibliometrics, social network analysis, chemometrics, computer vision, communication networks, etc. We consider the problem of how to factorize data sets with missing values with the goal of capturing the underlying latent struc… ▽ More

    Submitted 12 May, 2010; originally announced May 2010.

    ACM Class: G.1.3; G.1.6

    Journal ref: Chemometrics and Intelligent Laboratory Systems 106(1):41-56, Mar. 2011