Skip to main content

Showing 1–2 of 2 results for author: Ghiringhelli, L M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.11404  [pdf, other

    stat.ML cond-mat.mtrl-sci cs.LG physics.comp-ph

    How big is Big Data?

    Authors: Daniel T. Speckhard, Tim Bechtel, Luca M. Ghiringhelli, Martin Kuban, Santiago Rigamonti, Claudia Draxl

    Abstract: Big data has ushered in a new wave of predictive power using machine learning models. In this work, we assess what {\it big} means in the context of typical materials-science machine-learning problems. This concerns not only data volume, but also data quality and veracity as much as infrastructure issues. With selected examples, we ask (i) how models generalize to similar datasets, (ii) how high-q… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

  2. arXiv:2001.11212  [pdf, other

    stat.ML cs.IT cs.LG physics.data-an

    TCMI: a non-parametric mutual-dependence estimator for multivariate continuous distributions

    Authors: Benjamin Regler, Matthias Scheffler, Luca M. Ghiringhelli

    Abstract: The identification of relevant features, i.e., the driving variables that determine a process or the properties of a system, is an essential part of the analysis of data sets with a large number of variables. A mathematical rigorous approach to quantifying the relevance of these features is mutual information. Mutual information determines the relevance of features in terms of their joint mutual d… ▽ More

    Submitted 30 July, 2022; v1 submitted 30 January, 2020; originally announced January 2020.

    Comments: 28 pages, 8 figures, 8 tables

    Journal ref: Data Mining and Knowledge Discovery (2022)