-
From Neurons to Neutrons: A Case Study in Interpretability
Authors:
Ouail Kitouni,
Niklas Nolte,
Víctor Samuel Pérez-Díaz,
Sokratis Trifinopoulos,
Mike Williams
Abstract:
Mechanistic Interpretability (MI) promises a path toward fully understanding how neural networks make their predictions. Prior work demonstrates that even when trained to perform simple arithmetic, models can implement a variety of algorithms (sometimes concurrently) depending on initialization and hyperparameters. Does this mean neuron-level interpretability techniques have limited applicability?…
▽ More
Mechanistic Interpretability (MI) promises a path toward fully understanding how neural networks make their predictions. Prior work demonstrates that even when trained to perform simple arithmetic, models can implement a variety of algorithms (sometimes concurrently) depending on initialization and hyperparameters. Does this mean neuron-level interpretability techniques have limited applicability? We argue that high-dimensional neural networks can learn low-dimensional representations of their training data that are useful beyond simply making good predictions. Such representations can be understood through the mechanistic interpretability lens and provide insights that are surprisingly faithful to human-derived domain knowledge. This indicates that such approaches to interpretability can be useful for deriving a new understanding of a problem from models trained to solve it. As a case study, we extract nuclear physics concepts by studying models trained to reproduce nuclear data.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
Applied Machine Learning to Anomaly Detection in Enterprise Purchase Processes
Authors:
A. Herreros-Martínez,
R. Magdalena-Benedicto,
J. Vila-Francés,
A. J. Serrano-López,
S. Pérez-Díaz
Abstract:
In a context of a continuous digitalisation of processes, organisations must deal with the challenge of detecting anomalies that can reveal suspicious activities upon an increasing volume of data. To pursue this goal, audit engagements are carried out regularly, and internal auditors and purchase specialists are constantly looking for new methods to automate these processes. This work proposes a m…
▽ More
In a context of a continuous digitalisation of processes, organisations must deal with the challenge of detecting anomalies that can reveal suspicious activities upon an increasing volume of data. To pursue this goal, audit engagements are carried out regularly, and internal auditors and purchase specialists are constantly looking for new methods to automate these processes. This work proposes a methodology to prioritise the investigation of the cases detected in two large purchase datasets from real data. The goal is to contribute to the effectiveness of the companies' control efforts and to increase the performance of carrying out such tasks. A comprehensive Exploratory Data Analysis is carried out before using unsupervised Machine Learning techniques addressed to detect anomalies. A univariate approach has been applied through the z-Score index and the DBSCAN algorithm, while a multivariate analysis is implemented with the k-Means and Isolation Forest algorithms, and the Silhouette index, resulting in each method having a transaction candidates' proposal to be reviewed. An ensemble prioritisation of the candidates is provided jointly with a proposal of explicability methods (LIME, Shapley, SHAP) to help the company specialists in their understanding.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
Unsupervised Machine Learning for the Classification of Astrophysical X-ray Sources
Authors:
Víctor Samuel Pérez-Díaz,
Juan Rafael Martínez-Galarza,
Alexander Caicedo,
Raffaele D'Abrusco
Abstract:
The automatic classification of X-ray detections is a necessary step in extracting astrophysical information from compiled catalogs of astrophysical sources. Classification is useful for the study of individual objects, statistics for population studies, as well as for anomaly detection, i.e., the identification of new unexplored phenomena, including transients and spectrally extreme sources. Desp…
▽ More
The automatic classification of X-ray detections is a necessary step in extracting astrophysical information from compiled catalogs of astrophysical sources. Classification is useful for the study of individual objects, statistics for population studies, as well as for anomaly detection, i.e., the identification of new unexplored phenomena, including transients and spectrally extreme sources. Despite the importance of this task, classification remains challenging in X-ray astronomy due to the lack of optical counterparts and representative training sets. We develop an alternative methodology that employs an unsupervised machine learning approach to provide probabilistic classes to Chandra Source Catalog sources with a limited number of labeled sources, and without ancillary information from optical and infrared catalogs. We provide a catalog of probabilistic classes for 8,756 sources, comprising a total of 14,507 detections, and demonstrate the success of the method at identifying emission from young stellar objects, as well as distinguishing between small-scale and large-scale compact accretors with a significant level of confidence. We investigate the consistency between the distribution of features among classified objects and well-established astrophysical hypotheses such as the unified AGN model. This provides interpretability to the probabilistic classifier. Code and tables are available publicly through GitHub. We provide a web playground for readers to explore our final classification at https://umlcaxs-playground.streamlit.app.
△ Less
Submitted 22 January, 2024;
originally announced January 2024.
-
Determination and (re)parametrization of rational developable surfaces
Authors:
Sonia Perez-Diaz,
Li-Yong Shen
Abstract:
The developable surface is an important surface in computer aided design, geometric modeling and industrial manufactory. It is often given in the stan- dard parametric form, but it can also be in the implicit form which is commonly used in algebraic geometry. Not all algebraic developable surfaces have rational parametrizations. In this paper, we focus on the rational developable surfaces. For a g…
▽ More
The developable surface is an important surface in computer aided design, geometric modeling and industrial manufactory. It is often given in the stan- dard parametric form, but it can also be in the implicit form which is commonly used in algebraic geometry. Not all algebraic developable surfaces have rational parametrizations. In this paper, we focus on the rational developable surfaces. For a given algebraic surface, we first determine whether it is developable by geometric inspection, and we give a rational proper parametrization for the af- firmative case. For a rational parametric surface, we can also determine the developability and give a proper reparametrization for the developable surface.
△ Less
Submitted 10 May, 2013;
originally announced May 2013.
-
Numerical Reparametrization of Rational Parametric Plane Curves
Authors:
Sonia Perez-Diaz,
Li-Yong Shen
Abstract:
In this paper, we present an algorithm for reparametrizing algebraic plane curves from a numerical point of view. That is, we deal with mathematical objects that are assumed to be given approximately. More precisely, given a tolerance $ε>0$ and a rational parametrization $\cal P$ with perturbed float coefficients of a plane curve $\cal C$, we present an algorithm that computes a parametrization…
▽ More
In this paper, we present an algorithm for reparametrizing algebraic plane curves from a numerical point of view. That is, we deal with mathematical objects that are assumed to be given approximately. More precisely, given a tolerance $ε>0$ and a rational parametrization $\cal P$ with perturbed float coefficients of a plane curve $\cal C$, we present an algorithm that computes a parametrization $\cal Q$ of a new plane curve $\cal D$ such that ${\cal Q}$ is an {\it $ε$--proper reparametrization} of $\cal D$. In addition, the error bound is carefully discussed and we present a formula that measures the "closeness" between the input curve $\cal C$ and the output curve $\cal D$.
△ Less
Submitted 14 August, 2013; v1 submitted 10 May, 2013;
originally announced May 2013.