-
A discrete dislocation analysis of size-dependent plasticity in torsion
Authors:
A. Cruzado,
M. P. Ariza,
A. Needleman,
M. Ortiz,
A. A. Benzerga
Abstract:
A method for solving three dimensional discrete dislocation plasticity boundary-value problems using a monopole representation of the dislocations is presented. At each time step, the displacement, strain and stress fields in a finite body are obtained by superposition of infinite body dislocation fields and an image field that enforces the boundary conditions. The three dimensional infinite body…
▽ More
A method for solving three dimensional discrete dislocation plasticity boundary-value problems using a monopole representation of the dislocations is presented. At each time step, the displacement, strain and stress fields in a finite body are obtained by superposition of infinite body dislocation fields and an image field that enforces the boundary conditions. The three dimensional infinite body fields are obtained by representing dislocations as being comprised of points, termed monopoles, that carry dislocation line and Burgers vector information. The image fields are obtained from a three dimensional linear elastic finite element calculation. The implementation of the coupling of the monopole representation with the finite element method, including the interaction of curved dislocations with free surfaces, is presented in some detail because it differs significantly from an implementation with a line based dislocation representation. Numerical convergence and the modeling of dislocation loop nucleation for large scale computations are investigated. The monopole discrete dislocation plasticity framework is used to investigate the effect of size and initial dislocation density on the torsion of wires with diameters varying over three orders of magnitude. Depending on the initial dislocation source density and the wire diameter, three regimes of torsion-twist response are obtained: (i) for wires with a sufficiently small diameter, plastic deformation is nucleation controlled and is strongly size dependent; (ii) for wires with larger diameters dislocation plasticity is dislocation interaction controlled, with the emergence of geometrically necessary dislocations and dislocation pile-ups playing a key role, and is strongly size dependent; and (iii) for wires with sufficiently large diameters plastic deformation becomes less heterogeneous and the dependence on size is greatly diminished.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
RSMM: A Framework to Assess Maturity of Research Software Project
Authors:
Deekshitha,
Rena Bakhshi,
Jason Maassen,
Carlos Martinez Ortiz,
Rob van Nieuwpoort,
Slinger Jansen
Abstract:
The organizations and researchers producing research software face a common problem of making their software sustainable beyond funding provided by a single research project. This is addressed by research software engineers through building communities around their software, providing appropriate licensing, creating reliable and reproducible research software, making it sustainable and impactful,…
▽ More
The organizations and researchers producing research software face a common problem of making their software sustainable beyond funding provided by a single research project. This is addressed by research software engineers through building communities around their software, providing appropriate licensing, creating reliable and reproducible research software, making it sustainable and impactful, promoting, and ensuring that the research software is easy to adopt in research workflows, etc. As a result, numerous practices and guidelines exist to enhance research software quality, reusability, and sustainability. However, there is a lack of a unified framework to systematically integrate these practices and help organizations and research software developers refine their development and management processes. Our paper aims at bridging this gap by introducing a novel framework: RSMM. It is designed through systematic literature review and insights from interviews with research software project experts. In short, RSMM offers a structured pathway for evaluating and refining research software project management by categorizing 79 best practices into 17 capabilities across 4 focus areas. From assessing code quality and security to measuring impact, sustainability, and reproducibility, the model provides a complete evaluation of a research software project maturity. With RSMM, individuals as well as organizations involved in research software development gain a systematic approach to tackling various research software engineering challenges. By utilizing RSMM as a comprehensive checklist, organizations can systematically evaluate and refine their project management practices and organizational structure.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Towards Practicable Algorithms for Rewriting Graph Queries beyond DL-Lite
Authors:
Bianca Löhnert,
Nikolaus Augsten,
Cem Okulmus,
Magdalena Ortiz
Abstract:
Despite the many advantages that ontology-based data access (OBDA) has brought to a range of application domains, state-of-the-art OBDA systems still do not support popular graph database management systems such as Neo4j. Algorithms for query rewriting focus on languages like conjunctive queries and their unions, which are fragments of first-order logic and were developed for relational data. Such…
▽ More
Despite the many advantages that ontology-based data access (OBDA) has brought to a range of application domains, state-of-the-art OBDA systems still do not support popular graph database management systems such as Neo4j. Algorithms for query rewriting focus on languages like conjunctive queries and their unions, which are fragments of first-order logic and were developed for relational data. Such query languages are poorly suited for querying graph data. Moreover, they also limit the expressiveness of the ontology languages that admit rewritings, restricting them to those where the data complexity of reasoning is not higher than it is in first-order logic. In this paper, we propose a technique for rewriting a family of navigational queries for a suitably restricted fragment of ELHI that extends DL-Lite and that is NL-complete in data complexity. We implemented a proof-of-concept prototype that rewrites into Cypher queries, and tested it on a real-world cognitive neuroscience use case with promising results.
△ Less
Submitted 23 April, 2025; v1 submitted 28 May, 2024;
originally announced May 2024.
-
ConvDTW-ACS: Audio Segmentation for Track Type Detection During Car Manufacturing
Authors:
Álvaro López-Chilet,
Zhaoyi Liu,
Jon Ander Gómez,
Carlos Alvarez,
Marivi Alonso Ortiz,
Andres Orejuela Mesa,
David Newton,
Friedrich Wolf-Monheim,
Sam Michiels,
Danny Hughes
Abstract:
This paper proposes a method for Acoustic Constrained Segmentation (ACS) in audio recordings of vehicles driven through a production test track, delimiting the boundaries of surface types in the track. ACS is a variant of classical acoustic segmentation where the sequence of labels is known, contiguous and invariable, which is especially useful in this work as the test track has a standard configu…
▽ More
This paper proposes a method for Acoustic Constrained Segmentation (ACS) in audio recordings of vehicles driven through a production test track, delimiting the boundaries of surface types in the track. ACS is a variant of classical acoustic segmentation where the sequence of labels is known, contiguous and invariable, which is especially useful in this work as the test track has a standard configuration of surface types. The proposed ConvDTW-ACS method utilizes a Convolutional Neural Network for classifying overlapping image chunks extracted from the full audio spectrogram. Then, our custom Dynamic Time Warping algorithm aligns the sequence of predicted probabilities to the sequence of surface types in the track, from which timestamps of the surface type boundaries can be extracted. The method was evaluated on a real-world dataset collected from the Ford Manufacturing Plant in Valencia (Spain), achieving a mean error of 166 milliseconds when delimiting, within the audio, the boundaries of the surfaces in the track. The results demonstrate the effectiveness of the proposed method in accurately segmenting different surface types, which could enable the development of more specialized AI systems to improve the quality inspection process.
△ Less
Submitted 28 February, 2024;
originally announced February 2024.
-
Data-Driven Games in Computational Mechanics
Authors:
Kerstin Weinberg,
Laurent Strainier,
Sergio Conti,
Michael Ortiz
Abstract:
We resort to game theory in order to formulate Data-Driven methods for solid mechanics in which stress and strain players pursue different objectives. The objective of the stress player is to minimize the discrepancy to a material data set, whereas the objective of the strain player is to ensure the admissibility of the mechanical state, in the sense of compatibility and equilibrium. We show that,…
▽ More
We resort to game theory in order to formulate Data-Driven methods for solid mechanics in which stress and strain players pursue different objectives. The objective of the stress player is to minimize the discrepancy to a material data set, whereas the objective of the strain player is to ensure the admissibility of the mechanical state, in the sense of compatibility and equilibrium. We show that, unlike the cooperative Data-Driven games proposed in the past, the new non-cooperative Data-Driven games identify an effective material law from the data and reduce to conventional displacement boundary-value problems, which facilitates their practical implementation. However, unlike supervised machine learning methods, the proposed non-cooperative Data-Driven games are unsupervised, ansatz-free and parameter-free. In particular, the effective material law is learned from the data directly, without recourse to regression to a parameterized class of functions such as neural networks. We present analysis that elucidates sufficient conditions for convergence of the Data-Driven solutions with respect to the data. We also present selected examples of implementation and application that demonstrate the range and versatility of the approach.
△ Less
Submitted 26 May, 2023;
originally announced May 2023.
-
BioJam Camp: toward justice through bioengineering and biodesign co-learning with youth
Authors:
Callie Chappell,
Henry A. -A.,
Elvia B. O.,
Emily B.,
Bailey B.,
Jacqueline C. -M.,
Caroline Daws,
Cristian F.,
Emiliano G.,
Page Goddard,
Xavier G.,
Anne Hu,
Gabriela J.,
Kelley Langhans,
Briana Martin-Villa,
Penny M. -S.,
Jennifer M.,
Soyang N.,
Melissa Ortiz,
Aryana P.,
Trisha S,
Corinne Takara,
Emily T.,
Paloma Vazquez,
Rolando Perez
, et al. (1 additional authors not shown)
Abstract:
BioJam is a political, artistic, and educational project in which Bay Area artists, scientists, and educators collaborate with youth and communities of color to address historical exclusion of their communities in STEM fields and reframe what science can be. As an intergenerational collective, we co-learn on topics of culture (social and biological), community (cultural and ecological), and creati…
▽ More
BioJam is a political, artistic, and educational project in which Bay Area artists, scientists, and educators collaborate with youth and communities of color to address historical exclusion of their communities in STEM fields and reframe what science can be. As an intergenerational collective, we co-learn on topics of culture (social and biological), community (cultural and ecological), and creativity. We reject the notion that increasing the number of scientists of color requires inculcation in the ways of the dominant culture. Instead, we center cultural practices, traditional ways of knowing, storytelling, art, experiential learning, and community engagement to break down the framing that positions these practices as distinct from science. The goal of this work is to realize a future in which the practice of science is relatable, accessible, and liberatory.
△ Less
Submitted 1 November, 2022;
originally announced November 2022.
-
Exploring Attention GAN for Vehicle Motion Prediction
Authors:
Carlos Gómez-Huélamo,
Marcos V. Conde,
Miguel Ortiz,
Santiago Montiel,
Rafael Barea,
Luis M. Bergasa
Abstract:
The design of a safe and reliable Autonomous Driving stack (ADS) is one of the most challenging tasks of our era. These ADS are expected to be driven in highly dynamic environments with full autonomy, and a reliability greater than human beings. In that sense, to efficiently and safely navigate through arbitrarily complex traffic scenarios, ADS must have the ability to forecast the future trajecto…
▽ More
The design of a safe and reliable Autonomous Driving stack (ADS) is one of the most challenging tasks of our era. These ADS are expected to be driven in highly dynamic environments with full autonomy, and a reliability greater than human beings. In that sense, to efficiently and safely navigate through arbitrarily complex traffic scenarios, ADS must have the ability to forecast the future trajectories of surrounding actors. Current state-of-the-art models are typically based on Recurrent, Graph and Convolutional networks, achieving noticeable results in the context of vehicle prediction. In this paper we explore the influence of attention in generative models for motion prediction, considering both physical and social context to compute the most plausible trajectories. We first encode the past trajectories using a LSTM network, which serves as input to a Multi-Head Self-Attention module that computes the social context. On the other hand, we formulate a weighted interpolation to calculate the velocity and orientation in the last observation frame in order to calculate acceptable target points, extracted from the driveable of the HDMap information, which represents our physical context. Finally, the input of our generator is a white noise vector sampled from a multivariate normal distribution while the social and physical context are its conditions, in order to predict plausible trajectories. We validate our method using the Argoverse Motion Forecasting Benchmark 1.1, achieving competitive unimodal results.
△ Less
Submitted 26 September, 2022;
originally announced September 2022.
-
Model-Free Data-Driven Inference in Computational Mechanics
Authors:
Erik Prume,
Stefanie Reese,
Michael Ortiz
Abstract:
We extend the model-free Data-Driven computing paradigm to solids and structures that are stochastic due to intrinsic randomness in the material behavior. The behavior of such materials is characterized by a likelihood measure instead of a constitutive relation. We specifically assume that the material likelihood measure is known only through an empirical point-data set in material or phase space.…
▽ More
We extend the model-free Data-Driven computing paradigm to solids and structures that are stochastic due to intrinsic randomness in the material behavior. The behavior of such materials is characterized by a likelihood measure instead of a constitutive relation. We specifically assume that the material likelihood measure is known only through an empirical point-data set in material or phase space. The state of the solid or structure is additionally subject to compatibility and equilibrium constraints. The problem is then to infer the likelihood of a given structural outcome of interest. In this work, we present a Data-Driven method of inference that determines likelihoods of outcomes from the empirical material data and that requires no material or prior modeling. In particular, the computation of expectations is reduced to explicit sums over local material data sets and to quadratures over admissible states, i. e., states satisfying compatibility and equilibrium. The complexity of the material data-set sums is linear in the number of data points and in the number of members in the structure. Efficient population annealing procedures and fast search algorithms for accelerating the calculations are presented. The scope, cost and convergence properties of the method are assessed with the aid selected applications and benchmark tests.
△ Less
Submitted 28 October, 2022; v1 submitted 13 July, 2022;
originally announced July 2022.
-
Efficient entity-based reinforcement learning
Authors:
Vince Jankovics,
Michael Garcia Ortiz,
Eduardo Alonso
Abstract:
Recent deep reinforcement learning (DRL) successes rely on end-to-end learning from fixed-size observational inputs (e.g. image, state-variables). However, many challenging and interesting problems in decision making involve observations or intermediary representations which are best described as a set of entities: either the image-based approach would miss small but important details in the obser…
▽ More
Recent deep reinforcement learning (DRL) successes rely on end-to-end learning from fixed-size observational inputs (e.g. image, state-variables). However, many challenging and interesting problems in decision making involve observations or intermediary representations which are best described as a set of entities: either the image-based approach would miss small but important details in the observations (e.g. ojects on a radar, vehicles on satellite images, etc.), the number of sensed objects is not fixed (e.g. robotic manipulation), or the problem simply cannot be represented in a meaningful way as an image (e.g. power grid control, or logistics). This type of structured representations is not directly compatible with current DRL architectures, however, there has been an increase in machine learning techniques directly targeting structured information, potentially addressing this issue. We propose to combine recent advances in set representations with slot attention and graph neural networks to process structured data, broadening the range of applications of DRL algorithms. This approach allows to address entity-based problems in an efficient and scalable way. We show that it can improve training time and robustness significantly, and demonstrate their potential to handle structured as well as purely visual domains, on multiple environments from the Atari Learning Environment and Simple Playgrounds.
△ Less
Submitted 6 June, 2022;
originally announced June 2022.
-
Exploring Map-based Features for Efficient Attention-based Vehicle Motion Prediction
Authors:
Carlos Gómez-Huélamo,
Marcos V. Conde,
Miguel Ortiz
Abstract:
Motion prediction (MP) of multiple agents is a crucial task in arbitrarily complex environments, from social robots to self-driving cars. Current approaches tackle this problem using end-to-end networks, where the input data is usually a rendered top-view of the scene and the past trajectories of all the agents; leveraging this information is a must to obtain optimal performance. In that sense, a…
▽ More
Motion prediction (MP) of multiple agents is a crucial task in arbitrarily complex environments, from social robots to self-driving cars. Current approaches tackle this problem using end-to-end networks, where the input data is usually a rendered top-view of the scene and the past trajectories of all the agents; leveraging this information is a must to obtain optimal performance. In that sense, a reliable Autonomous Driving (AD) system must produce reasonable predictions on time, however, despite many of these approaches use simple ConvNets and LSTMs, models might not be efficient enough for real-time applications when using both sources of information (map and trajectory history). Moreover, the performance of these models highly depends on the amount of training data, which can be expensive (particularly the annotated HD maps). In this work, we explore how to achieve competitive performance on the Argoverse 1.0 Benchmark using efficient attention-based models, which take as input the past trajectories and map-based features from minimal map information to ensure efficient and reliable MP. These features represent interpretable information as the driveable area and plausible goal points, in opposition to black-box CNN-based methods for map processing.
△ Less
Submitted 10 June, 2022; v1 submitted 25 May, 2022;
originally announced May 2022.
-
Actions over Core-closed Knowledge Bases
Authors:
Claudia Cauli,
Magdalena Ortiz,
Nir Piterman
Abstract:
We present new results on the application of semantic- and knowledge-based reasoning techniques to the analysis of cloud deployments. In particular, to the security of Infrastructure as Code configuration files, encoded as description logic knowledge bases. We introduce an action language to model mutating actions; that is, actions that change the structural configuration of a given deployment by…
▽ More
We present new results on the application of semantic- and knowledge-based reasoning techniques to the analysis of cloud deployments. In particular, to the security of Infrastructure as Code configuration files, encoded as description logic knowledge bases. We introduce an action language to model mutating actions; that is, actions that change the structural configuration of a given deployment by adding, modifying, or deleting resources. We mainly focus on two problems: the problem of determining whether the execution of an action, no matter the parameters passed to it, will not cause the violation of some security requirement (static verification), and the problem of finding sequences of actions that would lead the deployment to a state where (un)desirable properties are (not) satisfied (plan existence and plan synthesis). For all these problems, we provide definitions, complexity results, and decision procedures.
△ Less
Submitted 25 February, 2022;
originally announced February 2022.
-
Evaluating generative networks using Gaussian mixtures of image features
Authors:
Lorenzo Luzi,
Carlos Ortiz Marrero,
Nile Wynar,
Richard G. Baraniuk,
Michael J. Henry
Abstract:
We develop a measure for evaluating the performance of generative networks given two sets of images. A popular performance measure currently used to do this is the Fréchet Inception Distance (FID). FID assumes that images featurized using the penultimate layer of Inception-v3 follow a Gaussian distribution, an assumption which cannot be violated if we wish to use FID as a metric. However, we show…
▽ More
We develop a measure for evaluating the performance of generative networks given two sets of images. A popular performance measure currently used to do this is the Fréchet Inception Distance (FID). FID assumes that images featurized using the penultimate layer of Inception-v3 follow a Gaussian distribution, an assumption which cannot be violated if we wish to use FID as a metric. However, we show that Inception-v3 features of the ImageNet dataset are not Gaussian; in particular, every single marginal is not Gaussian. To remedy this problem, we model the featurized images using Gaussian mixture models (GMMs) and compute the 2-Wasserstein distance restricted to GMMs. We define a performance measure, which we call WaM, on two sets of images by using Inception-v3 (or another classifier) to featurize the images, estimate two GMMs, and use the restricted $2$-Wasserstein distance to compare the GMMs. We experimentally show the advantages of WaM over FID, including how FID is more sensitive than WaM to imperceptible image perturbations. By modelling the non-Gaussian features obtained from Inception-v3 as GMMs and using a GMM metric, we can more accurately evaluate generative network performance.
△ Less
Submitted 22 July, 2022; v1 submitted 8 October, 2021;
originally announced October 2021.
-
A spatially adaptive phase-field model of fracture
Authors:
Dhananjay Phansalkar,
Kerstin Weinberg,
Michael Ortiz,
Sigrid Leyendecker
Abstract:
Phase-field models of fracture introduce smeared cracks of width commensurate with a regularisation length parameter $ε$ and obeying a minimum energy principle. Mesh adaptivity naturally suggests itself as a means of supplying spatial resolution were needed while simultaneously keeping the computational size of the model as small as possible. Here, a variational-based spatial adaptivity is propose…
▽ More
Phase-field models of fracture introduce smeared cracks of width commensurate with a regularisation length parameter $ε$ and obeying a minimum energy principle. Mesh adaptivity naturally suggests itself as a means of supplying spatial resolution were needed while simultaneously keeping the computational size of the model as small as possible. Here, a variational-based spatial adaptivity is proposed for a phase-field model of fracture.
The conventional phase-field model is generalised by allowing a spatial variation of the regularisation length $ε$ in the energy functional. The optimal spatial variation of the regularisation length then follows by energy minimisation in the same manner as the displacement and phase fields. The extended phase-field model is utilised as a basis for an adaptive mesh refinement strategy, whereby the mesh size is required to resolve the optimal length parameter locally. The resulting solution procedure is implemented in the framework of the finite element library FEniCS. Selected numerical tests suggest that the spatially adaptive phase-field model exhibits the same convergence rate as the conventional phase-field model, albeit with a vastly superior constant, which results in considerable computational savings.
△ Less
Submitted 21 September, 2021;
originally announced September 2021.
-
Efficient Data Structures for Model-free Data-Driven Computational Mechanics
Authors:
Robert Eggersmann,
Laurent Stainier,
Michael Ortiz,
Stefanie Reese
Abstract:
The data-driven computing paradigm initially introduced by Kirchdoerfer and Ortiz (2016) enables finite element computations in solid mechanics to be performed directly from material data sets, without an explicit material model. From a computational effort point of view, the most challenging task is the projection of admissible states at material points onto their closest states in the material d…
▽ More
The data-driven computing paradigm initially introduced by Kirchdoerfer and Ortiz (2016) enables finite element computations in solid mechanics to be performed directly from material data sets, without an explicit material model. From a computational effort point of view, the most challenging task is the projection of admissible states at material points onto their closest states in the material data set. In this study, we compare and develop several possible data structures for solving the nearest-neighbor problem. We show that approximate nearest-neighbor (ANN) algorithms can accelerate material data searches by several orders of magnitude relative to exact searching algorithms. The approximations are suggested by--and adapted to--the structure of the data-driven iterative solver and result in no significant loss of solution accuracy. We assess the performance of the ANN algorithm with respect to material data set size with the aid of a 3D elasticity test case. We show that computations on a single processor with up to one billion material data points are feasible within a few seconds execution time with a speedup of more than 106 with respect to exact k-d trees.
△ Less
Submitted 1 December, 2020;
originally announced December 2020.
-
Entanglement Induced Barren Plateaus
Authors:
Carlos Ortiz Marrero,
Mária Kieferová,
Nathan Wiebe
Abstract:
We argue that an excess in entanglement between the visible and hidden units in a Quantum Neural Network can hinder learning. In particular, we show that quantum neural networks that satisfy a volume-law in the entanglement entropy will give rise to models not suitable for learning with high probability. Using arguments from quantum thermodynamics, we then show that this volume law is typical and…
▽ More
We argue that an excess in entanglement between the visible and hidden units in a Quantum Neural Network can hinder learning. In particular, we show that quantum neural networks that satisfy a volume-law in the entanglement entropy will give rise to models not suitable for learning with high probability. Using arguments from quantum thermodynamics, we then show that this volume law is typical and that there exists a barren plateau in the optimization landscape due to entanglement. More precisely, we show that for any bounded objective function on the visible layers, the Lipshitz constants of the expectation value of that objective function will scale inversely with the dimension of the hidden-subsystem with high probability. We show how this can cause both gradient descent and gradient-free methods to fail. We note that similar problems can happen with quantum Boltzmann machines, although stronger assumptions on the coupling between the hidden/visible subspaces are necessary. We highlight how pretraining such generative models may provide a way to navigate these barren plateaus.
△ Less
Submitted 10 March, 2021; v1 submitted 29 October, 2020;
originally announced October 2020.
-
Modeling Atmospheric Data and Identifying Dynamics: Temporal Data-Driven Modeling of Air Pollutants
Authors:
Javier Rubio-Herrero,
Carlos Ortiz Marrero,
Wai-Tong Louis Fan
Abstract:
Atmospheric modeling has recently experienced a surge with the advent of deep learning. Most of these models, however, predict concentrations of pollutants following a data-driven approach in which the physical laws that govern their behaviors and relationships remain hidden. With the aid of real-world air quality data collected hourly in different stations throughout Madrid, we present an empiric…
▽ More
Atmospheric modeling has recently experienced a surge with the advent of deep learning. Most of these models, however, predict concentrations of pollutants following a data-driven approach in which the physical laws that govern their behaviors and relationships remain hidden. With the aid of real-world air quality data collected hourly in different stations throughout Madrid, we present an empirical approach using data-driven techniques with the following goals: (1) Find parsimonious systems of ordinary differential equations via sparse identification of nonlinear dynamics (SINDy) that model the concentration of pollutants and their changes over time; (2) assess the performance and limitations of our models using stability analysis; (3) reconstruct the time series of chemical pollutants not measured in certain stations using delay coordinate embedding results. Our results show that Akaike's Information Criterion can work well in conjunction with best subset regression as to find an equilibrium between sparsity and goodness of fit. We also find that, due to the complexity of the chemical system under study, identifying the dynamics of this system over longer periods of time require higher levels of data filtering and smoothing. Stability analysis for the reconstructed ordinary differential equations (ODEs) reveals that more than half of the physically relevant critical points are saddle points, suggesting that the system is unstable even under the idealized assumption that all environmental conditions are constant over time.
△ Less
Submitted 6 July, 2021; v1 submitted 13 October, 2020;
originally announced October 2020.
-
Generating Efficient DNN-Ensembles with Evolutionary Computation
Authors:
Marc Ortiz,
Florian Scheidegger,
Marc Casas,
Cristiano Malossi,
Eduard Ayguadé
Abstract:
In this work, we leverage ensemble learning as a tool for the creation of faster, smaller, and more accurate deep learning models. We demonstrate that we can jointly optimize for accuracy, inference time, and the number of parameters by combining DNN classifiers. To achieve this, we combine multiple ensemble strategies: bagging, boosting, and an ordered chain of classifiers. To reduce the number o…
▽ More
In this work, we leverage ensemble learning as a tool for the creation of faster, smaller, and more accurate deep learning models. We demonstrate that we can jointly optimize for accuracy, inference time, and the number of parameters by combining DNN classifiers. To achieve this, we combine multiple ensemble strategies: bagging, boosting, and an ordered chain of classifiers. To reduce the number of DNN ensemble evaluations during the search, we propose EARN, an evolutionary approach that optimizes the ensemble according to three objectives regarding the constraints specified by the user. We run EARN on 10 image classification datasets with an initial pool of 32 state-of-the-art DCNN on both CPU and GPU platforms, and we generate models with speedups up to $7.60\times$, reductions of parameters by $10\times$, or increases in accuracy up to $6.01\%$ regarding the best DNN in the pool. In addition, our method generates models that are $5.6\times$ faster than the state-of-the-art methods for automatic model generation.
△ Less
Submitted 3 May, 2021; v1 submitted 18 September, 2020;
originally announced September 2020.
-
Vehicle Telematics Via Exteroceptive Sensors: A Survey
Authors:
Fernando Molano Ortiz,
Matteo Sammarco,
Luís Henrique M. K. Costa,
Marcin Detyniecki
Abstract:
Whereas a very large number of sensors are available in the automotive field, currently just a few of them, mostly proprioceptive ones, are used in telematics, automotive insurance, and mobility safety research. In this paper, we show that exteroceptive sensors, like microphones or cameras, could replace proprioceptive ones in many fields. Our main motivation is to provide the reader with alternat…
▽ More
Whereas a very large number of sensors are available in the automotive field, currently just a few of them, mostly proprioceptive ones, are used in telematics, automotive insurance, and mobility safety research. In this paper, we show that exteroceptive sensors, like microphones or cameras, could replace proprioceptive ones in many fields. Our main motivation is to provide the reader with alternative ideas for the development of telematics applications when proprioceptive sensors are unusable for technological issues, privacy concerns, or lack of availability in commercial devices. We first introduce a taxonomy of sensors in telematics. Then, we review in detail all exteroceptive sensors of some interest for vehicle telematics, highlighting advantages, drawbacks, and availability in off-the-shelf devices. Successively, we present a list of notable telematics services and applications in research and industry like driving profiling or vehicular safety. For each of them, we report the most recent and important works relying on exteroceptive sensors, as long as the available datasets. We conclude showing open challenges using exteroceptive sensors both for industry and research.
△ Less
Submitted 27 August, 2020;
originally announced August 2020.
-
Considerations for Eye Tracking Experiments in Information Retrieval
Authors:
Michael Segundo Ortiz
Abstract:
In this survey I discuss ophthalmic neurophysiology and the experimental considerations that must be made to reduce possible noise in an eye-tracking data stream. I also review the history, experiments, technological benefits and limitations of eye-tracking within the information retrieval field. The concepts of aware and adaptive user interfaces are also explored that humbly make an attempt to sy…
▽ More
In this survey I discuss ophthalmic neurophysiology and the experimental considerations that must be made to reduce possible noise in an eye-tracking data stream. I also review the history, experiments, technological benefits and limitations of eye-tracking within the information retrieval field. The concepts of aware and adaptive user interfaces are also explored that humbly make an attempt to synthesize work from the fields of industrial engineering and psychophysiology with information retrieval.
△ Less
Submitted 24 July, 2020;
originally announced July 2020.
-
Estimating COVID-19 cases and reproduction number in Mexico
Authors:
Michelle Anzarut,
Luis Felipe González,
Sonia Mendizábal,
María Teresa Ortiz
Abstract:
In this report we fit a semi-mechanistic Bayesian hierarchical model to describe the Mexican COVID-19 epidemic. We obtain two epidemiological measures: the number of infections and the reproduction number. Estimations are based on death data. Hence, we expect our estimates to be more accurate than the attack rates estimated from the reported number of cases.
In this report we fit a semi-mechanistic Bayesian hierarchical model to describe the Mexican COVID-19 epidemic. We obtain two epidemiological measures: the number of infections and the reproduction number. Estimations are based on death data. Hence, we expect our estimates to be more accurate than the attack rates estimated from the reported number of cases.
△ Less
Submitted 17 July, 2020;
originally announced July 2020.
-
Data-driven fracture mechanics
Authors:
Pietro Carrara,
Laura De Lorenzis,
Laurent Stainier,
Michael Ortiz
Abstract:
We present a new data-driven paradigm for variational brittle fracture mechanics. The fracture-related material modeling assumptions are removed and the governing equations stemming from variational principles are combined with a set of discrete data points, leading to a model-free data-driven method of solution. The solution at a given load step is identified as the point within the data set that…
▽ More
We present a new data-driven paradigm for variational brittle fracture mechanics. The fracture-related material modeling assumptions are removed and the governing equations stemming from variational principles are combined with a set of discrete data points, leading to a model-free data-driven method of solution. The solution at a given load step is identified as the point within the data set that best satisfies either the Kuhn-Tucker conditions stemming from the variational fracture problem or global minimization of a suitable energy functional, leading to data-driven counterparts of both the local and the global minimization approaches of variational fracture mechanics. Both formulations are tested on different test configurations with and without noise and for Griffith and R-curve type fracture behavior.
△ Less
Submitted 10 September, 2020; v1 submitted 3 June, 2020;
originally announced June 2020.
-
An ExpTime Upper Bound for $\mathcal{ALC}$ with Integers (Extended Version)
Authors:
Nadia Labai,
Magdalena Ortiz,
Mantas Šimkus
Abstract:
Concrete domains, especially those that allow to compare features with numeric values, have long been recognized as a very desirable extension of description logics (DLs), and significant efforts have been invested into adding them to usual DLs while keeping the complexity of reasoning in check. For expressive DLs and in the presence of general TBoxes, for standard reasoning tasks like consistency…
▽ More
Concrete domains, especially those that allow to compare features with numeric values, have long been recognized as a very desirable extension of description logics (DLs), and significant efforts have been invested into adding them to usual DLs while keeping the complexity of reasoning in check. For expressive DLs and in the presence of general TBoxes, for standard reasoning tasks like consistency, the most general decidability results are for the so-called $ω$-admissible domains, which are required to be dense. Supporting non-dense domains for features that range over integers or natural numbers remained largely open, despite often being singled out as a highly desirable extension. The decidability of some extensions of $\mathcal{ALC}$ with non-dense domains has been shown, but existing results rely on powerful machinery that does not allow to infer any elementary bounds on the complexity of the problem. In this paper, we study an extension of $\mathcal{ALC}$ with a rich integer domain that allows for comparisons (between features, and between features and constants coded in unary), and prove that consistency can be solved using automata-theoretic techniques in single exponential time, and thus has no higher worst-case complexity than standard $\mathcal{ALC}$. Our upper bounds apply to some extensions of DLs with concrete domains known from the literature, support general TBoxes, and allow for comparing values along paths of ordinary (not necessarily functional) roles.
△ Less
Submitted 3 June, 2020;
originally announced June 2020.
-
Model-free Data-Driven Computational Mechanics Enhanced by Tensor Voting
Authors:
Robert Eggersmann,
Laurent Stainier,
Michael Ortiz,
Stefanie Reese
Abstract:
The data-driven computing paradigm initially introduced by Kirchdoerfer & Ortiz (2016) is extended by incorporating locally linear tangent spaces into the data set. These tangent spaces are constructed by means of the tensor voting method introduced by Mordohai & Medioni (2010) which improves the learning of the underlying structure of a data set. Tensor voting is an instance-based machine learnin…
▽ More
The data-driven computing paradigm initially introduced by Kirchdoerfer & Ortiz (2016) is extended by incorporating locally linear tangent spaces into the data set. These tangent spaces are constructed by means of the tensor voting method introduced by Mordohai & Medioni (2010) which improves the learning of the underlying structure of a data set. Tensor voting is an instance-based machine learning technique which accumulates votes from the nearest neighbors to build up second-order tensors encoding tangents and normals to the underlying data structure. The here proposed second-order data-driven paradigm is a plug-in method for distance-minimizing as well as entropy-maximizing data-driven schemes. Like its predecessor, the resulting method aims to minimize a suitably defined free energy over phase space subject to compatibility and equilibrium constraints. The method's implementation is straightforward and numerically efficient since the data structure analysis is performed in an offline step. Selected numerical examples are presented that establish the higher-order convergence properties of the data-driven solvers enhanced by tensor voting for ideal and noisy data sets.
△ Less
Submitted 7 April, 2020; v1 submitted 6 April, 2020;
originally announced April 2020.
-
Polynomial Rewritings from Expressive Description Logics with Closed Predicates to Variants of Datalog
Authors:
Shqiponja Ahmetaj,
Magdalena Ortiz,
Mantas Simkus
Abstract:
In many scenarios, complete and incomplete information coexist. For this reason, the knowledge representation and database communities have long shown interest in simultaneously supporting the closed- and the open-world views when reasoning about logic theories. Here we consider the setting of querying possibly incomplete data using logic theories, formalized as the evaluation of an ontology-media…
▽ More
In many scenarios, complete and incomplete information coexist. For this reason, the knowledge representation and database communities have long shown interest in simultaneously supporting the closed- and the open-world views when reasoning about logic theories. Here we consider the setting of querying possibly incomplete data using logic theories, formalized as the evaluation of an ontology-mediated query (OMQ) that pairs a query with a theory, sometimes called an ontology, expressing background knowledge. This can be further enriched by specifying a set of closed predicates from the theory that are to be interpreted under the closed-world assumption, while the rest are interpreted with the open-world view. In this way we can retrieve more precise answers to queries by leveraging the partial completeness of the data.
The central goal of this paper is to understand the relative expressiveness of OMQ languages in which the ontology is written in the expressive Description Logic (DL) ALCHOI and includes a set of closed predicates. We consider a restricted class of conjunctive queries. Our main result is to show that every query in this non-monotonic query language can be translated in polynomial time into Datalog with negation under the stable model semantics. To overcome the challenge that Datalog has no direct means to express the existential quantification present in ALCHOI, we define a two-player game that characterizes the satisfaction of the ontology, and design a Datalog query that can decide the existence of a winning strategy for the game. If there are no closed predicates, that is in the case of querying a plain ALCHOI knowledge base, our translation yields a positive disjunctive Datalog program of polynomial size. To the best of our knowledge, unlike previous translations for related fragments with expressive (non-Horn) DLs, these are the first polynomial time translations.
△ Less
Submitted 16 December, 2019;
originally announced December 2019.
-
Pebble-Intervals Automata and FO2 with Two Orders (Extended Version)
Authors:
Nadia Labai,
Tomer Kotek,
Magdalena Ortiz,
Helmut Veith
Abstract:
We introduce a novel automata model, called pebble-intervals automata (PIA), and study its power and closure properties. PIAs are tailored for a decidable fragment of FO that is important for reasoning about structures that use data values from infinite domains: the two-variable fragment with one total preorder and its induced successor relation, one linear order, and an arbitrary number of unary…
▽ More
We introduce a novel automata model, called pebble-intervals automata (PIA), and study its power and closure properties. PIAs are tailored for a decidable fragment of FO that is important for reasoning about structures that use data values from infinite domains: the two-variable fragment with one total preorder and its induced successor relation, one linear order, and an arbitrary number of unary relations. We prove that the string projection of every language of data words definable in the logic is accepted by a pebble-intervals automaton A, and obtain as a corollary an automata-theoretic proof of the EXPSPACE upper bound for finite satisfiability due to Schwentick and Zeume.
△ Less
Submitted 3 December, 2019; v1 submitted 30 November, 2019;
originally announced December 2019.
-
Robust Assessment of Real-World Adversarial Examples
Authors:
Brett Jefferson,
Carlos Ortiz Marrero
Abstract:
We explore rigorous, systematic, and controlled experimental evaluation of adversarial examples in the real world and propose a testing regimen for evaluation of real world adversarial objects. We show that for small scene/ environmental perturbations, large adversarial performance differences exist. Current state of adversarial reporting exists largely as a frequency count over a dynamic collecti…
▽ More
We explore rigorous, systematic, and controlled experimental evaluation of adversarial examples in the real world and propose a testing regimen for evaluation of real world adversarial objects. We show that for small scene/ environmental perturbations, large adversarial performance differences exist. Current state of adversarial reporting exists largely as a frequency count over a dynamic collections of scenes. Our work underscores the need for either a more complete report or a score that incorporates scene changes and baseline performance for models and environments tested by adversarial developers. We put forth a score that attempts to address the above issues in a straight-forward exemplar application for multiple generated adversary examples. We contribute the following: 1. a testbed for adversarial assessment, 2. a score for adversarial examples, and 3. a collection of additional evaluations on testbed data.
△ Less
Submitted 14 March, 2020; v1 submitted 23 November, 2019;
originally announced November 2019.
-
Hypernetwork Science via High-Order Hypergraph Walks
Authors:
Sinan G. Aksoy,
Cliff Joslyn,
Carlos Ortiz Marrero,
Brenda Praggastis,
Emilie Purvine
Abstract:
We propose high-order hypergraph walks as a framework to generalize graph-based network science techniques to hypergraphs. Edge incidence in hypergraphs is quantitative, yielding hypergraph walks with both length and width. Graph methods which then generalize to hypergraphs include connected component analyses, graph distance-based metrics such as closeness centrality, and motif-based measures suc…
▽ More
We propose high-order hypergraph walks as a framework to generalize graph-based network science techniques to hypergraphs. Edge incidence in hypergraphs is quantitative, yielding hypergraph walks with both length and width. Graph methods which then generalize to hypergraphs include connected component analyses, graph distance-based metrics such as closeness centrality, and motif-based measures such as clustering coefficients. We apply high-order analogs of these methods to real world hypernetworks, and show they reveal nuanced and interpretable structure that cannot be detected by graph-based methods. Lastly, we apply three generative models to the data and find that basic hypergraph properties, such as density and degree distributions, do not necessarily control these new structural measurements. Our work demonstrates how analyses of hypergraph-structured data are richer when utilizing tools tailored to capture hypergraph-native phenomena, and suggests one possible avenue towards that end.
△ Less
Submitted 8 June, 2020; v1 submitted 26 June, 2019;
originally announced June 2019.
-
Unsupervised Emergence of Egocentric Spatial Structure from Sensorimotor Prediction
Authors:
Alban Laflaquière,
Michael Garcia Ortiz
Abstract:
Despite its omnipresence in robotics application, the nature of spatial knowledge and the mechanisms that underlie its emergence in autonomous agents are still poorly understood. Recent theoretical works suggest that the Euclidean structure of space induces invariants in an agent's raw sensorimotor experience. We hypothesize that capturing these invariants is beneficial for sensorimotor prediction…
▽ More
Despite its omnipresence in robotics application, the nature of spatial knowledge and the mechanisms that underlie its emergence in autonomous agents are still poorly understood. Recent theoretical works suggest that the Euclidean structure of space induces invariants in an agent's raw sensorimotor experience. We hypothesize that capturing these invariants is beneficial for sensorimotor prediction and that, under certain exploratory conditions, a motor representation capturing the structure of the external space should emerge as a byproduct of learning to predict future sensory experiences. We propose a simple sensorimotor predictive scheme, apply it to different agents and types of exploration, and evaluate the pertinence of these hypotheses. We show that a naive agent can capture the topology and metric regularity of its sensor's position in an egocentric spatial frame without any a priori knowledge, nor extraneous supervision.
△ Less
Submitted 17 September, 2019; v1 submitted 4 June, 2019;
originally announced June 2019.
-
Ontology Focusing: Knowledge-enriched Databases on Demand
Authors:
Tomasz Gogacz,
Víctor Gutiérrez-Basulto,
Yazmín A. Ibáñez-García,
Filip Murlak,
Magdalena Ortiz,
Mantas Šimkus
Abstract:
We propose a novel framework to facilitate the on-demand design of data-centric systems by exploiting domain knowledge from an existing ontology. Its key ingredient is a process that we call focusing, which allows to obtain a schema for a (possibly knowledge-enriched) database semi-automatically, given an ontology and a specification of the scope of the desired system. We formalize the inputs and…
▽ More
We propose a novel framework to facilitate the on-demand design of data-centric systems by exploiting domain knowledge from an existing ontology. Its key ingredient is a process that we call focusing, which allows to obtain a schema for a (possibly knowledge-enriched) database semi-automatically, given an ontology and a specification of the scope of the desired system. We formalize the inputs and outputs of focusing, and identify relevant computational problems: finding a schema via focusing, testing its consistency, and answering queries in the knowledge-enriched databases it produces. These definitions are fully independent of the ontology language. We then instantiate the framework using selected description logics as ontology languages, and popular classes of queries for specifying the scope of the system. For several representative combinations, we study the decidability and complexity of the identified computational problems. As a by-product, we isolate (and solve) variants of classical decision problems in description logics, that are interesting in their own right.
△ Less
Submitted 30 March, 2019;
originally announced April 2019.
-
Model-Free Data-Driven Methods in Mechanics: Material Data Identification and Solvers
Authors:
Laurent Stainier,
Adrien Leygue,
Michael Ortiz
Abstract:
This paper presents an integrated model-free data-driven approach to solid mechanics, allowing to perform numerical simulations on structures on the basis of measures of displacement fields on representative samples, without postulating a specific constitutive model. A material data identification procedure, allowing to infer strain-stress pairs from displacement fields and boundary conditions, is…
▽ More
This paper presents an integrated model-free data-driven approach to solid mechanics, allowing to perform numerical simulations on structures on the basis of measures of displacement fields on representative samples, without postulating a specific constitutive model. A material data identification procedure, allowing to infer strain-stress pairs from displacement fields and boundary conditions, is used to build a material database from a set of mutiaxial tests on a non-conventional sample. This database is in turn used by a data-driven solver, based on an algorithm minimizing the distance between manifolds of compatible and balanced mechanical states and the given database, to predict the response of structures of the same material, with arbitrary geometry and boundary conditions. Examples illustrate this modelling cycle and demonstrate how the data-driven identification method allows importance sampling of the material state space, yielding faster convergence of simulation results with increasing database size, when compared to synthetic material databases with regular sampling patterns.
△ Less
Submitted 18 June, 2019; v1 submitted 15 March, 2019;
originally announced March 2019.
-
State representation learning with recurrent capsule networks
Authors:
Louis Annabi,
Michael Garcia Ortiz
Abstract:
Unsupervised learning of compact and relevant state representations has been proved very useful at solving complex reinforcement learning tasks. In this paper, we propose a recurrent capsule network that learns such representations by trying to predict the future observations in an agent's trajectory.
Unsupervised learning of compact and relevant state representations has been proved very useful at solving complex reinforcement learning tasks. In this paper, we propose a recurrent capsule network that learns such representations by trying to predict the future observations in an agent's trajectory.
△ Less
Submitted 22 February, 2019; v1 submitted 28 December, 2018;
originally announced December 2018.
-
Toward Exploratory Search in Biomedicine: Evaluating Document Clusters by MeSH as a Semantic Anchor
Authors:
Michael Segundo Ortiz,
Kazuhiro Seki,
Javed Mostafa
Abstract:
The current mode of biomedical literature search is severely limited in effectively finding information relevant to specialists. A potential approach to solving this problem is exploratory search, which allows users to interactively navigate through a vast document collection. As the first step toward exploratory search for specialists in biomedicine, this paper develops a methodology to evaluate…
▽ More
The current mode of biomedical literature search is severely limited in effectively finding information relevant to specialists. A potential approach to solving this problem is exploratory search, which allows users to interactively navigate through a vast document collection. As the first step toward exploratory search for specialists in biomedicine, this paper develops a methodology to evaluate quality of document clusters. For this purpose, we incorporate human expertise into data set creation and evaluation framework by leveraging MeSH terms as semantic anchors. In addition, we investigate the benefit of full-text data for improving cluster quality.
△ Less
Submitted 5 December, 2018;
originally announced December 2018.
-
Grounding Perception: A Developmental Approach to Sensorimotor Contingencies
Authors:
Alban Laflaquière,
Nikolas Hemion,
Michaël Garcia Ortiz,
Jean-Christophe Baillie
Abstract:
Sensorimotor contingency theory offers a promising account of the nature of perception, a topic rarely addressed in the robotics community. We propose a developmental framework to address the problem of the autonomous acquisition of sensorimotor contingencies by a naive robot. While exploring the world, the robot internally encodes contingencies as predictive models that capture the structure they…
▽ More
Sensorimotor contingency theory offers a promising account of the nature of perception, a topic rarely addressed in the robotics community. We propose a developmental framework to address the problem of the autonomous acquisition of sensorimotor contingencies by a naive robot. While exploring the world, the robot internally encodes contingencies as predictive models that capture the structure they imply in its sensorimotor experience. Three preliminary applications are presented to illustrate our approach to the acquisition of perceptive abilities: discovering the environment, discovering objects, and discovering a visual field.
△ Less
Submitted 3 October, 2018;
originally announced October 2018.
-
Unsupervised Emergence of Spatial Structure from Sensorimotor Prediction
Authors:
Alban Laflaquière,
Michael Garcia Ortiz
Abstract:
Despite its omnipresence in robotics application, the nature of spatial knowledge and the mechanisms that underlie its emergence in autonomous agents are still poorly understood. Recent theoretical work suggests that the concept of space can be grounded by capturing invariants induced by the structure of space in an agent's raw sensorimotor experience. Moreover, it is hypothesized that capturing t…
▽ More
Despite its omnipresence in robotics application, the nature of spatial knowledge and the mechanisms that underlie its emergence in autonomous agents are still poorly understood. Recent theoretical work suggests that the concept of space can be grounded by capturing invariants induced by the structure of space in an agent's raw sensorimotor experience. Moreover, it is hypothesized that capturing these invariants is beneficial for a naive agent trying to predict its sensorimotor experience. Under certain exploratory conditions, spatial representations should thus emerge as a byproduct of learning to predict. We propose a simple sensorimotor predictive scheme, apply it to different agents and types of exploration, and evaluate the pertinence of this hypothesis. We show that a naive agent can capture the topology and metric regularity of its spatial configuration without any a priori knowledge, nor extraneous supervision.
△ Less
Submitted 27 November, 2018; v1 submitted 2 October, 2018;
originally announced October 2018.
-
Measures of Cluster Informativeness for Medical Evidence Aggregation and Dissemination
Authors:
Michael Segundo Ortiz,
Sam Bubnovich,
Mengqian Wang,
Kazuhiro Seki Ph. D.,
Javed Mostafa Ph. D
Abstract:
The largest collection of medical evidence in the world is PubMed. However, the significant barrier in accessing and extracting information is information organization. A factor that contributes towards this barrier is managing medical controlled vocabularies that allow us to systematically and consistently organize, index, and search biomedical literature. Additionally, from users' perspective, t…
▽ More
The largest collection of medical evidence in the world is PubMed. However, the significant barrier in accessing and extracting information is information organization. A factor that contributes towards this barrier is managing medical controlled vocabularies that allow us to systematically and consistently organize, index, and search biomedical literature. Additionally, from users' perspective, to ultimately improve access, visualization is likely to play a powerful role. There is a strong link between information organization and information visualization, as many powerful visualizations depend on clustering methods. To improve visualization, therefore, one has to develop concrete and scalable measures for vocabularies used in indexing and their impact on document clustering. The focus of this study is on the development and evaluation of clustering methods. The paper concludes with demonstration of downstream network visualizations and their impact on discovering potentially valuable and latent genetic and molecular associations.
△ Less
Submitted 5 September, 2018;
originally announced September 2018.
-
Model-Free Data-Driven Inelasticity
Authors:
Robert Eggersmann,
Trenton Kirchdoerfer,
Stefanie Reese,
Laurent Stainier,
Michael Ortiz
Abstract:
We extend the Data-Driven formulation of problems in elasticity of Kirchdoerfer and Ortiz (2016) to inelasticity. This extension differs fundamentally from Data-Driven problems in elasticity in that the material data set evolves in time as a consequence of the history dependence of the material. We investigate three representational paradigms for the evolving material data sets: i) materials with…
▽ More
We extend the Data-Driven formulation of problems in elasticity of Kirchdoerfer and Ortiz (2016) to inelasticity. This extension differs fundamentally from Data-Driven problems in elasticity in that the material data set evolves in time as a consequence of the history dependence of the material. We investigate three representational paradigms for the evolving material data sets: i) materials with memory, i.e., conditioning the material data set to the past history of deformation; ii) differential materials, i.e., conditioning the material data set to short histories of stress and strain; and iii) history variables, i.e., conditioning the material data set to ad hoc variables encoding partial information about the history of stress and strain. We also consider combinations of the three paradigms thereof and investigate their ability to represent the evolving data sets of different classes of inelastic materials, including viscoelasticity, viscoplasticity and plasticity. We present selected numerical examples that demonstrate the range and scope of Data-Driven inelasticity and the numerical performance of implementations thereof.
△ Less
Submitted 10 September, 2018; v1 submitted 31 August, 2018;
originally announced August 2018.
-
Relaxing and Restraining Queries for OBDA
Authors:
Medina Andreşel,
Yazmin Ibáñez-García,
Magdalena Ortiz,
Mantas Šimkus
Abstract:
In ontology-based data access (OBDA), ontologies have been successfully employed for querying possibly unstructured and incomplete data. In this paper, we advocate using ontologies not only to formulate queries and compute their answers, but also for modifying queries by relaxing or restraining them, so that they can retrieve either more or less answers over a given dataset. Towards this goal, we…
▽ More
In ontology-based data access (OBDA), ontologies have been successfully employed for querying possibly unstructured and incomplete data. In this paper, we advocate using ontologies not only to formulate queries and compute their answers, but also for modifying queries by relaxing or restraining them, so that they can retrieve either more or less answers over a given dataset. Towards this goal, we first illustrate that some domain knowledge that could be naturally leveraged in OBDA can be expressed using complex role inclusions (CRI). Queries over ontologies with CRI are not first-order (FO) rewritable in general. We propose an extension of DL-Lite with CRI, and show that conjunctive queries over ontologies in this extension are FO rewritable. Our main contribution is a set of rules to relax and restrain conjunctive queries (CQs). Firstly, we define rules that use the ontology to produce CQs that are relaxations/restrictions over any dataset. Secondly, we introduce a set of data-driven rules, that leverage patterns in the current dataset, to obtain more fine-grained relaxations and restrictions.
△ Less
Submitted 8 August, 2018;
originally announced August 2018.
-
Learning Representations of Spatial Displacement through Sensorimotor Prediction
Authors:
Michael Garcia Ortiz,
Alban Laflaquière
Abstract:
Robots act in their environment through sequences of continuous motor commands. Because of the dimensionality of the motor space, as well as the infinite possible combinations of successive motor commands, agents need compact representations that capture the structure of the resulting displacements. In the case of an autonomous agent with no a priori knowledge about its sensorimotor apparatus, thi…
▽ More
Robots act in their environment through sequences of continuous motor commands. Because of the dimensionality of the motor space, as well as the infinite possible combinations of successive motor commands, agents need compact representations that capture the structure of the resulting displacements. In the case of an autonomous agent with no a priori knowledge about its sensorimotor apparatus, this compression has to be learned. We propose to use Recurrent Neural Networks to encode motor sequences into a compact representation, which is used to predict the consequence of motor sequences in term of sensory changes. We show that sensory prediction can successfully guide the compression of motor sequences into representations that are organized topologically in term of spatial displacement.
△ Less
Submitted 16 May, 2018;
originally announced May 2018.
-
Predicting Graph Categories from Structural Properties
Authors:
James P. Canning,
Emma E. Ingram,
Sammantha Nowak-Wolff,
Adriana M. Ortiz,
Nesreen K. Ahmed,
Ryan A. Rossi,
Karl R. B. Schmitt,
Sucheta Soundarajan
Abstract:
This paper has been withdrawn from arXiv.org due to a disagreement among the authors related to several peer-review comments received prior to submission on arXiv.org. Even though the current version of this paper is withdrawn, there was no disagreement between authors on the novel work in this paper.
One specific issue was the discussion of related work by Ikehara \& Clauset (found on page 8 of…
▽ More
This paper has been withdrawn from arXiv.org due to a disagreement among the authors related to several peer-review comments received prior to submission on arXiv.org. Even though the current version of this paper is withdrawn, there was no disagreement between authors on the novel work in this paper.
One specific issue was the discussion of related work by Ikehara \& Clauset (found on page 8 of the previously posted version). Peer-review comments on a similar version made ALL authors aware that the discussion misrepresented their work prior to submission to arXiv.org. However, some authors choose to post to arXiv a minimally updated version without the consent of all authors or properly addressing this attribution issue.
================
Original Paper Abstract:
Complex networks are often categorized according to the underlying phenomena that they represent such as molecular interactions, re-tweets, and brain activity. In this work, we investigate the problem of predicting the category (domain) of arbitrary networks. This includes complex networks from different domains as well as synthetically generated graphs from five different network models. A classification accuracy of $96.6\%$ is achieved using a random forest classifier with both real and synthetic networks. This work makes two important findings. First, our results indicate that complex networks from various domains have distinct structural properties that allow us to predict with high accuracy the category of a new previously unseen network. Second, synthetic graphs are trivial to classify as the classification model can predict with near-certainty the network model used to generate it. Overall, the results demonstrate that networks drawn from different domains (and network models) are trivial to distinguish using only a handful of simple structural properties.
△ Less
Submitted 1 August, 2019; v1 submitted 7 May, 2018;
originally announced May 2018.
-
Low-Precision Floating-Point Schemes for Neural Network Training
Authors:
Marc Ortiz,
Adrián Cristal,
Eduard Ayguadé,
Marc Casas
Abstract:
The use of low-precision fixed-point arithmetic along with stochastic rounding has been proposed as a promising alternative to the commonly used 32-bit floating point arithmetic to enhance training neural networks training in terms of performance and energy efficiency. In the first part of this paper, the behaviour of the 12-bit fixed-point arithmetic when training a convolutional neural network w…
▽ More
The use of low-precision fixed-point arithmetic along with stochastic rounding has been proposed as a promising alternative to the commonly used 32-bit floating point arithmetic to enhance training neural networks training in terms of performance and energy efficiency. In the first part of this paper, the behaviour of the 12-bit fixed-point arithmetic when training a convolutional neural network with the CIFAR-10 dataset is analysed, showing that such arithmetic is not the most appropriate for the training phase. After that, the paper presents and evaluates, under the same conditions, alternative low-precision arithmetics, starting with the 12-bit floating-point arithmetic. These two representations are then leveraged using local scaling in order to increase accuracy and get closer to the baseline 32-bit floating-point arithmetic. Finally, the paper introduces a simplified model in which both the outputs and the gradients of the neural networks are constrained to power-of-two values, just using 7 bits for their representation. The evaluation demonstrates a minimal loss in accuracy for the proposed Power-of-Two neural network, avoiding the use of multiplications and divisions and thereby, significantly reducing the training time as well as the energy consumption and memory requirements during the training and inference phases.
△ Less
Submitted 14 April, 2018;
originally announced April 2018.
-
Representation Learning in Partially Observable Environments using Sensorimotor Prediction
Authors:
Thibaut Kulak,
Michael Garcia Ortiz
Abstract:
In order to explore and act autonomously in an environment, an agent needs to learn from the sensorimotor information that is captured while acting. By extracting the regularities in this sensorimotor stream, it can learn a model of the world, which in turn can be used as a basis for action and exploration.
This requires the acquisition of compact representations from a possibly high dimensional…
▽ More
In order to explore and act autonomously in an environment, an agent needs to learn from the sensorimotor information that is captured while acting. By extracting the regularities in this sensorimotor stream, it can learn a model of the world, which in turn can be used as a basis for action and exploration.
This requires the acquisition of compact representations from a possibly high dimensional raw observation, which is noisy and ambiguous. In this paper, we learn sensory representations from sensorimotor prediction. We propose a model which integrates sensorimotor information over time, and project it in a sensory representation which is useful for prediction. We emphasize on a simple example the role of motor and memory for learning sensory representations.
△ Less
Submitted 26 April, 2018; v1 submitted 1 March, 2018;
originally announced March 2018.
-
Network Classification and Categorization
Authors:
James P. Canning,
Emma E. Ingram,
Sammantha Nowak-Wolff,
Adriana M. Ortiz,
Nesreen K. Ahmed,
Ryan A. Rossi,
Karl R. B. Schmitt,
Sucheta Soundarajan
Abstract:
To the best of our knowledge, this paper presents the first large-scale study that tests whether network categories (e.g., social networks vs. web graphs) are distinguishable from one another (using both categories of real-world networks and synthetic graphs). A classification accuracy of $94.2\%$ was achieved using a random forest classifier with both real and synthetic networks. This work makes…
▽ More
To the best of our knowledge, this paper presents the first large-scale study that tests whether network categories (e.g., social networks vs. web graphs) are distinguishable from one another (using both categories of real-world networks and synthetic graphs). A classification accuracy of $94.2\%$ was achieved using a random forest classifier with both real and synthetic networks. This work makes two important findings. First, real-world networks from various domains have distinct structural properties that allow us to predict with high accuracy the category of an arbitrary network. Second, classifying synthetic networks is trivial as our models can easily distinguish between synthetic graphs and the real-world networks they are supposed to model.
△ Less
Submitted 13 September, 2017;
originally announced September 2017.
-
How far is Facebook from me? Facebook network infrastructure analysis
Authors:
Reza Farahbakhsh,
Angel Cuevas,
Antonio M. Ortiz,
Xiao Han,
Noel Crespi
Abstract:
Facebook is today the most popular social network with more than one billion subscribers worldwide. To provide good quality of service (e.g., low access delay) to their clients, FB relies on Akamai, which provides a worldwide content distribution network with a large number of edge servers that are much closer to FB subscribers. In this article we aim to depict a global picture of the current FB n…
▽ More
Facebook is today the most popular social network with more than one billion subscribers worldwide. To provide good quality of service (e.g., low access delay) to their clients, FB relies on Akamai, which provides a worldwide content distribution network with a large number of edge servers that are much closer to FB subscribers. In this article we aim to depict a global picture of the current FB network infrastructure deployment taking into account both native FB servers and Akamai nodes. Toward this end, we have performed a measurement-based analysis during a period of two weeks using 463 Planet- Lab nodes distributed across 41 countries. Based on the obtained data we compare the average access delay that nodes in different countries experience accessing both native FB servers and Akamai nodes. In addition, we obtain a wide view of the deployment of Akamai nodes serving FB users worldwide. Finally, we analyze the geographical coverage of those nodes, and demonstrate that in most of the cases Akamai nodes located in a particular country service not only local FB subscribers, but also FB users located in nearby countries.
△ Less
Submitted 1 May, 2017;
originally announced May 2017.
-
Research Directions for Principles of Data Management (Dagstuhl Perspectives Workshop 16151)
Authors:
Serge Abiteboul,
Marcelo Arenas,
Pablo Barceló,
Meghyn Bienvenu,
Diego Calvanese,
Claire David,
Richard Hull,
Eyke Hüllermeier,
Benny Kimelfeld,
Leonid Libkin,
Wim Martens,
Tova Milo,
Filip Murlak,
Frank Neven,
Magdalena Ortiz,
Thomas Schwentick,
Julia Stoyanovich,
Jianwen Su,
Dan Suciu,
Victor Vianu,
Ke Yi
Abstract:
In April 2016, a community of researchers working in the area of Principles of Data Management (PDM) joined in a workshop at the Dagstuhl Castle in Germany. The workshop was organized jointly by the Executive Committee of the ACM Symposium on Principles of Database Systems (PODS) and the Council of the International Conference on Database Theory (ICDT). The mission of this workshop was to identify…
▽ More
In April 2016, a community of researchers working in the area of Principles of Data Management (PDM) joined in a workshop at the Dagstuhl Castle in Germany. The workshop was organized jointly by the Executive Committee of the ACM Symposium on Principles of Database Systems (PODS) and the Council of the International Conference on Database Theory (ICDT). The mission of this workshop was to identify and explore some of the most important research directions that have high relevance to society and to Computer Science today, and where the PDM community has the potential to make significant contributions. This report describes the family of research directions that the workshop focused on from three perspectives: potential practical relevance, results already obtained, and research questions that appear surmountable in the short and medium term.
△ Less
Submitted 31 January, 2017;
originally announced January 2017.
-
Managing Change in Graph-structured Data Using Description Logics (long version with appendix)
Authors:
Shqiponja Ahmetaj,
Diego Calvanese,
Magdalena Ortiz,
Mantas Simkus
Abstract:
In this paper, we consider the setting of graph-structured data that evolves as a result of operations carried out by users or applications. We study different reasoning problems, which range from ensuring the satisfaction of a given set of integrity constraints after a given sequence of updates, to deciding the (non-)existence of a sequence of actions that would take the data to an (un)desirable…
▽ More
In this paper, we consider the setting of graph-structured data that evolves as a result of operations carried out by users or applications. We study different reasoning problems, which range from ensuring the satisfaction of a given set of integrity constraints after a given sequence of updates, to deciding the (non-)existence of a sequence of actions that would take the data to an (un)desirable state, starting either from a specific data instance or from an incomplete description of it. We consider an action language in which actions are finite sequences of conditional insertions and deletions of nodes and labels, and use Description Logics for describing integrity constraints and (partial) states of the data. We then formalize the above data management problems as a static verification problem and several planning problems. We provide algorithms and tight complexity bounds for the formalized problems, both for an expressive DL and for a variant of DL-Lite.
△ Less
Submitted 29 May, 2014; v1 submitted 16 April, 2014;
originally announced April 2014.
-
Nested Regular Path Queries in Description Logics
Authors:
Meghyn Bienvenu,
Diego Calvanese,
Magdalena Ortiz,
Mantas Simkus
Abstract:
Two-way regular path queries (2RPQs) have received increased attention recently due to their ability to relate pairs of objects by flexibly navigating graph-structured data. They are present in property paths in SPARQL 1.1, the new standard RDF query language, and in the XML query language XPath. In line with XPath, we consider the extension of 2RPQs with nesting, which allows one to require that…
▽ More
Two-way regular path queries (2RPQs) have received increased attention recently due to their ability to relate pairs of objects by flexibly navigating graph-structured data. They are present in property paths in SPARQL 1.1, the new standard RDF query language, and in the XML query language XPath. In line with XPath, we consider the extension of 2RPQs with nesting, which allows one to require that objects along a path satisfy complex conditions, in turn expressed through (nested) 2RPQs. We study the computational complexity of answering nested 2RPQs and conjunctions thereof (CN2RPQs) in the presence of domain knowledge expressed in description logics (DLs). We establish tight complexity bounds in data and combined complexity for a variety of DLs, ranging from lightweight DLs (DL-Lite, EL) up to highly expressive ones. Interestingly, we are able to show that adding nesting to (C)2RPQs does not affect worst-case data complexity of query answering for any of the considered DLs. However, in the case of lightweight DLs, adding nesting to 2RPQs leads to a surprising jump in combined complexity, from P-complete to Exp-complete.
△ Less
Submitted 4 March, 2014; v1 submitted 27 February, 2014;
originally announced February 2014.
-
Reasoning about Explanations for Negative Query Answers in DL-Lite
Authors:
Diego Calvanese,
Magdalena Ortiz,
Mantas Simkus,
Giorgio Stefanoni
Abstract:
In order to meet usability requirements, most logic-based applications provide explanation facilities for reasoning services. This holds also for Description Logics, where research has focused on the explanation of both TBox reasoning and, more recently, query answering. Besides explaining the presence of a tuple in a query answer, it is important to explain also why a given tuple is missing. W…
▽ More
In order to meet usability requirements, most logic-based applications provide explanation facilities for reasoning services. This holds also for Description Logics, where research has focused on the explanation of both TBox reasoning and, more recently, query answering. Besides explaining the presence of a tuple in a query answer, it is important to explain also why a given tuple is missing. We address the latter problem for instance and conjunctive query answering over DL-Lite ontologies by adopting abductive reasoning; that is, we look for additions to the ABox that force a given tuple to be in the result. As reasoning tasks we consider existence and recognition of an explanation, and relevance and necessity of a given assertion for an explanation. We characterize the computational complexity of these problems for arbitrary, subset minimal, and cardinality minimal explanations.
△ Less
Submitted 3 February, 2014;
originally announced February 2014.
-
The Optimal Uncertainty Algorithm in the Mystic Framework
Authors:
M. McKerns,
H. Owhadi,
C. Scovel,
T. J. Sullivan,
M. Ortiz
Abstract:
We have recently proposed a rigorous framework for Uncertainty Quantification (UQ) in which UQ objectives and assumption/information set are brought into the forefront, providing a framework for the communication and comparison of UQ results. In particular, this framework does not implicitly impose inappropriate assumptions nor does it repudiate relevant information. This framework, which we call…
▽ More
We have recently proposed a rigorous framework for Uncertainty Quantification (UQ) in which UQ objectives and assumption/information set are brought into the forefront, providing a framework for the communication and comparison of UQ results. In particular, this framework does not implicitly impose inappropriate assumptions nor does it repudiate relevant information. This framework, which we call Optimal Uncertainty Quantification (OUQ), is based on the observation that given a set of assumptions and information, there exist bounds on uncertainties obtained as values of optimization problems and that these bounds are optimal. It provides a uniform environment for the optimal solution of the problems of validation, certification, experimental design, reduced order modeling, prediction, extrapolation, all under aleatoric and epistemic uncertainties. OUQ optimization problems are extremely large, and even though under general conditions they have finite-dimensional reductions, they must often be solved numerically. This general algorithmic framework for OUQ has been implemented in the mystic optimization framework. We describe this implementation, and demonstrate its use in the context of the Caltech surrogate model for hypervelocity impact.
△ Less
Submitted 6 February, 2012;
originally announced February 2012.
-
Optimal Uncertainty Quantification
Authors:
Houman Owhadi,
Clint Scovel,
Timothy John Sullivan,
Mike McKerns,
Michael Ortiz
Abstract:
We propose a rigorous framework for Uncertainty Quantification (UQ) in which the UQ objectives and the assumptions/information set are brought to the forefront. This framework, which we call \emph{Optimal Uncertainty Quantification} (OUQ), is based on the observation that, given a set of assumptions and information about the problem, there exist optimal bounds on uncertainties: these are obtained…
▽ More
We propose a rigorous framework for Uncertainty Quantification (UQ) in which the UQ objectives and the assumptions/information set are brought to the forefront. This framework, which we call \emph{Optimal Uncertainty Quantification} (OUQ), is based on the observation that, given a set of assumptions and information about the problem, there exist optimal bounds on uncertainties: these are obtained as values of well-defined optimization problems corresponding to extremizing probabilities of failure, or of deviations, subject to the constraints imposed by the scenarios compatible with the assumptions and information. In particular, this framework does not implicitly impose inappropriate assumptions, nor does it repudiate relevant information. Although OUQ optimization problems are extremely large, we show that under general conditions they have finite-dimensional reductions. As an application, we develop \emph{Optimal Concentration Inequalities} (OCI) of Hoeffding and McDiarmid type. Surprisingly, these results show that uncertainties in input parameters, which propagate to output uncertainties in the classical sensitivity analysis paradigm, may fail to do so if the transfer functions (or probability distributions) are imperfectly known. We show how, for hierarchical structures, this phenomenon may lead to the non-propagation of uncertainties or information across scales. In addition, a general algorithmic framework is developed for OUQ and is tested on the Caltech surrogate model for hypervelocity impact and on the seismic safety assessment of truss structures, suggesting the feasibility of the framework for important complex systems. The introduction of this paper provides both an overview of the paper and a self-contained mini-tutorial about basic concepts and issues of UQ.
△ Less
Submitted 23 May, 2012; v1 submitted 2 September, 2010;
originally announced September 2010.