-
CosmoBench: A Multiscale, Multiview, Multitask Cosmology Benchmark for Geometric Deep Learning
Authors:
Ningyuan Huang,
Richard Stiskalek,
Jun-Young Lee,
Adrian E. Bayer,
Charles C. Margossian,
Christian Kragh Jespersen,
Lucia A. Perez,
Lawrence K. Saul,
Francisco Villaescusa-Navarro
Abstract:
Cosmological simulations provide a wealth of data in the form of point clouds and directed trees. A crucial goal is to extract insights from this data that shed light on the nature and composition of the Universe. In this paper we introduce CosmoBench, a benchmark dataset curated from state-of-the-art cosmological simulations whose runs required more than 41 million core-hours and generated over t…
▽ More
Cosmological simulations provide a wealth of data in the form of point clouds and directed trees. A crucial goal is to extract insights from this data that shed light on the nature and composition of the Universe. In this paper we introduce CosmoBench, a benchmark dataset curated from state-of-the-art cosmological simulations whose runs required more than 41 million core-hours and generated over two petabytes of data. CosmoBench is the largest dataset of its kind: it contains 34 thousand point clouds from simulations of dark matter halos and galaxies at three different length scales, as well as 25 thousand directed trees that record the formation history of halos on two different time scales. The data in CosmoBench can be used for multiple tasks -- to predict cosmological parameters from point clouds and merger trees, to predict the velocities of individual halos and galaxies from their collective positions, and to reconstruct merger trees on finer time scales from those on coarser time scales. We provide several baselines on these tasks, some based on established approaches from cosmological modeling and others rooted in machine learning. For the latter, we study different approaches -- from simple linear models that are minimally constrained by symmetries to much larger and more computationally-demanding models in deep learning, such as graph neural networks. We find that least-squares fits with a handful of invariant features sometimes outperform deep architectures with many more parameters and far longer training times. Still there remains tremendous potential to improve these baselines by combining machine learning and cosmology to fully exploit the data. CosmoBench sets the stage for bridging cosmology and geometric deep learning at scale. We invite the community to push the frontier of scientific discovery by engaging with this dataset, available at https://cosmobench.streamlit.app
△ Less
Submitted 4 July, 2025;
originally announced July 2025.
-
Enfoque Odychess: Un método dialéctico, constructivista y adaptativo para la enseñanza del ajedrez con inteligencias artificiales generativas
Authors:
Ernesto Giralt Hernandez,
Lazaro Antonio Bueno Perez
Abstract:
Chess teaching has evolved through different approaches, however, traditional methodologies, often based on memorization, contrast with the new possibilities offered by generative artificial intelligence, a technology still little explored in this field. This study seeks to empirically validate the effectiveness of the Odychess Approach in improving chess knowledge, strategic understanding, and me…
▽ More
Chess teaching has evolved through different approaches, however, traditional methodologies, often based on memorization, contrast with the new possibilities offered by generative artificial intelligence, a technology still little explored in this field. This study seeks to empirically validate the effectiveness of the Odychess Approach in improving chess knowledge, strategic understanding, and metacognitive skills in students. A quasi-experimental study was conducted with a pre-test/post-test design and a control group (N=60). The experimental intervention implemented the Odychess Approach, incorporating a Llama 3.3 language model that was specifically adapted using Parameter-Efficient Fine-Tuning (PEFT) techniques to act as a Socratic chess tutor. Quantitative assessment instruments were used to measure chess knowledge, strategic understanding, and metacognitive skills before and after the intervention. The results of the quasi-experimental study showed significant improvements in the experimental group compared to the control group in the three variables analyzed: chess knowledge, strategic understanding, and metacognitive skills. The complementary qualitative analysis revealed greater analytical depth, more developed dialectical reasoning, and increased intrinsic motivation in students who participated in the Odychess method-based intervention. The Odychess Approach represents an effective pedagogical methodology for teaching chess, demonstrating the potential of the synergistic integration of constructivist and dialectical principles with generative artificial intelligence. The implications of this work are relevant for educators and institutions interested in adopting innovative pedagogical technologies and for researchers in the field of AI applied to education, highlighting the transferability of the language model adaptation methodology to other educational domains.
△ Less
Submitted 10 May, 2025;
originally announced May 2025.
-
Echoes of Discord: Forecasting Hater Reactions to Counterspeech
Authors:
Xiaoying Song,
Sharon Lisseth Perez,
Xinchen Yu,
Eduardo Blanco,
Lingzi Hong
Abstract:
Hate speech (HS) erodes the inclusiveness of online users and propagates negativity and division. Counterspeech has been recognized as a way to mitigate the harmful consequences. While some research has investigated the impact of user-generated counterspeech on social media platforms, few have examined and modeled haters' reactions toward counterspeech, despite the immediate alteration of haters'…
▽ More
Hate speech (HS) erodes the inclusiveness of online users and propagates negativity and division. Counterspeech has been recognized as a way to mitigate the harmful consequences. While some research has investigated the impact of user-generated counterspeech on social media platforms, few have examined and modeled haters' reactions toward counterspeech, despite the immediate alteration of haters' attitudes being an important aspect of counterspeech. This study fills the gap by analyzing the impact of counterspeech from the hater's perspective, focusing on whether the counterspeech leads the hater to reenter the conversation and if the reentry is hateful. We compile the Reddit Echoes of Hate dataset (ReEco), which consists of triple-turn conversations featuring haters' reactions, to assess the impact of counterspeech. To predict haters' behaviors, we employ two strategies: a two-stage reaction predictor and a three-way classifier. The linguistic analysis sheds insights on the language of counterspeech to hate eliciting different haters' reactions. Experimental results demonstrate that the 3-way classification model outperforms the two-stage reaction predictor, which first predicts reentry and then determines the reentry type. We conclude the study with an assessment showing the most common errors identified by the best-performing model.
△ Less
Submitted 13 February, 2025; v1 submitted 27 January, 2025;
originally announced January 2025.
-
A Toolbox for Design of Experiments for Energy Systems in Co-Simulation and Hardware Tests
Authors:
Jan Sören Schwarz,
Leonard Enrique Ramos Perez,
Minh Cong Pham,
Kai Heussen,
Quoc Tuan Tran
Abstract:
In context of highly complex energy system experiments, sensitivity analysis is gaining more and more importance to investigate the effects changing parameterization has on the outcome. Thus, it is crucial how to design an experiment to efficiently use the available resources. This paper describes the functionality of a toolbox designed to support the users in design of experiment for (co-)simulat…
▽ More
In context of highly complex energy system experiments, sensitivity analysis is gaining more and more importance to investigate the effects changing parameterization has on the outcome. Thus, it is crucial how to design an experiment to efficiently use the available resources. This paper describes the functionality of a toolbox designed to support the users in design of experiment for (co-)simulation and hardware tests. It provides a structure for object-oriented description of the parameterization and variations and performs sample generation based on this to provide a complete parameterization for the recommended experiment runs. After execution of the runs, it can also be used for analysis of the results to calculate and visualize the effects. The paper also presents two application cases using the toolbox which show how it can be implemented in sensitivity analysis studies with the co-simulation framework mosaik and a hybrid energy storage experiment.
△ Less
Submitted 22 October, 2024;
originally announced October 2024.
-
PixLens: A Novel Framework for Disentangled Evaluation in Diffusion-Based Image Editing with Object Detection + SAM
Authors:
Stefan Stefanache,
Lluís Pastor Pérez,
Julen Costa Watanabe,
Ernesto Sanchez Tejedor,
Thomas Hofmann,
Enis Simsar
Abstract:
Evaluating diffusion-based image-editing models is a crucial task in the field of Generative AI. Specifically, it is imperative to assess their capacity to execute diverse editing tasks while preserving the image content and realism. While recent developments in generative models have opened up previously unheard-of possibilities for image editing, conducting a thorough evaluation of these models…
▽ More
Evaluating diffusion-based image-editing models is a crucial task in the field of Generative AI. Specifically, it is imperative to assess their capacity to execute diverse editing tasks while preserving the image content and realism. While recent developments in generative models have opened up previously unheard-of possibilities for image editing, conducting a thorough evaluation of these models remains a challenging and open task. The absence of a standardized evaluation benchmark, primarily due to the inherent need for a post-edit reference image for evaluation, further complicates this issue. Currently, evaluations often rely on established models such as CLIP or require human intervention for a comprehensive understanding of the performance of these image editing models. Our benchmark, PixLens, provides a comprehensive evaluation of both edit quality and latent representation disentanglement, contributing to the advancement and refinement of existing methodologies in the field.
△ Less
Submitted 8 October, 2024;
originally announced October 2024.
-
EDHOC is a New Security Handshake Standard: An Overview of Security Analysis
Authors:
Elsa López Pérez,
Inria Göran Selander,
John Preuß Mattsson,
Thomas Watteyne,
Mališa Vučinić
Abstract:
The paper wraps up the call for formal analysis of the new security handshake protocol EDHOC by providing an overview of the protocol as it was standardized, a summary of the formal security analyses conducted by the community, and a discussion on open venues for future work.
The paper wraps up the call for formal analysis of the new security handshake protocol EDHOC by providing an overview of the protocol as it was standardized, a summary of the formal security analyses conducted by the community, and a discussion on open venues for future work.
△ Less
Submitted 10 July, 2024;
originally announced July 2024.
-
Human-in-the-loop Reinforcement Learning for Data Quality Monitoring in Particle Physics Experiments
Authors:
Olivia Jullian Parra,
Julián García Pardiñas,
Lorenzo Del Pianta Pérez,
Maximilian Janisch,
Suzanne Klaver,
Thomas Lehéricy,
Nicola Serra
Abstract:
Data Quality Monitoring (DQM) is a crucial task in large particle physics experiments, since detector malfunctioning can compromise the data. DQM is currently performed by human shifters, which is costly and results in limited accuracy. In this work, we provide a proof-of-concept for applying human-in-the-loop Reinforcement Learning (RL) to automate the DQM process while adapting to operating cond…
▽ More
Data Quality Monitoring (DQM) is a crucial task in large particle physics experiments, since detector malfunctioning can compromise the data. DQM is currently performed by human shifters, which is costly and results in limited accuracy. In this work, we provide a proof-of-concept for applying human-in-the-loop Reinforcement Learning (RL) to automate the DQM process while adapting to operating conditions that change over time. We implement a prototype based on the Proximal Policy Optimization (PPO) algorithm and validate it on a simplified synthetic dataset. We demonstrate how a multi-agent system can be trained for continuous automated monitoring during data collection, with human intervention actively requested only when relevant. We show that random, unbiased noise in human classification can be reduced, leading to an improved accuracy over the baseline. Additionally, we propose data augmentation techniques to deal with scarce data and to accelerate the learning process. Finally, we discuss further steps needed to implement the approach in the real world, including protocols for periodic control of the algorithm's outputs.
△ Less
Submitted 24 May, 2024;
originally announced May 2024.
-
Insight Gained from Migrating a Machine Learning Model to Intelligence Processing Units
Authors:
Hieu Le,
Zhenhua He,
Mai Le,
Dhruva K. Chakravorty,
Lisa M. Perez,
Akhil Chilumuru,
Yan Yao,
Jiefu Chen
Abstract:
The discoveries in this paper show that Intelligence Processing Units (IPUs) offer a viable accelerator alternative to GPUs for machine learning (ML) applications within the fields of materials science and battery research. We investigate the process of migrating a model from GPU to IPU and explore several optimization techniques, including pipelining and gradient accumulation, aimed at enhancing…
▽ More
The discoveries in this paper show that Intelligence Processing Units (IPUs) offer a viable accelerator alternative to GPUs for machine learning (ML) applications within the fields of materials science and battery research. We investigate the process of migrating a model from GPU to IPU and explore several optimization techniques, including pipelining and gradient accumulation, aimed at enhancing the performance of IPU-based models. Furthermore, we have effectively migrated a specialized model to the IPU platform. This model is employed for predicting effective conductivity, a parameter crucial in ion transport processes, which govern the performance of multiple charge and discharge cycles of batteries. The model utilizes a Convolutional Neural Network (CNN) architecture to perform prediction tasks for effective conductivity. The performance of this model on the IPU is found to be comparable to its execution on GPUs. We also analyze the utilization and performance of Graphcore's Bow IPU. Through benchmark tests, we observe significantly improved performance with the Bow IPU when compared to its predecessor, the Colossus IPU.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
LtU-ILI: An All-in-One Framework for Implicit Inference in Astrophysics and Cosmology
Authors:
Matthew Ho,
Deaglan J. Bartlett,
Nicolas Chartier,
Carolina Cuesta-Lazaro,
Simon Ding,
Axel Lapel,
Pablo Lemos,
Christopher C. Lovell,
T. Lucas Makinen,
Chirag Modi,
Viraj Pandya,
Shivam Pandey,
Lucia A. Perez,
Benjamin Wandelt,
Greg L. Bryan
Abstract:
This paper presents the Learning the Universe Implicit Likelihood Inference (LtU-ILI) pipeline, a codebase for rapid, user-friendly, and cutting-edge machine learning (ML) inference in astrophysics and cosmology. The pipeline includes software for implementing various neural architectures, training schemata, priors, and density estimators in a manner easily adaptable to any research workflow. It i…
▽ More
This paper presents the Learning the Universe Implicit Likelihood Inference (LtU-ILI) pipeline, a codebase for rapid, user-friendly, and cutting-edge machine learning (ML) inference in astrophysics and cosmology. The pipeline includes software for implementing various neural architectures, training schemata, priors, and density estimators in a manner easily adaptable to any research workflow. It includes comprehensive validation metrics to assess posterior estimate coverage, enhancing the reliability of inferred results. Additionally, the pipeline is easily parallelizable and is designed for efficient exploration of modeling hyperparameters. To demonstrate its capabilities, we present real applications across a range of astrophysics and cosmology problems, such as: estimating galaxy cluster masses from X-ray photometry; inferring cosmology from matter power spectra and halo point clouds; characterizing progenitors in gravitational wave signals; capturing physical dust parameters from galaxy colors and luminosities; and establishing properties of semi-analytic models of galaxy formation. We also include exhaustive benchmarking and comparisons of all implemented methods as well as discussions about the challenges and pitfalls of ML inference in astronomical sciences. All code and examples are made publicly available at https://github.com/maho3/ltu-ili.
△ Less
Submitted 2 July, 2024; v1 submitted 6 February, 2024;
originally announced February 2024.
-
Field-level simulation-based inference with galaxy catalogs: the impact of systematic effects
Authors:
Natalí S. M. de Santi,
Francisco Villaescusa-Navarro,
L. Raul Abramo,
Helen Shao,
Lucia A. Perez,
Tiago Castro,
Yueying Ni,
Christopher C. Lovell,
Elena Hernandez-Martinez,
Federico Marinacci,
David N. Spergel,
Klaus Dolag,
Lars Hernquist,
Mark Vogelsberger
Abstract:
It has been recently shown that a powerful way to constrain cosmological parameters from galaxy redshift surveys is to train graph neural networks to perform field-level likelihood-free inference without imposing cuts on scale. In particular, de Santi et al. (2023) developed models that could accurately infer the value of $Ω_{\rm m}$ from catalogs that only contain the positions and radial velocit…
▽ More
It has been recently shown that a powerful way to constrain cosmological parameters from galaxy redshift surveys is to train graph neural networks to perform field-level likelihood-free inference without imposing cuts on scale. In particular, de Santi et al. (2023) developed models that could accurately infer the value of $Ω_{\rm m}$ from catalogs that only contain the positions and radial velocities of galaxies that are robust to uncertainties in astrophysics and subgrid models. However, observations are affected by many effects, including 1) masking, 2) uncertainties in peculiar velocities and radial distances, and 3) different galaxy selections. Moreover, observations only allow us to measure redshift, intertwining galaxies' radial positions and velocities. In this paper we train and test our models on galaxy catalogs, created from thousands of state-of-the-art hydrodynamic simulations run with different codes from the CAMELS project, that incorporate these observational effects. We find that, although the presence of these effects degrades the precision and accuracy of the models, and increases the fraction of catalogs where the model breaks down, the fraction of galaxy catalogs where the model performs well is over 90 %, demonstrating the potential of these models to constrain cosmological parameters even when applied to real data.
△ Less
Submitted 26 January, 2025; v1 submitted 23 October, 2023;
originally announced October 2023.
-
Neural Relational Inference with Fast Modular Meta-learning
Authors:
Ferran Alet,
Erica Weng,
Tomás Lozano Pérez,
Leslie Pack Kaelbling
Abstract:
\textit{Graph neural networks} (GNNs) are effective models for many dynamical systems consisting of entities and relations. Although most GNN applications assume a single type of entity and relation, many situations involve multiple types of interactions. \textit{Relational inference} is the problem of inferring these interactions and learning the dynamics from observational data. We frame relatio…
▽ More
\textit{Graph neural networks} (GNNs) are effective models for many dynamical systems consisting of entities and relations. Although most GNN applications assume a single type of entity and relation, many situations involve multiple types of interactions. \textit{Relational inference} is the problem of inferring these interactions and learning the dynamics from observational data. We frame relational inference as a \textit{modular meta-learning} problem, where neural modules are trained to be composed in different ways to solve many tasks. This meta-learning framework allows us to implicitly encode time invariance and infer relations in context of one another rather than independently, which increases inference capacity. Framing inference as the inner-loop optimization of meta-learning leads to a model-based approach that is more data-efficient and capable of estimating the state of entities that we do not observe directly, but whose existence can be inferred from their effect on observed entities. To address the large search space of graph neural network compositions, we meta-learn a \textit{proposal function} that speeds up the inner-loop simulated annealing search within the modular meta-learning algorithm, providing two orders of magnitude increase in the size of problems that can be addressed.
△ Less
Submitted 10 October, 2023;
originally announced October 2023.
-
A Novel Metric for mMIMO Base Station Association for Aerial Highway Systems
Authors:
Matteo Bernabè,
David López Pérez,
Nicola Piovesan,
Giovanni Geraci,
David Gesbert
Abstract:
In this article, we introduce a new metric for driving the serving cell selection process of a swarm of cellular connected unmanned aerial vehicles (CCUAVs) located on aerial highways when served by a massive multiple input multiple output (mMIMO) terrestrial network. Selecting the optimal serving cell from several suitable candidates is not straightforward. By solely relying on the traditional ce…
▽ More
In this article, we introduce a new metric for driving the serving cell selection process of a swarm of cellular connected unmanned aerial vehicles (CCUAVs) located on aerial highways when served by a massive multiple input multiple output (mMIMO) terrestrial network. Selecting the optimal serving cell from several suitable candidates is not straightforward. By solely relying on the traditional cell selection metric, based on reference signal received power (RSRP), it is possible to result in a scenario in which the serving cell can not multiplex an appropriate number of CCUAVs due to the high correlation in the line of sight (LoS) channels. To overcome such issue, in this work, we introduce a new cell selection metric to capture not only signal strength, but also spatial multiplexing capabilities. The proposed metric highly depends on the relative position between the aerial highways and the antennas of the base station. The numerical analysis indicates that the integration of the proposed new metric allows to have a better signal to interference plus noise ratio (SINR) performance on the aerial highways, resulting in a more reliable cellular connection for CCUAVs.
△ Less
Submitted 24 March, 2023;
originally announced March 2023.
-
Expanding the Reach of Research Computing: A Landscape Study
Authors:
Dhruva K. Chakravorty,
Sarah K. Janes,
James V. Howell,
Lisa M. Perez,
Amy Schultz,
Marie Goldie,
Austin L. Gamble,
Rajiv Malkan,
Honggao Liu,
Daniel Mireles,
Yuanqi Jing,
Zhenhua He,
Tim Cockerill
Abstract:
Research-computing continues to play an ever increasing role in academia. Access to computing resources, however, varies greatly between institutions. Sustaining the growing need for computing skills and access to advanced cyberinfrastructure requires that computing resources be available to students at all levels of scholarship, including community colleges. The National Science Foundation-funded…
▽ More
Research-computing continues to play an ever increasing role in academia. Access to computing resources, however, varies greatly between institutions. Sustaining the growing need for computing skills and access to advanced cyberinfrastructure requires that computing resources be available to students at all levels of scholarship, including community colleges. The National Science Foundation-funded Building Research Innovation in Community Colleges (BRICCs) community set out to understand the challenges faced by administrators, researchers and faculty in building a sustainable research computing continuum that extends to smaller and two-year terminal degree granting institutions. BRICCs purpose is to address the technology gaps, and encourage the development of curriculum needed to grow a computationally proficient research workforce. Toward addressing these goals, we performed a landscape study that culminated with a community workshop. Here, we present our key findings from workshop discussions and identify next steps to be taken by BRICCs, funding agencies, and the broader cyberinfrastructure community.
△ Less
Submitted 18 April, 2022; v1 submitted 14 April, 2022;
originally announced April 2022.
-
The CAMELS project: public data release
Authors:
Francisco Villaescusa-Navarro,
Shy Genel,
Daniel Anglés-Alcázar,
Lucia A. Perez,
Pablo Villanueva-Domingo,
Digvijay Wadekar,
Helen Shao,
Faizan G. Mohammad,
Sultan Hassan,
Emily Moser,
Erwin T. Lau,
Luis Fernando Machado Poletti Valle,
Andrina Nicola,
Leander Thiele,
Yongseok Jo,
Oliver H. E. Philcox,
Benjamin D. Oppenheimer,
Megan Tillman,
ChangHoon Hahn,
Neerav Kaushal,
Alice Pisani,
Matthew Gebhardt,
Ana Maria Delgado,
Joyce Caliendo,
Christina Kreisch
, et al. (22 additional authors not shown)
Abstract:
The Cosmology and Astrophysics with MachinE Learning Simulations (CAMELS) project was developed to combine cosmology with astrophysics through thousands of cosmological hydrodynamic simulations and machine learning. CAMELS contains 4,233 cosmological simulations, 2,049 N-body and 2,184 state-of-the-art hydrodynamic simulations that sample a vast volume in parameter space. In this paper we present…
▽ More
The Cosmology and Astrophysics with MachinE Learning Simulations (CAMELS) project was developed to combine cosmology with astrophysics through thousands of cosmological hydrodynamic simulations and machine learning. CAMELS contains 4,233 cosmological simulations, 2,049 N-body and 2,184 state-of-the-art hydrodynamic simulations that sample a vast volume in parameter space. In this paper we present the CAMELS public data release, describing the characteristics of the CAMELS simulations and a variety of data products generated from them, including halo, subhalo, galaxy, and void catalogues, power spectra, bispectra, Lyman-$α$ spectra, probability distribution functions, halo radial profiles, and X-rays photon lists. We also release over one thousand catalogues that contain billions of galaxies from CAMELS-SAM: a large collection of N-body simulations that have been combined with the Santa Cruz Semi-Analytic Model. We release all the data, comprising more than 350 terabytes and containing 143,922 snapshots, millions of halos, galaxies and summary statistics. We provide further technical details on how to access, download, read, and process the data at \url{https://camels.readthedocs.io}.
△ Less
Submitted 4 January, 2022;
originally announced January 2022.
-
The CAMELS Multifield Dataset: Learning the Universe's Fundamental Parameters with Artificial Intelligence
Authors:
Francisco Villaescusa-Navarro,
Shy Genel,
Daniel Angles-Alcazar,
Leander Thiele,
Romeel Dave,
Desika Narayanan,
Andrina Nicola,
Yin Li,
Pablo Villanueva-Domingo,
Benjamin Wandelt,
David N. Spergel,
Rachel S. Somerville,
Jose Manuel Zorrilla Matilla,
Faizan G. Mohammad,
Sultan Hassan,
Helen Shao,
Digvijay Wadekar,
Michael Eickenberg,
Kaze W. K. Wong,
Gabriella Contardo,
Yongseok Jo,
Emily Moser,
Erwin T. Lau,
Luis Fernando Machado Poletti Valle,
Lucia A. Perez
, et al. (3 additional authors not shown)
Abstract:
We present the Cosmology and Astrophysics with MachinE Learning Simulations (CAMELS) Multifield Dataset, CMD, a collection of hundreds of thousands of 2D maps and 3D grids containing many different properties of cosmic gas, dark matter, and stars from 2,000 distinct simulated universes at several cosmic times. The 2D maps and 3D grids represent cosmic regions that span $\sim$100 million light year…
▽ More
We present the Cosmology and Astrophysics with MachinE Learning Simulations (CAMELS) Multifield Dataset, CMD, a collection of hundreds of thousands of 2D maps and 3D grids containing many different properties of cosmic gas, dark matter, and stars from 2,000 distinct simulated universes at several cosmic times. The 2D maps and 3D grids represent cosmic regions that span $\sim$100 million light years and have been generated from thousands of state-of-the-art hydrodynamic and gravity-only N-body simulations from the CAMELS project. Designed to train machine learning models, CMD is the largest dataset of its kind containing more than 70 Terabytes of data. In this paper we describe CMD in detail and outline a few of its applications. We focus our attention on one such task, parameter inference, formulating the problems we face as a challenge to the community. We release all data and provide further technical details at https://camels-multifield-dataset.readthedocs.io.
△ Less
Submitted 22 September, 2021;
originally announced September 2021.
-
ETA Prediction with Graph Neural Networks in Google Maps
Authors:
Austin Derrow-Pinion,
Jennifer She,
David Wong,
Oliver Lange,
Todd Hester,
Luis Perez,
Marc Nunkesser,
Seongjae Lee,
Xueying Guo,
Brett Wiltshire,
Peter W. Battaglia,
Vishal Gupta,
Ang Li,
Zhongwen Xu,
Alvaro Sanchez-Gonzalez,
Yujia Li,
Petar Veličković
Abstract:
Travel-time prediction constitutes a task of high importance in transportation networks, with web mapping services like Google Maps regularly serving vast quantities of travel time queries from users and enterprises alike. Further, such a task requires accounting for complex spatiotemporal interactions (modelling both the topological properties of the road network and anticipating events -- such a…
▽ More
Travel-time prediction constitutes a task of high importance in transportation networks, with web mapping services like Google Maps regularly serving vast quantities of travel time queries from users and enterprises alike. Further, such a task requires accounting for complex spatiotemporal interactions (modelling both the topological properties of the road network and anticipating events -- such as rush hours -- that may occur in the future). Hence, it is an ideal target for graph representation learning at scale. Here we present a graph neural network estimator for estimated time of arrival (ETA) which we have deployed in production at Google Maps. While our main architecture consists of standard GNN building blocks, we further detail the usage of training schedule methods such as MetaGradients in order to make our model robust and production-ready. We also provide prescriptive studies: ablating on various architectural decisions and training regimes, and qualitative analyses on real-world situations where our model provides a competitive edge. Our GNN proved powerful when deployed, significantly reducing negative ETA outcomes in several regions compared to the previous production baseline (40+% in cities like Sydney).
△ Less
Submitted 25 August, 2021;
originally announced August 2021.
-
E2ETag: An End-to-End Trainable Method for Generating and Detecting Fiducial Markers
Authors:
J. Brennan Peace,
Eric Psota,
Yanfeng Liu,
Lance C. Pérez
Abstract:
Existing fiducial markers solutions are designed for efficient detection and decoding, however, their ability to stand out in natural environments is difficult to infer from relatively limited analysis. Furthermore, worsening performance in challenging image capture scenarios - such as poor exposure, motion blur, and off-axis viewing - sheds light on their limitations. E2ETag introduces an end-to-…
▽ More
Existing fiducial markers solutions are designed for efficient detection and decoding, however, their ability to stand out in natural environments is difficult to infer from relatively limited analysis. Furthermore, worsening performance in challenging image capture scenarios - such as poor exposure, motion blur, and off-axis viewing - sheds light on their limitations. E2ETag introduces an end-to-end trainable method for designing fiducial markers and a complimentary detector. By introducing back-propagatable marker augmentation and superimposition into training, the method learns to generate markers that can be detected and classified in challenging real-world environments using a fully convolutional detector network. Results demonstrate that E2ETag outperforms existing methods in ideal conditions and performs much better in the presence of motion blur, contrast fluctuations, noise, and off-axis viewing angles. Source code and trained models are available at https://github.com/jbpeace/E2ETag.
△ Less
Submitted 28 May, 2021;
originally announced May 2021.
-
Mastering Terra Mystica: Applying Self-Play to Multi-agent Cooperative Board Games
Authors:
Luis Perez
Abstract:
In this paper, we explore and compare multiple algorithms for solving the complex strategy game of Terra Mystica, hereafter abbreviated as TM. Previous work in the area of super-human game-play using AI has proven effective, with recent break-through for generic algorithms in games such as Go, Chess, and Shogi \cite{AlphaZero}. We directly apply these breakthroughs to a novel state-representation…
▽ More
In this paper, we explore and compare multiple algorithms for solving the complex strategy game of Terra Mystica, hereafter abbreviated as TM. Previous work in the area of super-human game-play using AI has proven effective, with recent break-through for generic algorithms in games such as Go, Chess, and Shogi \cite{AlphaZero}. We directly apply these breakthroughs to a novel state-representation of TM with the goal of creating an AI that will rival human players. Specifically, we present the initial results of applying AlphaZero to this state-representation and analyze the strategies developed. A brief analysis is presented. We call this modified algorithm with our novel state-representation AlphaTM. In the end, we discuss the success and shortcomings of this method by comparing against multiple baselines and typical human scores. All code used for this paper is available at on \href{https://github.com/kandluis/terrazero}{GitHub}.
△ Less
Submitted 21 February, 2021;
originally announced February 2021.
-
Automatic Code Generation using Pre-Trained Language Models
Authors:
Luis Perez,
Lizi Ottens,
Sudharshan Viswanathan
Abstract:
Recent advancements in natural language processing \cite{gpt2} \cite{BERT} have led to near-human performance in multiple natural language tasks. In this paper, we seek to understand whether similar techniques can be applied to a highly structured environment with strict syntax rules. Specifically, we propose an end-to-end machine learning model for code generation in the Python language built on-…
▽ More
Recent advancements in natural language processing \cite{gpt2} \cite{BERT} have led to near-human performance in multiple natural language tasks. In this paper, we seek to understand whether similar techniques can be applied to a highly structured environment with strict syntax rules. Specifically, we propose an end-to-end machine learning model for code generation in the Python language built on-top of pre-trained language models. We demonstrate that a fine-tuned model can perform well in code generation tasks, achieving a BLEU score of 0.22, an improvement of 46\% over a reasonable sequence-to-sequence baseline. All results and related code used for training and data processing are available on GitHub.
△ Less
Submitted 21 February, 2021;
originally announced February 2021.
-
Cloud Cover Nowcasting with Deep Learning
Authors:
Léa Berthomier,
Bruno Pradel,
Lior Perez
Abstract:
Nowcasting is a field of meteorology which aims at forecasting weather on a short term of up to a few hours. In the meteorology landscape, this field is rather specific as it requires particular techniques, such as data extrapolation, where conventional meteorology is generally based on physical modeling. In this paper, we focus on cloud cover nowcasting, which has various application areas such a…
▽ More
Nowcasting is a field of meteorology which aims at forecasting weather on a short term of up to a few hours. In the meteorology landscape, this field is rather specific as it requires particular techniques, such as data extrapolation, where conventional meteorology is generally based on physical modeling. In this paper, we focus on cloud cover nowcasting, which has various application areas such as satellite shots optimisation and photovoltaic energy production forecast.
Following recent deep learning successes on multiple imagery tasks, we applied deep convolutionnal neural networks on Meteosat satellite images for cloud cover nowcasting. We present the results of several architectures specialized in image segmentation and time series prediction. We selected the best models according to machine learning metrics as well as meteorological metrics. All selected architectures showed significant improvements over persistence and the well-known U-Net surpasses AROME physical model.
△ Less
Submitted 17 December, 2020; v1 submitted 24 September, 2020;
originally announced September 2020.
-
Towards Automatic Bayesian Optimization: A first step involving acquisition functions
Authors:
Eduardo C. Garrido Merchán,
Luis C. Jariego Pérez
Abstract:
Bayesian Optimization is the state of the art technique for the optimization of black boxes, i.e., functions where we do not have access to their analytical expression nor its gradients, they are expensive to evaluate and its evaluation is noisy. The most popular application of bayesian optimization is the automatic hyperparameter tuning of machine learning algorithms, where we obtain the best con…
▽ More
Bayesian Optimization is the state of the art technique for the optimization of black boxes, i.e., functions where we do not have access to their analytical expression nor its gradients, they are expensive to evaluate and its evaluation is noisy. The most popular application of bayesian optimization is the automatic hyperparameter tuning of machine learning algorithms, where we obtain the best configuration of machine learning algorithms by optimizing the estimation of the generalization error of these algorithms. Despite being applied with success, bayesian optimization methodologies also have hyperparameters that need to be configured such as the probabilistic surrogate model or the acquisition function used. A bad decision over the configuration of these hyperparameters implies obtaining bad quality results. Typically, these hyperparameters are tuned by making assumptions of the objective function that we want to evaluate but there are scenarios where we do not have any prior information about the objective function. In this paper, we propose a first attempt over automatic bayesian optimization by exploring several heuristics that automatically tune the acquisition function of bayesian optimization. We illustrate the effectiveness of these heurisitcs in a set of benchmark problems and a hyperparameter tuning problem of a machine learning algorithm.
△ Less
Submitted 12 January, 2021; v1 submitted 21 March, 2020;
originally announced March 2020.
-
Layered Embeddings for Amodal Instance Segmentation
Authors:
Yanfeng Liu,
Eric Psota,
Lance Pérez
Abstract:
The proposed method extends upon the representational output of semantic instance segmentation by explicitly including both visible and occluded parts. A fully convolutional network is trained to produce consistent pixel-level embedding across two layers such that, when clustered, the results convey the full spatial extent and depth ordering of each instance. Results demonstrate that the network c…
▽ More
The proposed method extends upon the representational output of semantic instance segmentation by explicitly including both visible and occluded parts. A fully convolutional network is trained to produce consistent pixel-level embedding across two layers such that, when clustered, the results convey the full spatial extent and depth ordering of each instance. Results demonstrate that the network can accurately estimate complete masks in the presence of occlusion and outperform leading top-down bounding-box approaches. Source code available at https://github.com/yanfengliu/layered_embeddings
△ Less
Submitted 14 February, 2020;
originally announced February 2020.
-
Trend-Based Networking Driven by Big Data Telemetry for SDN and Traditional Networks
Authors:
Ankur Jain,
Arohi Gupta,
Ashutosh Gupta,
Dewang Gedia,
Leidy Pérez,
Levi Perigo,
Rahil Gandotra,
Sanjay Murthy
Abstract:
Organizations face a challenge of accurately analyzing network data and providing automated action based on the observed trend. This trend-based analytics is beneficial to minimize the downtime and improve the performance of the network services, but organizations use different network management tools to understand and visualize the network traffic with limited abilities to dynamically optimize t…
▽ More
Organizations face a challenge of accurately analyzing network data and providing automated action based on the observed trend. This trend-based analytics is beneficial to minimize the downtime and improve the performance of the network services, but organizations use different network management tools to understand and visualize the network traffic with limited abilities to dynamically optimize the network. This research focuses on the development of an intelligent system that leverages big data telemetry analysis in Platform for Network Data Analytics (PNDA) to enable comprehensive trend-based networking decisions. The results include a graphical user interface (GUI) done via a web application for effortless management of all subsystems, and the system and application developed in this research demonstrate the true potential for a scalable system capable of effectively benchmarking the network to set the expected behavior for comparison and trend analysis. Moreover, this research provides a proof of concept of how trend analysis results are actioned in both a traditional network and a software-defined network (SDN) to achieve dynamic, automated load balancing.
△ Less
Submitted 23 April, 2019;
originally announced April 2019.
-
A Robust Feature-aware Sparse Mesh Representation
Authors:
Lizeth J. Fuentes Perez,
Luciano A. Romero Calla,
Anselmo A. Montenegro,
Claudio Mura,
Renato Pajarola
Abstract:
The sparse representation of signals defined on Euclidean domains has been successfully applied in signal processing. Bringing the power of sparse representations to non-regular domains is still a challenge, but promising approaches have started emerging recently. In this paper, we investigate the problem of sparsely representing discrete surfaces and propose a new representation that is capable o…
▽ More
The sparse representation of signals defined on Euclidean domains has been successfully applied in signal processing. Bringing the power of sparse representations to non-regular domains is still a challenge, but promising approaches have started emerging recently. In this paper, we investigate the problem of sparsely representing discrete surfaces and propose a new representation that is capable of providing tools for solving different geometry processing problems. The sparse discrete surface representation is obtained by combining innovative approaches into an integrated method. First, to deal with irregular mesh domains, we devised a new way to subdivide discrete meshes into a set of patches using a feature-aware seed sampling. Second, we achieve good surface approximation with over-fitting control by combining the power of a continuous global dictionary representation with a modified Orthogonal Marching Pursuit. The discrete surface approximation results produced were able to preserve the shape features while being robust to over-fitting. Our results show that the method is quite promising for applications like surface re-sampling and mesh compression.
△ Less
Submitted 24 November, 2020; v1 submitted 18 October, 2018;
originally announced October 2018.
-
A minimalistic approach for fast computation of geodesic distances on triangular meshes
Authors:
Luciano A. Romero Calla,
Lizeth J. Fuentes Perez,
Anselmo A. Montenegro
Abstract:
The computation of geodesic distances is an important research topic in Geometry Processing and 3D Shape Analysis as it is a basic component of many methods used in these areas. In this work, we present a minimalistic parallel algorithm based on front propagation to compute approximate geodesic distances on meshes. Our method is practical and simple to implement and does not require any heavy pre-…
▽ More
The computation of geodesic distances is an important research topic in Geometry Processing and 3D Shape Analysis as it is a basic component of many methods used in these areas. In this work, we present a minimalistic parallel algorithm based on front propagation to compute approximate geodesic distances on meshes. Our method is practical and simple to implement and does not require any heavy pre-processing. The convergence of our algorithm depends on the number of discrete level sets around the source points from which distance information propagates. To appropriately implement our method on GPUs taking into account memory coalescence problems, we take advantage of a graph representation based on a breadth-first search traversal that works harmoniously with our parallel front propagation approach. We report experiments that show how our method scales with the size of the problem. We compare the mean error and processing time obtained by our method with such measures computed using other methods. Our method produces results in competitive times with almost the same accuracy, especially for large meshes. We also demonstrate its use for solving two classical geometry processing problems: the regular sampling problem and the Voronoi tessellation on meshes.
△ Less
Submitted 23 August, 2019; v1 submitted 18 October, 2018;
originally announced October 2018.
-
The equivalence between two classic algorithms for the assignment problem
Authors:
Carlos A. Alfaro,
Sergio L. Perez,
Carlos E. Valencia,
Marcos C. Vargas
Abstract:
We give a detailed review of two algorithms that solve the minimization case of the assignment problem. The Bertsekas' auction algorithm and the Goldberg & Kennedy algorithm. We will show that these algorithms are equivalent in the sense that both perform equivalent steps in the same order. We also present experimental results comparing the performance of three algorithms for the assignment proble…
▽ More
We give a detailed review of two algorithms that solve the minimization case of the assignment problem. The Bertsekas' auction algorithm and the Goldberg & Kennedy algorithm. We will show that these algorithms are equivalent in the sense that both perform equivalent steps in the same order. We also present experimental results comparing the performance of three algorithms for the assignment problem. They show the auction algorithm performs and scales better in practice than algorithms that are harder to implement but have better theoretical time complexity.
△ Less
Submitted 8 October, 2018;
originally announced October 2018.
-
Network Service Orchestration: A Survey
Authors:
Nathan F. Saraiva de Sousa,
Danny A. Lachos Perez,
Raphael V. Rosa,
Mateus A. S. Santos,
Christian Esteve Rothenberg
Abstract:
Business models of network service providers are undergoing an evolving transformation fueled by vertical customer demands and technological advances such as 5G, Software Defined Networking~(SDN), and Network Function Virtualization~(NFV). Emerging scenarios call for agile network services consuming network, storage, and compute resources across heterogeneous infrastructures and administrative dom…
▽ More
Business models of network service providers are undergoing an evolving transformation fueled by vertical customer demands and technological advances such as 5G, Software Defined Networking~(SDN), and Network Function Virtualization~(NFV). Emerging scenarios call for agile network services consuming network, storage, and compute resources across heterogeneous infrastructures and administrative domains. Coordinating resource control and service creation across interconnected domains and diverse technologies becomes a grand challenge. Research and development efforts are being devoted to enabling orchestration processes to automate, coordinate, and manage the deployment and operation of network services. In this survey, we delve into the topic of Network Service Orchestration~(NSO) by reviewing the historical background, relevant research projects, enabling technologies, and standardization activities. We define key concepts and propose a taxonomy of NSO approaches and solutions to pave the way towards a common understanding of the various ongoing efforts around the realization of diverse NSO application scenarios. Based on the analysis of the state of affairs, we present a series of open challenges and research opportunities, altogether contributing to a timely and comprehensive survey on the vibrant and strategic topic of network service orchestration.
△ Less
Submitted 17 May, 2019; v1 submitted 17 March, 2018;
originally announced March 2018.
-
Promises and Caveats of Uplink IoT Ultra-Dense Networks
Authors:
Ming Ding,
David Lopez Perez
Abstract:
In this paper, by means of simulations, we evaluate the uplink (UL) performance of an Internet of Things (IoT) capable ultra-dense network (UDN) in terms of the coverage probability and the density of reliably working user equipments (UEs). From our study, we show the benefits and challenges that UL IoT UDNs will bring about in the future. In more detail, for a low-reliability criterion, such as a…
▽ More
In this paper, by means of simulations, we evaluate the uplink (UL) performance of an Internet of Things (IoT) capable ultra-dense network (UDN) in terms of the coverage probability and the density of reliably working user equipments (UEs). From our study, we show the benefits and challenges that UL IoT UDNs will bring about in the future. In more detail, for a low-reliability criterion, such as achieving a UL signal-to-interference-plus-noise ratio (SINR) above 0 dB, the density of reliably working UEs grows quickly with the network densification, showing the potential of UL IoT UDNs. In contrast, for a high-reliability criterion, such as achieving a UL SINR above 10 dB, the density of reliably working UEs remains to be low in UDNs due to excessive inter-cell interference, which should be considered when operating UL IoT UDNs. Moreover, considering the existence of a non-zero antenna height difference between base stations (BSs) and UEs, the density of reliably working UEs could even decrease as we deploy more BSs. This calls for the usage of sophisticated interference management schemes and/or beam steering/shaping technologies in UL IoT UDNs.
△ Less
Submitted 20 January, 2018;
originally announced January 2018.
-
The Effectiveness of Data Augmentation in Image Classification using Deep Learning
Authors:
Luis Perez,
Jason Wang
Abstract:
In this paper, we explore and compare multiple solutions to the problem of data augmentation in image classification. Previous work has demonstrated the effectiveness of data augmentation through simple techniques, such as cropping, rotating, and flipping input images. We artificially constrain our access to data to a small subset of the ImageNet dataset, and compare each data augmentation techniq…
▽ More
In this paper, we explore and compare multiple solutions to the problem of data augmentation in image classification. Previous work has demonstrated the effectiveness of data augmentation through simple techniques, such as cropping, rotating, and flipping input images. We artificially constrain our access to data to a small subset of the ImageNet dataset, and compare each data augmentation technique in turn. One of the more successful data augmentations strategies is the traditional transformations mentioned above. We also experiment with GANs to generate images of different styles. Finally, we propose a method to allow a neural net to learn augmentations that best improve the classifier, which we call neural augmentation. We discuss the successes and shortcomings of this method on various datasets.
△ Less
Submitted 13 December, 2017;
originally announced December 2017.
-
Predicting Yelp Star Reviews Based on Network Structure with Deep Learning
Authors:
Luis Perez
Abstract:
In this paper, we tackle the real-world problem of predicting Yelp star-review rating based on business features (such as images, descriptions), user features (average previous ratings), and, of particular interest, network properties (which businesses has a user rated before). We compare multiple models on different sets of features -- from simple linear regression on network features only to dee…
▽ More
In this paper, we tackle the real-world problem of predicting Yelp star-review rating based on business features (such as images, descriptions), user features (average previous ratings), and, of particular interest, network properties (which businesses has a user rated before). We compare multiple models on different sets of features -- from simple linear regression on network features only to deep learning models on network and item features.
In recent years, breakthroughs in deep learning have led to increased accuracy in common supervised learning tasks, such as image classification, captioning, and language understanding. However, the idea of combining deep learning with network feature and structure appears to be novel. While the problem of predicting future interactions in a network has been studied at length, these approaches have often ignored either node-specific data or global structure.
We demonstrate that taking a mixed approach combining both node-level features and network information can effectively be used to predict Yelp-review star ratings. We evaluate on the Yelp dataset by splitting our data along the time dimension (as would naturally occur in the real-world) and comparing our model against others which do no take advantage of the network structure and/or deep learning.
△ Less
Submitted 11 December, 2017;
originally announced December 2017.
-
What is the Optimal Network Deployment for a Fixed Density of Antennas?
Authors:
Xuefeng Yao,
Ming Ding,
David Lopez Perez,
Zihuai Lin,
Guoqiang Mao
Abstract:
In this paper, we answer a fundamental question: when the total number of antennas per square kilometer is fixed, what is the optimal network deployment? A denser network with a less number of antennas per base station (BS) or the opposite case. To evaluate network performance, we consider a practical network scenario with a fixed antennas density and multiuser multiple-input-multiple-output (MU-M…
▽ More
In this paper, we answer a fundamental question: when the total number of antennas per square kilometer is fixed, what is the optimal network deployment? A denser network with a less number of antennas per base station (BS) or the opposite case. To evaluate network performance, we consider a practical network scenario with a fixed antennas density and multiuser multiple-input-multiple-output (MU-MIMO) operations for single-antenna users. The number of antennas in each BS is calculated by dividing the antenna density by the BS density. With the consideration of several practical network models, i.e., pilot contamination, a limited user equipment (UE) density and probabilistic line-of-sight (LoS)/non-line-of-sight (NLoS) path loss model, we evaluate the area spectral efficiency (ASE) performance. From our simulation results, we conclude that there exists an optimal BS density for a certain UE density to maximize the ASE performance when the antenna density is fixed. The intuition is that (i) by densifying the network with more BSs, we can achieve a receive power gain due to the smaller distance between the typical UE and its serving BS; (ii) by installing more antennas in each BS, we can achieve a beamforming gain for UEs using MU-MIMO, although such beamforming gain is degraded by pilot contamination; (iii) thus, a trade-off exists between the receive power gain and the beamforming gain, if we fix the antenna density in the network.
△ Less
Submitted 24 October, 2017;
originally announced October 2017.
-
Ultra-Dense Networks: A New Look at the Proportional Fair Scheduler
Authors:
Ming Ding,
David Lopez Perez,
Amir H. Jafari,
Guoqiang Mao,
Zihuai Lin
Abstract:
In this paper, we theoretically study the proportional fair (PF) scheduler in the context of ultra-dense networks (UDNs). Analytical results are obtained for the coverage probability and the area spectral efficiency (ASE) performance of dense small cell networks (SCNs) with the PF scheduler employed at base stations (BSs). The key point of our analysis is that the typical user is no longer a rando…
▽ More
In this paper, we theoretically study the proportional fair (PF) scheduler in the context of ultra-dense networks (UDNs). Analytical results are obtained for the coverage probability and the area spectral efficiency (ASE) performance of dense small cell networks (SCNs) with the PF scheduler employed at base stations (BSs). The key point of our analysis is that the typical user is no longer a random user as assumed in most studies in the literature. Instead, a user with the maximum PF metric is chosen by its serving BS as the typical user. By comparing the previous results of the round-robin (RR) scheduler with our new results of the PF scheduler, we quantify the loss of the multi-user diversity of the PF scheduler with the network densification, which casts a new look at the role of the PF scheduler in UDNs. Our conclusion is that the RR scheduler should be used in UDNs to simplify the radio resource management (RRM).
△ Less
Submitted 26 September, 2017; v1 submitted 26 August, 2017;
originally announced August 2017.
-
Ultra-Dense Networks: Is There a Limit to Spatial Spectrum Reuse?
Authors:
Ming Ding,
David Lopez Perez,
Guoqiang Mao,
Zihuai Lin
Abstract:
The aggressive spatial spectrum reuse (SSR) by network densification using smaller cells has successfully driven the wireless communication industry onward in the past decades. In our future journey toward ultra-dense networks (UDNs), a fundamental question needs to be answered. Is there a limit to SSR? In other words, when we deploy thousands or millions of small cell base stations (BSs) per squa…
▽ More
The aggressive spatial spectrum reuse (SSR) by network densification using smaller cells has successfully driven the wireless communication industry onward in the past decades. In our future journey toward ultra-dense networks (UDNs), a fundamental question needs to be answered. Is there a limit to SSR? In other words, when we deploy thousands or millions of small cell base stations (BSs) per square kilometer, is activating all BSs on the same time/frequency resource the best strategy? In this paper, we present theoretical analyses to answer such question. In particular, we find that both the signal and interference powers become bounded in practical UDNs with a non-zero BS-to-UE antenna height difference and a finite UE density, which leads to a constant capacity scaling law. As a result, there exists an optimal SSR density that can maximize the network capacity. Hence, the limit to SSR should be considered in the operation of future UDNs.
△ Less
Submitted 15 October, 2017; v1 submitted 2 April, 2017;
originally announced April 2017.
-
A Comparison of Algorithms for Intrusion Detection on Batch and Data Stream Environments
Authors:
Jorge Luis Rivero Pérez,
Bernardete Ribeiro,
Kadir Hector Ortiz
Abstract:
Intruders detection in computer networks has some deficiencies from machine learning approach, given by the nature of the application. The principal problem is the modest display of detection systems based on learning algorithms under the constraints imposed by real environments. This article focuses on the machine learning approach for network intrusion detection in batch and data stream environm…
▽ More
Intruders detection in computer networks has some deficiencies from machine learning approach, given by the nature of the application. The principal problem is the modest display of detection systems based on learning algorithms under the constraints imposed by real environments. This article focuses on the machine learning approach for network intrusion detection in batch and data stream environments. First, we propose and describe three variants of KDD99 dataset preprocessing including attribute selection. Secondly, a thoroughly experimentation is performed from evaluating and comparing representative batch learning algorithms on the variants obtained from KDD99 pre processing. Finally, since network traffic is a constant data stream, which can present concept drifting with high rate of false positive, along with the fact that there are not many researches addressing intrusion detection on streaming environments, lead us to make a comparison of various representative data stream classification algorithms. This research allows determining the algorithms that better perform on the proposed variants of KDD99 for both batch and data stream environments.
△ Less
Submitted 3 January, 2017;
originally announced January 2017.
-
What Is the True Value of Dynamic TDD: A MAC Layer Perspective
Authors:
Ming Ding,
David Lopez Perez,
Guoqiang Mao,
Zihuai Lin
Abstract:
Small cell networks (SCNs) are envisioned to embrace dynamic time division duplexing (TDD) in order to tailor downlink (DL)/uplink (UL) subframe resources to quick variations and burstiness of DL/UL traffic. The study of dynamic TDD is particularly important because it provides valuable insights on the full duplex transmission technology, which has been identified as one of the candidate technolog…
▽ More
Small cell networks (SCNs) are envisioned to embrace dynamic time division duplexing (TDD) in order to tailor downlink (DL)/uplink (UL) subframe resources to quick variations and burstiness of DL/UL traffic. The study of dynamic TDD is particularly important because it provides valuable insights on the full duplex transmission technology, which has been identified as one of the candidate technologies for the 5th-generation (5G) networks. Up to now, the existing works on dynamic TDD have shown that the UL of dynamic TDD suffers from severe performance degradation due to the strong DL-to-UL interference in the physical (PHY) layer. This conclusion raises a fundamental question: Despite such obvious technology disadvantage, what is the true value of dynamic TDD? In this paper, we answer this question from a media access control (MAC) layer viewpoint and present analytical results on the DL/UL time resource utilization (TRU) of synchronous dynamic TDD, which has been widely adopted in the existing 4th-generation (4G) systems. Our analytical results shed new light on the dynamic TDD in future synchronous 5G networks.
△ Less
Submitted 26 September, 2017; v1 submitted 9 November, 2016;
originally announced November 2016.
-
Please Lower Small Cell Antenna Heights in 5G
Authors:
Ming Ding,
David Lopez Perez
Abstract:
In this paper, we present a new and significant theoretical discovery. If the absolute height difference between base station (BS) antenna and user equipment (UE) antenna is larger than zero, then the network capacity performance in terms of the area spectral efficiency (ASE) will continuously decrease as the BS density increases for ultra-dense (UD) small cell networks (SCNs). This performance be…
▽ More
In this paper, we present a new and significant theoretical discovery. If the absolute height difference between base station (BS) antenna and user equipment (UE) antenna is larger than zero, then the network capacity performance in terms of the area spectral efficiency (ASE) will continuously decrease as the BS density increases for ultra-dense (UD) small cell networks (SCNs). This performance behavior has a tremendous impact on the deployment of UD SCNs in the 5th-generation (5G) era. Network operators may invest large amounts of money in deploying more network infrastructure to only obtain an even worse network performance. Our study results reveal that it is a must to lower the SCN BS antenna height to the UE antenna height to fully achieve the capacity gains of UD SCNs in 5G. However, this requires a revolutionized approach of BS architecture and deployment, which is explored in this paper too.
△ Less
Submitted 26 September, 2017; v1 submitted 6 November, 2016;
originally announced November 2016.
-
Proposal of Data Processing Platform for Direct Marketing Data
Authors:
Jorge Luis Rivero Pérez,
Yaimara Peñate Santana,
Pedro Harenton Martínez López
Abstract:
Data mining has been widely used to identify potential customers for a new product or service. In this article is done a study of previous work relating to the application of data mining methodologies for software projects, specifically for direct marketing projects. Several data sets of demographic and historical customer purchases data available for evaluation of algorithms in this area, some of…
▽ More
Data mining has been widely used to identify potential customers for a new product or service. In this article is done a study of previous work relating to the application of data mining methodologies for software projects, specifically for direct marketing projects. Several data sets of demographic and historical customer purchases data available for evaluation of algorithms in this area, some of them very new and current are described. The main contribution of this paper is the proposal of a platform for distributed data stream processing for the processes of targeting customers and building predictive models required response; thus facilitating several of the functional requirements for development environments.
△ Less
Submitted 5 September, 2016;
originally announced September 2016.
-
Study on the Idle Mode Capability with LoS and NLoS Transmissions
Authors:
Ming Ding,
David Lopez Perez,
Guoqiang Mao,
Zihuai Lin
Abstract:
In this paper, we study the impact of the base station (BS) idle mode capability (IMC) on the network performance in dense small cell networks (SCNs). Different from existing works, we consider a sophisticated path loss model incorporating both line-of-sight (LoS) and non-line-of-sight (NLoS) transmissions. Analytical results are obtained for the coverage probability and the area spectral efficien…
▽ More
In this paper, we study the impact of the base station (BS) idle mode capability (IMC) on the network performance in dense small cell networks (SCNs). Different from existing works, we consider a sophisticated path loss model incorporating both line-of-sight (LoS) and non-line-of-sight (NLoS) transmissions. Analytical results are obtained for the coverage probability and the area spectral efficiency (ASE) performance for SCNs with IMCs at the BSs. The upper bound, the lower bound and the approximate expression of the activated BS density are also derived. The performance impact of the IMC is shown to be significant. As the BS density surpasses the UE density, thus creating a surplus of BSs, the coverage probability will continuously increase toward one. For the practical regime of the BS density, the results derived from our analysis are distinctively different from existing results, and thus shed new light on the deployment and the operation of future dense SCNs.
△ Less
Submitted 26 September, 2017; v1 submitted 23 August, 2016;
originally announced August 2016.
-
Attribute Learning for Network Intrusion Detection
Authors:
Jorge Luis Rivero Pérez,
Bernardete Ribeiro
Abstract:
Network intrusion detection is one of the most visible uses for Big Data analytics. One of the main problems in this application is the constant rise of new attacks. This scenario, characterized by the fact that not enough labeled examples are available for the new classes of attacks is hardly addressed by traditional machine learning approaches. New findings on the capabilities of Zero-Shot learn…
▽ More
Network intrusion detection is one of the most visible uses for Big Data analytics. One of the main problems in this application is the constant rise of new attacks. This scenario, characterized by the fact that not enough labeled examples are available for the new classes of attacks is hardly addressed by traditional machine learning approaches. New findings on the capabilities of Zero-Shot learning (ZSL) approach makes it an interesting solution for this problem because it has the ability to classify instances of unseen classes. ZSL has inherently two stages: the attribute learning and the inference stage. In this paper we propose a new algorithm for the attribute learning stage of ZSL. The idea is to learn new values for the attributes based on decision trees (DT). Our results show that based on the rules extracted from the DT a better distribution for the attribute values can be found. We also propose an experimental setup for the evaluation of ZSL on network intrusion detection (NID).
△ Less
Submitted 28 July, 2016;
originally announced July 2016.
-
Mahalanobis Distance Metric Learning Algorithm for Instance-based Data Stream Classification
Authors:
Jorge Luis Rivero Perez,
Bernardete Ribeiro,
Carlos Morell Perez
Abstract:
With the massive data challenges nowadays and the rapid growing of technology, stream mining has recently received considerable attention. To address the large number of scenarios in which this phenomenon manifests itself suitable tools are required in various research fields. Instance-based data stream algorithms generally employ the Euclidean distance for the classification task underlying this…
▽ More
With the massive data challenges nowadays and the rapid growing of technology, stream mining has recently received considerable attention. To address the large number of scenarios in which this phenomenon manifests itself suitable tools are required in various research fields. Instance-based data stream algorithms generally employ the Euclidean distance for the classification task underlying this problem. A novel way to look into this issue is to take advantage of a more flexible metric due to the increased requirements imposed by the data stream scenario. In this paper we present a new algorithm that learns a Mahalanobis metric using similarity and dissimilarity constraints in an online manner. This approach hybridizes a Mahalanobis distance metric learning algorithm and a k-NN data stream classification algorithm with concept drift detection. First, some basic aspects of Mahalanobis distance metric learning are described taking into account key properties as well as online distance metric learning algorithms. Second, we implement specific evaluation methodologies and comparative metrics such as Q statistic for data stream classification algorithms. Finally, our algorithm is evaluated on different datasets by comparing its results with one of the best instance-based data stream classification algorithm of the state of the art. The results demonstrate that our proposal is better
△ Less
Submitted 17 April, 2016;
originally announced April 2016.
-
DNA-GA: A New Approach of Network Performance Analysis
Authors:
Ming Ding,
David Lopez Perez,
Guoqiang Mao,
Zihuai Lin
Abstract:
In this paper, we propose a new approach of network performance analysis, which is based on our previous works on the deterministic network analysis using the Gaussian approximation (DNA-GA). First, we extend our previous works to a signal-to-interference ratio (SIR) analysis, which makes our DNA-GA analysis a formal microscopic analysis tool. Second, we show two approaches for upgrading the DNA-G…
▽ More
In this paper, we propose a new approach of network performance analysis, which is based on our previous works on the deterministic network analysis using the Gaussian approximation (DNA-GA). First, we extend our previous works to a signal-to-interference ratio (SIR) analysis, which makes our DNA-GA analysis a formal microscopic analysis tool. Second, we show two approaches for upgrading the DNA-GA analysis to a macroscopic analysis tool. Finally, we perform a comparison between the proposed DNA-GA analysis and the existing macroscopic analysis based on stochastic geometry. Our results show that the DNA-GA analysis possesses a few special features: (i) shadow fading is naturally considered in the DNAGA analysis; (ii) the DNA-GA analysis can handle non-uniform user distributions and any type of multi-path fading; (iii) the shape and/or the size of cell coverage areas in the DNA-GA analysis can be made arbitrary for the treatment of hotspot network scenarios. Thus, DNA-GA analysis is very useful for the network performance analysis of the 5th generation (5G) systems with general cell deployment and user distribution, both on a microscopic level and on a macroscopic level.
△ Less
Submitted 26 September, 2017; v1 submitted 16 December, 2015;
originally announced December 2015.
-
Where Is My Puppy? Retrieving Lost Dogs by Facial Features
Authors:
Thierry Pinheiro Moreira,
Mauricio Lisboa Perez,
Rafael de Oliveira Werneck,
Eduardo Valle
Abstract:
A pet that goes missing is among many people's worst fears: a moment of distraction is enough for a dog or a cat wandering off from home. Some measures help matching lost animals to their owners; but automated visual recognition is one that - although convenient, highly available, and low-cost - is surprisingly overlooked. In this paper, we inaugurate that promising avenue by pursuing face recogni…
▽ More
A pet that goes missing is among many people's worst fears: a moment of distraction is enough for a dog or a cat wandering off from home. Some measures help matching lost animals to their owners; but automated visual recognition is one that - although convenient, highly available, and low-cost - is surprisingly overlooked. In this paper, we inaugurate that promising avenue by pursuing face recognition for dogs. We contrast four ready-to-use human facial recognizers (EigenFaces, FisherFaces, LBPH, and a Sparse method) to two original solutions based upon convolutional neural networks: BARK (inspired in architecture-optimized networks employed for human facial recognition) and WOOF (based upon off-the-shelf OverFeat features). Human facial recognizers perform poorly for dogs (up to 60.5% accuracy), showing that dog facial recognition is not a trivial extension of human facial recognition. The convolutional network solutions work much better, with BARK attaining up to 81.1% accuracy, and WOOF, 89.4%. The tests were conducted in two datasets: Flickr-dog, with 42 dogs of two breeds (pugs and huskies); and Snoopybook, with 18 mongrel dogs.
△ Less
Submitted 1 August, 2016; v1 submitted 9 October, 2015;
originally announced October 2015.
-
Approximation of Uplink Inter-Cell Interference in FDMA Small Cell Networks
Authors:
Ming Ding,
David Lopez Perez,
Guoqiang Mao,
Zihuai Lin
Abstract:
In this paper, for the first time, we analytically prove that the uplink (UL) inter-cell interference in frequency division multiple access (FDMA) small cell networks (SCNs) can be well approximated by a lognormal distribution under a certain condition. The lognormal approximation is vital because it allows tractable network performance analysis with closed-form expressions. The derived condition,…
▽ More
In this paper, for the first time, we analytically prove that the uplink (UL) inter-cell interference in frequency division multiple access (FDMA) small cell networks (SCNs) can be well approximated by a lognormal distribution under a certain condition. The lognormal approximation is vital because it allows tractable network performance analysis with closed-form expressions. The derived condition, under which the lognormal approximation applies, does not pose particular requirements on the shapes/sizes of user equipment (UE) distribution areas as in previous works. Instead, our results show that if a path loss related random variable (RV) associated with the UE distribution area, has a low ratio of the 3rd absolute moment to the variance, the lognormal approximation will hold. Analytical and simulation results show that the derived condition can be readily satisfied in future dense/ultra-dense SCNs, indicating that our conclusions are very useful for network performance analysis of the 5th generation (5G) systems with more general cell deployment beyond the widely used Poisson deployment.
△ Less
Submitted 26 September, 2017; v1 submitted 8 May, 2015;
originally announced May 2015.
-
A software for learning Information Theory basics with emphasis on Entropy of Spanish
Authors:
Fabio G. Guerrero,
Lucio A. Perez
Abstract:
In this paper, a tutorial software to learn Information Theory basics in a practical way is reported. The software, called IT-tutor-UV, makes use of a modern existing Spanish corpus for the modeling of the source. Both the source and the channel coding are also included in this educational tool as part of the learning experience. Entropy values of the Spanish language obtained with the IT-tutor-…
▽ More
In this paper, a tutorial software to learn Information Theory basics in a practical way is reported. The software, called IT-tutor-UV, makes use of a modern existing Spanish corpus for the modeling of the source. Both the source and the channel coding are also included in this educational tool as part of the learning experience. Entropy values of the Spanish language obtained with the IT-tutor-UV are discussed and compared to others that were previously calculated under limited conditions.
△ Less
Submitted 20 September, 2007;
originally announced September 2007.