-
A Novel Frame Identification and Synchronization Technique for Smartphone Visible Light Communication Systems Based on Convolutional Neural Networks
Authors:
Vaigai Nayaki Yokar,
Hoa Le-Minh,
Xicong Li,
Wai Lok Woo,
Luis Nero Alves,
Stanislav Zvanovec,
Tran The Son,
Zabih Ghassemlooy
Abstract:
This paper proposes a novel, robust, and lightweight supervised Convolutional Neural Network (CNN)-based technique for frame identification and synchronization, designed to enhance short-link communication performance in a screen-to-camera (S2C) based visible light communication (VLC) system. Developed using Python and the TensorFlow Keras framework, the proposed CNN model was trained through thre…
▽ More
This paper proposes a novel, robust, and lightweight supervised Convolutional Neural Network (CNN)-based technique for frame identification and synchronization, designed to enhance short-link communication performance in a screen-to-camera (S2C) based visible light communication (VLC) system. Developed using Python and the TensorFlow Keras framework, the proposed CNN model was trained through three real-time experimental investigations conducted in Jupyter Notebook. These experiments incorporated a dataset created from scratch to address various real-time challenges in S2C communication, including blurring, cropping, and rotated images in mobility scenarios. Overhead frames were introduced for synchronization, which leads to enhanced system performance. The experimental results demonstrate that the proposed model achieves an overall accuracy of approximately 98.74%, highlighting its effectiveness in identifying and synchronizing frames in S2C VLC systems.
△ Less
Submitted 28 June, 2025;
originally announced June 2025.
-
High Resolution Tree Height Mapping of the Amazon Forest using Planet NICFI Images and LiDAR-Informed U-Net Model
Authors:
Fabien H Wagner,
Ricardo Dalagnol,
Griffin Carter,
Mayumi CM Hirye,
Shivraj Gill,
Le Bienfaiteur Sagang Takougoum,
Samuel Favrichon,
Michael Keller,
Jean PHB Ometto,
Lorena Alves,
Cynthia Creze,
Stephanie P George-Chacon,
Shuang Li,
Zhihua Liu,
Adugna Mullissa,
Yan Yang,
Erone G Santos,
Sarah R Worden,
Martin Brandt,
Philippe Ciais,
Stephen C Hagen,
Sassan Saatchi
Abstract:
Tree canopy height is one of the most important indicators of forest biomass, productivity, and ecosystem structure, but it is challenging to measure accurately from the ground and from space. Here, we used a U-Net model adapted for regression to map the mean tree canopy height in the Amazon forest from Planet NICFI images at ~4.78 m spatial resolution for the period 2020-2024. The U-Net model was…
▽ More
Tree canopy height is one of the most important indicators of forest biomass, productivity, and ecosystem structure, but it is challenging to measure accurately from the ground and from space. Here, we used a U-Net model adapted for regression to map the mean tree canopy height in the Amazon forest from Planet NICFI images at ~4.78 m spatial resolution for the period 2020-2024. The U-Net model was trained using canopy height models computed from aerial LiDAR data as a reference, along with their corresponding Planet NICFI images. Predictions of tree heights on the validation sample exhibited a mean error of 3.68 m and showed relatively low systematic bias across the entire range of tree heights present in the Amazon forest. Our model successfully estimated canopy heights up to 40-50 m without much saturation, outperforming existing canopy height products from global models in this region. We determined that the Amazon forest has an average canopy height of ~22 m. Events such as logging or deforestation could be detected from changes in tree height, and encouraging results were obtained to monitor the height of regenerating forests. These findings demonstrate the potential for large-scale mapping and monitoring of tree height for old and regenerating Amazon forests using Planet NICFI imagery.
△ Less
Submitted 17 January, 2025;
originally announced January 2025.
-
Optimizing Parking Space Classification: Distilling Ensembles into Lightweight Classifiers
Authors:
Paulo Luza Alves,
André Hochuli,
Luiz Eduardo de Oliveira,
Paulo Lisboa de Almeida
Abstract:
When deploying large-scale machine learning models for smart city applications, such as image-based parking lot monitoring, data often must be sent to a central server to perform classification tasks. This is challenging for the city's infrastructure, where image-based applications require transmitting large volumes of data, necessitating complex network and hardware infrastructures to process the…
▽ More
When deploying large-scale machine learning models for smart city applications, such as image-based parking lot monitoring, data often must be sent to a central server to perform classification tasks. This is challenging for the city's infrastructure, where image-based applications require transmitting large volumes of data, necessitating complex network and hardware infrastructures to process the data. To address this issue in image-based parking space classification, we propose creating a robust ensemble of classifiers to serve as Teacher models. These Teacher models are distilled into lightweight and specialized Student models that can be deployed directly on edge devices. The knowledge is distilled to the Student models through pseudo-labeled samples generated by the Teacher model, which are utilized to fine-tune the Student models on the target scenario. Our results show that the Student models, with 26 times fewer parameters than the Teacher models, achieved an average accuracy of 96.6% on the target test datasets, surpassing the Teacher models, which attained an average accuracy of 95.3%.
△ Less
Submitted 7 October, 2024;
originally announced October 2024.
-
EU cost action on future generation optical wireless communication technologies -- newfocus ca19111, a white paper
Authors:
M A Khalighi,
Z Ghassemlooy,
S Zvanovec,
N Stevens,
L N Alves,
A Shrestha,
M Uysal,
A M Vegni,
P D Diamantoulakis,
V K Papanikolaou,
G K Karagiannidis,
B Ortega,
V Almenar,
O Bouchet,
L Ladid
Abstract:
The EU COST Action NEWFOCUS is focused on investigating radical solutions with the potential to impact the design of future wireless networks. It aims to address some of the challenges in OWC and establish it as an efficient technology that can satisfy the demanding requirements of backhaul and access network levels in 5G networks. This also includes the use of hybrid links that associate OWC with…
▽ More
The EU COST Action NEWFOCUS is focused on investigating radical solutions with the potential to impact the design of future wireless networks. It aims to address some of the challenges in OWC and establish it as an efficient technology that can satisfy the demanding requirements of backhaul and access network levels in 5G networks. This also includes the use of hybrid links that associate OWC with radiofrequency or wired/fiber-based technologies. The focus of this White Paper is on the use of optical wireless communication (OWC) as enabling technology in a range of areas outlined in HE's Pillar II including Health, Manufacturing, Intelligent Transportation Systems (ITS), Unmanned Aerial Vehicles and Network and Protocol.
△ Less
Submitted 18 July, 2022;
originally announced October 2022.
-
Should Social Robots in Retail Manipulate Customers?
Authors:
Oliver Bendel,
Liliana Margarida Dos Santos Alves
Abstract:
Against the backdrop of structural changes in the retail trade, social robots have found their way into retail stores and shopping malls in order to attract, welcome, and greet customers; to inform them, advise them, and persuade them to make a purchase. Salespeople often have a broad knowledge of their product and rely on offering competent and honest advice, whether it be on shoes, clothing, or…
▽ More
Against the backdrop of structural changes in the retail trade, social robots have found their way into retail stores and shopping malls in order to attract, welcome, and greet customers; to inform them, advise them, and persuade them to make a purchase. Salespeople often have a broad knowledge of their product and rely on offering competent and honest advice, whether it be on shoes, clothing, or kitchen appliances. However, some frequently use sales tricks to secure purchases. The question arises of how consulting and sales robots should "behave". Should they behave like human advisors and salespeople, i.e., occasionally manipulate customers? Or should they be more honest and reliable than us? This article tries to answer these questions. After explaining the basics, it evaluates a study in this context and gives recommendations for companies that want to use consulting and sales robots. Ultimately, fair, honest, and trustworthy robots in retail are a win-win situation for all concerned.
△ Less
Submitted 17 June, 2022;
originally announced June 2022.
-
OpenKBP-Opt: An international and reproducible evaluation of 76 knowledge-based planning pipelines
Authors:
Aaron Babier,
Rafid Mahmood,
Binghao Zhang,
Victor G. L. Alves,
Ana Maria Barragán-Montero,
Joel Beaudry,
Carlos E. Cardenas,
Yankui Chang,
Zijie Chen,
Jaehee Chun,
Kelly Diaz,
Harold David Eraso,
Erik Faustmann,
Sibaji Gaj,
Skylar Gay,
Mary Gronberg,
Bingqi Guo,
Junjun He,
Gerd Heilemann,
Sanchit Hira,
Yuliang Huang,
Fuxin Ji,
Dashan Jiang,
Jean Carlo Jimenez Giraldo,
Hoyeon Lee
, et al. (34 additional authors not shown)
Abstract:
We establish an open framework for developing plan optimization models for knowledge-based planning (KBP) in radiotherapy. Our framework includes reference plans for 100 patients with head-and-neck cancer and high-quality dose predictions from 19 KBP models that were developed by different research groups during the OpenKBP Grand Challenge. The dose predictions were input to four optimization mode…
▽ More
We establish an open framework for developing plan optimization models for knowledge-based planning (KBP) in radiotherapy. Our framework includes reference plans for 100 patients with head-and-neck cancer and high-quality dose predictions from 19 KBP models that were developed by different research groups during the OpenKBP Grand Challenge. The dose predictions were input to four optimization models to form 76 unique KBP pipelines that generated 7600 plans. The predictions and plans were compared to the reference plans via: dose score, which is the average mean absolute voxel-by-voxel difference in dose a model achieved; the deviation in dose-volume histogram (DVH) criterion; and the frequency of clinical planning criteria satisfaction. We also performed a theoretical investigation to justify our dose mimicking models. The range in rank order correlation of the dose score between predictions and their KBP pipelines was 0.50 to 0.62, which indicates that the quality of the predictions is generally positively correlated with the quality of the plans. Additionally, compared to the input predictions, the KBP-generated plans performed significantly better (P<0.05; one-sided Wilcoxon test) on 18 of 23 DVH criteria. Similarly, each optimization model generated plans that satisfied a higher percentage of criteria than the reference plans. Lastly, our theoretical investigation demonstrated that the dose mimicking models generated plans that are also optimal for a conventional planning model. This was the largest international effort to date for evaluating the combination of KBP prediction and optimization models. In the interest of reproducibility, our data and code is freely available at https://github.com/ababier/open-kbp-opt.
△ Less
Submitted 16 February, 2022;
originally announced February 2022.
-
DeepDoseNet: A Deep Learning model for 3D Dose Prediction in Radiation Therapy
Authors:
Mumtaz Hussain Soomro,
Victor Gabriel Leandro Alves,
Hamidreza Nourzadeh,
Jeffrey V. Siebers
Abstract:
The DeepDoseNet 3D dose prediction model based on ResNet and Dilated DenseNet is proposed. The 340 head-and-neck datasets from the 2020 AAPM OpenKBP challenge were utilized, with 200 for training, 40 for validation, and 100 for testing. Structures include 56Gy, 63Gy, 70Gy PTVs, and brainstem, spinal cord, right parotid, left parotid, larynx, esophagus, and mandible OARs. Mean squared error (MSE) l…
▽ More
The DeepDoseNet 3D dose prediction model based on ResNet and Dilated DenseNet is proposed. The 340 head-and-neck datasets from the 2020 AAPM OpenKBP challenge were utilized, with 200 for training, 40 for validation, and 100 for testing. Structures include 56Gy, 63Gy, 70Gy PTVs, and brainstem, spinal cord, right parotid, left parotid, larynx, esophagus, and mandible OARs. Mean squared error (MSE) loss, mean absolute error (MAE) loss, and MAE plus dose-volume histogram (DVH) based loss functions were investigated. Each model's performance was compared using a 3D dose score, $\bar{S_{D}}$, (mean absolute difference between ground truth and predicted 3D dose distributions) and a DVH score, $\bar{S_{DVH}}$ (mean absolute difference between ground truth and predicted dose-volume metrics).Furthermore, DVH metrics Mean[Gy] and D0.1cc [Gy] for OARs and D99%, D95%, D1% for PTVs were computed. DeepDoseNet with the MAE plus DVH-based loss function had the best dose score performance of the OpenKBP entries. MAE+DVH model had the lowest prediction error (P<0.0001, Wilcoxon test) on validation and test datasets (validation: $\bar{S_{D}}$=2.3Gy, $\bar{S_{DVH}}$=1.9Gy; test: $\bar{S_{D}}$=2.0Gy, $\bar{S_{DVH}}$=1.6Gy) followed by the MAE model (validation: $\bar{S_{D}}$=3.6Gy, $\bar{S_{DVH}}$=2.4Gy; test: $\bar{S_{D}}$=3.5Gy, $\bar{S_{DVH}}$=2.3Gy). The MSE model had the highest prediction error (validation: $\bar{S_{D}}$=3.7Gy, $\bar{S_{DVH}}$=3.2Gy; test: $\bar{S_{D}}$=3.6Gy, $\bar{S_{DVH}}$=3.0Gy). No significant difference was found among models in terms of Mean [Gy], but the MAE+DVH model significantly outperformed the MAE and MSE models in terms of D0.1cc[Gy], particularly for mandible and parotids on both validation (P<0.01) and test (P<0.0001) datasets. MAE+DVH outperformed (P<0.0001) in terms of D99%, D95%, D1% for targets. MAE+DVH reduced $\bar{S_{D}}$ by ~60% and $\bar{S_{DVH}}$ by ~70%.
△ Less
Submitted 29 October, 2021;
originally announced November 2021.
-
EEG functional connectivity and deep learning for automatic diagnosis of brain disorders: Alzheimer's disease and schizophrenia
Authors:
Caroline L. Alves,
Aruane M. Pineda,
Kirstin Roster,
Christiane Thielemann,
Francisco A. Rodrigues
Abstract:
Mental disorders are among the leading causes of disability worldwide. The first step in treating these conditions is to obtain an accurate diagnosis, but the absence of established clinical tests makes this task challenging. Machine learning algorithms can provide a possible solution to this problem, as we describe in this work. We present a method for the automatic diagnosis of mental disorders…
▽ More
Mental disorders are among the leading causes of disability worldwide. The first step in treating these conditions is to obtain an accurate diagnosis, but the absence of established clinical tests makes this task challenging. Machine learning algorithms can provide a possible solution to this problem, as we describe in this work. We present a method for the automatic diagnosis of mental disorders based on the matrix of connections obtained from EEG time series and deep learning. We show that our approach can classify patients with Alzheimer's disease and schizophrenia with a high level of accuracy. The comparison with the traditional cases, that use raw EEG time series, shows that our method provides the highest precision. Therefore, the application of deep neural networks on data from brain connections is a very promising method to the diagnosis of neurological disorders.
△ Less
Submitted 7 October, 2021;
originally announced October 2021.
-
OARnet: Automated organs-at-risk delineation in Head and Neck CT images
Authors:
Mumtaz Hussain Soomro,
Hamidreza Nourzadeh,
Victor Gabriel Leandro Alves,
Wookjin Choi,
Jeffrey V. Siebers
Abstract:
A 3D deep learning model (OARnet) is developed and used to delineate 28 H&N OARs on CT images. OARnet utilizes a densely connected network to detect the OAR bounding-box, then delineates the OAR within the box. It reuses information from any layer to subsequent layers and uses skip connections to combine information from different dense block levels to progressively improve delineation accuracy. T…
▽ More
A 3D deep learning model (OARnet) is developed and used to delineate 28 H&N OARs on CT images. OARnet utilizes a densely connected network to detect the OAR bounding-box, then delineates the OAR within the box. It reuses information from any layer to subsequent layers and uses skip connections to combine information from different dense block levels to progressively improve delineation accuracy. Training uses up to 28 expert manual delineated (MD) OARs from 165 CTs. Dice similarity coefficient (DSC) and the 95th percentile Hausdorff distance (HD95) with respect to MD is assessed for 70 other CTs. Mean, maximum, and root-mean-square dose differences with respect to MD are assessed for 56 of the 70 CTs. OARnet is compared with UaNet, AnatomyNet, and Multi-Atlas Segmentation (MAS). Wilcoxon signed-rank tests using 95% confidence intervals are used to assess significance. Wilcoxon signed ranked tests show that, compared with UaNet, OARnet improves (p<0.05) the DSC (23/28 OARs) and HD95 (17/28). OARnet outperforms both AnatomyNet and MAS for DSC (28/28) and HD95 (27/28). Compared with UaNet, OARnet improves median DSC up to 0.05 and HD95 up to 1.5mm. Compared with AnatomyNet and MAS, OARnet improves median (DSC, HD95) by up to (0.08, 2.7mm) and (0.17, 6.3mm). Dosimetrically, OARnet outperforms UaNet (Dmax 7/28; Dmean 10/28), AnatomyNet (Dmax 21/28; Dmean 24/28), and MAS (Dmax 22/28; Dmean 21/28). The DenseNet architecture is optimized using a hybrid approach that performs OAR-specific bounding box detection followed by feature recognition. Compared with other auto-delineation methods, OARnet is better than or equal to UaNet for all but one geometric (Temporal Lobe L, HD95) and one dosimetric (Eye L, mean dose) endpoint for the 28 H&N OARs, and is better than or equal to both AnatomyNet and MAS for all OARs.
△ Less
Submitted 23 November, 2021; v1 submitted 31 August, 2021;
originally announced August 2021.
-
An Empirical Study on Refactoring-Inducing Pull Requests
Authors:
Flávia Coelho,
Nikolaos Tsantalis,
Tiago Massoni,
Everton L. G. Alves
Abstract:
Background: Pull-based development has shaped the practice of Modern Code Review (MCR), in which reviewers can contribute code improvements, such as refactorings, through comments and commits in Pull Requests (PRs). Past MCR studies uniformly treat all PRs, regardless of whether they induce refactoring or not. We define a PR as refactoring-inducing, when refactoring edits are performed after the i…
▽ More
Background: Pull-based development has shaped the practice of Modern Code Review (MCR), in which reviewers can contribute code improvements, such as refactorings, through comments and commits in Pull Requests (PRs). Past MCR studies uniformly treat all PRs, regardless of whether they induce refactoring or not. We define a PR as refactoring-inducing, when refactoring edits are performed after the initial commit(s), as either a result of discussion among reviewers or spontaneous actions carried out by the PR developer. Aims: This mixed study (quantitative and qualitative) explores code reviewing-related aspects intending to characterize refactoring-inducing PRs. Method: We hypothesize that refactoring-inducing PRs have distinct characteristics than non-refactoring-inducing ones and thus deserve special attention and treatment from researchers, practitioners, and tool builders. To investigate our hypothesis, we mined a sample of 1,845 Apache's merged PRs from GitHub, mined refactoring edits in these PRs, and ran a comparative study between refactoring-inducing and non-refactoring-inducing PRs. We also manually examined 2,096 review comments and 1,891 detected refactorings from 228 refactoring-inducing PRs. Results: We found 30.2% of refactoring-inducing PRs in our sample and that they significantly differ from non-refactoring-inducing ones in terms of number of commits, code churn, number of file changes, number of review comments, length of discussion, and time to merge. However, we found no statistical evidence that the number of reviewers is related to refactoring-inducement. Our qualitative analysis revealed that at least one refactoring edit was induced by review in 133 (58.3%) of the refactoring-inducing PRs examined. Conclusions: Our findings suggest directions for researchers, practitioners, and tool builders to improve practices around pull-based code review.
△ Less
Submitted 24 August, 2021;
originally announced August 2021.
-
Network-Coded Cooperative LoRa Network with D2D Communication
Authors:
L. H. O. Alves,
J. L. Rebelatto,
R. D. Souza,
G. Brante
Abstract:
We evaluate the outage probability and the energy efficiency (EE) of a LoRa network when the end-devices (EDs) are capable of exchanging messages with a device-to-device (D2D) technology. Our proposal is to assess the performance of the network when operating under the concept of network-coded cooperation (NCC), where the EDs are capable of transmitting linear combinations of more than one frame (…
▽ More
We evaluate the outage probability and the energy efficiency (EE) of a LoRa network when the end-devices (EDs) are capable of exchanging messages with a device-to-device (D2D) technology. Our proposal is to assess the performance of the network when operating under the concept of network-coded cooperation (NCC), where the EDs are capable of transmitting linear combinations of more than one frame (including frames of another ED), performed over a non-binary finite field. We consider both connection and collision probabilities when modelling the outage probability of an ED, as well as a realistic power consumption model to evaluate the EE. Our results indicate that the proposed scheme, which we refer to as NCC-LoRa, provides considerable gains in terms of both outage probability and EE when compared to a regular LoRa network, even when considering the additional consumption of D2D communication.
△ Less
Submitted 7 December, 2021; v1 submitted 6 July, 2021;
originally announced July 2021.
-
A continuous integration and web framework in support of the ATLAS Publication Process
Authors:
Juan Pedro Araque Espinosa,
Gabriel Baldi Levcovitz,
Riccardo-Maria Bianchi,
Ian Brock,
Tancredi Carli,
Nuno Filipe Castro,
Alessandra Ciocio,
Maurizio Colautti,
Ana Carolina Da Silva Menezes,
Gabriel De Oliveira da Fonseca,
Leandro Domingues Macedo Alves,
Andreas Hoecker,
Bruno Lange Ramos,
Gabriela Lemos Lúcidi Pinhão,
Carmen Maidantchik,
Fairouz Malek,
Robert McPherson,
Gianluca Picco,
Marcelo Teixeira Dos Santos
Abstract:
The ATLAS collaboration defines methods, establishes procedures, and organises advisory groups to manage the publication processes of scientific papers, conference papers, and public notes. All stages are managed through web systems, computing programs, and tools that are designed and developed by the collaboration. A framework called FENCE is integrated into the CERN GitLab software repository, t…
▽ More
The ATLAS collaboration defines methods, establishes procedures, and organises advisory groups to manage the publication processes of scientific papers, conference papers, and public notes. All stages are managed through web systems, computing programs, and tools that are designed and developed by the collaboration. A framework called FENCE is integrated into the CERN GitLab software repository, to automatically configure workspaces where each analysis can be documented by the analysis team and managed by the relevant coordinators. Continuous integration is used to guide the writers in applying consistent and correct formatting when preparing papers to be submitted to scientific journals. Additional software assures the correctness of other aspects of each paper, such as the lists of collaboration authors, funding agencies, and foundations. The framework and the workflow therein provide automatic and easy support to the researchers and facilitates each phase of the publication process, allowing authors to focus on the article contents. The framework and its integration with the most up to date and efficient tools has consequently provided a more professional and efficient automatized work environment to the whole collaboration.
△ Less
Submitted 28 January, 2021; v1 submitted 14 May, 2020;
originally announced May 2020.
-
Exogenous Shocks Lead to Increased Responsiveness and Shifts in Sentimental Resilience in Online Discussions
Authors:
Chathika Gunaratne,
Subash K. Ray,
Caroline Lourenco Alves,
Maria Waldl
Abstract:
The effects of real-world events on the dynamics and sentiment expressed through online conversations is not entirely understood. In particular, the dynamics of highly polarized communities, deeply invested in the outcome of a particular event naturally tend to have a contrasting emotional sensitivity to exogenous events. In this study, we analyze Twitter conversations during the Mexico vs Germany…
▽ More
The effects of real-world events on the dynamics and sentiment expressed through online conversations is not entirely understood. In particular, the dynamics of highly polarized communities, deeply invested in the outcome of a particular event naturally tend to have a contrasting emotional sensitivity to exogenous events. In this study, we analyze Twitter conversations during the Mexico vs Germany group match of the 2018 FIFA World Cup, and investigate the effect of the live game outcomes on the conversation dynamics and sentiment. We find that the exogenous events influence conversation volume and virality less, while having high influence on user responsiveness. Interestingly, we observe a shift in the influence that exogenous events have on fans immediately following the only goal scored in the game. The emotional resilience of fans of the advantaged team increased following this exogenous shock. In contrast, the sentiment of the disadvantaged team was left more susceptible to further exogenous events following the exogenous shock. These results support the fact that user engagement and emotional resilience of an online population holding highly polarized stances can be manipulated through an exogenous event of high importance.
△ Less
Submitted 9 April, 2020;
originally announced April 2020.
-
Reconstructing commuters network using machine learning and urban indicators
Authors:
Gabriel Spadon,
Andre C. P. L. F. de Carvalho,
Jose F. Rodrigues-Jr,
Luiz G. A. Alves
Abstract:
Human mobility has a significant impact on several layers of society, from infrastructural planning and economics to the spread of diseases and crime. Representing the system as a complex network, in which nodes are assigned to regions (e.g., a city) and links indicate the flow of people between two of them, physics-inspired models have been proposed to quantify the number of people migrating from…
▽ More
Human mobility has a significant impact on several layers of society, from infrastructural planning and economics to the spread of diseases and crime. Representing the system as a complex network, in which nodes are assigned to regions (e.g., a city) and links indicate the flow of people between two of them, physics-inspired models have been proposed to quantify the number of people migrating from one city to the other. Despite the advances made by these models, our ability to predict the number of commuters and reconstruct mobility networks remains limited. Here, we propose an alternative approach using machine learning and 22 urban indicators to predict the flow of people and reconstruct the intercity commuters network. Our results reveal that predictions based on machine learning algorithms and urban indicators can reconstruct the commuters network with 90.4% of accuracy and describe 77.6% of the variance observed in the flow of people between cities. We also identify essential features to recover the network structure and the urban indicators mostly related to commuting patterns. As previously reported, distance plays a significant role in commuting, but other indicators, such as Gross Domestic Product (GDP) and unemployment rate, are also driven-forces for people to commute. We believe that our results shed new lights on the modeling of migration and reinforce the role of urban indicators on commuting patterns. Also, because link-prediction and network reconstruction are still open challenges in network science, our results have implications in other areas, like economics, social sciences, and biology, where node attributes can give us information about the existence of links connecting entities in the network.
△ Less
Submitted 9 August, 2019;
originally announced August 2019.
-
Centrality anomalies in complex networks as a result of model over-simplification
Authors:
Luiz G. A. Alves,
Alberto Aleta,
Francisco A. Rodrigues,
Yamir Moreno,
Luis A. Nunes Amaral
Abstract:
Tremendous advances have been made in our understanding of the properties and evolution of complex networks. These advances were initially driven by information-poor empirical networks and theoretical analysis of unweighted and undirected graphs. Recently, information-rich empirical data complex networks supported the development of more sophisticated models that include edge directionality and we…
▽ More
Tremendous advances have been made in our understanding of the properties and evolution of complex networks. These advances were initially driven by information-poor empirical networks and theoretical analysis of unweighted and undirected graphs. Recently, information-rich empirical data complex networks supported the development of more sophisticated models that include edge directionality and weight properties, and multiple layers. Many studies still focus on unweighted undirected description of networks, prompting an essential question: how to identify when a model is simpler than it must be? Here, we argue that the presence of centrality anomalies in complex networks is a result of model over-simplification. Specifically, we investigate the well-known anomaly in betweenness centrality for transportation networks, according to which highly connected nodes are not necessarily the most central. Using a broad class of network models with weights and spatial constraints and four large data sets of transportation networks, we show that the unweighted projection of the structure of these networks can exhibit a significant fraction of anomalous nodes compared to a random null model. However, the weighted projection of these networks, compared with an appropriated null model, significantly reduces the fraction of anomalies observed, suggesting that centrality anomalies are a symptom of model over-simplification. Because lack of information-rich data is a common challenge when dealing with complex networks and can cause anomalies that misestimate the role of nodes in the system, we argue that sufficiently sophisticated models be used when anomalies are detected.
△ Less
Submitted 13 March, 2020; v1 submitted 2 February, 2019;
originally announced February 2019.
-
The dynamical structure of political corruption networks
Authors:
Haroldo V. Ribeiro,
Luiz G. A. Alves,
Alvaro F. Martins,
Ervin K. Lenzi,
Matjaz Perc
Abstract:
Corruptive behaviour in politics limits economic growth, embezzles public funds, and promotes socio-economic inequality in modern democracies. We analyse well-documented political corruption scandals in Brazil over the past 27 years, focusing on the dynamical structure of networks where two individuals are connected if they were involved in the same scandal. Our research reveals that corruption ru…
▽ More
Corruptive behaviour in politics limits economic growth, embezzles public funds, and promotes socio-economic inequality in modern democracies. We analyse well-documented political corruption scandals in Brazil over the past 27 years, focusing on the dynamical structure of networks where two individuals are connected if they were involved in the same scandal. Our research reveals that corruption runs in small groups that rarely comprise more than eight people, in networks that have hubs and a modular structure that encompasses more than one corruption scandal. We observe abrupt changes in the size of the largest connected component and in the degree distribution, which are due to the coalescence of different modules when new scandals come to light or when governments change. We show further that the dynamical structure of political corruption networks can be used for successfully predicting partners in future scandals. We discuss the important role of network science in detecting and mitigating political corruption.
△ Less
Submitted 5 January, 2018;
originally announced January 2018.
-
INDIGO-DataCloud:A data and computing platform to facilitate seamless access to e-infrastructures
Authors:
INDIGO-DataCloud Collaboration,
:,
Davide Salomoni,
Isabel Campos,
Luciano Gaido,
Jesus Marco de Lucas,
Peter Solagna,
Jorge Gomes,
Ludek Matyska,
Patrick Fuhrman,
Marcus Hardt,
Giacinto Donvito,
Lukasz Dutka,
Marcin Plociennik,
Roberto Barbera,
Ignacio Blanquer,
Andrea Ceccanti,
Mario David,
Cristina Duma,
Alvaro López-García,
Germán Moltó,
Pablo Orviz,
Zdenek Sustr,
Matthew Viljoen,
Fernando Aguilar
, et al. (40 additional authors not shown)
Abstract:
This paper describes the achievements of the H2020 project INDIGO-DataCloud. The project has provided e-infrastructures with tools, applications and cloud framework enhancements to manage the demanding requirements of scientific communities, either locally or through enhanced interfaces. The middleware developed allows to federate hybrid resources, to easily write, port and run scientific applicat…
▽ More
This paper describes the achievements of the H2020 project INDIGO-DataCloud. The project has provided e-infrastructures with tools, applications and cloud framework enhancements to manage the demanding requirements of scientific communities, either locally or through enhanced interfaces. The middleware developed allows to federate hybrid resources, to easily write, port and run scientific applications to the cloud. In particular, we have extended existing PaaS (Platform as a Service) solutions, allowing public and private e-infrastructures, including those provided by EGI, EUDAT, and Helix Nebula, to integrate their existing services and make them available through AAI services compliant with GEANT interfederation policies, thus guaranteeing transparency and trust in the provisioning of such services. Our middleware facilitates the execution of applications using containers on Cloud and Grid based infrastructures, as well as on HPC clusters. Our developments are freely downloadable as open source components, and are already being integrated into many scientific applications.
△ Less
Submitted 5 February, 2019; v1 submitted 6 November, 2017;
originally announced November 2017.
-
Enabling rootless Linux Containers in multi-user environments: the udocker tool
Authors:
Jorge Gomes,
Isabel Campos,
Emanuele Bagnaschi,
Mario David,
Luis Alves,
Joao Martins,
Joao Pina,
Alvaro Lopez-Garcia,
Pablo Orviz
Abstract:
Containers are increasingly used as means to distribute and run Linux services and applications. In this paper we describe the architectural design and implementation of udocker, a tool which enables the user to execute Linux containers in user mode. We also present a few practical applications, using a range of scientific codes characterized by different requirements: from single core execution t…
▽ More
Containers are increasingly used as means to distribute and run Linux services and applications. In this paper we describe the architectural design and implementation of udocker, a tool which enables the user to execute Linux containers in user mode. We also present a few practical applications, using a range of scientific codes characterized by different requirements: from single core execution to MPI parallel execution and execution on GPGPUs.
△ Less
Submitted 1 June, 2018; v1 submitted 6 November, 2017;
originally announced November 2017.
-
A Hint-Based Technique for System Level Model-Based Test Case Prioritization
Authors:
João Felipe Silva Ouriques,
Emanuela Gadelha Cartaxo,
Everton Leandro Galdino Alves,
Patrícia Duarte Lima Machado
Abstract:
Test Case Prioritization (TCP) techniques aim at proposing new test case execution orders to favor the achievement of certain testing goal, such as fault detection. Current TCP research focus mainly on code-based regression testing; however in the Model-Based Testing (MBT) context, we still need more investigation. General TCP techniques do not use historical information, since this information is…
▽ More
Test Case Prioritization (TCP) techniques aim at proposing new test case execution orders to favor the achievement of certain testing goal, such as fault detection. Current TCP research focus mainly on code-based regression testing; however in the Model-Based Testing (MBT) context, we still need more investigation. General TCP techniques do not use historical information, since this information is often unavailable. Therefore, techniques use different sources of information to guide prioritization. We propose a novel technique that guides its operation using provided hints, the Hint-Based Adaptive Random Prioritization - HARP. Hints are indications or suggestions provided by developers about error-prone functionalities. As hints may be hard to collect automatically, we also propose an approach of collecting them. To validate our findings, we performed an experiment measuring the effect of introducing hints to HARP. It shows that hints can improve HARP's performance comparing to its baseline. Then, we investigated the ability of developers/managers to provide good hints and used them in a case study. This analysis showed that developers and managers are able to provide useful hints, which improves HARP's fault detection comparing to its baseline. Nonetheless, the provided hints should be consensual among the development team members.
△ Less
Submitted 10 August, 2017;
originally announced August 2017.
-
The H-index Paradox: Your Coauthors Have a Higher H-index than You Do
Authors:
Fabrício Benevenuto,
Alberto H. F. Laender,
Bruno L. Alves
Abstract:
One interesting phenomenon that emerges from the typical structure of social networks is the friendship paradox. It states that your friends have on average more friends than you do. Recent efforts have explored variations of it, with numerous implications for the dynamics of social networks. However, the friendship paradox and its variations consider only the topological structure of the networks…
▽ More
One interesting phenomenon that emerges from the typical structure of social networks is the friendship paradox. It states that your friends have on average more friends than you do. Recent efforts have explored variations of it, with numerous implications for the dynamics of social networks. However, the friendship paradox and its variations consider only the topological structure of the networks and neglect many other characteristics that are correlated with node degree. In this article, we take the case of scientific collaborations to investigate whether a similar paradox also arises in terms of a researcher's scientific productivity as measured by her H-index. The H-index is a widely used metric in academia to capture both the quality and the quantity of a researcher's scientific output. It is likely that a researcher may use her coauthors' H-indexes as a way to infer whether her own H-index is adequate in her research area. Nevertheless, in this article, we show that the average H-index of a researcher's coauthors is usually higher than her own H-index. We present empirical evidence of this paradox and discuss some of its potential consequences.
△ Less
Submitted 19 October, 2015; v1 submitted 15 October, 2015;
originally announced October 2015.
-
A new Level-set based Protocol for Accurate Bone Segmentation from CT Imaging
Authors:
Manuel Pinheiro,
J. L. Alves
Abstract:
In this work it is proposed a medical image segmentation pipeline for accurate bone segmentation from CT imaging. It is a two-step methodology, with a pre-segmentation step and a segmentation refinement step. First, the user performs a rough segmenting of the desired region of interest. Next, a fully automatic refinement step is applied to the pre-segmented data. The automatic segmentation refinem…
▽ More
In this work it is proposed a medical image segmentation pipeline for accurate bone segmentation from CT imaging. It is a two-step methodology, with a pre-segmentation step and a segmentation refinement step. First, the user performs a rough segmenting of the desired region of interest. Next, a fully automatic refinement step is applied to the pre-segmented data. The automatic segmentation refinement is composed by several sub-stpng, namely image deconvolution, image cropping and interpolation. The user-defined pre-segmentation is then refined over the deconvolved, cropped, and up-sampled version of the image. The algorithm is applied in the segmentation of CT images of a composite femur bone, reconstructed with different reconstruction protocols. Segmentation outcomes are validated against a gold standard model obtained with coordinate measuring machine Nikon Metris LK V20 with a digital line scanner LC60-D that guarantees an accuracy of 28 $μm$. High sub-pixel accuracy models were obtained for all tested Datasets. The algorithm is able to produce high quality segmentation of the composite femur regardless of the surface meshing strategy used.
△ Less
Submitted 12 May, 2015;
originally announced May 2015.