-
Predicting temperatures in Brazilian states capitals via Machine Learning
Authors:
Sidney T. da Silva,
Enrique C. Gabrick,
Ana Luiza R. de Moraes,
Ricardo L. Viana,
Antonio M. Batista,
Iberê L. Caldas,
Jürgen Kurths
Abstract:
Climate change refers to substantial long-term variations in weather patterns. In this work, we employ a Machine Learning (ML) technique, the Random Forest (RF) algorithm, to forecast the monthly average temperature for Brazilian's states capitals (27 cities) and the whole country, from January 1961 until December 2022. To forecast the temperature at $k$-month, we consider as features in RF: $i)$…
▽ More
Climate change refers to substantial long-term variations in weather patterns. In this work, we employ a Machine Learning (ML) technique, the Random Forest (RF) algorithm, to forecast the monthly average temperature for Brazilian's states capitals (27 cities) and the whole country, from January 1961 until December 2022. To forecast the temperature at $k$-month, we consider as features in RF: $i)$ global emissions of carbon dioxide (CO$_2$), methane (CH$_4$), and nitrous oxide (N$_2$O) at $k$-month; $ii)$ temperatures from the previous three months, i.e., $(k-1)$, $(k-2)$ and $(k-3)$-month; $iii)$ combination of $i$ and $ii$. By investigating breakpoints in the time series, we discover that 24 cities and the gases present breakpoints in the 80's and 90's. After the breakpoints, we find an increase in the temperature and the gas emission. Thereafter, we separate the cities according to their geographical position and employ the RF algorithm to forecast the temperature from 2010-08 until 2022-12. Based on $i$, $ii$, and $iii$, we find that the three inputs result in a very precise forecast, with a normalized root mean squared error (NMRSE) less than 0.083 for the considered cases. From our simulations, the better forecasted region is Northeast through $iii$ (NMRSE = 0.012). Furthermore, we also investigate the forecasting of anomalous temperature data by removing the annual component of each time series. In this case, the best forecasting is obtained with strategy $i$, with the best region being Northeast (NRMSE = 0.090).
△ Less
Submitted 2 May, 2025;
originally announced May 2025.
-
Comparative Analysis of Deepfake Detection Models: New Approaches and Perspectives
Authors:
Matheus Martins Batista
Abstract:
The growing threat posed by deepfake videos, capable of manipulating realities and disseminating misinformation, drives the urgent need for effective detection methods. This work investigates and compares different approaches for identifying deepfakes, focusing on the GenConViT model and its performance relative to other architectures present in the DeepfakeBenchmark. To contextualize the research…
▽ More
The growing threat posed by deepfake videos, capable of manipulating realities and disseminating misinformation, drives the urgent need for effective detection methods. This work investigates and compares different approaches for identifying deepfakes, focusing on the GenConViT model and its performance relative to other architectures present in the DeepfakeBenchmark. To contextualize the research, the social and legal impacts of deepfakes are addressed, as well as the technical fundamentals of their creation and detection, including digital image processing, machine learning, and artificial neural networks, with emphasis on Convolutional Neural Networks (CNNs), Generative Adversarial Networks (GANs), and Transformers. The performance evaluation of the models was conducted using relevant metrics and new datasets established in the literature, such as WildDeep-fake and DeepSpeak, aiming to identify the most effective tools in the battle against misinformation and media manipulation. The obtained results indicated that GenConViT, after fine-tuning, exhibited superior performance in terms of accuracy (93.82%) and generalization capacity, surpassing other architectures in the DeepfakeBenchmark on the DeepSpeak dataset. This study contributes to the advancement of deepfake detection techniques, offering contributions to the development of more robust and effective solutions against the dissemination of false information.
△ Less
Submitted 2 April, 2025;
originally announced April 2025.
-
Before-after safety analysis of a shared space implementation
Authors:
Federico Orsini,
Mariana Batista,
Bernhard Friedrich,
Massimiliano Gastaldi,
Riccardo Rossi
Abstract:
Shared spaces aim to reduce the dominance of motor vehicles by promoting pedestrian and cyclist activity and minimizing segregation between road users. Despite the intended scope to improve the safety of vulnerable road users, only few works in the literature focused on before after safety evaluations, mainly analyzing changes in users trajectories and speeds, traffic volumes, and conflict counts,…
▽ More
Shared spaces aim to reduce the dominance of motor vehicles by promoting pedestrian and cyclist activity and minimizing segregation between road users. Despite the intended scope to improve the safety of vulnerable road users, only few works in the literature focused on before after safety evaluations, mainly analyzing changes in users trajectories and speeds, traffic volumes, and conflict counts, which, while useful, cannot univocally quantify road safety. Here, we propose a more advanced methodology, based on surrogate measures of safety and Extreme Value Theory, to assess road safety before and after the implementation of a shared space. The aim is to produce a crash risk estimation in different scenarios, obtaining a quantitative and comprehensive indicator, useful to practitioners for evaluating the safety of urban design solutions. A real world case study illustrates the proposed procedure. Video data were collected on two separate days, before and after a shared space implementation, and were semiautomatically processed to extract road users trajectories. Analysis of traffic volumes, trajectories, speeds and yield ratios allowed to understand the spatial behavior of road users in the two scenarios. Traffic conflicts, identified with an innovative surrogate measure of safety called time to avoided collision point, TTAC, were then used to estimate a Lomax distribution, and therefore to model the probabilistic relationship between conflicts and crashes, eventually retrieving a crash risk estimate. Results show that the analyzed shared space was able to significantly reduce the risk of crashes, and these findings are consistent with the observed changes in users speed and spatial behavior. The analyzed case study and its limitations were useful in highlighting the methodology main features and suggesting practical prescriptions for practitioners.
△ Less
Submitted 3 July, 2023;
originally announced July 2023.
-
CrimAnalyzer: Understanding Crime Patterns in São Paulo City
Authors:
Garcia-Zanabria,
Germain,
Silveira,
Jaqueline Alvarenga,
Poco,
Jorge,
Paiva,
Afonso,
Nery,
Marcelo Batista,
Silva,
Claudio T,
de Abreu,
Sergio Franca Adorno,
Nonato,
Luis Gustavo
Abstract:
São Paulo is the largest city in South America, with high criminality rates. The number and type of crimes varies considerably around the city, assuming different patterns depending on urban and social characteristics. In this scenario, enabling tools to explore particular locations of the city is very important for domain experts to understand how urban features as to mobility, passersby behavior…
▽ More
São Paulo is the largest city in South America, with high criminality rates. The number and type of crimes varies considerably around the city, assuming different patterns depending on urban and social characteristics. In this scenario, enabling tools to explore particular locations of the city is very important for domain experts to understand how urban features as to mobility, passersby behavior, and urban infrastructures can influence the quantity and type of crimes. In present work, we present CrimAnalyzer, a visualization assisted analytic tool that allows users to analyze crime behavior in specific regions of a city, providing new methodologies to identify local crime hotspots and their corresponding patterns over time. CrimAnalyzer has been developed from the demand of experts in criminology and it deals with three major challenges: i) flexibility to explore local regions and understand their crime patterns, ii) Identification of not only prevalent hotspots in terms of number of crimes but also hotspots where crimes are frequent but not in large amount, and iii) understand the dynamic of crime patterns over time. The effectiveness and usefulness of the proposed system are demonstrated by qualitative/quantitative comparisons as well as case studies involving real data and run by domain experts.
△ Less
Submitted 3 October, 2020;
originally announced October 2020.