-
Stochastic Fractional Neural Operators: A Symmetrized Approach to Modeling Turbulence in Complex Fluid Dynamics
Authors:
Rômulo Damasclin Chaves dos Santos,
Jorge Henrique de Oliveira Sales
Abstract:
In this work, we introduce a new class of neural network operators designed to handle problems where memory effects and randomness play a central role. In this work, we introduce a new class of neural network operators designed to handle problems where memory effects and randomness play a central role. These operators merge symmetrized activation functions, Caputo-type fractional derivatives, and…
▽ More
In this work, we introduce a new class of neural network operators designed to handle problems where memory effects and randomness play a central role. In this work, we introduce a new class of neural network operators designed to handle problems where memory effects and randomness play a central role. These operators merge symmetrized activation functions, Caputo-type fractional derivatives, and stochastic perturbations introduced via Itô type noise. The result is a powerful framework capable of approximating functions that evolve over time with both long-term memory and uncertain dynamics. We develop the mathematical foundations of these operators, proving three key theorems of Voronovskaya type. These results describe the asymptotic behavior of the operators, their convergence in the mean-square sense, and their consistency under fractional regularity assumptions. All estimates explicitly account for the influence of the memory parameter $α$ and the noise level $σ$. As a practical application, we apply the proposed theory to the fractional Navier-Stokes equations with stochastic forcing, a model often used to describe turbulence in fluid flows with memory. Our approach provides theoretical guarantees for the approximation quality and suggests that these neural operators can serve as effective tools in the analysis and simulation of complex systems. By blending ideas from neural networks, fractional calculus, and stochastic analysis, this research opens new perspectives for modeling turbulent phenomena and other multiscale processes where memory and randomness are fundamental. The results lay the groundwork for hybrid learning-based methods with strong analytical backing.
△ Less
Submitted 12 May, 2025;
originally announced May 2025.
-
Revolutionizing Fractional Calculus with Neural Networks: Voronovskaya-Damasclin Theory for Next-Generation AI Systems
Authors:
Rômulo Damasclin Chaves dos Santos,
Jorge Henrique de Oliveira Sales
Abstract:
This work introduces rigorous convergence rates for neural network operators activated by symmetrized and perturbed hyperbolic tangent functions, utilizing novel Voronovskaya-Damasclin asymptotic expansions. We analyze basic, Kantorovich, and quadrature-type operators over infinite domains, extending classical approximation theory to fractional calculus via Caputo derivatives. Key innovations incl…
▽ More
This work introduces rigorous convergence rates for neural network operators activated by symmetrized and perturbed hyperbolic tangent functions, utilizing novel Voronovskaya-Damasclin asymptotic expansions. We analyze basic, Kantorovich, and quadrature-type operators over infinite domains, extending classical approximation theory to fractional calculus via Caputo derivatives. Key innovations include parameterized activation functions with asymmetry control, symmetrized density operators, and fractional Taylor expansions for error analysis. The main theorem demonstrates that Kantorovich operators achieve \(o(n^{-β(N-\varepsilon)})\) convergence rates, while basic operators exhibit \(\mathcal{O}(n^{-βN})\) error decay. For deep networks, we prove \(\mathcal{O}(L^{-β(N-\varepsilon)})\) approximation bounds. Stability results under parameter perturbations highlight operator robustness. By integrating neural approximation theory with fractional calculus, this work provides foundational mathematical insights and deployable engineering solutions, with potential applications in complex system modeling and signal processing.
△ Less
Submitted 1 April, 2025;
originally announced April 2025.
-
Impacto de Treinamento em Programação Competitiva no Ensino Médio: Resultados e Desafios
Authors:
Camila da Cruz Santos,
Sarah Souto dos Santos,
Crishna Irion,
Giullia Rodrigues de Menezes,
Rafael Dias Araújo,
João Henrique de Souza Pereira
Abstract:
This article presents an ongoing research aiming to develop an effective methodology for teaching programming, focusing on participation in the Brazilian Informatics Olympiad (OBI), for elementary and high school students. The training conducted with students from the Federal Institute and state schools, demonstrates the importance of programming training programs as a way to promote interest in c…
▽ More
This article presents an ongoing research aiming to develop an effective methodology for teaching programming, focusing on participation in the Brazilian Informatics Olympiad (OBI), for elementary and high school students. The training conducted with students from the Federal Institute and state schools, demonstrates the importance of programming training programs as a way to promote interest in computing, stimulate the development of computational skills, and increase participation in competitions such as the OBI. The next steps of the research include conducting more training cycles and analyzing the results obtained in the competitions.
△ Less
Submitted 31 January, 2025;
originally announced March 2025.
-
Promoting Gender Equality in Competitive Programming: Strategies and Impacts of Affirmative Actions in Programming Marathons in Brazil
Authors:
Crishna Irion,
Camila da Cruz Santos,
Luiz Claudio Theodoro,
Rafael Dias Araujo,
Joao Henrique de Souza Pereira
Abstract:
In the context of Computing, competitive programming is a relevant area that aims to have students, usually in teams, solve programming challenges, developing skills and competencies in the field. However, female participation remains significantly low and notably distant compared to male participation, even with proven intellectual equity between genders. This research aims to present strategies…
▽ More
In the context of Computing, competitive programming is a relevant area that aims to have students, usually in teams, solve programming challenges, developing skills and competencies in the field. However, female participation remains significantly low and notably distant compared to male participation, even with proven intellectual equity between genders. This research aims to present strategies used to improve female participation in Programming Marathons in Brasil. The developed research is documentary, applied, and exploratory, with actions that generate results for female participation, with affirmative and inclusion actions, an important step towards gender equity in competitive programming.
△ Less
Submitted 21 February, 2025;
originally announced February 2025.
-
Extension of Symmetrized Neural Network Operators with Fractional and Mixed Activation Functions
Authors:
Rômulo Damasclin Chaves dos Santos,
Jorge Henrique de Oliveira Sales
Abstract:
We propose a novel extension to symmetrized neural network operators by incorporating fractional and mixed activation functions. This study addresses the limitations of existing models in approximating higher-order smooth functions, particularly in complex and high-dimensional spaces. Our framework introduces a fractional exponent in the activation functions, allowing adaptive non-linear approxima…
▽ More
We propose a novel extension to symmetrized neural network operators by incorporating fractional and mixed activation functions. This study addresses the limitations of existing models in approximating higher-order smooth functions, particularly in complex and high-dimensional spaces. Our framework introduces a fractional exponent in the activation functions, allowing adaptive non-linear approximations with improved accuracy. We define new density functions based on $q$-deformed and $θ$-parametrized logistic models and derive advanced Jackson-type inequalities that establish uniform convergence rates. Additionally, we provide a rigorous mathematical foundation for the proposed operators, supported by numerical validations demonstrating their efficiency in handling oscillatory and fractional components. The results extend the applicability of neural network approximation theory to broader functional spaces, paving the way for applications in solving partial differential equations and modeling complex systems.
△ Less
Submitted 17 January, 2025;
originally announced January 2025.
-
Emílias Podcast -- Mulheres na Computação: Ampliando Horizontes e Inspirando Carreiras em STEM
Authors:
Nathálya Chaves Dos Santos,
Adolfo Gustavo Serra Seca Neto
Abstract:
On October 3, 2024, the "Emílias Podcast -- Women in Computing" celebrates its 5th anniversary, standing out as a platform that promotes the participation of women in STEM (an acronym for "science, technology, engineering, and mathematics"). The podcast aims to provide a space for women in computing and related fields to share their experiences and highlight the various opportunities in Informatio…
▽ More
On October 3, 2024, the "Emílias Podcast -- Women in Computing" celebrates its 5th anniversary, standing out as a platform that promotes the participation of women in STEM (an acronym for "science, technology, engineering, and mathematics"). The podcast aims to provide a space for women in computing and related fields to share their experiences and highlight the various opportunities in Information and Communication Technology (ICT). The methodology included a feedback survey with interviewees, conducted via Google Forms, to assess their experience and determine whether they would recommend the podcast. In addition, we analyzed audience data, which showed consistent growth over the five years. The results revealed that 100% of the interviewees would recommend "Emílias Podcast," reflecting a high level of satisfaction with the project. The average participation experience rating was 4.7 on a scale of 1 to 5, highlighting positive aspects such as the quality of the script, the interview conduction, and the networking opportunities. The audience data also underscore the podcast's impact: with over 10,000 accumulated downloads and plays, it is primarily listened to by people aged 23 to 44, with 50.9% of the audience being female, demonstrating its relevance and reach. In conclusion, the feedback from interviewees and the audience data reinforce the podcast's positive impact and its crucial role in the inclusion of women in technology. The results highlight the importance of promoting the field and its opportunities, contributing to a more inclusive and inspiring future. The data analysis demonstrates the podcast's effectiveness in engaging and expanding its audience, establishing it as a significant example of social impact in ICT.
△ Less
Submitted 6 October, 2024;
originally announced October 2024.
-
Enhancing E-Learning System Through Learning Management System (LMS) Technologies: Reshape The Learner Experience
Authors:
Cecilia P. Abaricia,
Manuel Luis C. Delos Santos
Abstract:
This paper aims to determine how the LMS Web portal application reshapes the learner experience through the developed E-Learning Management System using Data Mining Algorithm.
The methodology that the researchers used is descriptive research involving the interpretation of the meaning or significance of what is described. Gather data from questionnaires, surveys, observations concerned with the…
▽ More
This paper aims to determine how the LMS Web portal application reshapes the learner experience through the developed E-Learning Management System using Data Mining Algorithm.
The methodology that the researchers used is descriptive research involving the interpretation of the meaning or significance of what is described. Gather data from questionnaires, surveys, observations concerned with the study, and the chi-square formula for the statistical treatment of data.
The findings of the study, the extent that LMS Web portal application reshapes the learner experience in terms of the following variables with the Average Weighted Mean (AWM): Flexible engagement of Learners in any device is highly satisfied; Personalize learning tracker is highly satisfied; Collaborating with the Learning Expert is highly satisfied; Provides user-friendly Teaching Tools is satisfied; Evident Learner Progress and Involvement and is satisfied.
In the final analysis, this E-Learning System can fit any educational needs as follows: chat, virtual classes, supportive resources for the students, individual and group monitoring, and assessment using LMS as maximum efficiency. Moreover, this platform can be used to deliver hybrid learning.
△ Less
Submitted 31 August, 2023;
originally announced September 2023.
-
ICARUS: An Android-Based Unmanned Aerial Vehicle (UAV) Search and Rescue Eye in the Sky
Authors:
Manuel Luis C. Delos Santos,
Jerum B. Dasalla,
Jomar C. Feliciano,
Dustin Red B. Cabatay
Abstract:
The purpose of this paper is to develop an unmanned aerial vehicle (UAV) using a quadcopter with the capability of video surveillance, map coordinates, a deployable parachute with a medicine kit or a food pack as a payload, a collision warning system, remotely controlled, integrated with an android application to assist in search and rescue operations.
Applied research for the development of the…
▽ More
The purpose of this paper is to develop an unmanned aerial vehicle (UAV) using a quadcopter with the capability of video surveillance, map coordinates, a deployable parachute with a medicine kit or a food pack as a payload, a collision warning system, remotely controlled, integrated with an android application to assist in search and rescue operations.
Applied research for the development of the functional prototype, quantitative and descriptive statistics to summarize data by describing the relationship between variables in a sample or population. The quadcopter underwent an evaluation using a survey instrument to test its acceptability using predefined variables to select respondents within Caloocan City and Quezon City, Philippines.
Demographic profiles and known issues and concerns were answered by 30 respondents. The results were summarized and distributed in Tables 1 and 2.
In terms of demographic profiles, the number of SAR operators within the specified areas is distributed equally, most are male, single, and within the age bracket of 31 and above. In issues and concerns, the most common type of search and rescue was ground search and rescue. Human error is the primary cause of most injuries in operating units. The prototype was useful and everyone agreed, in terms of acceptability, drone technology will improve search and rescue operations.
The innovative way of utilizing Android and drone technology is a new step towards the improvement of SAR operations in the Philippines.
The LiPo battery must be replaced with a higher capacity and the drone operator should undergo a training course and secure a permit from the Civil Aviation Authority of the Philippines (CAAP).
△ Less
Submitted 28 August, 2023;
originally announced August 2023.
-
Neurosymbolic AI and its Taxonomy: a survey
Authors:
Wandemberg Gibaut,
Leonardo Pereira,
Fabio Grassiotto,
Alexandre Osorio,
Eder Gadioli,
Amparo Munoz,
Sildolfo Gomes,
Claudio dos Santos
Abstract:
Neurosymbolic AI deals with models that combine symbolic processing, like classic AI, and neural networks, as it's a very established area. These models are emerging as an effort toward Artificial General Intelligence (AGI) by both exploring an alternative to just increasing datasets' and models' sizes and combining Learning over the data distribution, Reasoning on prior and learned knowledge, and…
▽ More
Neurosymbolic AI deals with models that combine symbolic processing, like classic AI, and neural networks, as it's a very established area. These models are emerging as an effort toward Artificial General Intelligence (AGI) by both exploring an alternative to just increasing datasets' and models' sizes and combining Learning over the data distribution, Reasoning on prior and learned knowledge, and by symbiotically using them. This survey investigates research papers in this area during recent years and brings classification and comparison between the presented models as well as applications.
△ Less
Submitted 17 May, 2023; v1 submitted 12 May, 2023;
originally announced May 2023.
-
Smart Face Shield: A Sensor-Based Wearable Face Shield Utilizing Computer Vision Algorithms
Authors:
Manuel Luis C. Delos Santos,
Ronaldo S. Tinio,
Darwin B. Diaz,
Karlene Emily I. Tolosa
Abstract:
The study aims the development of a wearable device to combat the onslaught of covid-19. Likewise, to enhance the regular face shield available in the market. Furthermore, to raise awareness of the health and safety protocols initiated by the government and its affiliates in the enforcement of social distancing with the integration of computer vision algorithms. The wearable device was composed of…
▽ More
The study aims the development of a wearable device to combat the onslaught of covid-19. Likewise, to enhance the regular face shield available in the market. Furthermore, to raise awareness of the health and safety protocols initiated by the government and its affiliates in the enforcement of social distancing with the integration of computer vision algorithms. The wearable device was composed of various hardware and software components such as a transparent polycarbonate face shield, microprocessor, sensors, camera, thin-film transistor on-screen display, jumper wires, power bank, and python programming language. The algorithm incorporated in the study was object detection under computer vision machine learning. The front camera with OpenCV technology determines the distance of a person in front of the user. Utilizing TensorFlow, the target object identifies and detects the image or live feed to get its bounding boxes. The focal length lens requires the determination of the distance from the camera to the target object. To get the focal length, multiply the pixel width by the known distance and divide it by the known width (Rosebrock, 2020). The deployment of unit testing ensures that the parameters are valid in terms of design and specifications.
△ Less
Submitted 17 December, 2022;
originally announced December 2022.
-
Understanding the Energy Consumption of HPC Scale Artificial Intelligence
Authors:
Danilo Carastan dos Santos
Abstract:
This paper contributes towards better understanding the energy consumption trade-offs of HPC scale Artificial Intelligence (AI), and more specifically Deep Learning (DL) algorithms. For this task we developed benchmark-tracker, a benchmark tool to evaluate the speed and energy consumption of DL algorithms in HPC environments. We exploited hardware counters and Python libraries to collect energy in…
▽ More
This paper contributes towards better understanding the energy consumption trade-offs of HPC scale Artificial Intelligence (AI), and more specifically Deep Learning (DL) algorithms. For this task we developed benchmark-tracker, a benchmark tool to evaluate the speed and energy consumption of DL algorithms in HPC environments. We exploited hardware counters and Python libraries to collect energy information through software, which enabled us to instrument a known AI benchmark tool, and to evaluate the energy consumption of numerous DL algorithms and models. Through an experimental campaign, we show a case example of the potential of benchmark-tracker to measure the computing speed and the energy consumption for training and inference DL algorithms, and also the potential of Benchmark-Tracker to help better understanding the energy behavior of DL algorithms in HPC platforms. This work is a step forward to better understand the energy consumption of Deep Learning in HPC, and it also contributes with a new tool to help HPC DL developers to better balance the HPC infrastructure in terms of speed and energy consumption.
△ Less
Submitted 14 November, 2022;
originally announced December 2022.
-
Applied Computer Vision on 2-Dimensional Lung X-Ray Images for Assisted Medical Diagnosis of Pneumonia
Authors:
Ralph Joseph S. D. Ligueran,
Manuel Luis C. Delos Santos,
Ronaldo S. Tinio,
Emmanuel H. Valencia
Abstract:
This study focuses on the application of a specific subfield of artificial intelligence referred to as computer vision in the analysis of 2-dimensional lung x-ray images for the assisted medical diagnosis of ordinary pneumonia.
A convolutional neural network algorithm was implemented in a Python-coded, Flask-based web application that can analyze x-ray images for the detection of ordinary pneumo…
▽ More
This study focuses on the application of a specific subfield of artificial intelligence referred to as computer vision in the analysis of 2-dimensional lung x-ray images for the assisted medical diagnosis of ordinary pneumonia.
A convolutional neural network algorithm was implemented in a Python-coded, Flask-based web application that can analyze x-ray images for the detection of ordinary pneumonia. Since convolutional neural network algorithms rely on machine learning for the identification and detection of patterns, a technique referred to as transfer learning was implemented to train the neural network in the identification and detection of patterns within the dataset. Open-source lung x-ray images were used as training data to create a knowledge base that served as the core element of the web application and the experimental design employed a 5-Trial Confirmatory Test for the validation of the web application.
The results of the 5-Trial Confirmatory Test show the calculation of Diagnostic Precision Percentage per Trial, General Diagnostic Precision Percentage, and General Diagnostic Error Percentage while the Confusion Matrix further shows the relationship between the label and the corresponding diagnosis result of the web application on each test images.
The developed web application can be used by medical practitioners in A.I.-assisted diagnosis of ordinary pneumonia, and by researchers in the fields of computer science and bioinformatics.
△ Less
Submitted 27 July, 2022;
originally announced July 2022.
-
An adaptive music generation architecture for games based on the deep learning Transformer mode
Authors:
Gustavo Amaral Costa dos Santos,
Augusto Baffa,
Jean-Pierre Briot,
Bruno Feijó,
Antonio Luz Furtado
Abstract:
This paper presents an architecture for generating music for video games based on the Transformer deep learning model. Our motivation is to be able to customize the generation according to the taste of the player, who can select a corpus of training examples, corresponding to his preferred musical style. The system generates various musical layers, following the standard layering strategy currentl…
▽ More
This paper presents an architecture for generating music for video games based on the Transformer deep learning model. Our motivation is to be able to customize the generation according to the taste of the player, who can select a corpus of training examples, corresponding to his preferred musical style. The system generates various musical layers, following the standard layering strategy currently used by composers designing video game music. To adapt the music generated to the game play and to the player(s) situation, we are using an arousal-valence model of emotions, in order to control the selection of musical layers. We discuss current limitations and prospects for the future, such as collaborative and interactive control of the musical components.
△ Less
Submitted 10 September, 2022; v1 submitted 4 July, 2022;
originally announced July 2022.
-
Item Matching using Text Description and Similarity Search
Authors:
Ana Paula Appel,
Anderson Luis de Paula Silva,
Adriana Reigota Silva,
Caique Dutra Santos,
Thiago Logo da Silva,
Rafael Poggi de Araujo,
Luiz Carlos Faray de Aquino
Abstract:
In this paper, we focus on the problem of item matching using only the description. Those specific items not only lack a unique code but also contain short text descriptions, making the item matching process difficult. Our goal is to compare products using only the description provided by the purchase process. Therefore, evaluating other characteristics and differences can uncover possible flaws d…
▽ More
In this paper, we focus on the problem of item matching using only the description. Those specific items not only lack a unique code but also contain short text descriptions, making the item matching process difficult. Our goal is to compare products using only the description provided by the purchase process. Therefore, evaluating other characteristics and differences can uncover possible flaws during the acquiring phase. However, the text of the items that we were working on was very small, with numbers due to the nature of the products and we have a limited amount of time to develop the solution which was 8 weeks. As result, we showed that working using a well-oriented methodology we were able to deliver a successful MVP and achieve the results expected with up to 55% match.
△ Less
Submitted 1 July, 2022; v1 submitted 28 June, 2022;
originally announced June 2022.
-
Predicting Pollution Level Using Random Forest: A Case Study of Marilao River in Bulacan Province, Philippines
Authors:
Jayson M. Victoriano,
Manuel Luis C. Delos Santos,
Albert A. Vinluan,
Jennifer T. Carpio
Abstract:
This study aims to predict the pollution level that threatens the Marilao River, located in the province of Bulacan, Philippines. The inhabitants of this area are now being exposed to pollution. Contamination of this waterway comes from both formal and informal industries, such as a used lead-acid battery, open dumpsites metal refining, and other toxic metals. Using various water quality parameter…
▽ More
This study aims to predict the pollution level that threatens the Marilao River, located in the province of Bulacan, Philippines. The inhabitants of this area are now being exposed to pollution. Contamination of this waterway comes from both formal and informal industries, such as a used lead-acid battery, open dumpsites metal refining, and other toxic metals. Using various water quality parameters like Dissolved Oxygen (DO), Potential of Hydrogen (pH), Biochemical Oxygen Demand (BOD) and Total Suspended Solids (TSS) were the basis for predicting the pollution level. This study used the Data Mining technique based on the sample data collected from January of 2013 to November of 2017. These were used as a training data and test results to predict the river condition with its corresponding pollution level classification indicated with the used of colors such as Green for Normal, Yellow for Average, Orange for Polluted and Red for Highly Polluted. The model got an accuracy of 91.75% with a Kappa value of 0.8115, interpreted as Strong in terms of the level of agreement.
△ Less
Submitted 12 February, 2022;
originally announced February 2022.
-
Combining Embeddings and Fuzzy Time Series for High-Dimensional Time Series Forecasting in Internet of Energy Applications
Authors:
Hugo Vinicius Bitencourt,
Luiz Augusto Facury de Souza,
Matheus Cascalho dos Santos,
Petrônio Cândido de Lima e Silva,
Frederico Gadelha Guimarães
Abstract:
The prediction of residential power usage is essential in assisting a smart grid to manage and preserve energy to ensure efficient use. An accurate energy forecasting at the customer level will reflect directly into efficiency improvements across the power grid system, however forecasting building energy use is a complex task due to many influencing factors, such as meteorological and occupancy pa…
▽ More
The prediction of residential power usage is essential in assisting a smart grid to manage and preserve energy to ensure efficient use. An accurate energy forecasting at the customer level will reflect directly into efficiency improvements across the power grid system, however forecasting building energy use is a complex task due to many influencing factors, such as meteorological and occupancy patterns. In addiction, high-dimensional time series increasingly arise in the Internet of Energy (IoE), given the emergence of multi-sensor environments and the two way communication between energy consumers and the smart grid. Therefore, methods that are capable of computing high-dimensional time series are of great value in smart building and IoE applications. Fuzzy Time Series (FTS) models stand out as data-driven non-parametric models of easy implementation and high accuracy. Unfortunately, the existing FTS models can be unfeasible if all features were used to train the model. We present a new methodology for handling high-dimensional time series, by projecting the original high-dimensional data into a low dimensional embedding space and using multivariate FTS approach in this low dimensional representation. Combining these techniques enables a better representation of the complex content of multivariate time series and more accurate forecasts.
△ Less
Submitted 3 December, 2021;
originally announced December 2021.
-
A Software Architecture for Autonomous Vehicles: Team LRM-B Entry in the First CARLA Autonomous Driving Challenge
Authors:
Luis Alberto Rosero,
Iago Pacheco Gomes,
Júnior Anderson Rodrigues da Silva,
Tiago Cesar dos Santos,
Angelica Tiemi Mizuno Nakamura,
Jean Amaro,
Denis Fernando Wolf,
Fernando Santos Osório
Abstract:
The objective of the first CARLA autonomous driving challenge was to deploy autonomous driving systems to lead with complex traffic scenarios where all participants faced the same challenging traffic situations. According to the organizers, this competition emerges as a way to democratize and to accelerate the research and development of autonomous vehicles around the world using the CARLA simulat…
▽ More
The objective of the first CARLA autonomous driving challenge was to deploy autonomous driving systems to lead with complex traffic scenarios where all participants faced the same challenging traffic situations. According to the organizers, this competition emerges as a way to democratize and to accelerate the research and development of autonomous vehicles around the world using the CARLA simulator contributing to the development of the autonomous vehicle area. Therefore, this paper presents the architecture design for the navigation of an autonomous vehicle in a simulated urban environment that attempts to commit the least number of traffic infractions, which used as the baseline the original architecture of the platform for autonomous navigation CaRINA 2. Our agent traveled in simulated scenarios for several hours, demonstrating his capabilities, winning three out of the four tracks of the challenge, and being ranked second in the remaining track.
Our architecture was made towards meeting the requirements of CARLA Autonomous Driving Challenge and has components for obstacle detection using 3D point clouds, traffic signs detection and classification which employs Convolutional Neural Networks (CNN) and depth information, risk assessment with collision detection using short-term motion prediction, decision-making with Markov Decision Process (MDP), and control using Model Predictive Control (MPC).
△ Less
Submitted 23 October, 2020;
originally announced October 2020.
-
Individual Factors that Influence Effort and Contributions on Wikipedia
Authors:
Luiz F. Pinto,
Carlos Denner dos Santos,
Silvia Onoyama
Abstract:
In this work, we aim to analyze how attitude, self-efficacy, and altruism influence effort and active contributions on Wikipedia. We propose a new conceptual model based on the theory of planned behavior and findings from the literature on online communities. This model differs from other models that have been previously proposed by considering altruism in its various facets (identification, recip…
▽ More
In this work, we aim to analyze how attitude, self-efficacy, and altruism influence effort and active contributions on Wikipedia. We propose a new conceptual model based on the theory of planned behavior and findings from the literature on online communities. This model differs from other models that have been previously proposed by considering altruism in its various facets (identification, reciprocity, and reputation), and by treating effort as a factor prior to performance results, which is measured in terms of active contributions, according to the organizational literature. To fulfill the study specific objectives, Wikipedia surveyed community members and collected secondary data. After excluding outliers, we obtained a final sample with 212 participants. We applied exploratory factor analysis and structural equation modeling, which resulted in a model with satisfactory fit indices. The results indicate that effort influences active contributions, and attitude, altruism by reputation, and altruism by identification influence effort. None of the proposed factors are directly related to active contributions. Experience directly influences self-efficacy while it positively moderates the relation between effort and active contributions. Finally, we present the conclusions via several implications for the literature as well as suggestions for future research.
△ Less
Submitted 14 July, 2020;
originally announced July 2020.
-
The influence of sponsors on organizational structure of free software communities
Authors:
Daniel Esashika,
Carlos Denner dos Santos
Abstract:
Initially, free software communities are characterized by selfmanagement, however, they were also influenced by public and private organizations that identified potential gains in the use of the geographically distributed production model. In this context, this research aims to answer the following questions: Do sponsors influence the organizational structures of free software communities by promo…
▽ More
Initially, free software communities are characterized by selfmanagement, however, they were also influenced by public and private organizations that identified potential gains in the use of the geographically distributed production model. In this context, this research aims to answer the following questions: Do sponsors influence the organizational structures of free software communities by promoting differences between sponsored and non-sponsored communities? What strategies are adopted by the sponsor to influence the organizational structure of free software communities? Two constructs are central to the study: organizational structure and sponsorship. For this research, we adopted case study methodology and three free software communities were studied. In the analysis of the results it was evidenced that sponsors influence decision making, definition of community key roles, and a formalization of norms. In turn, nonsponsored communities were characterized by the centralization and informality of the norms. We conclude that differences were identified in the organizational structure of sponsored and nonsponsored free software communities, and this differentiation was influenced by sponsors. In addition, it was possible to describe strategies and mechanisms used by sponsors to influence the community organizational structure.
△ Less
Submitted 5 July, 2020;
originally announced July 2020.
-
Bucking the Trend: An Agentive Perspective of Managerial Influence on Blogs Attractiveness
Authors:
Carlos Denner dos Santos,
Isadora Castro,
George Kuk,
Silvia Onoyama,
Marina Moreira
Abstract:
Blog management is central to the digitalization of work. However, existing theories tend to focus on environmental influence rather than managerial control of a blogs attractiveness at a microlevel. This study provides an agentive account of the adaptive behaviours exerted by the bloggers through the ways they use contents of their blogs to locate and harness their structural network positions of…
▽ More
Blog management is central to the digitalization of work. However, existing theories tend to focus on environmental influence rather than managerial control of a blogs attractiveness at a microlevel. This study provides an agentive account of the adaptive behaviours exerted by the bloggers through the ways they use contents of their blogs to locate and harness their structural network positions of a blogosphere. We collated individual characteristics of 165 bloggers who blogged about economics, and then analysed the ways they maintained the contents of their blogs. We used network analysis and monomial logistic regression to test our model predictions. Our findings show that in contrast to less attractive blogs, bloggers who are mindful of their peers contents as a means of maintaining network positions attract a significantly higher level of traffic to their blogs. This agentive perspective offers practical insights into how nodal preferences can be reversed in blog management. We conclude the paper by discussing contributions to theory and future research.
△ Less
Submitted 30 June, 2020;
originally announced June 2020.
-
Accelerating Antimicrobial Discovery with Controllable Deep Generative Models and Molecular Dynamics
Authors:
Payel Das,
Tom Sercu,
Kahini Wadhawan,
Inkit Padhi,
Sebastian Gehrmann,
Flaviu Cipcigan,
Vijil Chenthamarakshan,
Hendrik Strobelt,
Cicero dos Santos,
Pin-Yu Chen,
Yi Yan Yang,
Jeremy Tan,
James Hedrick,
Jason Crain,
Aleksandra Mojsilovic
Abstract:
De novo therapeutic design is challenged by a vast chemical repertoire and multiple constraints, e.g., high broad-spectrum potency and low toxicity. We propose CLaSS (Controlled Latent attribute Space Sampling) - an efficient computational method for attribute-controlled generation of molecules, which leverages guidance from classifiers trained on an informative latent space of molecules modeled u…
▽ More
De novo therapeutic design is challenged by a vast chemical repertoire and multiple constraints, e.g., high broad-spectrum potency and low toxicity. We propose CLaSS (Controlled Latent attribute Space Sampling) - an efficient computational method for attribute-controlled generation of molecules, which leverages guidance from classifiers trained on an informative latent space of molecules modeled using a deep generative autoencoder. We screen the generated molecules for additional key attributes by using deep learning classifiers in conjunction with novel features derived from atomistic simulations. The proposed approach is demonstrated for designing non-toxic antimicrobial peptides (AMPs) with strong broad-spectrum potency, which are emerging drug candidates for tackling antibiotic resistance. Synthesis and testing of only twenty designed sequences identified two novel and minimalist AMPs with high potency against diverse Gram-positive and Gram-negative pathogens, including one multidrug-resistant and one antibiotic-resistant K. pneumoniae, via membrane pore formation. Both antimicrobials exhibit low in vitro and in vivo toxicity and mitigate the onset of drug resistance. The proposed approach thus presents a viable path for faster and efficient discovery of potent and selective broad-spectrum antimicrobials.
△ Less
Submitted 25 February, 2021; v1 submitted 22 May, 2020;
originally announced May 2020.
-
Sobolev Independence Criterion
Authors:
Youssef Mroueh,
Tom Sercu,
Mattia Rigotti,
Inkit Padhi,
Cicero Dos Santos
Abstract:
We propose the Sobolev Independence Criterion (SIC), an interpretable dependency measure between a high dimensional random variable X and a response variable Y . SIC decomposes to the sum of feature importance scores and hence can be used for nonlinear feature selection. SIC can be seen as a gradient regularized Integral Probability Metric (IPM) between the joint distribution of the two random var…
▽ More
We propose the Sobolev Independence Criterion (SIC), an interpretable dependency measure between a high dimensional random variable X and a response variable Y . SIC decomposes to the sum of feature importance scores and hence can be used for nonlinear feature selection. SIC can be seen as a gradient regularized Integral Probability Metric (IPM) between the joint distribution of the two random variables and the product of their marginals. We use sparsity inducing gradient penalties to promote input sparsity of the critic of the IPM. In the kernel version we show that SIC can be cast as a convex optimization problem by introducing auxiliary variables that play an important role in feature selection as they are normalized feature importance scores. We then present a neural version of SIC where the critic is parameterized as a homogeneous neural network, improving its representation power as well as its interpretability. We conduct experiments validating SIC for feature selection in synthetic and real-world experiments. We show that SIC enables reliable and interpretable discoveries, when used in conjunction with the holdout randomization test and knockoffs to control the False Discovery Rate. Code is available at http://github.com/ibm/sic.
△ Less
Submitted 30 October, 2019;
originally announced October 2019.
-
Dynamic Gesture Recognition by Using CNNs and Star RGB: a Temporal Information Condensation
Authors:
Clebeson Canuto dos Santos,
Jorge Leonid Aching Samatelo,
Raquel Frizera Vassallo
Abstract:
Due to the advance of technologies, machines are increasingly present in people's daily lives. Thus, there has been more and more effort to develop interfaces, such as dynamic gestures, that provide an intuitive way of interaction. Currently, the most common trend is to use multimodal data, as depth and skeleton information, to enable dynamic gesture recognition. However, using only color informat…
▽ More
Due to the advance of technologies, machines are increasingly present in people's daily lives. Thus, there has been more and more effort to develop interfaces, such as dynamic gestures, that provide an intuitive way of interaction. Currently, the most common trend is to use multimodal data, as depth and skeleton information, to enable dynamic gesture recognition. However, using only color information would be more interesting, since RGB cameras are usually available in almost every public place, and could be used for gesture recognition without the need of installing other equipment. The main problem with such approach is the difficulty of representing spatio-temporal information using just color. With this in mind, we propose a technique capable of condensing a dynamic gesture, shown in a video, in just one RGB image. We call this technique star RGB. This image is then passed to a classifier formed by two Resnet CNNs, a soft-attention ensemble, and a fully connected layer, which indicates the class of the gesture present in the input video. Experiments were carried out using both Montalbano and GRIT datasets. For Montalbano dataset, the proposed approach achieved an accuracy of 94.58%. Such result reaches the state-of-the-art when considering this dataset and only color information. Regarding the GRIT dataset, our proposal achieves more than 98% of accuracy, recall, precision, and F1-score, outperforming the reference approach by more than 6%.
△ Less
Submitted 8 September, 2019; v1 submitted 9 April, 2019;
originally announced April 2019.
-
Wasserstein Barycenter Model Ensembling
Authors:
Pierre Dognin,
Igor Melnyk,
Youssef Mroueh,
Jerret Ross,
Cicero Dos Santos,
Tom Sercu
Abstract:
In this paper we propose to perform model ensembling in a multiclass or a multilabel learning setting using Wasserstein (W.) barycenters. Optimal transport metrics, such as the Wasserstein distance, allow incorporating semantic side information such as word embeddings. Using W. barycenters to find the consensus between models allows us to balance confidence and semantics in finding the agreement b…
▽ More
In this paper we propose to perform model ensembling in a multiclass or a multilabel learning setting using Wasserstein (W.) barycenters. Optimal transport metrics, such as the Wasserstein distance, allow incorporating semantic side information such as word embeddings. Using W. barycenters to find the consensus between models allows us to balance confidence and semantics in finding the agreement between the models. We show applications of Wasserstein ensembling in attribute-based classification, multilabel learning and image captioning generation. These results show that the W. ensembling is a viable alternative to the basic geometric or arithmetic mean ensembling.
△ Less
Submitted 13 February, 2019;
originally announced February 2019.
-
PepCVAE: Semi-Supervised Targeted Design of Antimicrobial Peptide Sequences
Authors:
Payel Das,
Kahini Wadhawan,
Oscar Chang,
Tom Sercu,
Cicero Dos Santos,
Matthew Riemer,
Vijil Chenthamarakshan,
Inkit Padhi,
Aleksandra Mojsilovic
Abstract:
Given the emerging global threat of antimicrobial resistance, new methods for next-generation antimicrobial design are urgently needed. We report a peptide generation framework PepCVAE, based on a semi-supervised variational autoencoder (VAE) model, for designing novel antimicrobial peptide (AMP) sequences. Our model learns a rich latent space of the biological peptide context by taking advantage…
▽ More
Given the emerging global threat of antimicrobial resistance, new methods for next-generation antimicrobial design are urgently needed. We report a peptide generation framework PepCVAE, based on a semi-supervised variational autoencoder (VAE) model, for designing novel antimicrobial peptide (AMP) sequences. Our model learns a rich latent space of the biological peptide context by taking advantage of abundant, unlabeled peptide sequences. The model further learns a disentangled antimicrobial attribute space by using the feedback from a jointly trained AMP classifier that uses limited labeled instances. The disentangled representation allows for controllable generation of AMPs. Extensive analysis of the PepCVAE-generated sequences reveals superior performance of our model in comparison to a plain VAE, as PepCVAE generates novel AMP sequences with higher long-range diversity, while being closer to the training distribution of biological peptides. These features are highly desired in next-generation antimicrobial design.
△ Less
Submitted 13 November, 2018; v1 submitted 17 October, 2018;
originally announced October 2018.
-
Driving Simulator Platform for Development and Evaluation of Safety and Emergency Systems
Authors:
Andrés E. Gómez,
Tiago C. dos Santos,
Carlos M. Massera,
Arthur de M. Neto,
Denis F. Wolf
Abstract:
According to data from the United Nations, more than 3000 people have died each day in the world due to road traffic collision. Considering recent researches, the human error may be considered as the main responsible for these fatalities. Because of this, researchers seek alternatives to transfer the vehicle control from people to autonomous systems. However, providing this technological innovatio…
▽ More
According to data from the United Nations, more than 3000 people have died each day in the world due to road traffic collision. Considering recent researches, the human error may be considered as the main responsible for these fatalities. Because of this, researchers seek alternatives to transfer the vehicle control from people to autonomous systems. However, providing this technological innovation for the people may demand complex challenges in the legal, economic and technological areas. Consequently, carmakers and researchers have divided the driving automation in safety and emergency systems that improve the driver perception on the road. This may reduce the human error. Therefore, the main contribution of this study is to propose a driving simulator platform to develop and evaluate safety and emergency systems, in the first design stage. This driving simulator platform has an advantage: a flexible software structure.This allows in the simulation one adaptation for development or evaluation of a system. The proposed driving simulator platform was tested in two applications: cooperative vehicle system development and the influence evaluation of a Driving Assistance System (\textit{DAS}) on a driver. In the cooperative vehicle system development, the results obtained show that the increment of the time delay in the communication among vehicles ($V2V$) is determinant for the system performance. On the other hand, in the influence evaluation of a \textit{DAS} in a driver, it was possible to conclude that the \textit{DAS'} model does not have the level of influence necessary in a driver to avoid an accident.
△ Less
Submitted 1 February, 2018;
originally announced February 2018.
-
Improved Neural Relation Detection for Knowledge Base Question Answering
Authors:
Mo Yu,
Wenpeng Yin,
Kazi Saidul Hasan,
Cicero dos Santos,
Bing Xiang,
Bowen Zhou
Abstract:
Relation detection is a core component for many NLP applications including Knowledge Base Question Answering (KBQA). In this paper, we propose a hierarchical recurrent neural network enhanced by residual learning that detects KB relations given an input question. Our method uses deep residual bidirectional LSTMs to compare questions and relation names via different hierarchies of abstraction. Addi…
▽ More
Relation detection is a core component for many NLP applications including Knowledge Base Question Answering (KBQA). In this paper, we propose a hierarchical recurrent neural network enhanced by residual learning that detects KB relations given an input question. Our method uses deep residual bidirectional LSTMs to compare questions and relation names via different hierarchies of abstraction. Additionally, we propose a simple KBQA system that integrates entity linking and our proposed relation detector to enable one enhance another. Experimental results evidence that our approach achieves not only outstanding relation detection performance, but more importantly, it helps our KBQA system to achieve state-of-the-art accuracy for both single-relation (SimpleQuestions) and multi-relation (WebQSP) QA benchmarks.
△ Less
Submitted 27 May, 2017; v1 submitted 20 April, 2017;
originally announced April 2017.
-
Attentive Pooling Networks
Authors:
Cicero dos Santos,
Ming Tan,
Bing Xiang,
Bowen Zhou
Abstract:
In this work, we propose Attentive Pooling (AP), a two-way attention mechanism for discriminative model training. In the context of pair-wise ranking or classification with neural networks, AP enables the pooling layer to be aware of the current input pair, in a way that information from the two input items can directly influence the computation of each other's representations. Along with such rep…
▽ More
In this work, we propose Attentive Pooling (AP), a two-way attention mechanism for discriminative model training. In the context of pair-wise ranking or classification with neural networks, AP enables the pooling layer to be aware of the current input pair, in a way that information from the two input items can directly influence the computation of each other's representations. Along with such representations of the paired inputs, AP jointly learns a similarity measure over projected segments (e.g. trigrams) of the pair, and subsequently, derives the corresponding attention vector for each input to guide the pooling. Our two-way attention mechanism is a general framework independent of the underlying representation learning, and it has been applied to both convolutional neural networks (CNNs) and recurrent neural networks (RNNs) in our studies. The empirical results, from three very different benchmark tasks of question answering/answer selection, demonstrate that our proposed models outperform a variety of strong baselines and achieve state-of-the-art performance in all the benchmarks.
△ Less
Submitted 10 February, 2016;
originally announced February 2016.
-
LSTM-based Deep Learning Models for Non-factoid Answer Selection
Authors:
Ming Tan,
Cicero dos Santos,
Bing Xiang,
Bowen Zhou
Abstract:
In this paper, we apply a general deep learning (DL) framework for the answer selection task, which does not depend on manually defined features or linguistic tools. The basic framework is to build the embeddings of questions and answers based on bidirectional long short-term memory (biLSTM) models, and measure their closeness by cosine similarity. We further extend this basic model in two directi…
▽ More
In this paper, we apply a general deep learning (DL) framework for the answer selection task, which does not depend on manually defined features or linguistic tools. The basic framework is to build the embeddings of questions and answers based on bidirectional long short-term memory (biLSTM) models, and measure their closeness by cosine similarity. We further extend this basic model in two directions. One direction is to define a more composite representation for questions and answers by combining convolutional neural network with the basic framework. The other direction is to utilize a simple but efficient attention mechanism in order to generate the answer representation according to the question context. Several variations of models are provided. The models are examined by two datasets, including TREC-QA and InsuranceQA. Experimental results demonstrate that the proposed models substantially outperform several strong baselines.
△ Less
Submitted 28 March, 2016; v1 submitted 12 November, 2015;
originally announced November 2015.