Skip to main content

Showing 1–50 of 51 results for author: Oliveira, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.08487  [pdf, ps, other

    cs.CL cs.AI

    Enhancing Essay Cohesion Assessment: A Novel Item Response Theory Approach

    Authors: Bruno Alexandre Rosa, Hilário Oliveira, Luiz Rodrigues, Eduardo Araujo Oliveira, Rafael Ferreira Mello

    Abstract: Essays are considered a valuable mechanism for evaluating learning outcomes in writing. Textual cohesion is an essential characteristic of a text, as it facilitates the establishment of meaning between its parts. Automatically scoring cohesion in essays presents a challenge in the field of educational artificial intelligence. The machine learning algorithms used to evaluate texts generally do not… ▽ More

    Submitted 11 July, 2025; originally announced July 2025.

    Comments: 24 pages, 4 tables

  2. arXiv:2506.13384  [pdf

    cs.AI cs.CY stat.ME stat.OT

    Delving Into the Psychology of Machines: Exploring the Structure of Self-Regulated Learning via LLM-Generated Survey Responses

    Authors: Leonie V. D. E. Vogelsmeier, Eduardo Oliveira, Kamila Misiejuk, Sonsoles López-Pernas, Mohammed Saqr

    Abstract: Large language models (LLMs) offer the potential to simulate human-like responses and behaviors, creating new opportunities for psychological science. In the context of self-regulated learning (SRL), if LLMs can reliably simulate survey responses at scale and speed, they could be used to test intervention scenarios, refine theoretical models, augment sparse datasets, and represent hard-to-reach po… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  3. arXiv:2506.09381  [pdf

    cs.CL

    Binary classification for perceived quality of headlines and links on worldwide news websites, 2018-2024

    Authors: Austin McCutcheon, Thiago E. A. de Oliveira, Aleksandr Zheleznov, Chris Brogly

    Abstract: The proliferation of online news enables potential widespread publication of perceived low-quality news headlines/links. As a result, we investigated whether it was possible to automatically distinguish perceived lower-quality news headlines/links from perceived higher-quality headlines/links. We evaluated twelve machine learning models on a binary, balanced dataset of 57,544,214 worldwide news we… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

  4. arXiv:2505.15916  [pdf, ps, other

    cs.CL cs.AI

    BR-TaxQA-R: A Dataset for Question Answering with References for Brazilian Personal Income Tax Law, including case law

    Authors: Juvenal Domingos Júnior, Augusto Faria, E. Seiti de Oliveira, Erick de Brito, Matheus Teotonio, Andre Assumpção, Diedre Carmo, Roberto Lotufo, Jayr Pereira

    Abstract: This paper presents BR-TaxQA-R, a novel dataset designed to support question answering with references in the context of Brazilian personal income tax law. The dataset contains 715 questions from the 2024 official Q\&A document published by Brazil's Internal Revenue Service, enriched with statutory norms and administrative rulings from the Conselho Administrativo de Recursos Fiscais (CARF). We imp… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  5. arXiv:2505.08828  [pdf

    cs.CL cs.AI cs.CY

    Human-AI Collaboration or Academic Misconduct? Measuring AI Use in Student Writing Through Stylometric Evidence

    Authors: Eduardo Araujo Oliveira, Madhavi Mohoni, Sonsoles López-Pernas, Mohammed Saqr

    Abstract: As human-AI collaboration becomes increasingly prevalent in educational contexts, understanding and measuring the extent and nature of such interactions pose significant challenges. This research investigates the use of authorship verification (AV) techniques not as a punitive measure, but as a means to quantify AI assistance in academic writing, with a focus on promoting transparency, interpretab… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

    Comments: 19 pages, 10 figures, 11 tables

  6. arXiv:2504.13867  [pdf, other

    cs.HC

    Mapping Executive Function Tasks for Children: A Scoping Review for Designing a Research-Oriented Platform

    Authors: Matheus Rodrigues Felizardo, Nuno Miguel Feixa Rodrigues, António Coelho, Sónia Silva Sousa, Adriana Sampaio, Eva Ferreira de Oliveira

    Abstract: Background: Executive functions (EFs) are cognitive processes essential for controlling impulses, staying focused, thinking before acting, and managing information. Childhood is a critical period for EF development, but there is a lack of standardized tools that combine EF tasks with physical activity in a gamified approach. Objectives: This scoping review maps EF tasks for children, identifies co… ▽ More

    Submitted 28 March, 2025; originally announced April 2025.

  7. arXiv:2503.22741  [pdf, other

    cs.CY cs.LG

    Concept Map Assessment Through Structure Classification

    Authors: Laís P. V. Vossen, Isabela Gasparini, Elaine H. T. Oliveira, Berrit Czinczel, Ute Harms, Lukas Menzel, Sebastian Gombert, Knut Neumann, Hendrik Drachsler

    Abstract: Due to their versatility, concept maps are used in various educational settings and serve as tools that enable educators to comprehend students' knowledge construction. An essential component for analyzing a concept map is its structure, which can be categorized into three distinct types: spoke, network, and chain. Understanding the predominant structure in a map offers insights into the student's… ▽ More

    Submitted 26 March, 2025; originally announced March 2025.

  8. arXiv:2501.14249  [pdf, other

    cs.LG cs.AI cs.CL

    Humanity's Last Exam

    Authors: Long Phan, Alice Gatti, Ziwen Han, Nathaniel Li, Josephina Hu, Hugh Zhang, Chen Bo Calvin Zhang, Mohamed Shaaban, John Ling, Sean Shi, Michael Choi, Anish Agrawal, Arnav Chopra, Adam Khoja, Ryan Kim, Richard Ren, Jason Hausenloy, Oliver Zhang, Mantas Mazeika, Dmitry Dodonov, Tung Nguyen, Jaeho Lee, Daron Anderson, Mikhail Doroshenko, Alun Cennyth Stokes , et al. (1084 additional authors not shown)

    Abstract: Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are not keeping pace in difficulty: LLMs now achieve over 90\% accuracy on popular benchmarks like MMLU, limiting informed measurement of state-of-the-art LLM capabilities. In response, we introduce Humanity's Last Exam (HLE), a multi-modal benchmark at the frontier of… ▽ More

    Submitted 19 April, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

    Comments: 29 pages, 6 figures

  9. arXiv:2412.02863  [pdf, ps, other

    cs.RO cs.AI cs.LG

    Proximal Control of UAVs with Federated Learning for Human-Robot Collaborative Domains

    Authors: Lucas Nogueira Nobrega, Ewerton de Oliveira, Martin Saska, Tiago Nascimento

    Abstract: The human-robot interaction (HRI) is a growing area of research. In HRI, complex command (action) classification is still an open problem that usually prevents the real applicability of such a technique. The literature presents some works that use neural networks to detect these actions. However, occlusion is still a major issue in HRI, especially when using uncrewed aerial vehicles (UAVs), since,… ▽ More

    Submitted 25 June, 2025; v1 submitted 3 December, 2024; originally announced December 2024.

    Comments: version 2

  10. arXiv:2411.00158  [pdf, other

    cs.CV cs.LG

    Using Deep Neural Networks to Quantify Parking Dwell Time

    Authors: Marcelo Eduardo Marques Ribas, Heloisa Benedet Mendes, Luiz Eduardo Soares de Oliveira, Luiz Antonio Zanlorensi, Paulo Ricardo Lisboa de Almeida

    Abstract: In smart cities, it is common practice to define a maximum length of stay for a given parking space to increase the space's rotativity and discourage the usage of individual transportation solutions. However, automatically determining individual car dwell times from images faces challenges, such as images collected from low-resolution cameras, lighting variations, and weather effects. In this work… ▽ More

    Submitted 31 October, 2024; originally announced November 2024.

    Comments: Paper accepted to the 2024 International Conference on Machine Learning and Applications

  11. Investigating Student Reasoning in Method-Level Code Refactoring: A Think-Aloud Study

    Authors: Eduardo Carneiro Oliveira, Hieke Keuning, Johan Jeuring

    Abstract: Producing code of good quality is an essential skill in software development. Code quality is an aspect of software quality that concerns the directly observable properties of code, such as decomposition, modularization, and code flow. Code quality can often be improved by means of code refactoring -- an internal change made to code that does not alter its observable behavior. According to the ACM… ▽ More

    Submitted 5 November, 2024; v1 submitted 28 October, 2024; originally announced October 2024.

  12. arXiv:2410.14705  [pdf, other

    cs.CV cs.LG

    Optimizing Parking Space Classification: Distilling Ensembles into Lightweight Classifiers

    Authors: Paulo Luza Alves, André Hochuli, Luiz Eduardo de Oliveira, Paulo Lisboa de Almeida

    Abstract: When deploying large-scale machine learning models for smart city applications, such as image-based parking lot monitoring, data often must be sent to a central server to perform classification tasks. This is challenging for the city's infrastructure, where image-based applications require transmitting large volumes of data, necessitating complex network and hardware infrastructures to process the… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

    Comments: Accepted for presentation at the International Conference on Machine Learning and Applications (ICMLA) 2024

  13. A Small Claims Court for the NLP: Judging Legal Text Classification Strategies With Small Datasets

    Authors: Mariana Yukari Noguti, Edduardo Vellasques, Luiz Eduardo Soares Oliveira

    Abstract: Recent advances in language modelling has significantly decreased the need of labelled data in text classification tasks. Transformer-based models, pre-trained on unlabeled data, can outmatch the performance of models trained from scratch for each task. However, the amount of labelled data need to fine-tune such type of model is still considerably high for domains requiring expert-level annotators… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

  14. arXiv:2407.18985  [pdf, other

    cs.SD eess.AS

    Implementation and Applications of WakeWords Integrated with Speaker Recognition: A Case Study

    Authors: Alexandre Costa Ferro Filho, Elisa Ayumi Masasi de Oliveira, Iago Alves Brito, Pedro Martins Bittencourt

    Abstract: This paper explores the application of artificial intelligence techniques in audio and voice processing, focusing on the integration of wake words and speaker recognition for secure access in embedded systems. With the growing prevalence of voice-activated devices such as Amazon Alexa, ensuring secure and user-specific interactions has become paramount. Our study aims to enhance the security frame… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

  15. arXiv:2405.09787  [pdf, other

    eess.IV cs.CV cs.LG

    Analysis of the BraTS 2023 Intracranial Meningioma Segmentation Challenge

    Authors: Dominic LaBella, Ujjwal Baid, Omaditya Khanna, Shan McBurney-Lin, Ryan McLean, Pierre Nedelec, Arif Rashid, Nourel Hoda Tahon, Talissa Altes, Radhika Bhalerao, Yaseen Dhemesh, Devon Godfrey, Fathi Hilal, Scott Floyd, Anastasia Janas, Anahita Fathi Kazerooni, John Kirkpatrick, Collin Kent, Florian Kofler, Kevin Leu, Nazanin Maleki, Bjoern Menze, Maxence Pajot, Zachary J. Reitman, Jeffrey D. Rudie , et al. (97 additional authors not shown)

    Abstract: We describe the design and results from the BraTS 2023 Intracranial Meningioma Segmentation Challenge. The BraTS Meningioma Challenge differed from prior BraTS Glioma challenges in that it focused on meningiomas, which are typically benign extra-axial tumors with diverse radiologic and anatomical presentation and a propensity for multiplicity. Nine participating teams each developed deep-learning… ▽ More

    Submitted 7 March, 2025; v1 submitted 15 May, 2024; originally announced May 2024.

    Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) https://melba-journal.org/2025:003 22 pages, 6 tables, 12 figures, MICCAI, MELBA

    Journal ref: Machine.Learning.for.Biomedical.Imaging. 3 (2025)

  16. arXiv:2404.06976  [pdf, other

    cs.IR

    Quati: A Brazilian Portuguese Information Retrieval Dataset from Native Speakers

    Authors: Mirelle Bueno, Eduardo Seiti de Oliveira, Rodrigo Nogueira, Roberto A. Lotufo, Jayr Alencar Pereira

    Abstract: Despite Portuguese being one of the most spoken languages in the world, there is a lack of high-quality information retrieval datasets in that language. We present Quati, a dataset specifically designed for the Brazilian Portuguese language. It comprises a collection of queries formulated by native speakers and a curated set of documents sourced from a selection of high-quality Brazilian Portugues… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 22 pages

  17. arXiv:2402.18511  [pdf

    cs.RO

    Leveraging Compliant Tactile Perception for Haptic Blind Surface Reconstruction

    Authors: Laurent Yves Emile Ramos Cheret, Vinicius Prado da Fonseca, Thiago Eustaquio Alves de Oliveira

    Abstract: Non-flat surfaces pose difficulties for robots operating in unstructured environments. Reconstructions of uneven surfaces may only be partially possible due to non-compliant end-effectors and limitations on vision systems such as transparency, reflections, and occlusions. This study achieves blind surface reconstruction by harnessing the robotic manipulator's kinematic data and a compliant tactile… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 7 pages, 9 figures, 2024 IEEE International Conference on Robotics and Automation (ICRA 2024)

  18. arXiv:2306.06834  [pdf, other

    cs.SE

    Motivational models for validating agile requirements in Software Engineering subjects

    Authors: Eduardo A. Oliveira, Leon Sterling

    Abstract: This paper describes how motivational models can be used to cross check agile requirements artifacts to improve consistency and completeness of software requirements. Motivational models provide a high level understanding of the purposes of a software system. They complement personas and user stories which focus more on user needs rather than on system features. We present an exploratory case stud… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

    Comments: 9 pages, 2 figures, SERP'21 - The 19th International Conference on Software Engineering Research and Practice

  19. arXiv:2208.10602  [pdf, other

    cs.CR cs.NI

    ABL: An original active blacklist based on a modification of the SMTP

    Authors: Pablo M. Oliveira, Mateus B. Vieira, Isaac C. Ferreira, João P. R. R. Leite, Edvard M. Oliveira, Bruno T. Kuehne, Edmilson M. Moreira, Otávio A. S. Carpinteiro

    Abstract: This paper presents a novel Active Blacklist (ABL) based on a modification of the Simple Mail Transfer Protocol (SMTP). ABL was implemented in the Mail Transfer Agent (MTA) Postfix of the e-mail server Zimbra and assessed exhaustively in a series of experiments. The modified server Zimbra showed computational performance and costs similar to those of the original server Zimbra when receiving legit… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: 18 pages, 6 figures, 5 tables

  20. arXiv:2206.08537  [pdf, ps, other

    cs.CV cs.LG

    Large-Margin Representation Learning for Texture Classification

    Authors: Jonathan de Matos, Luiz Eduardo Soares de Oliveira, Alceu de Souza Britto Junior, Alessandro Lameiras Koerich

    Abstract: This paper presents a novel approach combining convolutional layers (CLs) and large-margin metric learning for training supervised models on small datasets for texture classification. The core of such an approach is a loss function that computes the distances between instances of interest and support vectors. The objective is to update the weights of CLs iteratively to learn a representation with… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: 7 pages

  21. arXiv:2202.08176  [pdf, other

    cs.LG cs.AI

    Bias and unfairness in machine learning models: a systematic literature review

    Authors: Tiago Palma Pagano, Rafael Bessa Loureiro, Fernanda Vitória Nascimento Lisboa, Gustavo Oliveira Ramos Cruz, Rodrigo Matos Peixoto, Guilherme Aragão de Sousa Guimarães, Lucas Lisboa dos Santos, Maira Matos Araujo, Marco Cruz, Ewerton Lopes Silva de Oliveira, Ingrid Winkler, Erick Giovani Sperandio Nascimento

    Abstract: One of the difficulties of artificial intelligence is to ensure that model decisions are fair and free of bias. In research, datasets, metrics, techniques, and tools are applied to detect and mitigate algorithmic unfairness and bias. This study aims to examine existing knowledge on bias and unfairness in Machine Learning models, identifying mitigation methods, fairness metrics, and supporting tool… ▽ More

    Submitted 3 November, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

  22. arXiv:2102.03889  [pdf, other

    cs.CV

    Machine Learning Methods for Histopathological Image Analysis: A Review

    Authors: Jonathan de Matos, Steve Tsham Mpinda Ataky, Alceu de Souza Britto Jr., Luiz Eduardo Soares de Oliveira, Alessandro Lameiras Koerich

    Abstract: Histopathological images (HIs) are the gold standard for evaluating some types of tumors for cancer diagnosis. The analysis of such images is not only time and resource consuming, but also very challenging even for experienced pathologists, resulting in inter- and intra-observer disagreements. One of the ways of accelerating such an analysis is to use computer-aided diagnosis (CAD) systems. In thi… ▽ More

    Submitted 7 February, 2021; originally announced February 2021.

    Comments: 45 pages. arXiv admin note: text overlap with arXiv:1904.07900

  23. arXiv:2011.13676  [pdf

    cs.NI

    A Survey on Blockchain and Edge Computing applied to the Internet of Vehicles

    Authors: Anderson Queiroz, Eduardo Oliveira, Maria Barbosa, Kelvin Dias

    Abstract: With the advent of Intelligent Transportation Systems (ITS), data from diverse sensors either embedded into the vehicles or present along with the smart city infrastructure, are of utmost importance and require both processing power and efficient trust mechanisms for information exchange in vehicle-to-everything (V2X) communications. To accomplish these requirements, both edge computing and blockc… ▽ More

    Submitted 1 December, 2020; v1 submitted 27 November, 2020; originally announced November 2020.

    Comments: 6 pages, 2 pictures and 1 table (IEEE International Conference on Advanced Networks and Telecommunications Systems - Workshop on New Advances on Vehicle-to-Everything (V2X) Communications and Networking, 14-17 December 2020)

  24. arXiv:2011.00160  [pdf, other

    cs.CV cs.AI cs.LG

    Automatic Chronic Degenerative Diseases Identification Using Enteric Nervous System Images

    Authors: Gustavo Z. Felipe, Jacqueline N. Zanoni, Camila C. Sehaber-Sierakowski, Gleison D. P. Bossolani, Sara R. G. Souza, Franklin C. Flores, Luiz E. S. Oliveira, Rodolfo M. Pereira, Yandre M. G. Costa

    Abstract: Studies recently accomplished on the Enteric Nervous System have shown that chronic degenerative diseases affect the Enteric Glial Cells (EGC) and, thus, the development of recognition methods able to identify whether or not the EGC are affected by these type of diseases may be helpful in its diagnoses. In this work, we propose the use of pattern recognition and machine learning techniques to eval… ▽ More

    Submitted 30 October, 2020; originally announced November 2020.

  25. arXiv:2009.10808  [pdf

    cs.LG stat.AP

    Using Machine Learning to Develop a Novel COVID-19 Vulnerability Index (C19VI)

    Authors: Anuj Tiwari, Arya V. Dadhania, Vijay Avin Balaji Ragunathrao, Edson R. A. Oliveira

    Abstract: COVID19 is now one of the most leading causes of death in the United States. Systemic health, social and economic disparities have put the minorities and economically poor communities at a higher risk than others. There is an immediate requirement to develop a reliable measure of county-level vulnerabilities that can capture the heterogeneity of both vulnerable communities and the COVID19 pandemic… ▽ More

    Submitted 22 September, 2020; originally announced September 2020.

  26. arXiv:2009.04346  [pdf, other

    cs.AI cs.LG cs.NI

    A Methodological Approach to Model CBR-based Systems

    Authors: Eliseu M. Oliveira, Rafael F. Reale, Joberto S. B. Martins

    Abstract: Artificial intelligence (AI) has been used in various areas to support system optimization and find solutions where the complexity makes it challenging to use algorithmic and heuristics. Case-based Reasoning (CBR) is an AI technique intensively exploited in domains like management, medicine, design, construction, retail and smart grid. CBR is a technique for problem-solving and captures new knowle… ▽ More

    Submitted 9 September, 2020; originally announced September 2020.

    Comments: pp 1-16, 3 figures

    ACM Class: I.2.1; F.4.1

    Journal ref: Journal of Computer and Communications, September 2020, ISSN Online: 2327-5227

  27. Predicting MOOCs Dropout Using Only Two Easily Obtainable Features from the First Week's Activities

    Authors: Ahmed Alamri, Mohammad Alshehri, Alexandra I. Cristea, Filipe D. Pereira, Elaine Oliveira, Lei Shi, Craig Stewart

    Abstract: While Massive Open Online Course (MOOCs) platforms provide knowledge in a new and unique way, the very high number of dropouts is a significant drawback. Several features are considered to contribute towards learner attrition or lack of interest, which may lead to disengagement or total dropout. The jury is still out on which factors are the most appropriate predictors. However, the literature agr… ▽ More

    Submitted 12 August, 2020; originally announced August 2020.

    Comments: Intelligent Tutoring Systems. ITS 2019. Lecture Notes in Computer Science, vol 11528. Springer, Cham

  28. arXiv:2006.14711  [pdf, ps, other

    cs.CY

    New Metrics for Learning Evaluation in Digital Education Platforms

    Authors: Gabriel Leitão, Juan Colonna, Edwin Monteiro, Elaine Oliveira, Raimundo Barreto

    Abstract: Technology applied in education can provide great benefits and overcome challenges by facilitating access to learning objects anywhere and anytime. However, technology alone is not enough, since it requires suitable planning and learning methodologies. Using technology can be problematic, especially in determining whether learning has occurred or not. Futhermore, if learning has not occured, techn… ▽ More

    Submitted 22 September, 2022; v1 submitted 25 June, 2020; originally announced June 2020.

    Comments: 12 pages, 6 figures, 12 tables

  29. arXiv:2006.09197  [pdf, other

    cs.CV cs.CG

    Dense Non-Rigid Structure from Motion: A Manifold Viewpoint

    Authors: Suryansh Kumar, Luc Van Gool, Carlos E. P. de Oliveira, Anoop Cherian, Yuchao Dai, Hongdong Li

    Abstract: Non-Rigid Structure-from-Motion (NRSfM) problem aims to recover 3D geometry of a deforming object from its 2D feature correspondences across multiple frames. Classical approaches to this problem assume a small number of feature points and, ignore the local non-linearities of the shape deformation, and therefore, struggles to reliably model non-linear deformations. Furthermore, available dense NRSf… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

    Comments: A comprehensive version that combines our cvpr 2018 and cvpr 2019 work (Still under development and refinement, Initial Version). 13 Figures, 1 Table. arXiv admin note: text overlap with arXiv:1902.01077

  30. arXiv:2005.09110  [pdf, other

    cs.CV cs.LG

    Two-View Fine-grained Classification of Plant Species

    Authors: Voncarlos M. Araujo, Alceu S. Britto Jr., Luiz E. S. Oliveira, Alessandro L. Koerich

    Abstract: Automatic plant classification is a challenging problem due to the wide biodiversity of the existing plant species in a fine-grained scenario. Powerful deep learning architectures have been used to improve the classification performance in such a fine-grained problem, but usually building models that are highly dependent on a large training dataset and which are not scalable. In this paper, we pro… ▽ More

    Submitted 4 October, 2021; v1 submitted 18 May, 2020; originally announced May 2020.

  31. An End-to-End Approach for Recognition of Modern and Historical Handwritten Numeral Strings

    Authors: Andre G. Hochuli, Alceu S. Britto Jr., Jean P. Barddal, Luiz E. S. Oliveira, Robert Sabourin

    Abstract: An end-to-end solution for handwritten numeral string recognition is proposed, in which the numeral string is considered as composed of objects automatically detected and recognized by a YoLo-based model. The main contribution of this paper is to avoid heuristic-based methods for string preprocessing and segmentation, the need for task-oriented classifiers, and also the use of specific constraints… ▽ More

    Submitted 28 March, 2020; originally announced April 2020.

  32. arXiv:2002.00072  [pdf, other

    eess.IV cs.LG stat.ML

    Data Augmentation for Histopathological Images Based on Gaussian-Laplacian Pyramid Blending

    Authors: Steve Tsham Mpinda Ataky, Jonathan de Matos, Alceu de S. Britto Jr., Luiz E. S. Oliveira, Alessandro L. Koerich

    Abstract: Data imbalance is a major problem that affects several machine learning (ML) algorithms. Such a problem is troublesome because most of the ML algorithms attempt to optimize a loss function that does not take into account the data imbalance. Accordingly, the ML algorithm simply generates a trivial model that is biased toward predicting the most frequent class in the training data. In the case of hi… ▽ More

    Submitted 16 May, 2020; v1 submitted 31 January, 2020; originally announced February 2020.

    Comments: 8 pages

    Journal ref: IEEE International Joint Conference on Neural Networks (IJCNN 2020), Glasgow, UK

  33. SemClinBr -- a multi institutional and multi specialty semantically annotated corpus for Portuguese clinical NLP tasks

    Authors: Lucas Emanuel Silva e Oliveira, Ana Carolina Peters, Adalniza Moura Pucca da Silva, Caroline P. Gebeluca, Yohan Bonescki Gumiel, Lilian Mie Mukai Cintho, Deborah Ribeiro Carvalho, Sadid A. Hasan, Claudia Maria Cabral Moro

    Abstract: The high volume of research focusing on extracting patient's information from electronic health records (EHR) has led to an increase in the demand for annotated corpora, which are a very valuable resource for both the development and evaluation of natural language processing (NLP) algorithms. The absence of a multi-purpose clinical corpus outside the scope of the English language, especially in Br… ▽ More

    Submitted 27 January, 2020; originally announced January 2020.

  34. arXiv:1907.09404  [pdf, other

    cs.CV cs.LG cs.MM

    Deep Learning Approaches for Image Retrieval and Pattern Spotting in Ancient Documents

    Authors: Kelly Lais Wiggers, Alceu de Souza Britto Junior, Alessandro Lameiras Koerich, Laurent Heutte, Luiz Eduardo Soares de Oliveira

    Abstract: This paper describes two approaches for content-based image retrieval and pattern spotting in document images using deep learning. The first approach uses a pre-trained CNN model to cope with the lack of training data, which is fine-tuned to achieve a compact yet discriminant representation of queries and image candidates. The second approach uses a Siamese Convolution Neural Network trained on a… ▽ More

    Submitted 22 July, 2019; originally announced July 2019.

    Comments: The paper is under consideration at Pattern Recognition Letters

  35. arXiv:1907.07762  [pdf, other

    cs.CY eess.SY

    Agro 4.0: A Green Information System for Sustainable Agroecosystem Management

    Authors: Eugênio Pacceli Reis da Fonseca, Evandro Caldeira, Heitor Soares Ramos Filho, Leonardo Barbosa e Oliveira, Adriano César Machado Pereira, Pierre Santos Vilela

    Abstract: Agriculture is one of the most critical activities developed today by humankind and is in constant technical evolution to supply food and other essential products to everlasting and increasing demand. New machines, seeds, and fertilizers were developed to increase the productivity of cultivated areas. It is estimated that by 2050 we will have a population of 9 billion people and the production of… ▽ More

    Submitted 11 July, 2019; originally announced July 2019.

  36. arXiv:1905.12005  [pdf, other

    cs.CV eess.IV

    Texture CNN for Histopathological Image Classification

    Authors: Jonathan de Matos, Alceu de S. Britto Jr., Luiz E. S. de Oliveira, Alessandro L. Koerich

    Abstract: Biopsies are the gold standard for breast cancer diagnosis. This task can be improved by the use of Computer Aided Diagnosis (CAD) systems, reducing the time of diagnosis and reducing the inter and intra-observer variability. The advances in computing have brought this type of system closer to reality. However, datasets of Histopathological Images (HI) from biopsies are quite small and unbalanced… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

  37. arXiv:1904.07900  [pdf, other

    cs.CV cs.LG

    Histopathologic Image Processing: A Review

    Authors: Jonathan de Matos, Alceu de Souza Britto Jr., Luiz E. S. Oliveira, Alessandro L. Koerich

    Abstract: Histopathologic Images (HI) are the gold standard for evaluation of some tumors. However, the analysis of such images is challenging even for experienced pathologists, resulting in problems of inter and intra observer. Besides that, the analysis is time and resource consuming. One of the ways to accelerate such an analysis is by using Computer Aided Diagnosis systems. In this work we present a lit… ▽ More

    Submitted 16 April, 2019; originally announced April 2019.

  38. arXiv:1904.07834  [pdf, other

    cs.CV cs.LG

    Double Transfer Learning for Breast Cancer Histopathologic Image Classification

    Authors: Jonathan de Matos, Alceu de S. Britto Jr., Luiz E. S. Oliveira, Alessandro L. Koerich

    Abstract: This work proposes a classification approach for breast cancer histopathologic images (HI) that uses transfer learning to extract features from HI using an Inception-v3 CNN pre-trained with ImageNet dataset. We also use transfer learning on training a support vector machine (SVM) classifier on a tissue labeled colorectal cancer dataset aiming to filter the patches from a breast cancer HI and remov… ▽ More

    Submitted 16 April, 2019; originally announced April 2019.

  39. Cognitive Management of Bandwidth Allocation Models with Case-Based Reasoning -- Evidences Towards Dynamic BAM Reconfiguration

    Authors: Eliseu M. Oliveira, Rafael Freitas Reale, Joberto S. B. Martins

    Abstract: Management is a complex task in today's heterogeneous and large scale networks like Cloud, IoT, vehicular and MPLS networks. Likewise, researchers and developers envision the use of artificial intelligence techniques to create cognitive and autonomic management tools that aim better assist and enhance the management process cycle. Bandwidth allocation models (BAMs) are a resource allocation soluti… ▽ More

    Submitted 1 April, 2019; originally announced April 2019.

    Comments: IEEE Symposium on Computers and Communications - ISCC 2018

  40. Evaluating CBR Similarity Functions for BAM Switching in Networks with Dynamic Traffic Profile

    Authors: Eliseu Oliveira, Rafael Freitas, Joberto Martins

    Abstract: In an increasingly complex scenario for network management, a solution that allows configuration in more autonomous way with less intervention of the network manager is expected. This paper presents an evaluation of similarity functions that are necessary in the context of using a learning strategy for finding solutions. The learning approach considered is based on Case-Based Reasoning (CBR) and i… ▽ More

    Submitted 8 June, 2018; originally announced June 2018.

    Comments: https://lrsm.ibisc.univ-evry.fr/Advance2017/

  41. arXiv:1804.09279  [pdf, other

    cs.CV

    Segmentation-Free Approaches for Handwritten Numeral String Recognition

    Authors: Andre G Hochuli, Luiz E S Oliveira, Alceu S Britto Jr, Robert Sabourin

    Abstract: This paper presents segmentation-free strategies for the recognition of handwritten numeral strings of unknown length. A synthetic dataset of touching numeral strings of sizes 2-, 3- and 4-digits was created to train end-to-end solutions based on Convolutional Neural Networks. A robust experimental protocol is used to show that the proposed segmentation-free methods may reach the state-of-the-art… ▽ More

    Submitted 27 April, 2018; v1 submitted 24 April, 2018; originally announced April 2018.

    Comments: Paper accepted for publication on IJCNN 2018

  42. arXiv:1711.07295  [pdf, other

    cs.DB cs.DC

    Bitmap Filter: Speeding up Exact Set Similarity Joins with Bitwise Operations

    Authors: Edans F. O. Sandes, George Teodoro, Alba C. M. A. Melo

    Abstract: The Exact Set Similarity Join problem aims to find all similar sets between two collections of sets, with respect to a threshold and a similarity function such as overlap, Jaccard, dice or cosine. The naive approach verifies all pairs of sets and it is often considered impractical due the high number of combinations. So, Exact Set Similarity Join algorithms are usually based on the Filter-Verifica… ▽ More

    Submitted 20 November, 2017; originally announced November 2017.

    Comments: 13 pages, 14 figures

  43. arXiv:1710.02763  [pdf, other

    cs.CY

    Paperclickers: Affordable Solution for Classroom Response Systems

    Authors: Eduardo Oliveira, Jomara Bindá, Renato Lopes, Eduardo Valle

    Abstract: We propose a low-cost classroom response system requiring a single mobile device for the teacher and cards with printed codes for the students. We aim at broadening the adoption of active learning techniques in developing countries, offering a tool for easy implementation. We embody the solution as a smartphone application, describing the development history, pitfalls, and lessons learned that mig… ▽ More

    Submitted 7 October, 2017; originally announced October 2017.

    Comments: 12 pages, 13 figures

    MSC Class: 97U50

  44. arXiv:1709.00947  [pdf, other

    cs.CL cs.LG

    Learning Word Embeddings from the Portuguese Twitter Stream: A Study of some Practical Aspects

    Authors: Pedro Saleiro, Luís Sarmento, Eduarda Mendes Rodrigues, Carlos Soares, Eugénio Oliveira

    Abstract: This paper describes a preliminary study for producing and distributing a large-scale database of embeddings from the Portuguese Twitter stream. We start by experimenting with a relatively small sample and focusing on three challenges: volume of training data, vocabulary size and intrinsic evaluation metrics. Using a single GPU, we were able to scale up vocabulary size from 2048 words embedded and… ▽ More

    Submitted 4 September, 2017; originally announced September 2017.

  45. arXiv:1704.05091  [pdf, ps, other

    cs.CL cs.IR

    FEUP at SemEval-2017 Task 5: Predicting Sentiment Polarity and Intensity with Financial Word Embeddings

    Authors: Pedro Saleiro, Eduarda Mendes Rodrigues, Carlos Soares, Eugénio Oliveira

    Abstract: This paper presents the approach developed at the Faculty of Engineering of University of Porto, to participate in SemEval 2017, Task 5: Fine-grained Sentiment Analysis on Financial Microblogs and News. The task consisted in predicting a real continuous variable from -1.0 to +1.0 representing the polarity and intensity of sentiment concerning companies/stocks mentioned in short texts. We modeled t… ▽ More

    Submitted 17 April, 2017; originally announced April 2017.

  46. arXiv:1704.00326  [pdf, other

    cs.CV

    People Counting in Crowded and Outdoor Scenes using a Hybrid Multi-Camera Approach

    Authors: Fabio Dittrich, Luiz E. S. de Oliveira, Alceu S. Britto Jr., Alessandro L. Koerich

    Abstract: This paper presents two novel approaches for people counting in crowded and open environments that combine the information gathered by multiple views. Multiple camera are used to expand the field of view as well as to mitigate the problem of occlusion that commonly affects the performance of counting methods using single cameras. The first approach is regarded as a direct approach and it attempts… ▽ More

    Submitted 8 May, 2017; v1 submitted 2 April, 2017; originally announced April 2017.

  47. arXiv:1610.09936  [pdf, ps, other

    physics.soc-ph cs.SI

    Human Mobility in Large Cities as a Proxy for Crime

    Authors: Carlos Caminha, Vasco Furtado, Tarcisio H. C. Pequeno, Caio Ponte, Hygor P. M. Melo, Erneson A. Oliveira, José S. Andrade Jr

    Abstract: We investigate at the subscale of the neighborhoods of a highly populated city the incidence of property crimes in terms of both the resident and the floating population. Our results show that a relevant allometric relation could only be observed between property crimes and floating population. More precisely, the evidence of a superlinear behavior indicates that a disproportional number of proper… ▽ More

    Submitted 27 October, 2016; originally announced October 2016.

    Comments: 17 pages, 8 Figures

  48. arXiv:1601.00855  [pdf, other

    cs.IR

    TimeMachine: Entity-centric Search and Visualization of News Archives

    Authors: Pedro Saleiro, Jorge Teixeira, Carlos Soares, Eugénio Oliveira

    Abstract: We present a dynamic web tool that allows interactive search and visualization of large news archives using an entity-centric approach. Users are able to search entities using keyword phrases expressing news stories or events and the system retrieves the most relevant entities to the user query based on automatically extracted and indexed entity profiles. From the computational journalism perspect… ▽ More

    Submitted 5 January, 2016; originally announced January 2016.

    Comments: Advances in Information Retrieval: 38th European Conference on IR Research, ECIR 2016, Padua, Italy, March 20-23, 2016

  49. arXiv:1512.05448  [pdf, other

    math.OC cs.DS math.CO

    ADMM for the SDP relaxation of the QAP

    Authors: Danilo Elias Oliveira, Henry Wolkowicz, Yangyang Xu

    Abstract: The semidefinite programming (SDP) relaxation has proven to be extremely strong for many hard discrete optimization problems. This is in particular true for the quadratic assignment problem (QAP), arguably one of the hardest NP-hard discrete optimization problems. There are several difficulties that arise in efficiently solving the SDP relaxation, e.g.,~increased dimension; inefficiency of the cur… ▽ More

    Submitted 16 December, 2015; originally announced December 2015.

    Comments: 12 pages, 1 table

    MSC Class: 90C22; 90B80; 90C46; 90-08

  50. arXiv:1408.2889  [pdf, other

    cs.LG cs.NE

    A Classifier-free Ensemble Selection Method based on Data Diversity in Random Subspaces

    Authors: Albert H. R. Ko, Robert Sabourin, Alceu S. Britto Jr, Luiz E. S. Oliveira

    Abstract: The Ensemble of Classifiers (EoC) has been shown to be effective in improving the performance of single classifiers by combining their outputs, and one of the most important properties involved in the selection of the best EoC from a pool of classifiers is considered to be classifier diversity. In general, classifier diversity does not occur randomly, but is generated systematically by various ens… ▽ More

    Submitted 12 August, 2014; originally announced August 2014.

    ACM Class: I.5.2; I.5.3