Skip to main content

Showing 1–14 of 14 results for author: Dao, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.18493  [pdf, other

    cs.CE cs.AI

    Rule-based autocorrection of Piping and Instrumentation Diagrams (P&IDs) on graphs

    Authors: Lukas Schulze Balhorn, Niels Seijsener, Kevin Dao, Minji Kim, Dominik P. Goldstein, Ge H. M. Driessen, Artur M. Schweidtmann

    Abstract: A piping and instrumentation diagram (P&ID) is a central reference document in chemical process engineering. Currently, chemical engineers manually review P&IDs through visual inspection to find and rectify errors. However, engineering projects can involve hundreds to thousands of P&ID pages, creating a significant revision workload. This study proposes a rule-based method to support engineers wit… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

  2. arXiv:2502.12386  [pdf, other

    stat.AP cs.AI

    Bridging the Data Gap in AI Reliability Research and Establishing DR-AIR, a Comprehensive Data Repository for AI Reliability

    Authors: Simin Zheng, Jared M. Clark, Fatemeh Salboukh, Priscila Silva, Karen da Mata, Fenglian Pan, Jie Min, Jiayi Lian, Caleb B. King, Lance Fiondella, Jian Liu, Xinwei Deng, Yili Hong

    Abstract: Artificial intelligence (AI) technology and systems have been advancing rapidly. However, ensuring the reliability of these systems is crucial for fostering public confidence in their use. This necessitates the modeling and analysis of reliability data specific to AI systems. A major challenge in AI reliability research, particularly for those in academia, is the lack of readily available AI relia… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

    Comments: 34 pages, 12 figures

  3. arXiv:2501.07102  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    AdaCS: Adaptive Normalization for Enhanced Code-Switching ASR

    Authors: The Chuong Chu, Vu Tuan Dat Pham, Kien Dao, Hoang Nguyen, Quoc Hung Truong

    Abstract: Intra-sentential code-switching (CS) refers to the alternation between languages that happens within a single utterance and is a significant challenge for Automatic Speech Recognition (ASR) systems. For example, when a Vietnamese speaker uses foreign proper names or specialized terms within their speech. ASR systems often struggle to accurately transcribe intra-sentential CS due to their training… ▽ More

    Submitted 13 January, 2025; originally announced January 2025.

    Comments: Accepted at ICASSP 2025

  4. arXiv:2308.06309  [pdf, other

    eess.SY cs.LG cs.PF

    Predicting Resilience with Neural Networks

    Authors: Karen da Mata, Priscila Silva, Lance Fiondella

    Abstract: Resilience engineering studies the ability of a system to survive and recover from disruptive events, which finds applications in several domains. Most studies emphasize resilience metrics to quantify system performance, whereas recent studies propose statistical modeling approaches to project system recovery time after degradation. Moreover, past studies are either performed on data after recover… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

  5. arXiv:2308.02914  [pdf, other

    cs.AI q-fin.GN

    Anomaly Detection in Global Financial Markets with Graph Neural Networks and Nonextensive Entropy

    Authors: Kleyton da Costa

    Abstract: Anomaly detection is a challenging task, particularly in systems with many variables. Anomalies are outliers that statistically differ from the analyzed data and can arise from rare events, malfunctions, or system misuse. This study investigated the ability to detect anomalies in global financial markets through Graph Neural Networks (GNN) considering an uncertainty scenario measured by a nonexten… ▽ More

    Submitted 8 August, 2023; v1 submitted 5 August, 2023; originally announced August 2023.

  6. arXiv:2302.12094  [pdf, other

    cs.LG cs.AI

    Evaluating Explainability in Machine Learning Predictions through Explainer-Agnostic Metrics

    Authors: Cristian Munoz, Kleyton da Costa, Bernardo Modenesi, Adriano Koshiyama

    Abstract: The rapid integration of artificial intelligence (AI) into various industries has introduced new challenges in governance and regulation, particularly regarding the understanding of complex AI systems. A critical demand from decision-makers is the ability to explain the results of machine learning models, which is essential for fostering trust and ensuring ethical AI practices. In this paper, we d… ▽ More

    Submitted 6 November, 2024; v1 submitted 23 February, 2023; originally announced February 2023.

  7. arXiv:2212.10913  [pdf

    cs.CR cs.LG

    Ensemble learning techniques for intrusion detection system in the context of cybersecurity

    Authors: Andricson Abeline Moreira, Carlos A. C. Tojeiro, Carlos J. Reis, Gustavo Henrique Massaro, Igor Andrade Brito e Kelton A. P. da Costa

    Abstract: Recently, there has been an interest in improving the resources available in Intrusion Detection System (IDS) techniques. In this sense, several studies related to cybersecurity show that the environment invasions and information kidnapping are increasingly recurrent and complex. The criticality of the business involving operations in an environment using computing resources does not allow the vul… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

    Comments: in Portuguese language. CIACA - Conferencia Ibero-Americana Computação Aplicada 2022 Proceedings

  8. Extractive Text Summarization Using Generalized Additive Models with Interactions for Sentence Selection

    Authors: Vinícius Camargo da Silva, João Paulo Papa, Kelton Augusto Pontara da Costa

    Abstract: Automatic Text Summarization (ATS) is becoming relevant with the growth of textual data; however, with the popularization of public large-scale datasets, some recent machine learning approaches have focused on dense models and architectures that, despite producing notable results, usually turn out in models difficult to interpret. Given the challenge behind interpretable learning-based text summar… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

  9. arXiv:2208.10415  [pdf, other

    cs.DB

    NLDS-QL: From natural language data science questions to queries on graphs: analysing patients conditions & treatments

    Authors: Genoveva Vargas-Solar, Karim Dao, Mirian Halfeld Ferrari Alves

    Abstract: This paper introduces NLDS-QL, a translator of data science questions expressed in natural language (NL) into data science queries on graph databases. Our translator is based on a simplified NL described by a grammar that specifies sentences combining keywords to refer to operations on graphs with the vocabulary of the graph schema. The demonstration proposed in this paper shows NLDS-QL in action… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

  10. arXiv:2204.03572  [pdf, other

    eess.IV cs.CV cs.LG

    A Pathology-Based Machine Learning Method to Assist in Epithelial Dysplasia Diagnosis

    Authors: Karoline da Rocha, José C. M. Bermudez, Elena R. C. Rivero, Márcio H. Costa

    Abstract: The Epithelial Dysplasia (ED) is a tissue alteration commonly present in lesions preceding oral cancer, being its presence one of the most important factors in the progression toward carcinoma. This study proposes a method to design a low computational cost classification system to support the detection of dysplastic epithelia, contributing to reduce the variability of pathologist assessments. We… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

  11. arXiv:2203.02728  [pdf, other

    cs.CV

    An End-to-End Approach for Seam Carving Detection using Deep Neural Networks

    Authors: Thierry P. Moreira, Marcos Cleison S. Santana, Leandro A. Passos João Paulo Papa, Kelton Augusto P. da Costa

    Abstract: Seam carving is a computational method capable of resizing images for both reduction and expansion based on its content, instead of the image geometry. Although the technique is mostly employed to deal with redundant information, i.e., regions composed of pixels with similar intensity, it can also be used for tampering images by inserting or removing relevant objects. Therefore, detecting such a p… ▽ More

    Submitted 5 March, 2022; originally announced March 2022.

  12. A Review of Deep Learning-based Approaches for Deepfake Content Detection

    Authors: Leandro A. Passos, Danilo Jodas, Kelton A. P. da Costa, Luis A. Souza Júnior, Douglas Rodrigues, Javier Del Ser, David Camacho, João Paulo Papa

    Abstract: Recent advancements in deep learning generative models have raised concerns as they can create highly convincing counterfeit images and videos. This poses a threat to people's integrity and can lead to social instability. To address this issue, there is a pressing need to develop new computational models that can efficiently detect forged content and alert users to potential image and video manipu… ▽ More

    Submitted 15 February, 2024; v1 submitted 12 February, 2022; originally announced February 2022.

  13. arXiv:1605.06491  [pdf, ps, other

    physics.soc-ph cs.GT q-bio.PE

    Evolutionary mixed games in structured populations: Cooperation and the benefits of heterogeneity

    Authors: Marco A. Amaral, Lucas Wardil, Matjaz Perc, Jafferson K. L. da Silva

    Abstract: Evolutionary games on networks traditionally involve the same game at each interaction. Here we depart from this assumption by considering mixed games, where the game played at each interaction is drawn uniformly at random from a set of two different games. While in well-mixed populations the random mixture of the two games is always equivalent to the average single game, in structured populations… ▽ More

    Submitted 20 May, 2016; originally announced May 2016.

    Comments: 8 two-column pages, 9 figures; accepted for publication in Physical Review E

    Journal ref: Phys. Rev. E 93 (2016) 042304

  14. Benchmarking Usability and Performance of Multicore Languages

    Authors: Sebastian Nanz, Scott West, Kaue Soares da Silveira, Bertrand Meyer

    Abstract: Developers face a wide choice of programming languages and libraries supporting multicore computing. Ever more diverse paradigms for expressing parallelism and synchronization become available while their influence on usability and performance remains largely unclear. This paper describes an experiment comparing four markedly different approaches to parallel programming: Chapel, Cilk, Go, and Thre… ▽ More

    Submitted 23 October, 2014; v1 submitted 12 February, 2013; originally announced February 2013.

    Journal ref: Proceedings of the 7th ACM-IEEE International Symposium Empirical Software Engineering and Measurement (ESEM'13), pages 183-192. IEEE, 2013