Skip to main content

Showing 1–10 of 10 results for author: Mersha, M

.
  1. Explainable AI: XAI-Guided Context-Aware Data Augmentation

    Authors: Melkamu Abay Mersha, Mesay Gemeda Yigezu, Atnafu Lambebo Tonja, Hassan Shakil, Samer Iskander, Olga Kolesnikova, Jugal Kalita

    Abstract: Explainable AI (XAI) has emerged as a powerful tool for improving the performance of AI models, going beyond providing model transparency and interpretability. The scarcity of labeled data remains a fundamental challenge in developing robust and generalizable AI models, particularly for low-resource languages. Conventional data augmentation techniques introduce noise, cause semantic drift, disrupt… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  2. arXiv:2503.05050  [pdf, other

    cs.CL cs.AI cs.LG

    A Unified Framework with Novel Metrics for Evaluating the Effectiveness of XAI Techniques in LLMs

    Authors: Melkamu Abay Mersha, Mesay Gemeda Yigezu, Hassan Shakil, Ali K. AlShami, Sanghyun Byun, Jugal Kalita

    Abstract: The increasing complexity of LLMs presents significant challenges to their transparency and interpretability, necessitating the use of eXplainable AI (XAI) techniques to enhance trustworthiness and usability. This study introduces a comprehensive evaluation framework with four novel metrics for assessing the effectiveness of five XAI techniques across five LLMs and two downstream tasks. We apply t… ▽ More

    Submitted 7 April, 2025; v1 submitted 6 March, 2025; originally announced March 2025.

    Comments: arXiv admin note: substantial text overlap with arXiv:2501.15374

  3. arXiv:2501.15374  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Evaluating the Effectiveness of XAI Techniques for Encoder-Based Language Models

    Authors: Melkamu Abay Mersha, Mesay Gemeda Yigezu, Jugal Kalita

    Abstract: The black-box nature of large language models (LLMs) necessitates the development of eXplainable AI (XAI) techniques for transparency and trustworthiness. However, evaluating these techniques remains a challenge. This study presents a general evaluation framework using four key metrics: Human-reasoning Agreement (HA), Robustness, Consistency, and Contrastivity. We assess the effectiveness of six e… ▽ More

    Submitted 25 January, 2025; originally announced January 2025.

    Journal ref: 310(2025)113042

  4. SMART-Vision: Survey of Modern Action Recognition Techniques in Vision

    Authors: Ali K. AlShami, Ryan Rabinowitz, Khang Lam, Yousra Shleibik, Melkamu Mersha, Terrance Boult, Jugal Kalita

    Abstract: Human Action Recognition (HAR) is a challenging domain in computer vision, involving recognizing complex patterns by analyzing the spatiotemporal dynamics of individuals' movements in videos. These patterns arise in sequential data, such as video frames, which are often essential to accurately distinguish actions that would be ambiguous in a single image. HAR has garnered considerable interest due… ▽ More

    Submitted 22 January, 2025; originally announced January 2025.

    Journal ref: Multimedia Tools and Applications, Springer, 2024, pp. 1-72

  5. arXiv:2412.18036  [pdf, other

    cs.CL cs.AI

    Explainability in Neural Networks for Natural Language Processing Tasks

    Authors: Melkamu Mersha, Mingiziem Bitewa, Tsion Abay, Jugal Kalita

    Abstract: Neural networks are widely regarded as black-box models, creating significant challenges in understanding their inner workings, especially in natural language processing (NLP) applications. To address this opacity, model explanation techniques like Local Interpretable Model-Agnostic Explanations (LIME) have emerged as essential tools for providing insights into the behavior of these complex system… ▽ More

    Submitted 8 January, 2025; v1 submitted 23 December, 2024; originally announced December 2024.

  6. arXiv:2412.17203  [pdf, other

    cs.AR

    Agile TLB Prefetching and Prediction Replacement Policy

    Authors: Melkamu Mersha, Tsion Abay, Mingziem Bitewa, Gedare Bloom

    Abstract: Virtual-to-physical address translation is a critical performance bottleneck in paging-based virtual memory systems. The Translation Lookaside Buffer (TLB) accelerates address translation by caching frequently accessed mappings, but TLB misses lead to costly page walks. Hardware and software techniques address this challenge. Hardware approaches enhance TLB reach through system-level support, whil… ▽ More

    Submitted 22 December, 2024; originally announced December 2024.

  7. arXiv:2410.02609  [pdf, other

    cs.CL

    Ethio-Fake: Cutting-Edge Approaches to Combat Fake News in Under-Resourced Languages Using Explainable AI

    Authors: Mesay Gemeda Yigezu, Melkamu Abay Mersha, Girma Yohannis Bade, Jugal Kalita, Olga Kolesnikova, Alexander Gelbukh

    Abstract: The proliferation of fake news has emerged as a significant threat to the integrity of information dissemination, particularly on social media platforms. Misinformation can spread quickly due to the ease of creating and disseminating content, affecting public opinion and sociopolitical events. Identifying false information is therefore essential to reducing its negative consequences and maintainin… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

    Journal ref: ACLing 2024: 6th International Conference on AI in Computational Linguistics

  8. arXiv:2410.00134  [pdf, other

    cs.CL cs.AI

    Semantic-Driven Topic Modeling Using Transformer-Based Embeddings and Clustering Algorithms

    Authors: Melkamu Abay Mersha, Mesay Gemeda yigezu, Jugal Kalita

    Abstract: Topic modeling is a powerful technique to discover hidden topics and patterns within a collection of documents without prior knowledge. Traditional topic modeling and clustering-based techniques encounter challenges in capturing contextual semantic information. This study introduces an innovative end-to-end semantic-driven topic modeling technique for the topic extraction process, utilizing advanc… ▽ More

    Submitted 30 September, 2024; originally announced October 2024.

    Journal ref: ACLing2024 6th International Conference on AI in Computational Linguistics

  9. arXiv:2409.00265  [pdf, other

    cs.AI cs.CL cs.CY cs.LG

    Explainable Artificial Intelligence: A Survey of Needs, Techniques, Applications, and Future Direction

    Authors: Melkamu Mersha, Khang Lam, Joseph Wood, Ali AlShami, Jugal Kalita

    Abstract: Artificial intelligence models encounter significant challenges due to their black-box nature, particularly in safety-critical domains such as healthcare, finance, and autonomous vehicles. Explainable Artificial Intelligence (XAI) addresses these challenges by providing explanations for how these models make decisions and predictions, ensuring transparency, accountability, and fairness. Existing s… ▽ More

    Submitted 12 January, 2025; v1 submitted 30 August, 2024; originally announced September 2024.

    Journal ref: 599(2024)128111

  10. arXiv:2312.04764  [pdf, other

    cs.CL

    First Attempt at Building Parallel Corpora for Machine Translation of Northeast India's Very Low-Resource Languages

    Authors: Atnafu Lambebo Tonja, Melkamu Mersha, Ananya Kalita, Olga Kolesnikova, Jugal Kalita

    Abstract: This paper presents the creation of initial bilingual corpora for thirteen very low-resource languages of India, all from Northeast India. It also presents the results of initial translation efforts in these languages. It creates the first-ever parallel corpora for these languages and provides initial benchmark neural machine translation results for these languages. We intend to extend these corpo… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: Accepted to ICON 2023