Skip to main content

Showing 1–50 of 55 results for author: Cruz, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.01789  [pdf, ps, other

    cs.LG cs.AI cs.CL cs.CV eess.AS

    Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and Accountability

    Authors: Genta Indra Winata, David Anugraha, Emmy Liu, Alham Fikri Aji, Shou-Yi Hung, Aditya Parashar, Patrick Amadeus Irawan, Ruochen Zhang, Zheng-Xin Yong, Jan Christian Blaise Cruz, Niklas Muennighoff, Seungone Kim, Hanyang Zhao, Sudipta Kar, Kezia Erina Suryoraharjo, M. Farid Adilazuarda, En-Shiun Annie Lee, Ayu Purwarianti, Derry Tanti Wijaya, Monojit Choudhury

    Abstract: High-quality datasets are fundamental to training and evaluating machine learning models, yet their creation-especially with accurate human annotations-remains a significant challenge. Many dataset paper submissions lack originality, diversity, or rigorous quality control, and these shortcomings are often overlooked during peer review. Submissions also frequently omit essential details about datas… ▽ More

    Submitted 3 June, 2025; v1 submitted 2 June, 2025; originally announced June 2025.

    Comments: Preprint

  2. arXiv:2505.24456  [pdf, ps, other

    cs.CL

    CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation

    Authors: Emilio Villa-Cueva, Sholpan Bolatzhanova, Diana Turmakhan, Kareem Elzeky, Henok Biadglign Ademtew, Alham Fikri Aji, Israel Abebe Azime, Jinheon Baek, Frederico Belcavello, Fermin Cristobal, Jan Christian Blaise Cruz, Mary Dabre, Raj Dabre, Toqeer Ehsan, Naome A Etori, Fauzan Farooqui, Jiahui Geng, Guido Ivetta, Thanmay Jayakumar, Soyeong Jeong, Zheng Wei Lim, Aishik Mandal, Sofia Martinelli, Mihail Minkov Mihaylov, Daniil Orel , et al. (9 additional authors not shown)

    Abstract: Cultural content poses challenges for machine translation systems due to the differences in conceptualizations between cultures, where language alone may fail to convey sufficient context to capture region-specific meanings. In this work, we investigate whether images can act as cultural context in multimodal translation. We introduce CaMMT, a human-curated benchmark of over 5,800 triples of image… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

  3. arXiv:2503.07920  [pdf, other

    cs.CV cs.AI cs.CL

    Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

    Authors: Samuel Cahyawijaya, Holy Lovenia, Joel Ruben Antony Moniz, Tack Hwa Wong, Mohammad Rifqi Farhansyah, Thant Thiri Maung, Frederikus Hudi, David Anugraha, Muhammad Ravi Shulthan Habibi, Muhammad Reza Qorib, Amit Agarwal, Joseph Marvin Imperial, Hitesh Laxmichand Patel, Vicky Feliren, Bahrul Ilmi Nasution, Manuel Antonio Rufino, Genta Indra Winata, Rian Adam Rajagede, Carlos Rafael Catalan, Mohamed Fazli Imam, Priyaranjan Pattnayak, Salsabila Zahirah Pranida, Kevin Pratama, Yeshil Bangera, Adisai Na-Thalang , et al. (67 additional authors not shown)

    Abstract: Southeast Asia (SEA) is a region of extraordinary linguistic and cultural diversity, yet it remains significantly underrepresented in vision-language (VL) research. This often results in artificial intelligence (AI) models that fail to capture SEA cultural nuances. To fill this gap, we present SEA-VL, an open-source initiative dedicated to developing high-quality, culturally relevant data for SEA… ▽ More

    Submitted 18 March, 2025; v1 submitted 10 March, 2025; originally announced March 2025.

    Comments: [SEA-VL Dataset] https://huggingface.co/collections/SEACrowd/sea-vl-multicultural-vl-dataset-for-southeast-asia-67cf223d0c341d4ba2b236e7 [Appendix J] https://github.com/SEACrowd/seacrowd.github.io/blob/master/docs/SEA_VL_Appendix_J.pdf

  4. arXiv:2502.15701  [pdf

    cs.IR cs.AI cs.CL

    Political Events using RAG with LLMs

    Authors: Muhammad Arslan, Saba Munawar, Christophe Cruz

    Abstract: In the contemporary digital landscape, media content stands as the foundation for political news analysis, offering invaluable insights sourced from various channels like news articles, social media updates, speeches, and reports. Natural Language Processing (NLP) has revolutionized Political Information Extraction (IE), automating tasks such as Event Extraction (EE) from these diverse media outle… ▽ More

    Submitted 6 January, 2025; originally announced February 2025.

  5. arXiv:2502.15700  [pdf

    cs.IR cs.AI cs.CL

    Sustainable Digitalization of Business with Multi-Agent RAG and LLM

    Authors: Muhammad Arslan, Saba Munawar, Christophe Cruz

    Abstract: Businesses heavily rely on data sourced from various channels like news articles, financial reports, and consumer reviews to drive their operations, enabling informed decision-making and identifying opportunities. However, traditional manual methods for data extraction are often time-consuming and resource-intensive, prompting the adoption of digital transformation initiatives to enhance efficienc… ▽ More

    Submitted 6 January, 2025; originally announced February 2025.

  6. arXiv:2502.11269  [pdf, other

    cs.AI cs.LG cs.SC

    Unlocking the Potential of Generative AI through Neuro-Symbolic Architectures: Benefits and Limitations

    Authors: Oualid Bougzime, Samir Jabbar, Christophe Cruz, Frédéric Demoly

    Abstract: Neuro-symbolic artificial intelligence (NSAI) represents a transformative approach in artificial intelligence (AI) by combining deep learning's ability to handle large-scale and unstructured data with the structured reasoning of symbolic methods. By leveraging their complementary strengths, NSAI enhances generalization, reasoning, and scalability while addressing key challenges such as transparenc… ▽ More

    Submitted 16 February, 2025; originally announced February 2025.

    Comments: 54 pages, 7 figures

  7. arXiv:2502.05239  [pdf, other

    cs.CL cs.AI

    Enhancing Knowledge Graph Construction: Evaluating with Emphasis on Hallucination, Omission, and Graph Similarity Metrics

    Authors: Hussam Ghanem, Christophe Cruz

    Abstract: Recent advancements in large language models have demonstrated significant potential in the automated construction of knowledge graphs from unstructured text. This paper builds upon our previous work [16], which evaluated various models using metrics like precision, recall, F1 score, triple matching, and graph matching, and introduces a refined approach to address the critical issues of hallucinat… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

    Journal ref: Sixth International Knowledge Graph and Semantic Web Conference (KGSWC 2024), Dec 2024, Paris, France

  8. arXiv:2501.12660  [pdf, other

    cs.CL

    Extracting General-use Transformers for Low-resource Languages via Knowledge Distillation

    Authors: Jan Christian Blaise Cruz, Alham Fikri Aji

    Abstract: In this paper, we propose the use of simple knowledge distillation to produce smaller and more efficient single-language transformers from Massively Multilingual Transformers (MMTs) to alleviate tradeoffs associated with the use of such in low-resource settings. Using Tagalog as a case study, we show that these smaller single-language models perform on-par with strong baselines in a variety of ben… ▽ More

    Submitted 22 January, 2025; originally announced January 2025.

    Comments: LoResLM Workshop @ COLING 2025

  9. arXiv:2410.21573  [pdf, other

    cs.CL cs.AI

    Thank You, Stingray: Multilingual Large Language Models Can Not (Yet) Disambiguate Cross-Lingual Word Sense

    Authors: Samuel Cahyawijaya, Ruochen Zhang, Holy Lovenia, Jan Christian Blaise Cruz, Elisa Gilbert, Hiroki Nomoto, Alham Fikri Aji

    Abstract: Multilingual large language models (LLMs) have gained prominence, but concerns arise regarding their reliability beyond English. This study addresses the gap in cross-lingual semantic evaluation by introducing a novel benchmark for cross-lingual sense disambiguation, StingrayBench. In this paper, we demonstrate using false friends -- words that are orthographically similar but have completely diff… ▽ More

    Submitted 30 October, 2024; v1 submitted 28 October, 2024; originally announced October 2024.

  10. arXiv:2410.16331  [pdf, other

    quant-ph cs.ET cs.LG

    Exploring Quantum Neural Networks for Demand Forecasting

    Authors: Gleydson Fernandes de Jesus, Maria Heloísa Fraga da Silva, Otto Menegasso Pires, Lucas Cruz da Silva, Clebson dos Santos Cruz, Valéria Loureiro da Silva

    Abstract: Forecasting demand for assets and services can be addressed in various markets, providing a competitive advantage when the predictive models used demonstrate high accuracy. However, the training of machine learning models incurs high computational costs, which may limit the training of prediction models based on available computational capacity. In this context, this paper presents an approach for… ▽ More

    Submitted 19 October, 2024; originally announced October 2024.

    Comments: 22 pages, 13 figures, 10 tables

  11. arXiv:2410.12705  [pdf, other

    cs.CL cs.AI cs.CV

    WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines

    Authors: Genta Indra Winata, Frederikus Hudi, Patrick Amadeus Irawan, David Anugraha, Rifki Afina Putri, Yutong Wang, Adam Nohejl, Ubaidillah Ariq Prathama, Nedjma Ousidhoum, Afifa Amriani, Anar Rzayev, Anirban Das, Ashmari Pramodya, Aulia Adila, Bryan Wilie, Candy Olivia Mawalim, Ching Lam Cheng, Daud Abolade, Emmanuele Chersoni, Enrico Santus, Fariz Ikhwantri, Garry Kuwanto, Hanyang Zhao, Haryo Akbarianto Wibowo, Holy Lovenia , et al. (26 additional authors not shown)

    Abstract: Vision Language Models (VLMs) often struggle with culture-specific knowledge, particularly in languages other than English and in underrepresented cultural contexts. To evaluate their understanding of such knowledge, we introduce WorldCuisines, a massive-scale benchmark for multilingual and multicultural, visually grounded language understanding. This benchmark includes a visual question answering… ▽ More

    Submitted 8 May, 2025; v1 submitted 16 October, 2024; originally announced October 2024.

    Comments: Best Theme Paper at NAACL 2025

  12. arXiv:2410.00349  [pdf, other

    cs.RO

    Data Augmentation for 3DMM-based Arousal-Valence Prediction for HRI

    Authors: Christian Arzate Cruz, Yotam Sechayk, Takeo Igarashi, Randy Gomez

    Abstract: Humans use multiple communication channels to interact with each other. For instance, body gestures or facial expressions are commonly used to convey an intent. The use of such non-verbal cues has motivated the development of prediction models. One such approach is predicting arousal and valence (AV) from facial expressions. However, making these models accurate for human-robot interaction (HRI) s… ▽ More

    Submitted 30 September, 2024; originally announced October 2024.

  13. arXiv:2406.10118  [pdf, other

    cs.CL

    SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages

    Authors: Holy Lovenia, Rahmad Mahendra, Salsabil Maulana Akbar, Lester James V. Miranda, Jennifer Santoso, Elyanah Aco, Akhdan Fadhilah, Jonibek Mansurov, Joseph Marvin Imperial, Onno P. Kampman, Joel Ruben Antony Moniz, Muhammad Ravi Shulthan Habibi, Frederikus Hudi, Railey Montalan, Ryan Ignatius, Joanito Agili Lopo, William Nixon, Börje F. Karlsson, James Jaya, Ryandito Diandaru, Yuze Gao, Patrick Amadeus, Bin Wang, Jan Christian Blaise Cruz, Chenxi Whitehouse , et al. (36 additional authors not shown)

    Abstract: Southeast Asia (SEA) is a region rich in linguistic diversity and cultural variety, with over 1,300 indigenous languages and a population of 671 million people. However, prevailing AI models suffer from a significant lack of representation of texts, images, and audio datasets from SEA, compromising the quality of AI models for SEA languages. Evaluating models for SEA languages is challenging due t… ▽ More

    Submitted 10 March, 2025; v1 submitted 14 June, 2024; originally announced June 2024.

    Comments: https://seacrowd.github.io/ Published in EMNLP 2024

  14. arXiv:2406.05967  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark

    Authors: David Romero, Chenyang Lyu, Haryo Akbarianto Wibowo, Teresa Lynn, Injy Hamed, Aditya Nanda Kishore, Aishik Mandal, Alina Dragonetti, Artem Abzaliev, Atnafu Lambebo Tonja, Bontu Fufa Balcha, Chenxi Whitehouse, Christian Salamea, Dan John Velasco, David Ifeoluwa Adelani, David Le Meur, Emilio Villa-Cueva, Fajri Koto, Fauzan Farooqui, Frederico Belcavello, Ganzorig Batnasan, Gisela Vallejo, Grainne Caulfield, Guido Ivetta, Haiyue Song , et al. (51 additional authors not shown)

    Abstract: Visual Question Answering (VQA) is an important task in multimodal AI, and it is often used to test the ability of vision-language models to understand and reason on knowledge present in both visual and textual data. However, most of the current VQA models use datasets that are primarily focused on English and a few major world languages, with images that are typically Western-centric. While recen… ▽ More

    Submitted 4 November, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

    Comments: 38th Conference on Neural Information Processing Systems (NeurIPS 2024) Track on Datasets and Benchmarks

  15. arXiv:2404.02565  [pdf, other

    cs.HC

    Spatial Summation of Localized Pressure for Haptic Sensory Prostheses

    Authors: Sreela Kodali, Cihualpilli Camino Cruz, Thomas C. Bulea, Kevin S. Rao Diana Bharucha-Goebel, Alexander T. Chesler, Carsten G. Bonnemann, Allison M. Okamura

    Abstract: A host of medical conditions, including amputations, diabetes, stroke, and genetic disease, result in loss of touch sensation. Because most types of sensory loss have no pharmacological treatment or rehabilitative therapy, we propose a haptic sensory prosthesis that provides substitutive feedback. The wrist and forearm are compelling locations for feedback due to available skin area and not occlud… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 2 pages, 2 figures, 2024 IEEE Haptics Symposium Work-in-Progress Paper

  16. arXiv:2403.07769  [pdf

    cs.AI cs.CL cs.CY cs.MA

    Transforming Competition into Collaboration: The Revolutionary Role of Multi-Agent Systems and Language Models in Modern Organizations

    Authors: Carlos Jose Xavier Cruz

    Abstract: This article explores the dynamic influence of computational entities based on multi-agent systems theory (SMA) combined with large language models (LLM), which are characterized by their ability to simulate complex human interactions, as a possibility to revolutionize human user interaction from the use of specialized artificial agents to support everything from operational organizational process… ▽ More

    Submitted 15 March, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  17. arXiv:2401.06161  [pdf

    cs.CY cs.AI

    Trustworthy human-centric based Automated Decision-Making Systems

    Authors: Marcelino Cabrera, Carlos Cruz, Pavel Novoa-Hernández, David A. Pelta, José Luis Verdegay

    Abstract: Automated Decision-Making Systems (ADS) have become pervasive across various fields, activities, and occupations, to enhance performance. However, this widespread adoption introduces potential risks, including the misuse of ADS. Such misuse may manifest when ADS is employed in situations where it is unnecessary or when essential requirements, conditions, and terms are overlooked, leading to uninte… ▽ More

    Submitted 22 December, 2023; originally announced January 2024.

    Comments: 16 pages, 1 Table

  18. arXiv:2310.16322  [pdf, other

    cs.CL

    Samsung R&D Institute Philippines at WMT 2023

    Authors: Jan Christian Blaise Cruz

    Abstract: In this paper, we describe the constrained MT systems submitted by Samsung R&D Institute Philippines to the WMT 2023 General Translation Task for two directions: en$\rightarrow$he and he$\rightarrow$en. Our systems comprise of Transformer-based sequence-to-sequence models that are trained with a mix of best practices: comprehensive data preprocessing pipelines, synthetic backtranslated data, and t… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: To appear in Proceedings of the Eighth Conference on Machine Translation 2023 (WMT)

  19. arXiv:2308.05609  [pdf, ps, other

    cs.CL cs.IR cs.PF

    LASIGE and UNICAGE solution to the NASA LitCoin NLP Competition

    Authors: Pedro Ruas, Diana F. Sousa, André Neves, Carlos Cruz, Francisco M. Couto

    Abstract: Biomedical Natural Language Processing (NLP) tends to become cumbersome for most researchers, frequently due to the amount and heterogeneity of text to be processed. To address this challenge, the industry is continuously developing highly efficient tools and creating more flexible engineering solutions. This work presents the integration between industry data engineering solutions for efficient d… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

  20. arXiv:2307.10296  [pdf, other

    eess.IV cs.CV cs.LG

    Towards Automated Semantic Segmentation in Mammography Images

    Authors: Cesar A. Sierra-Franco, Jan Hurtado, Victor de A. Thomaz, Leonardo C. da Cruz, Santiago V. Silva, Alberto B. Raposo

    Abstract: Mammography images are widely used to detect non-palpable breast lesions or nodules, preventing cancer and providing the opportunity to plan interventions when necessary. The identification of some structures of interest is essential to make a diagnosis and evaluate image adequacy. Thus, computer-aided detection systems can be helpful in assisting medical interpretation by automatically segmenting… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

    Comments: 6 pages

  21. arXiv:2307.01548  [pdf, other

    cs.AI

    Knowledge Graph for NLG in the context of conversational agents

    Authors: Hussam Ghanem, Massinissa Atmani, Christophe Cruz

    Abstract: The use of knowledge graphs (KGs) enhances the accuracy and comprehensiveness of the responses provided by a conversational agent. While generating answers during conversations consists in generating text from these KGs, it is still regarded as a challenging task that has gained significant attention in recent years. In this document, we provide a review of different architectures used for knowled… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Journal ref: French Regional Conference on Complex Systems (FRCCS 2023), May 2023, Le Havre, France

  22. Pseudo-Labeling Enhanced by Privileged Information and Its Application to In Situ Sequencing Images

    Authors: Marzieh Haghighi, Mario C. Cruz, Erin Weisbart, Beth A. Cimini, Avtar Singh, Julia Bauman, Maria E. Lozada, Sanam L. Kavari, James T. Neal, Paul C. Blainey, Anne E. Carpenter, Shantanu Singh

    Abstract: Various strategies for label-scarce object detection have been explored by the computer vision research community. These strategies mainly rely on assumptions that are specific to natural images and not directly applicable to the biological and biomedical vision domains. For example, most semi-supervised learning strategies rely on a small set of labeled data as a confident source of ground truth.… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: This paper has been accepted for publication at IJCAI 2023

    Journal ref: Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence (IJCAI), Main Track, Pages 4775-4784, 2023

  23. arXiv:2306.10034  [pdf

    cs.IR cs.LG

    Unlocking Insights into Business Trajectories with Transformer-based Spatio-temporal Data Analysis

    Authors: Muhammad Arslan, Christophe Cruz

    Abstract: The world of business is constantly evolving and staying ahead of the curve requires a deep understanding of market trends and performance. This article addresses this requirement by modeling business trajectories using news articles data.

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: Presented in the conference Spatial Analysis and GEOmatics 2023 SAGEO

  24. arXiv:2306.07046  [pdf

    cs.IR cs.LG

    Imbalanced Multi-label Classification for Business-related Text with Moderately Large Label Spaces

    Authors: Muhammad Arslan, Christophe Cruz

    Abstract: In this study, we compared the performance of four different methods for multi label text classification using a specific imbalanced business dataset. The four methods we evaluated were fine tuned BERT, Binary Relevance, Classifier Chains, and Label Powerset. The results show that fine tuned BERT outperforms the other three methods by a significant margin, achieving high values of accuracy, F1 Sco… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Journal ref: https://easychair.org/smart-program/FRCCS2023/2023-06-01.html

  25. arXiv:2305.14235  [pdf, other

    cs.CL cs.AI

    Multilingual Large Language Models Are Not (Yet) Code-Switchers

    Authors: Ruochen Zhang, Samuel Cahyawijaya, Jan Christian Blaise Cruz, Genta Indra Winata, Alham Fikri Aji

    Abstract: Multilingual Large Language Models (LLMs) have recently shown great capabilities in a wide range of tasks, exhibiting state-of-the-art performance through zero-shot or few-shot prompting methods. While there have been extensive studies on their abilities in monolingual tasks, the investigation of their potential in the context of code-switching (CSW), the practice of alternating languages within a… ▽ More

    Submitted 23 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted at EMNLP 2023

  26. arXiv:2303.13592  [pdf, other

    cs.CL cs.AI

    Prompting Multilingual Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages

    Authors: Zheng-Xin Yong, Ruochen Zhang, Jessica Zosa Forde, Skyler Wang, Arjun Subramonian, Holy Lovenia, Samuel Cahyawijaya, Genta Indra Winata, Lintang Sutawika, Jan Christian Blaise Cruz, Yin Lin Tan, Long Phan, Rowena Garcia, Thamar Solorio, Alham Fikri Aji

    Abstract: While code-mixing is a common linguistic practice in many parts of the world, collecting high-quality and low-cost code-mixed data remains a challenge for natural language processing (NLP) research. The recent proliferation of Large Language Models (LLMs) compels one to ask: how capable are these systems in generating code-mixed data? In this paper, we explore prompting multilingual LLMs in a zero… ▽ More

    Submitted 12 September, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: Updating Authors

  27. arXiv:2301.05122  [pdf, other

    quant-ph cs.CC cs.DS

    Quantum algorithm for finding minimum values in a Quantum Random Access Memory

    Authors: Anton S. Albino, Lucas Q. Galvão, Ethan Hansen, Mauro Q. Nooblath Neto, Clebson Cruz

    Abstract: Finding the minimum value in an unordered database is a common and fundamental task in computer science. However, the optimal classical deterministic algorithm can find the minimum value with a time complexity that grows linearly with the number of elements in the database. In this paper, we present the proposal of a quantum algorithm for finding the minimum value of a database, which is quadratic… ▽ More

    Submitted 12 January, 2023; originally announced January 2023.

  28. arXiv:2212.13656  [pdf, other

    cs.DC

    Smart meter data processing: a showcase for simple and efficient textual processing

    Authors: Miguel Ferreira, André Neves, Rodrigo Gorjão, Carlos Cruz, Miguel L. Pardal

    Abstract: The increase in the production and collection of data from devices is an ongoing trend due to the roll-out of more cyber-physical applications. Smart meters, because of their importance in power grids, are a class of such devices whose produced data requires meticulous processing. In this paper, we use Unicage, a data processing system based on classic Unix shell scripting, that delivers excellent… ▽ More

    Submitted 27 December, 2022; originally announced December 2022.

    Comments: 11 pages, 5 figures, 1 table, 9 listings. Accepted after review for the 1st Workshop on High-Performance and Reliable Big Data (HPBD 2021), which was held virtually on September 20th 2021, and was co-located with the 40th International Symposium on Reliable Distributed Systems (SRDS 2021)

  29. arXiv:2205.00952  [pdf, other

    cs.CV

    Leaf Tar Spot Detection Using RGB Images

    Authors: Sriram Baireddy, Da-Young Lee, Carlos Gongora-Canul, Christian D. Cruz, Edward J. Delp

    Abstract: Tar spot disease is a fungal disease that appears as a series of black circular spots containing spores on corn leaves. Tar spot has proven to be an impactful disease in terms of reducing crop yield. To quantify disease progression, experts usually have to visually phenotype leaves from the plant. This process is very time-consuming and is difficult to incorporate in any high-throughput phenotypin… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

  30. arXiv:2204.03251  [pdf, other

    cs.CL

    Towards Automatic Construction of Filipino WordNet: Word Sense Induction and Synset Induction Using Sentence Embeddings

    Authors: Dan John Velasco, Axel Alba, Trisha Gail Pelagio, Bryce Anthony Ramirez, Unisse Chua, Briane Paul Samson, Jan Christian Blaise Cruz, Charibeth Cheng

    Abstract: Wordnets are indispensable tools for various natural language processing applications. Unfortunately, wordnets get outdated, and producing or updating wordnets can be slow and costly in terms of time and resources. This problem intensifies for low-resource languages. This study proposes a method for word sense induction and synset induction using only two linguistic resources, namely, an unlabeled… ▽ More

    Submitted 19 October, 2023; v1 submitted 7 April, 2022; originally announced April 2022.

    Comments: To appear in SEALP 2023. Formerly titled "Automatic WordNet Construction using Word Sense Induction through Sentence Embeddings"

  31. arXiv:2204.02653  [pdf, ps, other

    cs.CL

    Using Synthetic Data for Conversational Response Generation in Low-resource Settings

    Authors: Gabriel Louis Tan, Adrian Paule Ty, Schuyler Ng, Denzel Adrian Co, Jan Christian Blaise Cruz, Charibeth Cheng

    Abstract: Response generation is a task in natural language processing (NLP) where a model is trained to respond to human statements. Conversational response generators take this one step further with the ability to respond within the context of previous responses. While there are existing techniques for training such models, they all require an abundance of conversational data which are not always availabl… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

  32. arXiv:2111.10513  [pdf, other

    cs.CL

    Data Processing Matters: SRPH-Konvergen AI's Machine Translation System for WMT'21

    Authors: Lintang Sutawika, Jan Christian Blaise Cruz

    Abstract: In this paper, we describe the submission of the joint Samsung Research Philippines-Konvergen AI team for the WMT'21 Large Scale Multilingual Translation Task - Small Track 2. We submit a standard Seq2Seq Transformer model to the shared task without any training or architecture tricks, relying mainly on the strength of our data preprocessing techniques to boost performance. Our final submission mo… ▽ More

    Submitted 19 November, 2021; originally announced November 2021.

    Comments: In Proceedings of the Sixth Conference on Machine Translation (WMT)

  33. arXiv:2111.06053  [pdf, other

    cs.CL

    Improving Large-scale Language Models and Resources for Filipino

    Authors: Jan Christian Blaise Cruz, Charibeth Cheng

    Abstract: In this paper, we improve on existing language resources for the low-resource Filipino language in two ways. First, we outline the construction of the TLUnified dataset, a large-scale pretraining corpus that serves as an improvement over smaller existing pretraining datasets for the language in terms of scale and topic variety. Second, we pretrain new Transformer language models following the RoBE… ▽ More

    Submitted 11 November, 2021; originally announced November 2021.

    Comments: Resources are available at blaisecruz.com/resources

  34. arXiv:2105.12949  [pdf, other

    cs.HC

    A Survey on Interactive Reinforcement Learning: Design Principles and Open Challenges

    Authors: Christian Arzate Cruz, Takeo Igarashi

    Abstract: Interactive reinforcement learning (RL) has been successfully used in various applications in different fields, which has also motivated HCI researchers to contribute in this area. In this paper, we survey interactive RL to empower human-computer interaction (HCI) researchers with the technical background in RL needed to design new interaction techniques and propose new applications. We elucidate… ▽ More

    Submitted 27 May, 2021; originally announced May 2021.

  35. arXiv:2105.12944  [pdf, other

    cs.HC

    MarioMix: Creating Aligned Playstyles for Bots with Interactive Reinforcement Learning

    Authors: Christian Arzate Cruz, Takeo Igarashi

    Abstract: In this paper, we propose a generic framework that enables game developers without knowledge of machine learning to create bot behaviors with playstyles that align with their preferences. Our framework is based on interactive reinforcement learning (RL), and we used it to create a behavior authoring tool called MarioMix. This tool enables non-experts to create bots with varied playstyles for the g… ▽ More

    Submitted 27 May, 2021; originally announced May 2021.

  36. arXiv:2105.12938  [pdf, other

    cs.HC

    Interactive Explanations: Diagnosis and Repair of Reinforcement Learning Based Agent Behaviors

    Authors: Christian Arzate Cruz, Takeo Igarashi

    Abstract: Reinforcement learning techniques successfully generate convincing agent behaviors, but it is still difficult to tailor the behavior to align with a user's specific preferences. What is missing is a communication method for the system to explain the behavior and for the user to repair it. In this paper, we present a novel interaction method that uses interactive explanations using templates of nat… ▽ More

    Submitted 27 May, 2021; originally announced May 2021.

  37. arXiv:2010.11574  [pdf, other

    cs.CL

    Exploiting News Article Structure for Automatic Corpus Generation of Entailment Datasets

    Authors: Jan Christian Blaise Cruz, Jose Kristian Resabal, James Lin, Dan John Velasco, Charibeth Cheng

    Abstract: Transformers represent the state-of-the-art in Natural Language Processing (NLP) in recent years, proving effective even in tasks done in low-resource languages. While pretrained transformers for these languages can be made, it is challenging to measure their true performance and capacity due to the lack of hard benchmark datasets, as well as the difficulty and cost of producing them. In this pape… ▽ More

    Submitted 13 August, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: To appear in PRICAI 2021. Formerly titled "Investigating the True Performance of Transformers in Low-Resource Languages: A Case Study in Automatic Corpus Creation." Code and data available at https://github.com/jcblaisecruz02/Filipino-Text-Benchmarks

  38. arXiv:2005.02068  [pdf, other

    cs.CL

    Establishing Baselines for Text Classification in Low-Resource Languages

    Authors: Jan Christian Blaise Cruz, Charibeth Cheng

    Abstract: While transformer-based finetuning techniques have proven effective in tasks that involve low-resource, low-data environments, a lack of properly established baselines and benchmark datasets make it hard to compare different approaches that are aimed at tackling the low-resource setting. In this work, we provide three contributions. First, we introduce two previously unreleased datasets as benchma… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

    Comments: We release all our models, finetuning code, and data at https://github.com/jcblaisecruz02/Filipino-Text-Benchmarks

  39. arXiv:2005.01107  [pdf, other

    cs.CL

    Simplifying Paragraph-level Question Generation via Transformer Language Models

    Authors: Luis Enrico Lopez, Diane Kathryn Cruz, Jan Christian Blaise Cruz, Charibeth Cheng

    Abstract: Question generation (QG) is a natural language generation task where a model is trained to ask questions corresponding to some input text. Most recent approaches frame QG as a sequence-to-sequence problem and rely on additional features and mechanisms to increase performance; however, these often increase model complexity, and can rely on auxiliary data unavailable in practical use. A single Trans… ▽ More

    Submitted 13 August, 2021; v1 submitted 3 May, 2020; originally announced May 2020.

    Comments: To appear in PRICAI 2021. Formerly titled "Transformer-based End-to-End Question Generation."

  40. arXiv:2003.00762  [pdf, other

    eess.IV cs.LG

    Flashlight CNN Image Denoising

    Authors: Pham Huu Thanh Binh, Cristóvão Cruz, Karen Egiazarian

    Abstract: This paper proposes a learning-based denoising method called FlashLight CNN (FLCNN) that implements a deep neural network for image denoising. The proposed approach is based on deep residual networks and inception networks and it is able to leverage many more parameters than residual networks alone for denoising grayscale images corrupted by additive white Gaussian noise (AWGN). FlashLight CNN dem… ▽ More

    Submitted 2 July, 2020; v1 submitted 2 March, 2020; originally announced March 2020.

  41. arXiv:1911.01279  [pdf

    cs.CY eess.SP

    Automated Smart Wick System-Based Microfarm Using Internet of Things

    Authors: R. Jorda, Jr., C. Alcabasa, A. Buhay, E. C. Dela Cruz, J. P. Mendoza, A. Tolentino, L. K. Tolentino, E. Fernandez, A. Thio-ac, J. Velasco, N. Arago

    Abstract: This paper presents a study conducted to allow urban farmers to remotely monitor their farm through the design and development of an Internet of Things-based (IoT) microfarm prototype which utilized wick system as planting method. The system involves the detection of three environmental parameters namely, light intensity, soil moisture and temperature through the use of respective sensors which we… ▽ More

    Submitted 30 October, 2019; originally announced November 2019.

    Journal ref: Lecture Notes on Research and Innovation in Computer Engineering and Computer Sciences, 2019

  42. Localization of Fake News Detection via Multitask Transfer Learning

    Authors: Jan Christian Blaise Cruz, Julianne Agatha Tan, Charibeth Cheng

    Abstract: The use of the internet as a fast medium of spreading fake news reinforces the need for computational tools that combat it. Techniques that train fake news classifiers exist, but they all assume an abundance of resources including large labeled datasets and expert-curated corpora, which low-resource languages may not have. In this work, we make two main contributions: First, we alleviate resource… ▽ More

    Submitted 15 May, 2020; v1 submitted 21 October, 2019; originally announced October 2019.

    Comments: Published in the LREC 2020 Proceedings. Models and data available at https://github.com/jcblaisecruz02/Tagalog-fake-news

    Journal ref: In Proceedings of The 12th Language Resources and Evaluation Conference, pp.2589-2597 (2020)

  43. arXiv:1907.07286  [pdf, other

    math.CO cs.DM

    Vertex arboricity of cographs

    Authors: Sebastián González Hermosillo de la Maza, Pavol Hell, César Hernández Cruz, Seyyed Aliasghar Hosseini, Payam Valadkhan

    Abstract: Arboricity is a graph parameter akin to chromatic number, in that it seeks to partition the vertices into the smallest number of sparse subgraphs. Where for the chromatic number we are partitioning the vertices into independent sets, for the arboricity we want to partition the vertices into cycle-free subsets (i.e., forests). Arboricity is NP-hard in general, and our focus is on the arboricity of… ▽ More

    Submitted 16 July, 2019; originally announced July 2019.

    Comments: 14 pages, 1 figure

    MSC Class: 05C70; 05C75

  44. Evaluating Language Model Finetuning Techniques for Low-resource Languages

    Authors: Jan Christian Blaise Cruz, Charibeth Cheng

    Abstract: Unlike mainstream languages (such as English and French), low-resource languages often suffer from a lack of expert-annotated corpora and benchmark resources that make it hard to apply state-of-the-art techniques directly. In this paper, we alleviate this scarcity problem for the low-resourced Filipino language in two ways. First, we introduce a new benchmark language modeling dataset in Filipino… ▽ More

    Submitted 30 June, 2019; originally announced July 2019.

    Comments: Pretrained models and datasets available at https://github.com/jcblaisecruz02/Tagalog-BERT

  45. Nonlocality-Reinforced Convolutional Neural Networks for Image Denoising

    Authors: Cristóvão Cruz, Alessandro Foi, Vladimir Katkovnik, Karen Egiazarian

    Abstract: We introduce a paradigm for nonlocal sparsity reinforced deep convolutional neural network denoising. It is a combination of a local multiscale denoising by a convolutional neural network (CNN) based denoiser and a nonlocal denoising based on a nonlocal filter (NLF) exploiting the mutual similarities between groups of patches. CNN models are leveraged with noise levels that progressively decrease… ▽ More

    Submitted 21 June, 2018; v1 submitted 6 March, 2018; originally announced March 2018.

    Comments: Accepted for publication in IEEE SPL

  46. Single Image Super-Resolution based on Wiener Filter in Similarity Domain

    Authors: Cristóvão Cruz, Rakesh Mehta, Vladimir Katkovnik, Karen Egiazarian

    Abstract: Single image super resolution (SISR) is an ill-posed problem aiming at estimating a plausible high resolution (HR) image from a single low resolution (LR) image. Current state-of-the-art SISR methods are patch-based. They use either external data or internal self-similarity to learn a prior for a HR image. External data based methods utilize large number of patches from the training data, while se… ▽ More

    Submitted 29 November, 2017; v1 submitted 13 April, 2017; originally announced April 2017.

    Comments: Paper accepted for publication on IEEE Transactions on Image Processing

  47. arXiv:1412.0854  [pdf, other

    cs.AI

    Semantic HMC for Big Data Analysis

    Authors: Thomas Hassan, Rafael Peixoto, Christophe Cruz, Aurlie Bertaux, Nuno Silva

    Abstract: Analyzing Big Data can help corporations to im-prove their efficiency. In this work we present a new vision to derive Value from Big Data using a Semantic Hierarchical Multi-label Classification called Semantic HMC based in a non-supervised Ontology learning process. We also proposea Semantic HMC process, using scalable Machine-Learning techniques and Rule-based reasoning.

    Submitted 2 December, 2014; originally announced December 2014.

  48. arXiv:1301.5349  [pdf

    cs.CG cs.AI

    Toward the Automatic Generation of a Semantic VRML Model from Unorganized 3D Point Clouds

    Authors: Helmi Ben Hmida, Christophe Cruz, Christophe Nicolle, Frank Boochs

    Abstract: This paper presents our experience regarding the creation of 3D semantic facility model out of unorganized 3D point clouds. Thus, a knowledge-based detection approach of objects using the OWL ontology language is presented. This knowledge is used to define SWRL detection rules. In addition, the combination of 3D processing built-ins and topological Built-Ins in SWRL rules aims at combining geometr… ▽ More

    Submitted 21 January, 2013; originally announced January 2013.

    Comments: arXiv admin note: substantial text overlap with arXiv:1301.4991, arXiv:1301.4783

    Journal ref: The Fifth International Conference on Advances in Semantic Processing, Lisbon : Portugal (2011)

  49. arXiv:1301.4992  [pdf

    cs.AI

    From 9-IM Topological Operators to Qualitative Spatial Relations using 3D Selective Nef Complexes and Logic Rules for bodies

    Authors: Helmi Ben Hmida, Christophe Cruz, Frank Boochs, Christophe Nicolle

    Abstract: This paper presents a method to compute automatically topological relations using SWRL rules. The calculation of these rules is based on the definition of a Selective Nef Complexes Nef Polyhedra structure generated from standard Polyhedron. The Selective Nef Complexes is a data model providing a set of binary Boolean operators such as Union, Difference, Intersection and Symmetric difference, and u… ▽ More

    Submitted 21 January, 2013; originally announced January 2013.

    Comments: arXiv admin note: substantial text overlap with arXiv:1301.4780

    Journal ref: International Conference on Knowledge Engineering and Ontology Development, Barcelone : Spain (2012)

  50. arXiv:1301.4991  [pdf

    cs.AI

    Knowledge Base Approach for 3D Objects Detection in Point Clouds Using 3D Processing and Specialists Knowledge

    Authors: Helmi Ben Hmida, Christophe Cruz, Frank Boochs, Christophe Nicolle

    Abstract: This paper presents a knowledge-based detection of objects approach using the OWL ontology language, the Semantic Web Rule Language, and 3D processing built-ins aiming at combining geometrical analysis of 3D point clouds and specialist's knowledge. Here, we share our experience regarding the creation of 3D semantic facility model out of unorganized 3D point clouds. Thus, a knowledge-based detectio… ▽ More

    Submitted 21 January, 2013; originally announced January 2013.

    Comments: ISSN: 1942-2679. arXiv admin note: text overlap with arXiv:1301.4783

    Journal ref: International Journal On Advances in Intelligent Systems 5, 1 et 2 (2012) 1-14