Skip to main content

Showing 1–50 of 65 results for author: Hung, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.00409  [pdf

    eess.AS cs.AI cs.LG

    Perceptual Implications of Automatic Anonymization in Pathological Speech

    Authors: Soroosh Tayebi Arasteh, Saba Afza, Tri-Thien Nguyen, Lukas Buess, Maryam Parvin, Tomas Arias-Vergara, Paula Andrea Perez-Toro, Hiu Ching Hung, Mahshad Lotfinia, Thomas Gorges, Elmar Noeth, Maria Schuster, Seung Hee Yang, Andreas Maier

    Abstract: Automatic anonymization techniques are essential for ethical sharing of pathological speech data, yet their perceptual consequences remain understudied. This study presents the first comprehensive human-centered analysis of anonymized pathological speech, using a structured perceptual protocol involving ten native and non-native German listeners with diverse linguistic, clinical, and technical bac… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

  2. arXiv:2504.19854  [pdf, other

    cs.RO cs.AI cs.CV

    NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks

    Authors: Chia-Yu Hung, Qi Sun, Pengfei Hong, Amir Zadeh, Chuan Li, U-Xuan Tan, Navonil Majumder, Soujanya Poria

    Abstract: Existing Visual-Language-Action (VLA) models have shown promising performance in zero-shot scenarios, demonstrating impressive task execution and reasoning capabilities. However, a significant challenge arises from the limitations of visual encoding, which can result in failures during tasks such as object grasping. Moreover, these models typically suffer from high computational overhead due to th… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

  3. arXiv:2504.17146  [pdf, other

    cs.CY cs.SI

    Utilizing Dynamic Time Warping for Pandemic Surveillance: Understanding the Relationship between Google Trends Network Metrics and COVID-19 Incidences

    Authors: Michael T. Lopez II, Cheska Elise Hung, Maria Regina Justina E. Estuar

    Abstract: The premise of network statistics derived from Google Trends data to foresee COVID-19 disease progression is gaining momentum in infodemiology. This approach was applied in Metro Manila, National Capital Region, Philippines. Through dynamic time warping (DTW), the temporal alignment was quantified between network metrics and COVID-19 case trajectories, and systematically explored 320 parameter con… ▽ More

    Submitted 9 May, 2025; v1 submitted 23 April, 2025; originally announced April 2025.

    Comments: Pre-print conference submission to IEEE AMLDS 2025 (see website here: https://amlds.site/index.html). This full paper has been accepted for presentation and publication. It has 8 pages, 2 tables, and 2 figures

    ACM Class: J.3; I.5.3

  4. arXiv:2504.05317  [pdf, other

    cs.IR cs.AI cs.CL cs.LG

    On Synthesizing Data for Context Attribution in Question Answering

    Authors: Gorjan Radevski, Kiril Gashteovski, Shahbaz Syed, Christopher Malon, Sebastien Nicolas, Chia-Chien Hung, Timo Sztyler, Verena Heußer, Wiem Ben Rim, Masafumi Enomoto, Kunihiro Takeoka, Masafumi Oyamada, Goran Glavaš, Carolin Lawrence

    Abstract: Question Answering (QA) accounts for a significant portion of LLM usage "in the wild". However, LLMs sometimes produce false or misleading responses, also known as "hallucinations". Therefore, grounding the generated answers in contextually provided information -- i.e., providing evidence for the generated text -- is paramount for LLMs' trustworthiness. Providing this information is the task of co… ▽ More

    Submitted 21 February, 2025; originally announced April 2025.

  5. arXiv:2503.21162  [pdf, other

    cs.CY cs.IR

    Network Density Analysis of Health Seeking Behavior in Metro Manila: A Retrospective Analysis on COVID-19 Google Trends Data

    Authors: Michael T. Lopez II, Cheska Elise Hung, Maria Regina Justina E. Estuar

    Abstract: This study examined the temporal aspect of COVID-19-related health-seeking behavior in Metro Manila, National Capital Region, Philippines through a network density analysis of Google Trends data. A total of 15 keywords across five categories (English symptoms, Filipino symptoms, face wearing, quarantine, and new normal) were examined using both 15-day and 30-day rolling windows from March 2020 to… ▽ More

    Submitted 28 March, 2025; v1 submitted 27 March, 2025; originally announced March 2025.

    Comments: Pre-print conference submission to ICMHI 2025 (see website here: https://www.icmhi.org/index.html), which it has been accepted. This has 12 pages, and 2 figures

    ACM Class: I.6.3; J.3

  6. arXiv:2502.19175  [pdf, other

    cs.CL cs.AI

    MEDDxAgent: A Unified Modular Agent Framework for Explainable Automatic Differential Diagnosis

    Authors: Daniel Rose, Chia-Chien Hung, Marco Lepri, Israa Alqassem, Kiril Gashteovski, Carolin Lawrence

    Abstract: Differential Diagnosis (DDx) is a fundamental yet complex aspect of clinical decision-making, in which physicians iteratively refine a ranked list of possible diseases based on symptoms, antecedents, and medical knowledge. While recent advances in large language models have shown promise in supporting DDx, existing approaches face key limitations, including single-dataset evaluations, isolated opt… ▽ More

    Submitted 26 February, 2025; originally announced February 2025.

  7. arXiv:2502.05731  [pdf, other

    cs.HC

    Visual Text Mining with Progressive Taxonomy Construction for Environmental Studies

    Authors: Sam Yu-Te Lee, Cheng-Wei Hung, Mei-Hua Yuan, Kwan-Liu Ma

    Abstract: Environmental experts have developed the DPSIR (Driver, Pressure, State, Impact, Response) framework to systematically study and communicate key relationships between society and the environment. Using this framework requires experts to construct a DPSIR taxonomy from a corpus, annotate the documents, and identify DPSIR variables and relationships, which is laborious and inflexible. Automating it… ▽ More

    Submitted 8 February, 2025; originally announced February 2025.

  8. arXiv:2412.21037  [pdf, other

    cs.SD cs.AI cs.CL eess.AS

    TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization

    Authors: Chia-Yu Hung, Navonil Majumder, Zhifeng Kong, Ambuj Mehrish, Amir Ali Bagherzadeh, Chuan Li, Rafael Valle, Bryan Catanzaro, Soujanya Poria

    Abstract: We introduce TangoFlux, an efficient Text-to-Audio (TTA) generative model with 515M parameters, capable of generating up to 30 seconds of 44.1kHz audio in just 3.7 seconds on a single A40 GPU. A key challenge in aligning TTA models lies in the difficulty of creating preference pairs, as TTA lacks structured mechanisms like verifiable rewards or gold-standard answers available for Large Language Mo… ▽ More

    Submitted 10 April, 2025; v1 submitted 30 December, 2024; originally announced December 2024.

    Comments: https://tangoflux.github.io/

  9. arXiv:2412.04947  [pdf, other

    cs.CL

    C$^2$LEVA: Toward Comprehensive and Contamination-Free Language Model Evaluation

    Authors: Yanyang Li, Tin Long Wong, Cheung To Hung, Jianqiao Zhao, Duo Zheng, Ka Wai Liu, Michael R. Lyu, Liwei Wang

    Abstract: Recent advances in large language models (LLMs) have shown significant promise, yet their evaluation raises concerns, particularly regarding data contamination due to the lack of access to proprietary training data. To address this issue, we present C$^2$LEVA, a comprehensive bilingual benchmark featuring systematic contamination prevention. C$^2$LEVA firstly offers a holistic evaluation encompass… ▽ More

    Submitted 15 December, 2024; v1 submitted 6 December, 2024; originally announced December 2024.

  10. arXiv:2410.11526  [pdf

    cs.HC cs.CL

    Human-LLM Collaborative Construction of a Cantonese Emotion Lexicon

    Authors: Yusong Zhang, Dong Dong, Chi-tim Hung, Leonard Heyerdahl, Tamara Giles-Vernick, Eng-kiong Yeoh

    Abstract: Large Language Models (LLMs) have demonstrated remarkable capabilities in language understanding and generation. Advanced utilization of the knowledge embedded in LLMs for automated annotation has consistently been explored. This study proposed to develop an emotion lexicon for Cantonese, a low-resource language, through collaborative efforts between LLM and human annotators. By integrating emotio… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: 13 pages

  11. arXiv:2407.13702  [pdf, other

    cs.CL

    ANHALTEN: Cross-Lingual Transfer for German Token-Level Reference-Free Hallucination Detection

    Authors: Janek Herrlein, Chia-Chien Hung, Goran Glavaš

    Abstract: Research on token-level reference-free hallucination detection has predominantly focused on English, primarily due to the scarcity of robust datasets in other languages. This has hindered systematic investigations into the effectiveness of cross-lingual transfer for this important NLP application. To address this gap, we introduce ANHALTEN, a new evaluation dataset that extends the English halluci… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: ACL 2024 Student Research Workshop

  12. arXiv:2406.15193  [pdf, other

    cs.CL

    Inference Time Alignment with Reward-Guided Tree Search

    Authors: Chia-Yu Hung, Navonil Majumder, Ambuj Mehrish, Soujanya Poria

    Abstract: Inference-time computation methods enhance the performance of Large Language Models (LLMs) by leveraging additional computational resources to achieve superior results. Common techniques, such as Best-of-N sampling, Majority Voting, and variants of tree-search algorithms have proven to be effective in boosting the performance of LLMs. These approaches strategically trade increased computational re… ▽ More

    Submitted 26 November, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

  13. arXiv:2405.16450  [pdf, other

    cs.LG cs.AI cs.PL

    Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search

    Authors: Max Liu, Chan-Hung Yu, Wei-Hsu Lee, Cheng-Wei Hung, Yen-Chun Chen, Shao-Hua Sun

    Abstract: Programmatic reinforcement learning (PRL) has been explored for representing policies through programs as a means to achieve interpretability and generalization. Despite promising outcomes, current state-of-the-art PRL methods are hindered by sample inefficiency, necessitating tens of millions of program-environment interactions. To tackle this challenge, we introduce a novel LLM-guided search fra… ▽ More

    Submitted 11 March, 2025; v1 submitted 26 May, 2024; originally announced May 2024.

  14. arXiv:2404.09956  [pdf, other

    cs.SD cs.AI cs.CL eess.AS

    Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization

    Authors: Navonil Majumder, Chia-Yu Hung, Deepanway Ghosal, Wei-Ning Hsu, Rada Mihalcea, Soujanya Poria

    Abstract: Generative multimodal content is increasingly prevalent in much of the content creation arena, as it has the potential to allow artists and media personnel to create pre-production mockups by quickly bringing their ideas to life. The generation of audio from text prompts is an important aspect of such processes in the music and film industry. Many of the recent diffusion-based text-to-audio models… ▽ More

    Submitted 17 July, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: Accepted at ACM MM 2024

  15. arXiv:2404.08820  [pdf

    cs.CV cs.LG

    Single-image driven 3d viewpoint training data augmentation for effective wine label recognition

    Authors: Yueh-Cheng Huang, Hsin-Yi Chen, Cheng-Jui Hung, Jen-Hui Chuang, Jenq-Neng Hwang

    Abstract: Confronting the critical challenge of insufficient training data in the field of complex image recognition, this paper introduces a novel 3D viewpoint augmentation technique specifically tailored for wine label recognition. This method enhances deep learning model performance by generating visually realistic training samples from a single real-world wine label image, overcoming the challenges pose… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  16. arXiv:2403.15759  [pdf

    cs.CY

    Deep Learning Approach to Forecasting COVID-19 Cases in Residential Buildings of Hong Kong Public Housing Estates: The Role of Environment and Sociodemographics

    Authors: E. Leung, J. Guan, KO. Kwok, CT. Hung, CC. Ching, KC. Chong, CHK. Yam, T. Sun, WH. Tsang, EK. Yeoh, A. Lee

    Abstract: Introduction: The current study investigates the complex association between COVID-19 and the studied districts' socioecology (e.g. internal and external built environment, sociodemographic profiles, etc.) to quantify their contributions to the early outbreaks and epidemic resurgence of COVID-19. Methods: We aligned the analytic model's architecture with the hierarchical structure of the resident'… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  17. arXiv:2403.13842  [pdf

    cs.LG physics.soc-ph

    Analyzing the Variations in Emergency Department Boarding and Testing the Transferability of Forecasting Models across COVID-19 Pandemic Waves in Hong Kong: Hybrid CNN-LSTM approach to quantifying building-level socioecological risk

    Authors: Eman Leung, Jingjing Guan, Kin On Kwok, CT Hung, CC. Ching, CK. Chung, Hector Tsang, EK Yeoh, Albert Lee

    Abstract: Emergency department's (ED) boarding (defined as ED waiting time greater than four hours) has been linked to poor patient outcomes and health system performance. Yet, effective forecasting models is rare before COVID-19, lacking during the peri-COVID era. Here, a hybrid convolutional neural network (CNN)-Long short-term memory (LSTM) model was applied to public-domain data sourced from Hong Kong's… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  18. arXiv:2402.18883  [pdf, other

    cs.DS

    Efficient Processing of Subsequent Densest Subgraph Query

    Authors: Chia-Yang Hung, Chih-Ya Shen

    Abstract: Dense subgraph extraction is a fundamental problem in graph analysis and data mining, aimed at identifying cohesive and densely connected substructures within a given graph. It plays a crucial role in various domains, including social network analysis, biological network analysis, recommendation systems, and community detection. However, extracting a subgraph with the highest node similarity is a… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: 11 pages

    MSC Class: 68W27

  19. arXiv:2401.15554  [pdf

    cs.CV

    Pericoronary adipose tissue feature analysis in CT calcium score images with comparison to coronary CTA

    Authors: Yingnan Song, Hao Wu, Juhwan Lee, Justin Kim, Ammar Hoori, Tao Hu, Vladislav Zimin, Mohamed Makhlouf, Sadeer Al-Kindi, Sanjay Rajagopalan, Chun-Ho Yun, Chung-Lieh Hung, David L. Wilson

    Abstract: We investigated the feasibility and advantages of using non-contrast CT calcium score (CTCS) images to assess pericoronary adipose tissue (PCAT) and its association with major adverse cardiovascular events (MACE). PCAT features from coronary CTA (CCTA) have been shown to be associated with cardiovascular risk but are potentially confounded by iodine. If PCAT in CTCS images can be similarly analyze… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: 24 pages,10 figures

  20. arXiv:2401.11095  [pdf, other

    cs.HC cs.SD eess.AS

    SoundShift: Exploring Sound Manipulations for Accessible Mixed-Reality Awareness

    Authors: Ruei-Che Chang, Chia-Sheng Hung, Bing-Yu Chen, Dhruv Jain, Anhong Guo

    Abstract: Mixed-reality (MR) soundscapes blend real-world sound with virtual audio from hearing devices, presenting intricate auditory information that is hard to discern and differentiate. This is particularly challenging for blind or visually impaired individuals, who rely on sounds and descriptions in their everyday lives. To understand how complex audio information is consumed, we analyzed online forum… ▽ More

    Submitted 26 May, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: DIS 2024

  21. arXiv:2311.14966  [pdf, other

    cs.CL

    Walking a Tightrope -- Evaluating Large Language Models in High-Risk Domains

    Authors: Chia-Chien Hung, Wiem Ben Rim, Lindsay Frost, Lars Bruckner, Carolin Lawrence

    Abstract: High-risk domains pose unique challenges that require language models to provide accurate and safe responses. Despite the great success of large language models (LLMs), such as ChatGPT and its variants, their performance in high-risk domains remains unclear. Our study delves into an in-depth analysis of the performance of instruction-tuned LLMs, focusing on factual accuracy and safety adherence. T… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

    Comments: EMNLP 2023 Workshop on Benchmarking Generalisation in NLP (GenBench)

  22. arXiv:2311.07993  [pdf, other

    cs.CV

    Explicit Change Relation Learning for Change Detection in VHR Remote Sensing Images

    Authors: Dalong Zheng, Zebin Wu, Jia Liu, Chih-Cheng Hung, Zhihui Wei

    Abstract: Change detection has always been a concerned task in the interpretation of remote sensing images. It is essentially a unique binary classification task with two inputs, and there is a change relationship between these two inputs. At present, the mining of change relationship features is usually implicit in the network architectures that contain single-branch or two-branch encoders. However, due to… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  23. arXiv:2310.14909  [pdf, other

    cs.CL cs.AI cs.LG

    Linking Surface Facts to Large-Scale Knowledge Graphs

    Authors: Gorjan Radevski, Kiril Gashteovski, Chia-Chien Hung, Carolin Lawrence, Goran Glavaš

    Abstract: Open Information Extraction (OIE) methods extract facts from natural language text in the form of ("subject"; "relation"; "object") triples. These facts are, however, merely surface forms, the ambiguity of which impedes their downstream usage; e.g., the surface phrase "Michael Jordan" may refer to either the former basketball player or the university professor. Knowledge Graphs (KGs), on the other… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  24. arXiv:2310.08123  [pdf, other

    cs.CL

    Who Wrote it and Why? Prompting Large-Language Models for Authorship Verification

    Authors: Chia-Yu Hung, Zhiqiang Hu, Yujia Hu, Roy Ka-Wei Lee

    Abstract: Authorship verification (AV) is a fundamental task in natural language processing (NLP) and computational linguistics, with applications in forensic analysis, plagiarism detection, and identification of deceptive content. Existing AV techniques, including traditional stylometric and deep learning approaches, face limitations in terms of data requirements and lack of explainability. To address thes… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: 7 pages,1 figure

  25. arXiv:2309.09658  [pdf

    cs.CL

    A Novel Method of Fuzzy Topic Modeling based on Transformer Processing

    Authors: Ching-Hsun Tseng, Shin-Jye Lee, Po-Wei Cheng, Chien Lee, Chih-Chieh Hung

    Abstract: Topic modeling is admittedly a convenient way to monitor markets trend. Conventionally, Latent Dirichlet Allocation, LDA, is considered a must-do model to gain this type of information. By given the merit of deducing keyword with token conditional probability in LDA, we can know the most possible or essential topic. However, the results are not intuitive because the given topics cannot wholly fit… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: Asian Journal of Information and Communications, Vol.12, No. 1, 125-140

  26. arXiv:2306.15593  [pdf

    cs.CV

    Cardiac CT perfusion imaging of pericoronary adipose tissue (PCAT) highlights potential confounds in coronary CTA

    Authors: Hao Wu, Yingnan Song, Ammar Hoori, Ananya Subramaniam, Juhwan Lee, Justin Kim, Tao Hu, Sadeer Al-Kindi, Wei-Ming Huang, Chun-Ho Yun, Chung-Lieh Hung, Sanjay Rajagopalan, David L. Wilson

    Abstract: Features of pericoronary adipose tissue (PCAT) assessed from coronary computed tomography angiography (CCTA) are associated with inflammation and cardiovascular risk. As PCAT is vascularly connected with coronary vasculature, the presence of iodine is a potential confounding factor on PCAT HU and textures that has not been adequately investigated. Use dynamic cardiac CT perfusion (CCTP) to inform… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: 13 pages, 8 figures

  27. arXiv:2305.12717  [pdf, other

    cs.CL cs.LG

    TADA: Efficient Task-Agnostic Domain Adaptation for Transformers

    Authors: Chia-Chien Hung, Lukas Lange, Jannik Strötgen

    Abstract: Intermediate training of pre-trained transformer-based language models on domain-specific data leads to substantial gains for downstream tasks. To increase efficiency and prevent catastrophic forgetting alleviated from full domain-adaptive pre-training, approaches such as adapters have been developed. However, these require additional parameters for each layer, and are criticized for their limited… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: ACL-Findings 2023

  28. arXiv:2305.05139   

    cs.SD cs.MM eess.AS

    Temporal Convolution Network Based Onset Detection and Query by Humming System Design

    Authors: Yu Cheng Hung, Jian-Jiun Ding

    Abstract: Onsets are a key factor to split audio into several notes. In this paper, we ensemble multiple temporal convolution network (TCN) based model and utilize a restricted frequency range spectrogram to achieve more robust onset detection. Different from the present onset detection of QBH system which is only available in a clean scenario, our proposal of onset detection and speech enhancement can prev… ▽ More

    Submitted 7 June, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: This paper has been withdrawn by the author due to a crucial definition of probability threshold and several grammer and vocabulary mistakes

  29. arXiv:2305.03982  [pdf

    cs.SD cs.MM eess.AS

    Pitch Estimation by Denoising Preprocessor and Hybrid Estimation Model

    Authors: Yu Cheng Hung, Ping Hung Chen, Jian Jiun Ding

    Abstract: Pitch estimation is to estimate the fundamental frequency and the midi number and plays a critical role in music signal analysis and vocal signal processing. In this work, we proposed a new architecture based on a learning-based enhancement preprocessor and a combination of several traditional and deep learning pitch estimation methods to achieve better pitch estimation performance in both noisy a… ▽ More

    Submitted 6 May, 2023; originally announced May 2023.

    Comments: From ICCE-Taiwan

  30. arXiv:2303.17388  [pdf, other

    cs.SE

    BPCE: A Prototype for Co-Evolution between Business Process Variants through Configurable Process Model

    Authors: Linyue Liu, Xi Guo, Chun Ouyang, Patrick C. K. Hung, Hong-Yu Zhang, Keqing He, Chen Mo, Zaiwen Feng

    Abstract: With the continuous development of business process management technology, the increasing business process models are usually owned by large enterprises. In large enterprises, different stakeholders may modify the same business process model. In order to better manage the changeability of processes, they adopt configurable business process models to manage process variants. However, the process va… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

    Comments: 18 pages , 11 figures

    MSC Class: 68N99 ACM Class: D.2.2

  31. arXiv:2303.03364  [pdf, other

    cs.RO cs.CV cs.LG

    Leveraging Scene Embeddings for Gradient-Based Motion Planning in Latent Space

    Authors: Jun Yamada, Chia-Man Hung, Jack Collins, Ioannis Havoutis, Ingmar Posner

    Abstract: Motion planning framed as optimisation in structured latent spaces has recently emerged as competitive with traditional methods in terms of planning success while significantly outperforming them in terms of computational speed. However, the real-world applicability of recent work in this domain remains limited by the need to express obstacle information directly in state-space, involving simple g… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: Project website: https://amp-ls.github.io/

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA), 2023

  32. arXiv:2211.14986  [pdf

    eess.IV cs.CV

    An Unpaired Cross-modality Segmentation Framework Using Data Augmentation and Hybrid Convolutional Networks for Segmenting Vestibular Schwannoma and Cochlea

    Authors: Yuzhou Zhuang, Hong Liu, Enmin Song, Coskun Cetinkaya, Chih-Cheng Hung

    Abstract: The crossMoDA challenge aims to automatically segment the vestibular schwannoma (VS) tumor and cochlea regions of unlabeled high-resolution T2 scans by leveraging labeled contrast-enhanced T1 scans. The 2022 edition extends the segmentation task by including multi-institutional scans. In this work, we proposed an unpaired cross-modality segmentation framework using data augmentation and hybrid con… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

    Comments: Accepted by BrainLes MICCAI proceedings

  33. Reaching Through Latent Space: From Joint Statistics to Path Planning in Manipulation

    Authors: Chia-Man Hung, Shaohong Zhong, Walter Goodwin, Oiwi Parker Jones, Martin Engelcke, Ioannis Havoutis, Ingmar Posner

    Abstract: We present a novel approach to path planning for robotic manipulators, in which paths are produced via iterative optimisation in the latent space of a generative model of robot poses. Constraints are incorporated through the use of constraint satisfaction classifiers operating on the same space. Optimisation leverages gradients through our learned models that provide a simple way to combine goal r… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: 10 pages, 6 figures, 4 tables

    ACM Class: I.2.6; I.2.9; I.2.10

    Journal ref: IEEE Robotics and Automation Letters 7.2 (2022): 5334-5341

  34. arXiv:2210.07362  [pdf, other

    cs.CL

    Can Demographic Factors Improve Text Classification? Revisiting Demographic Adaptation in the Age of Transformers

    Authors: Chia-Chien Hung, Anne Lauscher, Dirk Hovy, Simone Paolo Ponzetto, Goran Glavaš

    Abstract: Demographic factors (e.g., gender or age) shape our language. Previous work showed that incorporating demographic factors can consistently improve performance for various NLP tasks with traditional NLP models. In this work, we investigate whether these previous findings still hold with state-of-the-art pretrained Transformer-based language models (PLMs). We use three common specialization methods… ▽ More

    Submitted 9 May, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: Findings of EACL 2023. arXiv admin note: text overlap with arXiv:2208.01029

  35. arXiv:2208.01029  [pdf, other

    cs.CL

    On the Limitations of Sociodemographic Adaptation with Transformers

    Authors: Chia-Chien Hung, Anne Lauscher, Dirk Hovy, Simone Paolo Ponzetto, Goran Glavaš

    Abstract: Sociodemographic factors (e.g., gender or age) shape our language. Previous work showed that incorporating specific sociodemographic factors can consistently improve performance for various NLP tasks in traditional NLP models. We investigate whether these previous findings still hold with state-of-the-art pretrained Transformers. We use three common specialization methods proven effective for inco… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

  36. arXiv:2206.03139  [pdf, other

    cs.LG cs.AI cs.CL

    Intra-agent speech permits zero-shot task acquisition

    Authors: Chen Yan, Federico Carnevale, Petko Georgiev, Adam Santoro, Aurelia Guy, Alistair Muldal, Chia-Chun Hung, Josh Abramson, Timothy Lillicrap, Gregory Wayne

    Abstract: Human language learners are exposed to a trickle of informative, context-sensitive language, but a flood of raw sensory data. Through both social language use and internal processes of rehearsal and practice, language learners are able to build high-level, semantic representations that explain their perceptions. Here, we take inspiration from such processes of "inner speech" in humans (Vygotsky, 1… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

  37. arXiv:2205.14981  [pdf, other

    cs.CL

    ZusammenQA: Data Augmentation with Specialized Models for Cross-lingual Open-retrieval Question Answering System

    Authors: Chia-Chien Hung, Tommaso Green, Robert Litschko, Tornike Tsereteli, Sotaro Takeshita, Marco Bombieri, Goran Glavaš, Simone Paolo Ponzetto

    Abstract: This paper introduces our proposed system for the MIA Shared Task on Cross-lingual Open-retrieval Question Answering (COQA). In this challenging scenario, given an input question the system has to gather evidence documents from a multilingual pool and generate from them an answer in the language of the question. We devised several approaches combining different model variants for three main compon… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

  38. arXiv:2205.10400  [pdf, other

    cs.CL

    Multi2WOZ: A Robust Multilingual Dataset and Conversational Pretraining for Task-Oriented Dialog

    Authors: Chia-Chien Hung, Anne Lauscher, Ivan Vulić, Simone Paolo Ponzetto, Goran Glavaš

    Abstract: Research on (multi-domain) task-oriented dialog (TOD) has predominantly focused on the English language, primarily due to the shortage of robust TOD datasets in other languages, preventing the systematic investigation of cross-lingual transfer for this crucial NLP application area. In this work, we introduce Multi2WOZ, a new multilingual multi-domain TOD dataset, derived from the well-established… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

    Comments: NAACL 2022

  39. arXiv:2205.07446  [pdf, other

    cs.CL cs.AI cs.LG

    Miutsu: NTU's TaskBot for the Alexa Prize

    Authors: Yen-Ting Lin, Hui-Chi Kuo, Ze-Song Xu, Ssu Chiu, Chieh-Chi Hung, Yi-Cheng Chen, Chao-Wei Huang, Yun-Nung Chen

    Abstract: This paper introduces Miutsu, National Taiwan University's Alexa Prize TaskBot, which is designed to assist users in completing tasks requiring multiple steps and decisions in two different domains -- home improvement and cooking. We overview our system design and architectural goals, and detail the proposed core elements, including question answering, task retrieval, social chatting, and various… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

  40. arXiv:2201.00008  [pdf, other

    cs.LG cs.AI

    A Lightweight and Accurate Spatial-Temporal Transformer for Traffic Forecasting

    Authors: Guanyao Li, Shuhan Zhong, S. -H. Gary Chan, Ruiyuan Li, Chih-Chieh Hung, Wen-Chih Peng

    Abstract: We study the forecasting problem for traffic with dynamic, possibly periodical, and joint spatial-temporal dependency between regions. Given the aggregated inflow and outflow traffic of regions in a city from time slots 0 to t-1, we predict the traffic at time t at any region. Prior arts in the area often consider the spatial and temporal dependencies in a decoupled manner or are rather computatio… ▽ More

    Submitted 3 May, 2022; v1 submitted 30 December, 2021; originally announced January 2022.

  41. arXiv:2110.08395  [pdf, other

    cs.CL

    DS-TOD: Efficient Domain Specialization for Task Oriented Dialog

    Authors: Chia-Chien Hung, Anne Lauscher, Simone Paolo Ponzetto, Goran Glavaš

    Abstract: Recent work has shown that self-supervised dialog-specific pretraining on large conversational datasets yields substantial gains over traditional language modeling (LM) pretraining in downstream task-oriented dialog (TOD). These approaches, however, exploit general dialogic corpora (e.g., Reddit) and thus presumably fail to reliably embed domain-specific knowledge useful for concrete downstream TO… ▽ More

    Submitted 20 May, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: Findings of ACL 2022

  42. arXiv:2107.05455  [pdf, ps, other

    cs.DM

    A Local Diagnosis Algorithm for Hypercube-like Networks under the BGM Diagnosis Model

    Authors: Cheng-Kuan Lin, Tzu-Liang Kung, Chun-Nan Hung, Yuan-Hsiang Teng

    Abstract: System diagnosis is process of identifying faulty nodes in a system. An efficient diagnosis is crucial for a multiprocessor system. The BGM diagnosis model is a modification of the PMC diagnosis model, which is a test-based diagnosis. In this paper, we present a specific structure and propose an algorithm for diagnosing a node in a system under the BGM model. We also give a polynomial-time algorit… ▽ More

    Submitted 8 June, 2022; v1 submitted 30 June, 2021; originally announced July 2021.

    Journal ref: Fundamenta Informaticae, Volume 185, Issue 4 (July 7, 2022) fi:7674

  43. arXiv:2104.06274  [pdf, other

    cs.DC

    Optimal Data Placement for Data-Sharing Scientific Workflows in Heterogeneous Edge-Cloud Computing Environments

    Authors: Xin Du, Songtao Tang, Zhihui Lu, Keke Gai, Jie Wu, Patrick C. K. Hung

    Abstract: The heterogeneous edge-cloud computing paradigm can provide a more optimal direction to deploy scientific workflows than traditional distributed computing or cloud computing environments. Due to the different sizes of scientific datasets and some of these datasets must keep private, it is still a difficult problem to finding an data placement strategy that can minimize data transmission as well as… ▽ More

    Submitted 13 April, 2021; originally announced April 2021.

  44. arXiv:2103.11881  [pdf, other

    cs.RO cs.LG

    Introspective Visuomotor Control: Exploiting Uncertainty in Deep Visuomotor Control for Failure Recovery

    Authors: Chia-Man Hung, Li Sun, Yizhe Wu, Ioannis Havoutis, Ingmar Posner

    Abstract: End-to-end visuomotor control is emerging as a compelling solution for robot manipulation tasks. However, imitation learning-based visuomotor control approaches tend to suffer from a common limitation, lacking the ability to recover from an out-of-distribution state caused by compounding errors. In this paper, instead of using tactile feedback or explicitly detecting the failure through vision, we… ▽ More

    Submitted 22 March, 2021; originally announced March 2021.

    Comments: 7 pages, 5 figures, 1 table

    ACM Class: I.2.9; I.2.10

  45. arXiv:2102.02446  [pdf, other

    cs.LG math.GT

    The Analysis from Nonlinear Distance Metric to Kernel-based Drug Prescription Prediction System

    Authors: Der-Chen Chang, Ophir Frieder, Chi-Feng Hung, Hao-Ren Yao

    Abstract: Distance metrics and their nonlinear variant play a crucial role in machine learning based real-world problem solving. We demonstrated how Euclidean and cosine distance measures differ not only theoretically but also in real-world medical application, namely, outcome prediction of drug prescription. Euclidean distance exhibits favorable properties in the local geometry problem. To this regard, Euc… ▽ More

    Submitted 23 February, 2021; v1 submitted 4 February, 2021; originally announced February 2021.

    Comments: Accepted to Journal of Nonlinear and Variational Analysis, JNVA 2021

  46. arXiv:2011.05755  [pdf, other

    q-bio.QM cs.DC eess.IV

    Cryo-RALib -- a modular library for accelerating alignment in cryo-EM

    Authors: Szu-Chi Chung, Cheng-Yu Hung, Huei-Lun Siao, Hung-Yi Wu, Wei-Hau Chang, I-Ping Tu

    Abstract: Thanks to automated cryo-EM and GPU-accelerated processing, single-particle cryo-EM has become a rapid structure determination method that permits capture of dynamical structures of molecules in solution, which has been recently demonstrated by the determination of COVID-19 spike protein in March, shortly after its breakout in late January 2020. This rapidity is critical for vaccine development in… ▽ More

    Submitted 25 February, 2021; v1 submitted 11 November, 2020; originally announced November 2020.

  47. Cross-Global Attention Graph Kernel Network Prediction of Drug Prescription

    Authors: Hao-Ren Yao, Der-Chen Chang, Ophir Frieder, Wendy Huang, I-Chia Liang, Chi-Feng Hung

    Abstract: We present an end-to-end, interpretable, deep-learning architecture to learn a graph kernel that predicts the outcome of chronic disease drug prescription. This is achieved through a deep metric learning collaborative with a Support Vector Machine objective using a graphical representation of Electronic Health Records. We formulate the predictive model as a binary graph classification problem with… ▽ More

    Submitted 4 August, 2020; originally announced August 2020.

    Comments: ACM-BCB 2020 (Full paper)

    Journal ref: Proceedings of the 11th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics (BCB '20), September 21-24, 2020, Virtual Event, USA

  48. arXiv:2005.02220  [pdf

    cs.CV

    Learning of Art Style Using AI and Its Evaluation Based on Psychological Experiments

    Authors: Mai Cong Hung, Ryohei Nakatsu, Naoko Tosa, Takashi Kusumi, Koji Koyamada

    Abstract: GANs (Generative adversarial networks) is a new AI technology that can perform deep learning with less training data and has the capability of achieving transformation between two image sets. Using GAN we have carried out a comparison between several art sets with different art style. We have prepared several image sets; a flower photo set (A), an art image set (B1) of Impressionism drawings, an a… ▽ More

    Submitted 4 May, 2020; originally announced May 2020.

  49. arXiv:2005.01492  [pdf, other

    physics.soc-ph cs.LG stat.ML

    TRIPDECODER: Study Travel Time Attributes and Route Preferences of Metro Systems from Smart Card Data

    Authors: Xiancai Tian, Baihua Zheng, Yazhe Wang, Hsiao-Ting Huang, Chih-Chieh Hung

    Abstract: In this paper, we target at recovering the exact routes taken by commuters inside a metro system that arenot captured by an Automated Fare Collection (AFC) system and hence remain unknown. We strategicallypropose two inference tasks to handle the recovering, one to infer the travel time of each travel link thatcontributes to the total duration of any trip inside a metro network and the other to in… ▽ More

    Submitted 1 May, 2020; originally announced May 2020.

    ACM Class: I.5.1

  50. arXiv:2003.12175  [pdf, other

    cs.LG cs.SD eess.AS

    Incremental Learning Algorithm for Sound Event Detection

    Authors: Eunjeong Koh, Fatemeh Saki, Yinyi Guo, Cheng-Yu Hung, Erik Visser

    Abstract: This paper presents a new learning strategy for the Sound Event Detection (SED) system to tackle the issues of i) knowledge migration from a pre-trained model to a new target model and ii) learning new sound events without forgetting the previously learned ones without re-training from scratch. In order to migrate the previously learned knowledge from the source model to the target one, a neural a… ▽ More

    Submitted 26 March, 2020; originally announced March 2020.

    Comments: IEEE ICME 2020 Camera Ready Version

    Journal ref: IEEE ICME 2020