Skip to main content

Showing 1–9 of 9 results for author: Liou, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.15013  [pdf, other

    cs.CL

    Stay Hungry, Stay Foolish: On the Extended Reading Articles Generation with LLMs

    Authors: Yow-Fu Liou, Yu-Chien Tang, An-Zi Yen

    Abstract: The process of creating educational materials is both time-consuming and demanding for educators. This research explores the potential of Large Language Models (LLMs) to streamline this task by automating the generation of extended reading materials and relevant course suggestions. Using the TED-Ed Dig Deeper sections as an initial exploration, we investigate how supplementary articles can be enri… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

    Comments: Accepted by iRAISE@AAAI2025

  2. arXiv:2410.12330  [pdf, other

    cs.LG

    MAX: Masked Autoencoder for X-ray Fluorescence in Geological Investigation

    Authors: An-Sheng Lee, Yu-Wen Pao, Hsuan-Tien Lin, Sofia Ya Hsuan Liou

    Abstract: Pre-training foundation models has become the de-facto procedure for deep learning approaches, yet its application remains limited in the geological studies, where in needs of the model transferability to break the shackle of data scarcity. Here we target on the X-ray fluorescence (XRF) scanning data, a standard high-resolution measurement in extensive scientific drilling projects. We propose a sc… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  3. arXiv:2303.16637  [pdf, other

    cs.CV

    MuRAL: Multi-Scale Region-based Active Learning for Object Detection

    Authors: Yi-Syuan Liou, Tsung-Han Wu, Jia-Fong Yeh, Wen-Chin Chen, Winston H. Hsu

    Abstract: Obtaining large-scale labeled object detection dataset can be costly and time-consuming, as it involves annotating images with bounding boxes and class labels. Thus, some specialized active learning methods have been proposed to reduce the cost by selecting either coarse-grained samples or fine-grained instances from unlabeled data for labeling. However, the former approaches suffer from redundant… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

  4. arXiv:2202.06484  [pdf, other

    cs.CV cs.LG

    D2ADA: Dynamic Density-aware Active Domain Adaptation for Semantic Segmentation

    Authors: Tsung-Han Wu, Yi-Syuan Liou, Shao-Ji Yuan, Hsin-Ying Lee, Tung-I Chen, Kuan-Chih Huang, Winston H. Hsu

    Abstract: In the field of domain adaptation, a trade-off exists between the model performance and the number of target domain annotations. Active learning, maximizing model performance with few informative labeled data, comes in handy for such a scenario. In this work, we present D2ADA, a general active domain adaptation framework for semantic segmentation. To adapt the model to the target domain with minim… ▽ More

    Submitted 18 July, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: Accepted by ECCV 2022. The code is available at https://github.com/tsunghan-wu/D2ADA

  5. arXiv:2112.01348  [pdf, other

    cs.CV

    3rd Place Solution for NeurIPS 2021 Shifts Challenge: Vehicle Motion Prediction

    Authors: Ching-Yu Tseng, Po-Shao Lin, Yu-Jia Liou, Kuan-Chih Huang, Winston H. Hsu

    Abstract: Shifts Challenge: Robustness and Uncertainty under Real-World Distributional Shift is a competition held by NeurIPS 2021. The objective of this competition is to search for methods to solve the motion prediction problem in cross-domain. In the real world dataset, It exists variance between input data distribution and ground-true data distribution, which is called the domain shift problem. In this… ▽ More

    Submitted 2 December, 2021; originally announced December 2021.

    Journal ref: Bayesian Deep Learning Workshop, NeurIPS 2021

  6. arXiv:2110.05221  [pdf, other

    cs.CL cs.AI

    Multi-Task Learning for Situated Multi-Domain End-to-End Dialogue Systems

    Authors: Po-Nien Kung, Chung-Cheng Chang, Tse-Hsuan Yang, Hsin-Kai Hsu, Yu-Jia Liou, Yun-Nung Chen

    Abstract: Task-oriented dialogue systems have been a promising area in the NLP field. Previous work showed the effectiveness of using a single GPT-2 based model to predict belief states and responses via causal language modeling. In this paper, we leverage multi-task learning techniques to train a GPT-2 based model on a more challenging dataset with multiple domains, multiple modalities, and more diversity… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

  7. arXiv:2109.03551  [pdf, other

    cs.SD cs.CL cs.CV eess.AS

    Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion

    Authors: Yi-Syuan Liou, Wen-Chin Huang, Ming-Chi Yen, Shu-Wei Tsai, Yu-Huai Peng, Tomoki Toda, Yu Tsao, Hsin-Min Wang

    Abstract: Voice conversion (VC) is an effective approach to electrolaryngeal (EL) speech enhancement, a task that aims to improve the quality of the artificial voice from an electrolarynx device. In frame-based VC methods, time alignment needs to be performed prior to model training, and the dynamic time warping (DTW) algorithm is widely adopted to compute the best time alignment between each utterance pair… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

    Comments: Accepted to APSIPA ASC 2021

  8. arXiv:1802.07934  [pdf, other

    cs.CV

    Adversarial Learning for Semi-Supervised Semantic Segmentation

    Authors: Wei-Chih Hung, Yi-Hsuan Tsai, Yan-Ting Liou, Yen-Yu Lin, Ming-Hsuan Yang

    Abstract: We propose a method for semi-supervised semantic segmentation using an adversarial network. While most existing discriminators are trained to classify input images as real or fake on the image level, we design a discriminator in a fully convolutional manner to differentiate the predicted probability maps from the ground truth segmentation distribution with the consideration of the spatial resoluti… ▽ More

    Submitted 24 July, 2018; v1 submitted 22 February, 2018; originally announced February 2018.

    Comments: Accepted in BMVC 2018. Code and models available at https://github.com/hfslyc/AdvSemiSeg

  9. arXiv:1506.02327  [pdf, other

    cs.CL cs.LG cs.NE

    A Multi-layered Acoustic Tokenizing Deep Neural Network (MAT-DNN) for Unsupervised Discovery of Linguistic Units and Generation of High Quality Features

    Authors: Cheng-Tao Chung, Cheng-Yu Tsai, Hsiang-Hung Lu, Yuan-ming Liou, Yen-Chen Wu, Yen-Ju Lu, Hung-yi Lee, Lin-shan Lee

    Abstract: This paper summarizes the work done by the authors for the Zero Resource Speech Challenge organized in the technical program of Interspeech 2015. The goal of the challenge is to discover linguistic units directly from unlabeled speech data. The Multi-layered Acoustic Tokenizer (MAT) proposed in this work automatically discovers multiple sets of acoustic tokens from the given corpus. Each acoustic… ▽ More

    Submitted 7 June, 2015; originally announced June 2015.

    Comments: submitted to Interspeech 2015