Skip to main content

Showing 1–14 of 14 results for author: Kao, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.00761  [pdf, other

    cs.DC cs.NE cs.NI

    Towards a Decentralised Application-Centric Orchestration Framework in the Cloud-Edge Continuum

    Authors: Amjad Ullah, Andras Markus, Hacı İsmail Aslan, Tamas Kiss, Jozsef Kovacs, James Deslauriers, Amy L. Murphy, Yiming Wang Odej Kao

    Abstract: The efficient management of complex distributed applications in the Cloud-Edge continuum, including their deployment on heterogeneous computing resources and run-time operations, presents significant challenges. Resource management solutions -- also called orchestrators -- play a pivotal role by automating and managing tasks such as resource discovery, optimisation, application deployment, and lif… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

    Comments: Accepted for publication in the 9th IEEE International Conference on Fog and Edge Computing 2025

  2. arXiv:2401.15312  [pdf, other

    cs.CL

    How We Refute Claims: Automatic Fact-Checking through Flaw Identification and Explanation

    Authors: Wei-Yu Kao, An-Zi Yen

    Abstract: Automated fact-checking is a crucial task in the governance of internet content. Although various studies utilize advanced models to tackle this issue, a significant gap persists in addressing complex real-world rumors and deceptive claims. To address this challenge, this paper explores the novel task of flaw-oriented fact-checking, including aspect generation and flaw identification. We also intr… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

  3. arXiv:2401.12019  [pdf, other

    cs.CV

    Stereo-Matching Knowledge Distilled Monocular Depth Estimation Filtered by Multiple Disparity Consistency

    Authors: Woonghyun Ka, Jae Young Lee, Jaehyun Choi, Junmo Kim

    Abstract: In stereo-matching knowledge distillation methods of the self-supervised monocular depth estimation, the stereo-matching network's knowledge is distilled into a monocular depth network through pseudo-depth maps. In these methods, the learning-based stereo-confidence network is generally utilized to identify errors in the pseudo-depth maps to prevent transferring the errors. However, the learning-b… ▽ More

    Submitted 22 January, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

    Comments: ICASSP 2024. The first two authors are equally contributed

  4. arXiv:2401.12001  [pdf, other

    cs.CV

    Modeling Stereo-Confidence Out of the End-to-End Stereo-Matching Network via Disparity Plane Sweep

    Authors: Jae Young Lee, Woonghyun Ka, Jaehyun Choi, Junmo Kim

    Abstract: We propose a novel stereo-confidence that can be measured externally to various stereo-matching networks, offering an alternative input modality choice of the cost volume for learning-based approaches, especially in safety-critical systems. Grounded in the foundational concepts of disparity definition and the disparity plane sweep, the proposed stereo-confidence method is built upon the idea that… ▽ More

    Submitted 22 January, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

    Comments: AAAI 2024. The first two authors contributed equally

  5. arXiv:2210.07496  [pdf, ps, other

    math.CO cs.IT

    On the size of maximal binary codes with 2, 3, and 4 distances

    Authors: Alexander Barg, Alexey Glazyrin, Wei-Jiun Kao, Ching-Yi Lai, Pin-Chieh Tseng, Wei-Hsuan Yu

    Abstract: We address the maximum size of binary codes and binary constant weight codes with few distances. Previous works established a number of bounds for these quantities as well as the exact values for a range of small code lengths. As our main results, we determine the exact size of maximal binary codes with two distances for all lengths $n\ge 6$ as well as the exact size of maximal binary constant wei… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: Main text 23 pp. and Appendix 17pp

  6. arXiv:2204.03219  [pdf, other

    eess.AS cs.LG cs.SD

    DDOS: A MOS Prediction Framework utilizing Domain Adaptive Pre-training and Distribution of Opinion Scores

    Authors: Wei-Cheng Tseng, Wei-Tsung Kao, Hung-yi Lee

    Abstract: Mean opinion score (MOS) is a typical subjective evaluation metric for speech synthesis systems. Since collecting MOS is time-consuming, it would be desirable if there are accurate MOS prediction models for automatic evaluation. In this work, we propose DDOS, a novel MOS prediction model. DDOS utilizes domain adaptive pre-training to further pre-train self-supervised learning models on synthetic s… ▽ More

    Submitted 15 August, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

    Comments: Accepted to Interspeech 2022. Code will be available in the future

  7. arXiv:2204.00352  [pdf, other

    cs.LG eess.AS

    On the Efficiency of Integrating Self-supervised Learning and Meta-learning for User-defined Few-shot Keyword Spotting

    Authors: Wei-Tsung Kao, Yuan-Kuei Wu, Chia-Ping Chen, Zhi-Sheng Chen, Yu-Pao Tsai, Hung-Yi Lee

    Abstract: User-defined keyword spotting is a task to detect new spoken terms defined by users. This can be viewed as a few-shot learning problem since it is unreasonable for users to define their desired keywords by providing many examples. To solve this problem, previous works try to incorporate self-supervised learning models or apply meta-learning algorithms. But it is unclear whether self-supervised lea… ▽ More

    Submitted 5 October, 2022; v1 submitted 1 April, 2022; originally announced April 2022.

    Comments: Accepted by SLT 2022

  8. arXiv:2202.03822  [pdf, other

    cs.CV cs.AI

    A Novel Plug-in Module for Fine-Grained Visual Classification

    Authors: Po-Yung Chou, Cheng-Hung Lin, Wen-Chung Kao

    Abstract: Visual classification can be divided into coarse-grained and fine-grained classification. Coarse-grained classification represents categories with a large degree of dissimilarity, such as the classification of cats and dogs, while fine-grained classification represents classifications with a large degree of similarity, such as cat species, bird species, and the makes or models of vehicles. Unlike… ▽ More

    Submitted 8 February, 2022; originally announced February 2022.

  9. arXiv:2201.07436  [pdf, other

    cs.CV

    Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth

    Authors: Doyeon Kim, Woonghyun Ka, Pyungwhan Ahn, Donggyu Joo, Sehwan Chun, Junmo Kim

    Abstract: Depth estimation from a single image is an important task that can be applied to various fields in computer vision, and has grown rapidly with the development of convolutional neural networks. In this paper, we propose a novel structure and training strategy for monocular depth estimation to further improve the prediction accuracy of the network. We deploy a hierarchical transformer encoder to cap… ▽ More

    Submitted 29 October, 2022; v1 submitted 19 January, 2022; originally announced January 2022.

    Comments: 11pages, 5 figures

  10. arXiv:2111.05113  [pdf, other

    cs.CR cs.LG cs.SD eess.AS

    Membership Inference Attacks Against Self-supervised Speech Models

    Authors: Wei-Cheng Tseng, Wei-Tsung Kao, Hung-yi Lee

    Abstract: Recently, adapting the idea of self-supervised learning (SSL) on continuous speech has started gaining attention. SSL models pre-trained on a huge amount of unlabeled audio can generate general-purpose representations that benefit a wide variety of speech processing tasks. Despite their ubiquitous deployment, however, the potential privacy risks of these models have not been well investigated. In… ▽ More

    Submitted 15 August, 2022; v1 submitted 9 November, 2021; originally announced November 2021.

    Comments: Accepted to Interspeech 2022. Code will be available in the future

  11. arXiv:2104.03017  [pdf, other

    eess.AS cs.LG cs.SD

    Utilizing Self-supervised Representations for MOS Prediction

    Authors: Wei-Cheng Tseng, Chien-yu Huang, Wei-Tsung Kao, Yist Y. Lin, Hung-yi Lee

    Abstract: Speech quality assessment has been a critical issue in speech processing for decades. Existing automatic evaluations usually require clean references or parallel ground truth data, which is infeasible when the amount of data soars. Subjective tests, on the other hand, do not need any additional clean or parallel data and correlates better to human perception. However, such a test is expensive and… ▽ More

    Submitted 20 September, 2021; v1 submitted 7 April, 2021; originally announced April 2021.

    Comments: In Proceedings of Interspeech 2021. We acknowledge the support of AWS Machine Learning Research Awards program. Source code available at https://github.com/s3prl/s3prl/tree/master/s3prl/downstream/mos_prediction

  12. arXiv:2103.07162  [pdf, other

    cs.CL cs.LG

    Is BERT a Cross-Disciplinary Knowledge Learner? A Surprising Finding of Pre-trained Models' Transferability

    Authors: Wei-Tsung Kao, Hung-Yi Lee

    Abstract: This paper investigates whether the power of the models pre-trained on text data, such as BERT, can be transferred to general token sequence classification applications. To verify pre-trained models' transferability, we test the pre-trained models on text classification tasks with meanings of tokens mismatches, and real-world non-text token sequence classification data, including amino acid, DNA,… ▽ More

    Submitted 19 April, 2022; v1 submitted 12 March, 2021; originally announced March 2021.

    Comments: Findings of EMNLP 2021

  13. arXiv:2001.09309  [pdf, other

    cs.CL cs.LG

    BERT's output layer recognizes all hidden layers? Some Intriguing Phenomena and a simple way to boost BERT

    Authors: Wei-Tsung Kao, Tsung-Han Wu, Po-Han Chi, Chun-Cheng Hsieh, Hung-Yi Lee

    Abstract: Although Bidirectional Encoder Representations from Transformers (BERT) have achieved tremendous success in many natural language processing (NLP) tasks, it remains a black box. A variety of previous works have tried to lift the veil of BERT and understand each layer's functionality. In this paper, we found that surprisingly the output layer of BERT can reconstruct the input sentence by directly t… ▽ More

    Submitted 15 February, 2021; v1 submitted 25 January, 2020; originally announced January 2020.

    Comments: 7 pages, 8 figures, 3 tables

  14. arXiv:1903.12258  [pdf, other

    q-fin.GN cs.LG q-fin.ST stat.ML

    Using Deep Learning Neural Networks and Candlestick Chart Representation to Predict Stock Market

    Authors: Rosdyana Mangir Irawan Kusuma, Trang-Thi Ho, Wei-Chun Kao, Yu-Yen Ou, Kai-Lung Hua

    Abstract: Stock market prediction is still a challenging problem because there are many factors effect to the stock market price such as company news and performance, industry performance, investor sentiment, social media sentiment and economic factors. This work explores the predictability in the stock market using Deep Convolutional Network and candlestick charts. The outcome is utilized to design a decis… ▽ More

    Submitted 25 February, 2019; originally announced March 2019.

    Comments: conference,13 pages,3 figures