Skip to main content

Showing 1–9 of 9 results for author: Chao, K

Searching in archive eess. Search in all archives.
.
  1. arXiv:2412.05951  [pdf, other

    cs.SD cs.CV cs.LG eess.AS

    When Vision Models Meet Parameter Efficient Look-Aside Adapters Without Large-Scale Audio Pretraining

    Authors: Juan Yeo, Jinkwan Jang, Kyubyung Chae, Seongkyu Mun, Taesup Kim

    Abstract: Recent studies show that pretrained vision models can boost performance in audio downstream tasks. To enhance the performance further, an additional pretraining stage with large scale audio data is typically required to infuse audio specific knowledge into the vision model. However, such approaches require extensive audio data and a carefully designed objective function. In this work, we propose b… ▽ More

    Submitted 8 December, 2024; originally announced December 2024.

    Comments: 5 pages, 3 figures

  2. arXiv:2411.18075  [pdf, other

    cs.SD eess.AS

    Music2Fail: Transfer Music to Failed Recorder Style

    Authors: Chon In Leong, I-Ling Chung, Kin-Fong Chao, Jun-You Wang, Yi-Hsuan Yang, Jyh-Shing Roger Jang

    Abstract: The goal of music style transfer is to convert a music performance by one instrument into another while keeping the musical contents unchanged. In this paper, we investigate another style transfer scenario called ``failed-music style transfer''. Unlike the usual music style transfer where the content remains the same and only the instrumental characteristics are changed, this scenario seeks to tra… ▽ More

    Submitted 27 November, 2024; originally announced November 2024.

    Comments: Accepted by APSIPA 2024

  3. arXiv:2304.14496  [pdf, ps, other

    physics.ins-det cs.LG eess.SP nucl-ex

    Restoring Original Signal From Pile-up Signal using Deep Learning

    Authors: C. H. Kim, S. Ahn, K. Y. Chae, J. Hooker, G. V. Rogachev

    Abstract: Pile-up signals are frequently produced in experimental physics. They create inaccurate physics data with high uncertainty and cause various problems. Therefore, the correction to pile-up signals is crucially required. In this study, we implemented a deep learning method to restore the original signals from the pile-up signals. We showed that a deep learning model could accurately reconstruct the… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

  4. arXiv:2206.06730  [pdf, other

    eess.IV cs.CV

    Automated Precision Localization of Peripherally Inserted Central Catheter Tip through Model-Agnostic Multi-Stage Networks

    Authors: Subin Park, Yoon Ki Cha, Soyoung Park, Kyung-Su Kim, Myung Jin Chung

    Abstract: Peripherally inserted central catheters (PICCs) have been widely used as one of the representative central venous lines (CVCs) due to their long-term intravascular access with low infectivity. However, PICCs have a fatal drawback of a high frequency of tip mispositions, increasing the risk of puncture, embolism, and complications such as cardiac arrhythmias. To automatically and precisely detect i… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: Subin Park and Yoon Ki Cha have contributed equally to this work as the co-first author. Kyung-Su Kim ([email protected]) and Myung Jin Chung ([email protected]) have contributed equally to this work as the co-corresponding author

  5. arXiv:2007.09610  [pdf, other

    eess.IV cs.CV cs.LG

    Self-similarity Student for Partial Label Histopathology Image Segmentation

    Authors: Hsien-Tzu Cheng, Chun-Fu Yeh, Po-Chen Kuo, Andy Wei, Keng-Chi Liu, Mong-Chi Ko, Kuan-Hua Chao, Yu-Ching Peng, Tyng-Luh Liu

    Abstract: Delineation of cancerous regions in gigapixel whole slide images (WSIs) is a crucial diagnostic procedure in digital pathology. This process is time-consuming because of the large search space in the gigapixel WSIs, causing chances of omission and misinterpretation at indistinct tumor lesions. To tackle this, the development of an automated cancerous region segmentation method is imperative. We fr… ▽ More

    Submitted 19 July, 2020; originally announced July 2020.

    Comments: ECCV 2020

  6. arXiv:2004.12786  [pdf, other

    eess.IV cs.CV cs.LG

    A Cascaded Learning Strategy for Robust COVID-19 Pneumonia Chest X-Ray Screening

    Authors: Chun-Fu Yeh, Hsien-Tzu Cheng, Andy Wei, Hsin-Ming Chen, Po-Chen Kuo, Keng-Chi Liu, Mong-Chi Ko, Ray-Jade Chen, Po-Chang Lee, Jen-Hsiang Chuang, Chi-Mai Chen, Yi-Chang Chen, Wen-Jeng Lee, Ning Chien, Jo-Yu Chen, Yu-Sen Huang, Yu-Chien Chang, Yu-Cheng Huang, Nai-Kuan Chou, Kuan-Hua Chao, Yi-Chin Tu, Yeun-Chung Chang, Tyng-Luh Liu

    Abstract: We introduce a comprehensive screening platform for the COVID-19 (a.k.a., SARS-CoV-2) pneumonia. The proposed AI-based system works on chest x-ray (CXR) images to predict whether a patient is infected with the COVID-19 disease. Although the recent international joint effort on making the availability of all sorts of open data, the public collection of CXR images is still relatively small for relia… ▽ More

    Submitted 30 April, 2020; v1 submitted 24 April, 2020; originally announced April 2020.

    Comments: 14 pages, 6 figures

  7. arXiv:2003.02632  [pdf, other

    eess.SP

    STAR: Spatio-Temporal Prediction of Air Quality Using A Multimodal Approach

    Authors: Tien-Cuong Bui, Joonyoung Kim, Taewoo Kang, Donghyeon Lee, Junyoung Choi, Insoon Yang, Kyomin Jung, Sang Kyun Cha

    Abstract: With the increase of global economic activities and high energy demand, many countries have raised concerns about air pollution. However, air quality prediction is a challenging issue due to the complex interaction of many factors. In this paper, we propose a multimodal approach for spatio-temporal air quality prediction. Our model learns the multimodal fusion of critical factors to predict future… ▽ More

    Submitted 28 February, 2020; originally announced March 2020.

    Comments: 18 pages, 9 figures, Intelligent System Conference (Intellisys 2020 - Accepted)

  8. arXiv:1912.00649  [pdf, other

    cs.MM cs.CV eess.AS

    An Attention-Based Speaker Naming Method for Online Adaptation in Non-Fixed Scenarios

    Authors: Jungwoo Pyo, Joohyun Lee, Youngjune Park, Tien-Cuong Bui, Sang Kyun Cha

    Abstract: A speaker naming task, which finds and identifies the active speaker in a certain movie or drama scene, is crucial for dealing with high-level video analysis applications such as automatic subtitle labeling and video summarization. Modern approaches have usually exploited biometric features with a gradient-based method instead of rule-based algorithms. In a certain situation, however, a naive grad… ▽ More

    Submitted 2 December, 2019; originally announced December 2019.

    Comments: AAAI 2020 Workshop on Interactive and Conversational Recommendation Systems(WICRS)

  9. arXiv:1911.12919  [pdf, other

    cs.LG eess.SP stat.ML

    Spatiotemporal deep learning model for citywide air pollution interpolation and prediction

    Authors: Van-Duc Le, Tien-Cuong Bui, Sang Kyun Cha

    Abstract: Recently, air pollution is one of the most concerns for big cities. Predicting air quality for any regions and at any time is a critical requirement of urban citizens. However, air pollution prediction for the whole city is a challenging problem. The reason is, there are many spatiotemporal factors affecting air pollution throughout the city. Collecting as many of them could help us to forecast ai… ▽ More

    Submitted 28 November, 2019; originally announced November 2019.

    Comments: Accepted at BigComp2020