Skip to main content

Showing 1–25 of 25 results for author: Chun, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2507.08422  [pdf, ps, other

    cs.CV eess.IV

    Upsample What Matters: Region-Adaptive Latent Sampling for Accelerated Diffusion Transformers

    Authors: Wongi Jeong, Kyungryeol Lee, Hoigi Seo, Se Young Chun

    Abstract: Diffusion transformers have emerged as an alternative to U-net-based diffusion models for high-fidelity image and video generation, offering superior scalability. However, their heavy computation remains a major obstacle to real-world deployment. Existing acceleration methods primarily exploit the temporal dimension such as reusing cached features across diffusion timesteps. Here, we propose Regio… ▽ More

    Submitted 11 July, 2025; originally announced July 2025.

  2. arXiv:2505.23085  [pdf, ps, other

    cs.CV cs.AI eess.IV

    GeoMan: Temporally Consistent Human Geometry Estimation using Image-to-Video Diffusion

    Authors: Gwanghyun Kim, Xueting Li, Ye Yuan, Koki Nagano, Tianye Li, Jan Kautz, Se Young Chun, Umar Iqbal

    Abstract: Estimating accurate and temporally consistent 3D human geometry from videos is a challenging problem in computer vision. Existing methods, primarily optimized for single images, often suffer from temporal inconsistencies and fail to capture fine-grained dynamic details. To address these limitations, we present GeoMan, a novel architecture designed to produce accurate and temporally consistent dept… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: Project page: https://research.nvidia.com/labs/dair/geoman

  3. arXiv:2505.04820  [pdf, other

    eess.IV math.NA math.OC

    Convergent Complex Quasi-Newton Proximal Methods for Gradient-Driven Denoisers in Compressed Sensing MRI Reconstruction

    Authors: Tao Hong, Zhaoyi Xu, Se Young Chun, Luis Hernandez-Garcia, Jeffrey A. Fessler

    Abstract: In compressed sensing (CS) MRI, model-based methods are pivotal to achieving accurate reconstruction. One of the main challenges in model-based methods is finding an effective prior to describe the statistical distribution of the target image. Plug-and-Play (PnP) and REgularization by Denoising (RED) are two general frameworks that use denoisers as the prior. While PnP/RED methods with convolution… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

    Comments: 12 pages, 10 figures, https://hongtao-argmin.github.io/CQNPM-GD-CSMRI/

  4. arXiv:2505.00133  [pdf, other

    eess.IV cs.CV

    Efficient and robust 3D blind harmonization for large domain gaps

    Authors: Hwihun Jeong, Hayeon Lee, Se Young Chun, Jongho Lee

    Abstract: Blind harmonization has emerged as a promising technique for MR image harmonization to achieve scale-invariant representations, requiring only target domain data (i.e., no source domain data necessary). However, existing methods face limitations such as inter-slice heterogeneity in 3D, moderate image quality, and limited performance for a large domain gap. To address these challenges, we introduce… ▽ More

    Submitted 30 April, 2025; originally announced May 2025.

  5. Finding Reproducible and Prognostic Radiomic Features in Variable Slice Thickness Contrast Enhanced CT of Colorectal Liver Metastases

    Authors: Jacob J. Peoples, Mohammad Hamghalam, Imani James, Maida Wasim, Natalie Gangai, Hyunseon Christine Kang, X. John Rong, Yun Shin Chun, Richard K. G. Do, Amber L. Simpson

    Abstract: Establishing the reproducibility of radiomic signatures is a critical step in the path to clinical adoption of quantitative imaging biomarkers; however, radiomic signatures must also be meaningfully related to an outcome of clinical importance to be of value for personalized medicine. In this study, we analyze both the reproducibility and prognostic value of radiomic features extracted from the li… ▽ More

    Submitted 19 January, 2025; originally announced January 2025.

    Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) https://melba-journal.org/2024:032

    Journal ref: Machine.Learning.for.Biomedical.Imaging. 2 (2025)

  6. arXiv:2409.11738  [pdf, other

    eess.IV cs.CV

    Adaptive Selection of Sampling-Reconstruction in Fourier Compressed Sensing

    Authors: Seongmin Hong, Jaehyeok Bae, Jongho Lee, Se Young Chun

    Abstract: Compressed sensing (CS) has emerged to overcome the inefficiency of Nyquist sampling. However, traditional optimization-based reconstruction is slow and can not yield an exact image in practice. Deep learning-based reconstruction has been a promising alternative to optimization-based reconstruction, outperforming it in accuracy and computation speed. Finding an efficient sampling method with deep… ▽ More

    Submitted 18 September, 2024; v1 submitted 18 September, 2024; originally announced September 2024.

    Comments: 30 pages, 9.8 MB, Accepted to ECCV 2024

  7. arXiv:2409.10394  [pdf, ps, other

    eess.IV cs.AI

    MOST: MR reconstruction Optimization for multiple downStream Tasks via continual learning

    Authors: Hwihun Jeong, Se Young Chun, Jongho Lee

    Abstract: Deep learning-based Magnetic Resonance (MR) reconstruction methods have focused on generating high-quality images but often overlook the impact on downstream tasks (e.g., segmentation) that utilize the reconstructed images. Cascading separately trained reconstruction network and downstream task network has been shown to introduce performance degradation due to error propagation and domain gaps bet… ▽ More

    Submitted 24 June, 2025; v1 submitted 16 September, 2024; originally announced September 2024.

  8. arXiv:2407.05551  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    Read, Watch and Scream! Sound Generation from Text and Video

    Authors: Yujin Jeong, Yunji Kim, Sanghyuk Chun, Jiyoung Lee

    Abstract: Despite the impressive progress of multimodal generative models, video-to-audio generation still suffers from limited performance and limits the flexibility to prioritize sound synthesis for specific objects within the scene. Conversely, text-to-audio generation methods generate high-quality audio but pose challenges in ensuring comprehensive scene depiction and time-varying control. To tackle the… ▽ More

    Submitted 26 December, 2024; v1 submitted 7 July, 2024; originally announced July 2024.

    Comments: AAAI2025, Project page: https://naver-ai.github.io/rewas

  9. arXiv:2312.07425  [pdf, other

    cs.LG cs.CV eess.IV eess.SP

    Deep Internal Learning: Deep Learning from a Single Input

    Authors: Tom Tirer, Raja Giryes, Se Young Chun, Yonina C. Eldar

    Abstract: Deep learning, in general, focuses on training a neural network from large labeled datasets. Yet, in many cases there is value in training a network just from the input at hand. This is particularly relevant in many signal and image processing problems where training data is scarce and diversity is large on the one hand, and on the other, there is a lot of structure in the data that can be exploit… ▽ More

    Submitted 8 April, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: Accepted to IEEE Signal Processing Magazine

  10. arXiv:2312.01689  [pdf, other

    eess.IV cs.CV

    Fast and accurate sparse-view CBCT reconstruction using meta-learned neural attenuation field and hash-encoding regularization

    Authors: Heejun Shin, Taehee Kim, Jongho Lee, Se Young Chun, Seungryung Cho, Dongmyung Shin

    Abstract: Cone beam computed tomography (CBCT) is an emerging medical imaging technique to visualize the internal anatomical structures of patients. During a CBCT scan, several projection images of different angles or views are collectively utilized to reconstruct a tomographic image. However, reducing the number of projections in a CBCT scan while preserving the quality of a reconstructed image is challeng… ▽ More

    Submitted 16 January, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

  11. arXiv:2307.10667  [pdf, other

    eess.IV cs.CV

    Efficient Unified Demosaicing for Bayer and Non-Bayer Patterned Image Sensors

    Authors: Haechang Lee, Dongwon Park, Wongi Jeong, Kijeong Kim, Hyunwoo Je, Dongil Ryu, Se Young Chun

    Abstract: As the physical size of recent CMOS image sensors (CIS) gets smaller, the latest mobile cameras are adopting unique non-Bayer color filter array (CFA) patterns (e.g., Quad, Nona, QxQ), which consist of homogeneous color units with adjacent pixels. These non-Bayer sensors are superior to conventional Bayer CFA thanks to their changeable pixel-bin sizes for different light conditions but may introdu… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

  12. arXiv:2211.05910  [pdf, other

    eess.IV cs.CV

    Efficient and Accurate Quantized Image Super-Resolution on Mobile NPUs, Mobile AI & AIM 2022 challenge: Report

    Authors: Andrey Ignatov, Radu Timofte, Maurizio Denna, Abdel Younes, Ganzorig Gankhuyag, Jingang Huh, Myeong Kyun Kim, Kihwan Yoon, Hyeon-Cheol Moon, Seungho Lee, Yoonsik Choe, Jinwoo Jeong, Sungjei Kim, Maciej Smyl, Tomasz Latkowski, Pawel Kubik, Michal Sokolski, Yujie Ma, Jiahao Chao, Zhou Zhou, Hongfan Gao, Zhengfeng Yang, Zhenbing Zeng, Zhengyang Zhuge, Chenghua Li , et al. (71 additional authors not shown)

    Abstract: Image super-resolution is a common task on mobile and IoT devices, where one often needs to upscale and enhance low-resolution images and video frames. While numerous solutions have been proposed for this problem in the past, they are usually not compatible with low-power mobile NPUs having many computational and memory constraints. In this Mobile AI challenge, we address this problem and propose… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: text overlap with arXiv:2105.07825, arXiv:2105.08826, arXiv:2211.04470, arXiv:2211.03885, arXiv:2211.05256

  13. arXiv:2211.04470  [pdf, other

    cs.CV eess.IV

    Efficient Single-Image Depth Estimation on Mobile Devices, Mobile AI & AIM 2022 Challenge: Report

    Authors: Andrey Ignatov, Grigory Malivenko, Radu Timofte, Lukasz Treszczotko, Xin Chang, Piotr Ksiazek, Michal Lopuszynski, Maciej Pioro, Rafal Rudnicki, Maciej Smyl, Yujie Ma, Zhenyu Li, Zehui Chen, Jialei Xu, Xianming Liu, Junjun Jiang, XueChao Shi, Difan Xu, Yanan Li, Xiaotao Wang, Lei Lei, Ziyu Zhang, Yicheng Wang, Zilong Huang, Guozhong Luo , et al. (14 additional authors not shown)

    Abstract: Various depth estimation models are now widely used on many mobile and IoT devices for image segmentation, bokeh effect rendering, object tracking and many other mobile tasks. Thus, it is very crucial to have efficient and accurate depth estimation models that can run fast on low-power mobile chipsets. In this Mobile AI challenge, the target was to develop deep learning-based single image depth es… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2105.08630, arXiv:2211.03885; text overlap with arXiv:2105.08819, arXiv:2105.08826, arXiv:2105.08629, arXiv:2105.07809, arXiv:2105.07825

  14. arXiv:2208.07552  [pdf, ps, other

    eess.IV cs.CV cs.LG

    Self-supervised training of deep denoisers in multi-coil MRI considering noise correlations

    Authors: Juhyung Park, Dongwon Park, Sooyeon Ji, Hyeong-Geol Shin, Se Young Chun, Jongho Lee

    Abstract: Deep learning-based denoising methods have shown powerful results for improving the signal-to-noise ratio of magnetic resonance (MR) images, mostly by leveraging supervised learning with clean ground truth. However, acquiring clean ground truth images is often expensive and time-consuming. Self supervised methods have been widely investigated to mitigate the dependency on clean images, but mostly… ▽ More

    Submitted 12 June, 2025; v1 submitted 16 August, 2022; originally announced August 2022.

    Comments: 9 pages, 5figures

  15. arXiv:2207.01520  [pdf, other

    eess.IV cs.CV

    Adaptive GLCM sampling for transformer-based COVID-19 detection on CT

    Authors: Okchul Jung, Dong Un Kang, Gwanghyun Kim, Se Young Chun

    Abstract: The world has suffered from COVID-19 (SARS-CoV-2) for the last two years, causing much damage and change in people's daily lives. Thus, automated detection of COVID-19 utilizing deep learning on chest computed tomography (CT) scans became promising, which helps correct diagnosis efficiently. Recently, transformer-based COVID-19 detection method on CT is proposed to utilize 3D information in CT vol… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

    Comments: 6 pages

  16. arXiv:2205.04821  [pdf, other

    eess.IV cs.CV

    Self-supervised regression learning using domain knowledge: Applications to improving self-supervised denoising in imaging

    Authors: Il Yong Chun, Dongwon Park, Xuehang Zheng, Se Young Chun, Yong Long

    Abstract: Regression that predicts continuous quantity is a central part of applications using computational imaging and computer vision technologies. Yet, studying and understanding self-supervised learning for regression tasks - except for a particular regression task, image denoising - have lagged behind. This paper proposes a general self-supervised regression learning (SSRL) framework that enables lear… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

    Comments: 17 pages, 16 figures, 2 tables, submitted to IEEE T-IP

  17. arXiv:2108.12841  [pdf, other

    eess.IV cs.CV

    Rethinking Deep Image Prior for Denoising

    Authors: Yeonsik Jo, Se Young Chun, Jonghyun Choi

    Abstract: Deep image prior (DIP) serves as a good inductive bias for diverse inverse problems. Among them, denoising is known to be particularly challenging for the DIP due to noise fitting with the requirement of an early stopping. To address the issue, we first analyze the DIP by the notion of effective degrees of freedom (DF) to monitor the optimization progress and propose a principled stopping criterio… ▽ More

    Submitted 29 August, 2021; originally announced August 2021.

    Comments: ICCV 2021

  18. arXiv:2106.04165  [pdf, other

    cs.LG cs.NE eess.SY math.DS

    Neural Hybrid Automata: Learning Dynamics with Multiple Modes and Stochastic Transitions

    Authors: Michael Poli, Stefano Massaroli, Luca Scimeca, Seong Joon Oh, Sanghyuk Chun, Atsushi Yamashita, Hajime Asama, Jinkyoo Park, Animesh Garg

    Abstract: Effective control and prediction of dynamical systems often require appropriate handling of continuous-time and discrete, event-triggered processes. Stochastic hybrid systems (SHSs), common across engineering domains, provide a formalism for dynamical systems subject to discrete, possibly stochastic, state jumps and multi-modal continuous-time flows. Despite the versatility and importance of SHSs… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

  19. arXiv:2105.00543  [pdf, other

    cs.HC eess.SY

    MagSurface: Wireless 2D Finger Tracking Leveraging Magnetic Fields

    Authors: Sarnab Bhattacharya, Keum San Chun, Edison Thomaz

    Abstract: With the ubiquity of touchscreens, touch input modality has become a popular way of interaction. However, current touchscreen technology is limiting in its design as it restricts touch interactions to specially instrumented touch surfaces. Surface contaminants like water can also hinder proper interactions. In this paper, we propose the use of magnetic field sensing to enable finger tracking on a… ▽ More

    Submitted 2 May, 2021; originally announced May 2021.

  20. arXiv:2102.02485  [pdf, other

    cs.CV eess.IV

    Image Restoration by Deep Projected GSURE

    Authors: Shady Abu-Hussein, Tom Tirer, Se Young Chun, Yonina C. Eldar, Raja Giryes

    Abstract: Ill-posed inverse problems appear in many image processing applications, such as deblurring and super-resolution. In recent years, solutions that are based on deep Convolutional Neural Networks (CNNs) have shown great promise. Yet, most of these techniques, which train CNNs using external data, are restricted to the observation models that have been used in the training phase. A recent alternative… ▽ More

    Submitted 4 February, 2021; originally announced February 2021.

  21. arXiv:2005.01996  [pdf, other

    eess.IV cs.CV

    NTIRE 2020 Challenge on Real-World Image Super-Resolution: Methods and Results

    Authors: Andreas Lugmayr, Martin Danelljan, Radu Timofte, Namhyuk Ahn, Dongwoon Bai, Jie Cai, Yun Cao, Junyang Chen, Kaihua Cheng, SeYoung Chun, Wei Deng, Mostafa El-Khamy, Chiu Man Ho, Xiaozhong Ji, Amin Kheradmand, Gwantae Kim, Hanseok Ko, Kanghyu Lee, Jungwon Lee, Hao Li, Ziluan Liu, Zhi-Song Liu, Shuai Liu, Yunhua Lu, Zibo Meng , et al. (21 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2020 challenge on real world super-resolution. It focuses on the participating methods and final results. The challenge addresses the real world setting, where paired true high and low-resolution images are unavailable. For training, only one set of source input images is therefore provided along with a set of unpaired high-quality target images. In Track 1: Image Proc… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

  22. arXiv:1911.07410  [pdf, other

    eess.IV cs.CV

    Multi-Temporal Recurrent Neural Networks For Progressive Non-Uniform Single Image Deblurring With Incremental Temporal Training

    Authors: Dongwon Park, Dong Un Kang, Jisoo Kim, Se Young Chun

    Abstract: Multi-scale (MS) approaches have been widely investigated for blind single image / video deblurring that sequentially recovers deblurred images in low spatial scale first and then in high spatial scale later with the output of lower scales. MS approaches have been effective especially for severe blurs induced by large motions in high spatial scale since those can be seen as small blurs in low spat… ▽ More

    Submitted 17 November, 2019; originally announced November 2019.

    Comments: 10 pages, 8 figures, 6 tables, work in progress

  23. arXiv:1911.04385  [pdf, other

    cs.SD eess.AS

    Visualizing and Understanding Self-attention based Music Tagging

    Authors: Minz Won, Sanghyuk Chun, Xavier Serra

    Abstract: Recently, we proposed a self-attention based music tagging model. Different from most of the conventional deep architectures in music information retrieval, which use stacked 3x3 filters by treating music spectrograms as images, the proposed self-attention based model attempted to regard music as a temporal sequence of individual audio events. Not only the performance, but it could also facilitate… ▽ More

    Submitted 11 November, 2019; originally announced November 2019.

    Comments: Machine Learning for Music Discovery Workshop (ML4MD) at ICML 2019

  24. arXiv:1906.04972  [pdf, other

    cs.SD eess.AS

    Toward Interpretable Music Tagging with Self-Attention

    Authors: Minz Won, Sanghyuk Chun, Xavier Serra

    Abstract: Self-attention is an attention mechanism that learns a representation by relating different positions in the sequence. The transformer, which is a sequence model solely based on self-attention, and its variants achieved state-of-the-art results in many natural language processing tasks. Since music composes its semantics based on the relations between components in sparse positions, adopting the s… ▽ More

    Submitted 12 June, 2019; originally announced June 2019.

    Comments: 13 pages, 12 figures; code: https://github.com/minzwon/self-attention-music-tagging

  25. arXiv:1812.08914  [pdf, other

    eess.AS cs.SD

    Multi-Domain Processing via Hybrid Denoising Networks for Speech Enhancement

    Authors: Jang-Hyun Kim, Jaejun Yoo, Sanghyuk Chun, Adrian Kim, Jung-Woo Ha

    Abstract: We present a hybrid framework that leverages the trade-off between temporal and frequency precision in audio representations to improve the performance of speech enhancement task. We first show that conventional approaches using specific representations such as raw-audio and spectrograms are each effective at targeting different types of noise. By integrating both approaches, our model can learn m… ▽ More

    Submitted 20 December, 2018; originally announced December 2018.

    Comments: 7pages, 6 figures, 2 tables