Skip to main content

Showing 1–3 of 3 results for author: Yang, D H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.00970  [pdf, other

    cs.CL cs.AI cs.LG

    SentenceKV: Efficient LLM Inference via Sentence-Level Semantic KV Caching

    Authors: Yuxuan Zhu, Ali Falahati, David H. Yang, Mohammad Mohammadi Amiri

    Abstract: Large language models face significant computational and memory challenges when processing long contexts. During inference, efficient management of the key-value (KV) cache, which stores intermediate activations for autoregressive generation, is critical to reducing memory overhead and improving computational efficiency. Traditional token-level efficient KV caching methods overlook semantic inform… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

  2. arXiv:2502.00311  [pdf, other

    cs.LG

    Sparse Gradient Compression for Fine-Tuning Large Language Models

    Authors: David H. Yang, Mohammad Mohammadi Amiri, Tejaswini Pedapati, Subhajit Chaudhury, Pin-Yu Chen

    Abstract: Fine-tuning large language models (LLMs) for downstream tasks has become increasingly crucial due to their widespread use and the growing availability of open-source models. However, the high memory costs associated with fine-tuning remain a significant challenge, especially as models increase in size. To address this, parameter efficient fine-tuning (PEFT) methods have been proposed to minimize t… ▽ More

    Submitted 31 January, 2025; originally announced February 2025.

  3. arXiv:1806.09748  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Cycle Consistent Adversarial Denoising Network for Multiphase Coronary CT Angiography

    Authors: Eunhee Kang, Hyun Jung Koo, Dong Hyun Yang, Joon Bum Seo, Jong Chul Ye

    Abstract: In coronary CT angiography, a series of CT images are taken at different levels of radiation dose during the examination. Although this reduces the total radiation dose, the image quality during the low-dose phases is significantly degraded. To address this problem, here we propose a novel semi-supervised learning technique that can remove the noises of the CT images obtained in the low-dose phase… ▽ More

    Submitted 7 November, 2018; v1 submitted 25 June, 2018; originally announced June 2018.

    Comments: This work is accepted in Medical Physics