Skip to main content

Showing 1–50 of 69 results for author: Son, H

Searching in archive cs. Search in all archives.
.
  1. Compensating Spatiotemporally Inconsistent Observations for Online Dynamic 3D Gaussian Splatting

    Authors: Youngsik Yun, Jeongmin Bae, Hyunseung Son, Seoha Kim, Hahyun Lee, Gun Bang, Youngjung Uh

    Abstract: Online reconstruction of dynamic scenes is significant as it enables learning scenes from live-streaming video inputs, while existing offline dynamic reconstruction methods rely on recorded video inputs. However, previous online reconstruction approaches have primarily focused on efficiency and rendering quality, overlooking the temporal consistency of their results, which often contain noticeable… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

    Comments: SIGGRAPH 2025, Project page: https://bbangsik13.github.io/OR2

  2. arXiv:2504.02812  [pdf, other

    cs.CV

    BOP Challenge 2024 on Model-Based and Model-Free 6D Object Pose Estimation

    Authors: Van Nguyen Nguyen, Stephen Tyree, Andrew Guo, Mederic Fourmy, Anas Gouda, Taeyeop Lee, Sungphill Moon, Hyeontae Son, Lukas Ranftl, Jonathan Tremblay, Eric Brachmann, Bertram Drost, Vincent Lepetit, Carsten Rother, Stan Birchfield, Jiri Matas, Yann Labbe, Martin Sundermeyer, Tomas Hodan

    Abstract: We present the evaluation methodology, datasets and results of the BOP Challenge 2024, the 6th in a series of public competitions organized to capture the state of the art in 6D object pose estimation and related tasks. In 2024, our goal was to transition BOP from lab-like setups to real-world scenarios. First, we introduced new model-free tasks, where no 3D object models are available and methods… ▽ More

    Submitted 23 April, 2025; v1 submitted 3 April, 2025; originally announced April 2025.

    Comments: arXiv admin note: text overlap with arXiv:2403.09799

  3. arXiv:2503.17731  [pdf, other

    cs.CV

    Co-op: Correspondence-based Novel Object Pose Estimation

    Authors: Sungphill Moon, Hyeontae Son, Dongcheol Hur, Sangwook Kim

    Abstract: We propose Co-op, a novel method for accurately and robustly estimating the 6DoF pose of objects unseen during training from a single RGB image. Our method requires only the CAD model of the target object and can precisely estimate its pose without any additional fine-tuning. While existing model-based methods suffer from inefficiency due to using a large number of templates, our method enables fa… ▽ More

    Submitted 22 March, 2025; originally announced March 2025.

    Comments: Accepted at CVPR 2025

  4. arXiv:2503.10695  [pdf, other

    cs.LG cs.AI cs.CL

    Introducing Verification Task of Set Consistency with Set-Consistency Energy Networks

    Authors: Mooho Song, Hyeryung Son, Jay-Yoon Lee

    Abstract: Examining logical inconsistencies among multiple statements (such as collections of sentences or question-answer pairs) is a crucial challenge in machine learning, particularly for ensuring the safety and reliability of models. Traditional methods that rely on pairwise comparisons often fail to capture inconsistencies that only emerge when more than two statements are evaluated collectively. To ad… ▽ More

    Submitted 19 March, 2025; v1 submitted 12 March, 2025; originally announced March 2025.

  5. arXiv:2503.08092  [pdf, other

    cs.CV

    SparseVoxFormer: Sparse Voxel-based Transformer for Multi-modal 3D Object Detection

    Authors: Hyeongseok Son, Jia He, Seung-In Park, Ying Min, Yunhao Zhang, ByungIn Yoo

    Abstract: Most previous 3D object detection methods that leverage the multi-modality of LiDAR and cameras utilize the Bird's Eye View (BEV) space for intermediate feature representation. However, this space uses a low x, y-resolution and sacrifices z-axis information to reduce the overall feature resolution, which may result in declined accuracy. To tackle the problem of using low-resolution features, this… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

  6. arXiv:2501.09395  [pdf, other

    cs.LG cs.AI math.NA

    ELM-DeepONets: Backpropagation-Free Training of Deep Operator Networks via Extreme Learning Machines

    Authors: Hwijae Son

    Abstract: Deep Operator Networks (DeepONets) are among the most prominent frameworks for operator learning, grounded in the universal approximation theorem for operators. However, training DeepONets typically requires significant computational resources. To address this limitation, we propose ELM-DeepONets, an Extreme Learning Machine (ELM) framework for DeepONets that leverages the backpropagation-free nat… ▽ More

    Submitted 16 January, 2025; originally announced January 2025.

  7. arXiv:2412.18571  [pdf, other

    quant-ph cs.DM cs.LG

    Scalable Quantum-Inspired Optimization through Dynamic Qubit Compression

    Authors: Co Tran, Quoc-Bao Tran, Hy Truong Son, Thang N Dinh

    Abstract: Hard combinatorial optimization problems, often mapped to Ising models, promise potential solutions with quantum advantage but are constrained by limited qubit counts in near-term devices. We present an innovative quantum-inspired framework that dynamically compresses large Ising models to fit available quantum hardware of different sizes. Thus, we aim to bridge the gap between large-scale optimiz… ▽ More

    Submitted 24 December, 2024; originally announced December 2024.

    Comments: Accepted to AAAI'25

  8. arXiv:2412.03587  [pdf, other

    cs.CL cs.AI cs.LG

    Not All Adapters Matter: Selective Adapter Freezing for Memory-Efficient Fine-Tuning of Language Models

    Authors: Hyegang Son, Yonglak Son, Changhoon Kim, Young Geun Kim

    Abstract: Transformer-based large-scale pre-trained models achieve great success. Fine-tuning is the standard practice for leveraging these models in downstream tasks. Among the fine-tuning methods, adapter-tuning provides a parameter-efficient fine-tuning by introducing lightweight trainable modules while keeping most pre-trained parameters frozen. However, existing adapter-tuning methods still impose subs… ▽ More

    Submitted 15 May, 2025; v1 submitted 26 November, 2024; originally announced December 2024.

    Comments: URL: https://aclanthology.org/2025.naacl-long.480/ Volume: Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers) Year: 2025 Address: Albuquerque, New Mexico

  9. arXiv:2412.03161  [pdf, other

    math.NA cs.AI

    Physics-Informed Deep Inverse Operator Networks for Solving PDE Inverse Problems

    Authors: Sung Woong Cho, Hwijae Son

    Abstract: Inverse problems involving partial differential equations (PDEs) can be seen as discovering a mapping from measurement data to unknown quantities, often framed within an operator learning approach. However, existing methods typically rely on large amounts of labeled training data, which is impractical for most real-world applications. Moreover, these supervised models may fail to capture the under… ▽ More

    Submitted 7 February, 2025; v1 submitted 4 December, 2024; originally announced December 2024.

    MSC Class: 65M32; 68T99 ACM Class: G.1.8; G.1.10

  10. arXiv:2411.12525  [pdf, other

    cs.CV cs.AI

    Rethinking Top Probability from Multi-view for Distracted Driver Behaviour Localization

    Authors: Quang Vinh Nguyen, Vo Hoang Thanh Son, Chau Truong Vinh Hoang, Duc Duy Nguyen, Nhat Huy Nguyen Minh, Soo-Hyung Kim

    Abstract: Naturalistic driving action localization task aims to recognize and comprehend human behaviors and actions from video data captured during real-world driving scenarios. Previous studies have shown great action localization performance by applying a recognition model followed by probability-based post-processing. Nevertheless, the probabilities provided by the recognition model frequently contain c… ▽ More

    Submitted 19 November, 2024; originally announced November 2024.

    Comments: Computer Vision and Pattern Recognition Workshop 2024

  11. arXiv:2410.20110  [pdf

    eess.SP cs.LG

    ISDNN: A Deep Neural Network for Channel Estimation in Massive MIMO systems

    Authors: Do Hai Son, Vu Tung Lam, Tran Thi Thuy Quynh

    Abstract: Massive Multiple-Input Multiple-Output (massive MIMO) technology stands as a cornerstone in 5G and beyonds. Despite the remarkable advancements offered by massive MIMO technology, the extreme number of antennas introduces challenges during the channel estimation (CE) phase. In this paper, we propose a single-step Deep Neural Network (DNN) for CE, termed Iterative Sequential DNN (ISDNN), inspired b… ▽ More

    Submitted 26 October, 2024; originally announced October 2024.

  12. arXiv:2409.14679  [pdf, other

    cs.CV cs.AI cs.RO

    Quantifying Context Bias in Domain Adaptation for Object Detection

    Authors: Hojun Son, Arpan Kusari

    Abstract: Domain adaptation for object detection (DAOD) aims to transfer a trained model from a source to a target domain. Various DAOD methods exist, some of which minimize context bias between foreground-background associations in various domains. However, no prior work has studied context bias in DAOD by analyzing changes in background features during adaptation and how context bias is represented in dif… ▽ More

    Submitted 22 September, 2024; originally announced September 2024.

    Comments: Under review

  13. arXiv:2408.12894  [pdf, other

    cs.CV

    FLoD: Integrating Flexible Level of Detail into 3D Gaussian Splatting for Customizable Rendering

    Authors: Yunji Seo, Young Sun Choi, Hyun Seung Son, Youngjung Uh

    Abstract: 3D Gaussian Splatting (3DGS) achieves fast and high-quality renderings by using numerous small Gaussians, which leads to significant memory consumption. This reliance on a large number of Gaussians restricts the application of 3DGS-based models on low-cost devices due to memory limitations. However, simply reducing the number of Gaussians to accommodate devices with less memory capacity leads to i… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

    Comments: Project page: https://3dgs-flod.github.io/flod.github.io/

  14. arXiv:2407.15603  [pdf, other

    cs.CR

    Semi-Supervised Learning for Anomaly Detection in Blockchain-based Supply Chains

    Authors: Do Hai Son, Bui Duc Manh, Tran Viet Khoa, Nguyen Linh Trung, Dinh Thai Hoang, Hoang Trong Minh, Yibeltal Alem, Le Quang Minh

    Abstract: Blockchain-based supply chain (BSC) systems have tremendously been developed recently and can play an important role in our society in the future. In this study, we develop an anomaly detection model for BSC systems. Our proposed model can detect cyber-attacks at various levels, including the network layer, consensus layer, and beyond, by analyzing only the traffic data at the network layer. To do… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  15. Real-time Cyberattack Detection with Collaborative Learning for Blockchain Networks

    Authors: Tran Viet Khoa, Do Hai Son, Dinh Thai Hoang, Nguyen Linh Trung, Tran Thi Thuy Quynh, Diep N. Nguyen, Nguyen Viet Ha, Eryk Dutkiewicz

    Abstract: With the ever-increasing popularity of blockchain applications, securing blockchain networks plays a critical role in these cyber systems. In this paper, we first study cyberattacks (e.g., flooding of transactions, brute pass) in blockchain networks and then propose an efficient collaborative cyberattack detection model to protect blockchain networks. Specifically, we deploy a blockchain network i… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  16. arXiv:2407.00740  [pdf, other

    cs.CL cs.LG

    Locate&Edit: Energy-based Text Editing for Efficient, Flexible, and Faithful Controlled Text Generation

    Authors: Hye Ryung Son, Jay-Yoon Lee

    Abstract: Recent approaches to controlled text generation (CTG) often involve manipulating the weights or logits of base language models (LMs) at decoding time. However, these methods are inapplicable to latest black-box LMs and ineffective at preserving the core semantics of the base LM's original generations. In this work, we propose Locate&Edit(L&E), an efficient and flexible energy-based approach to CTG… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 18 pages, 2 figures

  17. arXiv:2406.12244  [pdf, other

    cs.SE

    W2E (Workout to Earn): A Low Cost DApp based on ERC-20 and ERC-721 standards

    Authors: Do Hai Son, Nguyen Danh Hao, Tran Thi Thuy Quynh, Le Quang Minh

    Abstract: Decentralized applications (DApps) have gained prominence with the advent of blockchain technology, particularly Ethereum, providing trust, transparency, and traceability. However, challenges such as rising transaction costs and block confirmation delays hinder their widespread adoption. In this paper, we present our DApp named W2E - Workout to Earn, a mobile DApp incentivizing exercise through to… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  18. arXiv:2406.00552  [pdf, other

    cs.LG cs.DC

    Graph Neural Network Training Systems: A Performance Comparison of Full-Graph and Mini-Batch

    Authors: Saurabh Bajaj, Hojae Son, Juelin Liu, Hui Guan, Marco Serafini

    Abstract: Graph Neural Networks (GNNs) have gained significant attention in recent years due to their ability to learn representations of graph-structured data. Two common methods for training GNNs are mini-batch training and full-graph training. Since these two methods require different training pipelines and systems optimizations, two separate classes of GNN training systems emerged, each tailored for one… ▽ More

    Submitted 20 December, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

    Comments: 12 pages, 9 Figures, 8 Tables, 1 appendix, Graph Neural Network, Graph Neural Networks, Full-graph training, Mini-batch training, full-batch training, distributed training, performance, epoch time, time to accuracy, accuracy

  19. arXiv:2405.02367  [pdf, other

    cs.LG cs.CV

    Enhancing Social Media Post Popularity Prediction with Visual Content

    Authors: Dahyun Jeong, Hyelim Son, Yunjin Choi, Keunwoo Kim

    Abstract: Our study presents a framework for predicting image-based social media content popularity that focuses on addressing complex image information and a hierarchical data structure. We utilize the Google Cloud Vision API to effectively extract key image and color information from users' postings, achieving 6.8% higher accuracy compared to using non-image covariates alone. For prediction, we explore a… ▽ More

    Submitted 8 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

    Report number: Report-no: JKSS-D-23-00299R1

  20. arXiv:2404.16065  [pdf, other

    cs.HC eess.SP

    mmWave Wearable Antenna for Interaction with VR Devices

    Authors: Haksun Son, Song Min Kim

    Abstract: The VR industry is one of the most promising industries for the near future, as it can provide a more immersive connection between people and the virtual world. Currently, VR devices interact with people using inconvenient controllers or cameras that perform poorly in dark environments. Interaction through millimeter-wave wearable devices has the potential to conveniently track human behavior rega… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  21. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  22. arXiv:2403.12892  [pdf, other

    cs.RO eess.SP

    Uoc luong kenh truyen trong he thong da robot su dung SDR

    Authors: Do Hai Son, Nguyen Huu Hung, Pham Duy Hung, Tran Thi Thuy Quynh

    Abstract: This study focuses on developing an experimental system for estimating communication channels in a multi-robot mobile system using software-defined radio (SDR) devices. The system consists of two mobile robots programmed for two scenarios: one where the robot remains stationary and another where it follows a predefined trajectory. Communication within the system is conducted through orthogonal fre… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: in Vietnamese language

  23. arXiv:2403.11510  [pdf, other

    cs.CV

    GenFlow: Generalizable Recurrent Flow for 6D Pose Refinement of Novel Objects

    Authors: Sungphill Moon, Hyeontae Son, Dongcheol Hur, Sangwook Kim

    Abstract: Despite the progress of learning-based methods for 6D object pose estimation, the trade-off between accuracy and scalability for novel objects still exists. Specifically, previous methods for novel objects do not make good use of the target object's 3D shape information since they focus on generalization by processing the shape indirectly, making them less effective. We present GenFlow, an approac… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  24. arXiv:2403.09541  [pdf, other

    cs.CR

    RANDAO-based RNG: Last Revealer Attacks in Ethereum 2.0 Randomness and a Potential Solution

    Authors: Do Hai Son, Tran Thi Thuy Quynh, Le Quang Minh

    Abstract: Ethereum 2.0 is a major upgrade to improve its scalability, throughput, and security. In this version, RANDAO is the scheme to randomly select the users who propose, confirm blocks, and get rewards. However, a vulnerability, referred to as the `Last Revealer Attack' (LRA), compromises the randomness of this scheme by introducing bias to the Random Number Generator (RNG) process. This vulnerability… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  25. arXiv:2403.01616  [pdf, other

    cs.CL

    Towards Comprehensive Vietnamese Retrieval-Augmented Generation and Large Language Models

    Authors: Nguyen Quang Duc, Le Hai Son, Nguyen Duc Nhan, Nguyen Dich Nhat Minh, Le Thanh Huong, Dinh Viet Sang

    Abstract: This paper presents our contributions towards advancing the state of Vietnamese language understanding and generation through the development and dissemination of open datasets and pre-trained models for Vietnamese Retrieval-Augmented Generation (RAG) and Large Language Models (LLMs).

    Submitted 5 March, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

  26. arXiv:2402.18372  [pdf, other

    cs.LG cs.AI cs.DC

    FedUV: Uniformity and Variance for Heterogeneous Federated Learning

    Authors: Ha Min Son, Moon-Hyun Kim, Tai-Myoung Chung, Chao Huang, Xin Liu

    Abstract: Federated learning is a promising framework to train neural networks with widely distributed data. However, performance degrades heavily with heterogeneously distributed data. Recent work has shown this is due to the final layer of the network being most prone to local bias, some finding success freezing the final layer as an orthogonal classifier. We investigate the training dynamics of the class… ▽ More

    Submitted 1 March, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: 11 pages, 4 figures, 5 tables, to appear at CVPR 2024

  27. arXiv:2312.00404  [pdf, other

    cs.LG cs.DB

    A Causality-Aware Pattern Mining Scheme for Group Activity Recognition in a Pervasive Sensor Space

    Authors: Hyunju Kim, Heesuk Son, Dongman Lee

    Abstract: Human activity recognition (HAR) is a key challenge in pervasive computing and its solutions have been presented based on various disciplines. Specifically, for HAR in a smart space without privacy and accessibility issues, data streams generated by deployed pervasive sensors are leveraged. In this paper, we focus on a group activity by which a group of users perform a collaborative task without u… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  28. arXiv:2310.02570  [pdf, other

    cs.SD eess.AS

    Improving severity preservation of healthy-to-pathological voice conversion with global style tokens

    Authors: Bence Mark Halpern, Wen-Chin Huang, Lester Phillip Violeta, R. J. J. H. van Son, Tomoki Toda

    Abstract: In healthy-to-pathological voice conversion (H2P-VC), healthy speech is converted into pathological while preserving the identity. The paper improves on previous two-stage approach to H2P-VC where (1) speech is created first with the appropriate severity, (2) then the speaker identity of the voice is converted while preserving the severity of the voice. Specifically, we propose improvements to (2)… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: 7 pages, 3 figures, 5 tables. Accepted to IEEE Automatic Speech Recognition and Understanding Workshop 2023

    ACM Class: I.2.7

  29. arXiv:2309.08485  [pdf, other

    cs.CR cs.AI

    XFedHunter: An Explainable Federated Learning Framework for Advanced Persistent Threat Detection in SDN

    Authors: Huynh Thai Thi, Ngo Duc Hoang Son, Phan The Duy, Nghi Hoang Khoa, Khoa Ngo-Khanh, Van-Hau Pham

    Abstract: Advanced Persistent Threat (APT) attacks are highly sophisticated and employ a multitude of advanced methods and techniques to target organizations and steal sensitive and confidential information. APT attacks consist of multiple stages and have a defined strategy, utilizing new and innovative techniques and technologies developed by hackers to evade security software monitoring. To effectively pr… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  30. arXiv:2308.15804  [pdf, other

    cs.CR cs.DC

    Collaborative Learning Framework to Detect Attacks in Transactions and Smart Contracts

    Authors: Tran Viet Khoa, Do Hai Son, Chi-Hieu Nguyen, Dinh Thai Hoang, Diep N. Nguyen, Tran Thi Thuy Quynh, Trong-Minh Hoang, Nguyen Viet Ha, Eryk Dutkiewicz, Abu Alsheikh, Nguyen Linh Trung

    Abstract: With the escalating prevalence of malicious activities exploiting vulnerabilities in blockchain systems, there is an urgent requirement for robust attack detection mechanisms. To address this challenge, this paper presents a novel collaborative learning framework designed to detect attacks in blockchain transactions and smart contracts by analyzing transaction features. Our framework exhibits the… ▽ More

    Submitted 10 August, 2024; v1 submitted 30 August, 2023; originally announced August 2023.

  31. Impact Analysis of Antenna Array Geometry on Performance of Semi-blind Structured Channel Estimation for massive MIMO-OFDM systems

    Authors: Do Hai Son, Tran Thi Thuy Quynh

    Abstract: Channel estimation is always implemented in communication systems to overcome the effect of interference and noise. Especially, in wireless communications, this task is more challenging to improve system performance while saving resources. This paper focuses on investigating the impact of geometries of antenna arrays on the performance of structured channel estimation in massive MIMO-OFDM systems.… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  32. arXiv:2303.06800  [pdf, other

    cs.CV

    Object-Centric Multi-Task Learning for Human Instances

    Authors: Hyeongseok Son, Sangil Jung, Solae Lee, Seongeun Kim, Seung-In Park, ByungIn Yoo

    Abstract: Human is one of the most essential classes in visual recognition tasks such as detection, segmentation, and pose estimation. Although much effort has been put into individual tasks, multi-task learning for these three tasks has been rarely studied. In this paper, we explore a compact multi-task network architecture that maximally shares the parameters of the multiple tasks via object-centric learn… ▽ More

    Submitted 12 March, 2023; originally announced March 2023.

  33. InFusionSurf: Refining Neural RGB-D Surface Reconstruction Using Per-Frame Intrinsic Refinement and TSDF Fusion Prior Learning

    Authors: Seunghwan Lee, Gwanmo Park, Hyewon Son, Jiwon Ryu, Han Joo Chae

    Abstract: We introduce InFusionSurf, an innovative enhancement for neural radiance field (NeRF) frameworks in 3D surface reconstruction using RGB-D video frames. Building upon previous methods that have employed feature encoding to improve optimization speed, we further improve the reconstruction quality with minimal impact on optimization time by refining depth information. InFusionSurf addresses camera mo… ▽ More

    Submitted 6 October, 2024; v1 submitted 8 March, 2023; originally announced March 2023.

    Comments: ICME'24 (Oral), Project page: https://rokit-healthcare.github.io/InFusionSurf/

  34. arXiv:2212.01976  [pdf, other

    cs.CR cs.AI

    FedCC: Robust Federated Learning against Model Poisoning Attacks

    Authors: Hyejun Jeong, Hamin Son, Seohu Lee, Jayun Hyun, Tai-Myoung Chung

    Abstract: Federated learning is a distributed framework designed to address privacy concerns. However, it introduces new attack surfaces, which are especially prone when data is non-Independently and Identically Distributed. Existing approaches fail to effectively mitigate the malicious influence in this setting; previous approaches often tackle non-IID data and poisoning attacks separately. To address both… ▽ More

    Submitted 19 February, 2025; v1 submitted 4 December, 2022; originally announced December 2022.

  35. arXiv:2209.06521  [pdf

    physics.med-ph cs.CL

    Preregistered protocol for: Articulatory changes in speech following treatment for oral or oropharyngeal cancer: a systematic review

    Authors: Thomas B. Tienkamp, Teja Rebernik, Defne Abur, Rob J. J. H. van Son, Sebastiaan A. H. J. de Visscher, Max J. H. Witjes, Martijn Wieling

    Abstract: This document outlines a PROSPERO pre-registered protocol for a systematic review regarding articulatory changes in speech following oral or orophayrngeal cancer treatment. Treatment of tumours in the oral cavity may result in physiological changes that could lead to articulatory difficulties. The tongue becomes less mobile due to scar tissue and/or potential (postoperative) radiation therapy. Mor… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

  36. arXiv:2205.13434  [pdf, other

    cs.CL cs.AI

    Jointly Learning Span Extraction and Sequence Labeling for Information Extraction from Business Documents

    Authors: Nguyen Hong Son, Hieu M. Vu, Tuan-Anh D. Nguyen, Minh-Tien Nguyen

    Abstract: This paper introduces a new information extraction model for business documents. Different from prior studies which only base on span extraction or sequence labeling, the model takes into account advantage of both span extraction and sequence labeling. The combination allows the model to deal with long documents with sparse information (the small amount of extracted information). The model is trai… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

    Comments: Accepted to IJCNN 2022

  37. Real-Time Video Deblurring via Lightweight Motion Compensation

    Authors: Hyeongseok Son, Junyong Lee, Sunghyun Cho, Seungyong Lee

    Abstract: While motion compensation greatly improves video deblurring quality, separately performing motion compensation and video deblurring demands huge computational overhead. This paper proposes a real-time video deblurring framework consisting of a lightweight multi-task unit that supports both video deblurring and motion compensation in an efficient way. The multi-task unit is specifically designed to… ▽ More

    Submitted 13 September, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: Computer Graphics Forum (special issue on Pacific Graphics 2022), 2022; Equal contribution from the first two authors

    Journal ref: Computer Graphics Forum (special issue on PG 2022), Vol. 41, No. 7, 2022

  38. arXiv:2205.01059  [pdf, other

    cs.LG cs.AI math.NA math.OC

    Enhanced Physics-Informed Neural Networks with Augmented Lagrangian Relaxation Method (AL-PINNs)

    Authors: Hwijae Son, Sung Woong Cho, Hyung Ju Hwang

    Abstract: Physics-Informed Neural Networks (PINNs) have become a prominent application of deep learning in scientific computation, as they are powerful approximators of solutions to nonlinear partial differential equations (PDEs). There have been numerous attempts to facilitate the training process of PINNs by adjusting the weight of each component of the loss function, called adaptive loss-balancing algori… ▽ More

    Submitted 30 May, 2023; v1 submitted 29 April, 2022; originally announced May 2022.

  39. arXiv:2204.03958  [pdf, other

    cs.CL cs.AI

    Enhance Incomplete Utterance Restoration by Joint Learning Token Extraction and Text Generation

    Authors: Shumpei Inoue, Tsungwei Liu, Nguyen Hong Son, Minh-Tien Nguyen

    Abstract: This paper introduces a model for incomplete utterance restoration (IUR) called JET (\textbf{J}oint learning token \textbf{E}xtraction and \textbf{T}ext generation). Different from prior studies that only work on extraction or abstraction datasets, we design a simple but effective model, working for both scenarios of IUR. Our design simulates the nature of IUR, where omitted tokens from the contex… ▽ More

    Submitted 28 July, 2022; v1 submitted 8 April, 2022; originally announced April 2022.

    Comments: This paper was accepted by 2022 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2022). It includes 10 pages, 2 figures

  40. Teaching for large-scale Reproducibility Verification

    Authors: Lars Vilhuber, Hyuk Harry Son, Meredith Welch, David N. Wasser, Michael Darisse

    Abstract: We describe a unique environment in which undergraduate students from various STEM and social science disciplines are trained in data provenance and reproducible methods, and then apply that knowledge to real, conditionally accepted manuscripts and associated replication packages. We describe in detail the recruitment, training, and regular activities. While the activity is not part of a regular c… ▽ More

    Submitted 31 March, 2022; originally announced April 2022.

  41. An Effective Framework of Private Ethereum Blockchain Networks for Smart Grid

    Authors: Do Hai Son, Tran Thi Thuy Quynh, Tran Viet Khoa, Dinh Thai Hoang, Nguyen Linh Trung, Nguyen Viet Ha, Dusit Niyato, Nguyen N. Diep, Eryk Dutkiewicz

    Abstract: A smart grid is an important application in Industry 4.0 with a lot of new technologies and equipment working together. Hence, sensitive data stored in the smart grid is vulnerable to malicious modification and theft. This paper proposes a framework to build a smart grid based on a highly effective private Ethereum network. Our framework provides a real smart grid that includes modern hardware and… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: 6 pages, conference

  42. Collaborative Learning for Cyberattack Detection in Blockchain Networks

    Authors: Tran Viet Khoa, Do Hai Son, Dinh Thai Hoang, Nguyen Linh Trung, Tran Thi Thuy Quynh, Diep N. Nguyen, Nguyen Viet Ha, Eryk Dutkiewicz

    Abstract: This article aims to study intrusion attacks and then develop a novel cyberattack detection framework to detect cyberattacks at the network layer (e.g., Brute Password and Flooding of Transactions) of blockchain networks. Specifically, we first design and implement a blockchain network in our laboratory. This blockchain network will serve two purposes, i.e., to generate the real traffic data (incl… ▽ More

    Submitted 6 May, 2024; v1 submitted 21 March, 2022; originally announced March 2022.

    Journal ref: IEEE Transactions on Systems, Man, and Cybernetics: Systems (2024)

  43. arXiv:2203.03814  [pdf, other

    eess.IV cs.CV cs.HC cs.LG

    Generating 3D Bio-Printable Patches Using Wound Segmentation and Reconstruction to Treat Diabetic Foot Ulcers

    Authors: Han Joo Chae, Seunghwan Lee, Hyewon Son, Seungyeob Han, Taebin Lim

    Abstract: We introduce AiD Regen, a novel system that generates 3D wound models combining 2D semantic segmentation with 3D reconstruction so that they can be printed via 3D bio-printers during the surgery to treat diabetic foot ulcers (DFUs). AiD Regen seamlessly binds the full pipeline, which includes RGB-D image capturing, semantic segmentation, boundary-guided point-cloud processing, 3D model reconstruct… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR 2022

  44. arXiv:2202.07291  [pdf, other

    cs.CV

    Exploring Discontinuity for Video Frame Interpolation

    Authors: Sangjin Lee, Hyeongmin Lee, Chajin Shin, Hanbin Son, Sangyoun Lee

    Abstract: Video frame interpolation (VFI) is the task that synthesizes the intermediate frame given two consecutive frames. Most of the previous studies have focused on appropriate frame warping operations and refinement modules for the warped frames. These studies have been conducted on natural videos containing only continuous motions. However, many practical videos contain various unnatural objects with… ▽ More

    Submitted 23 March, 2023; v1 submitted 15 February, 2022; originally announced February 2022.

    Comments: highlight at CVPR 2023 (10% of accepted papers)

  45. arXiv:2112.00407  [pdf, other

    cs.LG cs.AI cs.DC

    Compare Where It Matters: Using Layer-Wise Regularization To Improve Federated Learning on Heterogeneous Data

    Authors: Ha Min Son, Moon Hyun Kim, Tai-Myoung Chung

    Abstract: Federated Learning is a widely adopted method to train neural networks over distributed data. One main limitation is the performance degradation that occurs when data is heterogeneously distributed. While many works have attempted to address this problem, these methods under-perform because they are founded on a limited understanding of neural networks. In this work, we verify that only certain im… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

    Comments: 8 pages, 5 figures, 4 tables

  46. Iterative Filter Adaptive Network for Single Image Defocus Deblurring

    Authors: Junyong Lee, Hyeongseok Son, Jaesung Rim, Sunghyun Cho, Seungyong Lee

    Abstract: We propose a novel end-to-end learning-based approach for single image defocus deblurring. The proposed approach is equipped with a novel Iterative Filter Adaptive Network (IFAN) that is specifically designed to handle spatially-varying and large defocus blur. For adaptively handling spatially-varying blur, IFAN predicts pixel-wise deblurring filters, which are applied to defocused features of an… ▽ More

    Submitted 28 March, 2022; v1 submitted 31 August, 2021; originally announced August 2021.

    Comments: CVPR 2021

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 2034-2042

  47. Recurrent Video Deblurring with Blur-Invariant Motion Estimation and Pixel Volumes

    Authors: Hyeongseok Son, Junyong Lee, Jonghyeop Lee, Sunghyun Cho, Seungyong Lee

    Abstract: For the success of video deblurring, it is essential to utilize information from neighboring frames. Most state-of-the-art video deblurring methods adopt motion compensation between video frames to aggregate information from multiple frames that can help deblur a target frame. However, the motion compensation methods adopted by previous deblurring methods are not blur-invariant, and consequently,… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

    Comments: 17 pages, Camera-ready version for ACM Transactions on Graphics (TOG) 2021

    Journal ref: ACM Transactions on Graphics, Vol. 40, No. 5, Article 185, 2021

  48. arXiv:2108.09108  [pdf, other

    cs.CV

    Single Image Defocus Deblurring Using Kernel-Sharing Parallel Atrous Convolutions

    Authors: Hyeongseok Son, Junyong Lee, Sunghyun Cho, Seungyong Lee

    Abstract: This paper proposes a novel deep learning approach for single image defocus deblurring based on inverse kernels. In a defocused image, the blur shapes are similar among pixels although the blur sizes can spatially vary. To utilize the property with inverse kernels, we exploit the observation that when only the size of a defocus blur changes while keeping the shape, the shape of the corresponding i… ▽ More

    Submitted 20 August, 2021; originally announced August 2021.

    Comments: Accepted to ICCV 2021

  49. arXiv:2108.01903  [pdf, other

    cs.LG cs.AI

    Personalized Federated Learning with Clustering: Non-IID Heart Rate Variability Data Application

    Authors: Joo Hun Yoo, Ha Min Son, Hyejun Jeong, Eun-Hye Jang, Ah Young Kim, Han Young Yu, Hong Jin Jeon, Tai-Myoung Chung

    Abstract: While machine learning techniques are being applied to various fields for their exceptional ability to find complex relations in large datasets, the strengthening of regulations on data ownership and privacy is causing increasing difficulty in its application to medical data. In light of this, Federated Learning has recently been proposed as a solution to train on private data without breach of co… ▽ More

    Submitted 10 August, 2021; v1 submitted 4 August, 2021; originally announced August 2021.

    Comments: 6 pages with two columns, 4 figures, 3 tables

  50. arXiv:2106.12147  [pdf, other

    math.NA cs.LG

    Lagrangian dual framework for conservative neural network solutions of kinetic equations

    Authors: Hyung Ju Hwang, Hwijae Son

    Abstract: In this paper, we propose a novel conservative formulation for solving kinetic equations via neural networks. More precisely, we formulate the learning problem as a constrained optimization problem with constraints that represent the physical conservation laws. The constraints are relaxed toward the residual loss function by the Lagrangian duality. By imposing physical conservation properties of t… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.