Skip to main content

Showing 1–26 of 26 results for author: Jo, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.12790  [pdf, ps, other

    cs.LG math.NA physics.comp-ph

    PDEfuncta: Spectrally-Aware Neural Representation for PDE Solution Modeling

    Authors: Minju Jo, Woojin Cho, Uvini Balasuriya Mudiyanselage, Seungjun Lee, Noseong Park, Kookjin Lee

    Abstract: Scientific machine learning often involves representing complex solution fields that exhibit high-frequency features such as sharp transitions, fine-scale oscillations, and localized structures. While implicit neural representations (INRs) have shown promise for continuous function modeling, capturing such high-frequency behavior remains a challenge-especially when modeling multiple solution field… ▽ More

    Submitted 15 June, 2025; originally announced June 2025.

  2. arXiv:2506.09526  [pdf, ps, other

    cs.LG cs.AI

    Neural Functions for Learning Periodic Signal

    Authors: Woojin Cho, Minju Jo, Kookjin Lee, Noseong Park

    Abstract: As function approximators, deep neural networks have served as an effective tool to represent various signal types. Recent approaches utilize multi-layer perceptrons (MLPs) to learn a nonlinear mapping from a coordinate to its corresponding signal, facilitating the learning of continuous neural representations from discrete data points. Despite notable successes in learning diverse signal types, c… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

  3. arXiv:2503.21166  [pdf, other

    cs.LG

    Unveiling the Potential of Superexpressive Networks in Implicit Neural Representations

    Authors: Uvini Balasuriya Mudiyanselage, Woojin Cho, Minju Jo, Noseong Park, Kookjin Lee

    Abstract: In this study, we examine the potential of one of the ``superexpressive'' networks in the context of learning neural functions for representing complex signals and performing machine learning downstream tasks. Our focus is on evaluating their performance on computer vision and scientific machine learning tasks including signal representation/inverse problems and solutions of partial differential e… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

    Comments: Accepted at ICLR 2025 Workshop on Neural Network Weights as a New Data Modality

  4. arXiv:2501.03879  [pdf, other

    cs.CV cs.AI

    CL3DOR: Contrastive Learning for 3D Large Multimodal Models via Odds Ratio on High-Resolution Point Clouds

    Authors: Keonwoo Kim, Yeongjae Cho, Taebaek Hwang, Minsoo Jo, Sangdo Han

    Abstract: Recent research has demonstrated that Large Language Models (LLMs) are not limited to text-only tasks but can also function as multimodal models across various modalities, including audio, images, and videos. In particular, research on 3D Large Multimodal Models (3D LMMs) is making notable strides, driven by the potential of processing higher-dimensional data like point clouds. However, upon close… ▽ More

    Submitted 7 January, 2025; originally announced January 2025.

  5. arXiv:2411.10945  [pdf, other

    cs.CV

    Anomaly Detection for People with Visual Impairments Using an Egocentric 360-Degree Camera

    Authors: Inpyo Song, Sanghyeon Lee, Minjun Joo, Jangwon Lee

    Abstract: Recent advancements in computer vision have led to a renewed interest in developing assistive technologies for individuals with visual impairments. Although extensive research has been conducted in the field of computer vision-based assistive technologies, most of the focus has been on understanding contexts in images, rather than addressing their physical safety and security concerns. To address… ▽ More

    Submitted 16 November, 2024; originally announced November 2024.

    Comments: WACV2025

  6. arXiv:2410.09350  [pdf, other

    cs.CL cs.AI

    Generative Subgraph Retrieval for Knowledge Graph-Grounded Dialog Generation

    Authors: Jinyoung Park, Minseok Joo, Joo-Kyung Kim, Hyunwoo J. Kim

    Abstract: Knowledge graph-grounded dialog generation requires retrieving a dialog-relevant subgraph from the given knowledge base graph and integrating it with the dialog history. Previous works typically represent the graph using an external encoder, such as graph neural networks, and retrieve relevant triplets based on the similarity between single-vector representations of triplets and the dialog history… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Comments: EMNLP (main)

  7. arXiv:2410.01273  [pdf, other

    cs.RO cs.CV cs.LG

    CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction

    Authors: Suhwan Choi, Yongjun Cho, Minchan Kim, Jaeyoon Jung, Myunchul Joe, Yubeen Park, Minseo Kim, Sungwoong Kim, Sungjae Lee, Hwiseong Park, Jiwan Chung, Youngjae Yu

    Abstract: Real-life robot navigation involves more than just reaching a destination; it requires optimizing movements while addressing scenario-specific goals. An intuitive way for humans to express these goals is through abstract cues like verbal commands or rough sketches. Such human guidance may lack details or be noisy. Nonetheless, we expect robots to navigate as intended. For robots to interpret and e… ▽ More

    Submitted 18 March, 2025; v1 submitted 2 October, 2024; originally announced October 2024.

    Comments: Accepted to ICRA 2025, project page https://worv-ai.github.io/canvas

  8. arXiv:2408.09446  [pdf, other

    cs.LG math.NA physics.comp-ph

    Parameterized Physics-informed Neural Networks for Parameterized PDEs

    Authors: Woojin Cho, Minju Jo, Haksoo Lim, Kookjin Lee, Dongeun Lee, Sanghyun Hong, Noseong Park

    Abstract: Complex physical systems are often described by partial differential equations (PDEs) that depend on parameters such as the Reynolds number in fluid mechanics. In applications such as design optimization or uncertainty quantification, solutions of those PDEs need to be evaluated at numerous points in the parameter space. While physics-informed neural networks (PINNs) have emerged as a new strong c… ▽ More

    Submitted 18 August, 2024; originally announced August 2024.

  9. arXiv:2407.19156  [pdf, other

    cs.CV

    Robust Multimodal 3D Object Detection via Modality-Agnostic Decoding and Proximity-based Modality Ensemble

    Authors: Juhan Cha, Minseok Joo, Jihwan Park, Sanghyeok Lee, Injae Kim, Hyunwoo J. Kim

    Abstract: Recent advancements in 3D object detection have benefited from multi-modal information from the multi-view cameras and LiDAR sensors. However, the inherent disparities between the modalities pose substantial challenges. We observe that existing multi-modal 3D object detection methods heavily rely on the LiDAR sensor, treating the camera as an auxiliary modality for augmenting semantic details. Thi… ▽ More

    Submitted 19 August, 2024; v1 submitted 26 July, 2024; originally announced July 2024.

  10. Is GPT-4 Alone Sufficient for Automated Essay Scoring?: A Comparative Judgment Approach Based on Rater Cognition

    Authors: Seungju Kim, Meounggun Jo

    Abstract: Large Language Models (LLMs) have shown promise in Automated Essay Scoring (AES), but their zero-shot and few-shot performance often falls short compared to state-of-the-art models and human raters. However, fine-tuning LLMs for each specific task is impractical due to the variety of essay prompts and rubrics used in real-world educational contexts. This study proposes a novel approach combining L… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 16 pages, 3 figures, Learning @ Scale 2024

  11. arXiv:2405.19794  [pdf, other

    cs.CV

    Video Question Answering for People with Visual Impairments Using an Egocentric 360-Degree Camera

    Authors: Inpyo Song, Minjun Joo, Joonhyung Kwon, Jangwon Lee

    Abstract: This paper addresses the daily challenges encountered by visually impaired individuals, such as limited access to information, navigation difficulties, and barriers to social interaction. To alleviate these challenges, we introduce a novel visual question answering dataset. Our dataset offers two significant advancements over previous datasets: Firstly, it features videos captured using a 360-degr… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: CVPR2024 EgoVis Workshop

  12. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  13. arXiv:2402.17812  [pdf, other

    cs.LG cs.CL

    DropBP: Accelerating Fine-Tuning of Large Language Models by Dropping Backward Propagation

    Authors: Sunghyeon Woo, Baeseong Park, Byeongwook Kim, Minjung Jo, Se Jung Kwon, Dongsuk Jeon, Dongsoo Lee

    Abstract: Large language models (LLMs) have achieved significant success across various domains. However, training these LLMs typically involves substantial memory and computational costs during both forward and backward propagation. While parameter-efficient fine-tuning (PEFT) considerably reduces the training memory associated with parameters, it does not address the significant computational costs and ac… ▽ More

    Submitted 28 February, 2025; v1 submitted 27 February, 2024; originally announced February 2024.

  14. arXiv:2308.11916  [pdf, other

    cs.CV

    Semantic-Aware Implicit Template Learning via Part Deformation Consistency

    Authors: Sihyeon Kim, Minseok Joo, Jaewon Lee, Juyeon Ko, Juhan Cha, Hyunwoo J. Kim

    Abstract: Learning implicit templates as neural fields has recently shown impressive performance in unsupervised shape correspondence. Despite the success, we observe current approaches, which solely rely on geometric information, often learn suboptimal deformation across generic object shapes, which have high structural variability. In this paper, we highlight the importance of part deformation consistency… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: ICCV camera-ready version

  15. arXiv:2305.07031  [pdf, other

    cs.LG cs.AI

    Hawkes Process Based on Controlled Differential Equations

    Authors: Minju Jo, Seungji Kook, Noseong Park

    Abstract: Hawkes processes are a popular framework to model the occurrence of sequential events, i.e., occurrence dynamics, in several fields such as social diffusion. In real-world scenarios, the inter-arrival time among events is irregular. However, existing neural network-based Hawkes process models not only i) fail to capture such complicated irregular dynamics, but also ii) resort to heuristics to calc… ▽ More

    Submitted 18 May, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

  16. arXiv:2301.04333  [pdf, other

    cs.LG cs.AI

    Learnable Path in Neural Controlled Differential Equations

    Authors: Sheo Yon Jhin, Minju Jo, Seungji Kook, Noseong Park, Sungpil Woo, Sunhwan Lim

    Abstract: Neural controlled differential equations (NCDEs), which are continuous analogues to recurrent neural networks (RNNs), are a specialized model in (irregular) time-series processing. In comparison with similar models, e.g., neural ordinary differential equations (NODEs), the key distinctive characteristics of NCDEs are i) the adoption of the continuous path created by an interpolation algorithm from… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

    Comments: Accepted by AAAI 2023

  17. arXiv:2211.04266  [pdf, ps, other

    cs.IR cs.AI cs.LG

    TimeKit: A Time-series Forecasting-based Upgrade Kit for Collaborative Filtering

    Authors: Seoyoung Hong, Minju Jo, Seungji Kook, Jaeeun Jung, Hyowon Wi, Noseong Park, Sung-Bae Cho

    Abstract: Recommender systems are a long-standing research problem in data mining and machine learning. They are incremental in nature, as new user-item interaction logs arrive. In real-world applications, we need to periodically train a collaborative filtering algorithm to extract user/item embedding vectors and therefore, a time-series of embedding vectors can be naturally defined. We present a time-serie… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

    Comments: Accepted at IEEE BigData 2022

  18. arXiv:2204.08781  [pdf, other

    cs.LG

    LORD: Lower-Dimensional Embedding of Log-Signature in Neural Rough Differential Equations

    Authors: Jaehoon Lee, Jinsung Jeon, Sheo yon Jhin, Jihyeon Hyeong, Jayoung Kim, Minju Jo, Kook Seungji, Noseong Park

    Abstract: The problem of processing very long time-series data (e.g., a length of more than 10,000) is a long-standing research problem in machine learning. Recently, one breakthrough, called neural rough differential equations (NRDEs), has been proposed and has shown that it is able to process such data. Their main concept is to use the log-signature transform, which is known to be more efficient than the… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

    Comments: main 9 pages

  19. arXiv:2204.08771  [pdf, other

    cs.LG

    EXIT: Extrapolation and Interpolation-based Neural Controlled Differential Equations for Time-series Classification and Forecasting

    Authors: Sheo Yon Jhin, Jaehoon Lee, Minju Jo, Seungji Kook, Jinsung Jeon, Jihyeon Hyeong, Jayoung Kim, Noseong Park

    Abstract: Deep learning inspired by differential equations is a recent research trend and has marked the state of the art performance for many machine learning tasks. Among them, time-series modeling with neural controlled differential equations (NCDEs) is considered as a breakthrough. In many cases, NCDE-based models not only provide better accuracy than recurrent neural networks (RNNs) but also make it po… ▽ More

    Submitted 21 September, 2022; v1 submitted 19 April, 2022; originally announced April 2022.

    Comments: main 8 pages

  20. arXiv:2108.04993  [pdf, other

    cs.LG

    LightMove: A Lightweight Next-POI Recommendation for Taxicab Rooftop Advertising

    Authors: Jinsung Jeon, Soyoung Kang, Minju Jo, Seunghyeon Cho, Noseong Park, Seonghoon Kim, Chiyoung Song

    Abstract: Mobile digital billboards are an effective way to augment brand-awareness. Among various such mobile billboards, taxicab rooftop devices are emerging in the market as a brand new media. Motov is a leading company in South Korea in the taxicab rooftop advertising market. In this work, we present a lightweight yet accurate deep learning-based method to predict taxicabs' next locations to better prep… ▽ More

    Submitted 18 August, 2021; v1 submitted 10 August, 2021; originally announced August 2021.

    Comments: Accepted in CIKM 2021

  21. arXiv:2105.14953  [pdf, other

    cs.LG

    ACE-NODE: Attentive Co-Evolving Neural Ordinary Differential Equations

    Authors: Sheo Yon Jhin, Minju Jo, Taeyong Kong, Jinsung Jeon, Noseong Park

    Abstract: Neural ordinary differential equations (NODEs) presented a new paradigm to construct (continuous-time) neural networks. While showing several good characteristics in terms of the number of parameters and the flexibility in constructing neural networks, they also have a couple of well-known limitations: i) theoretically NODEs learn homeomorphic mapping functions only, and ii) sometimes NODEs show n… ▽ More

    Submitted 31 May, 2021; originally announced May 2021.

    Comments: Accepted by KDD 2021

  22. arXiv:2104.00931  [pdf, other

    eess.AS cs.LG cs.SD

    Assem-VC: Realistic Voice Conversion by Assembling Modern Speech Synthesis Techniques

    Authors: Kang-wook Kim, Seung-won Park, Junhyeok Lee, Myun-chul Joe

    Abstract: Recent works on voice conversion (VC) focus on preserving the rhythm and the intonation as well as the linguistic content. To preserve these features from the source, we decompose current non-parallel VC systems into two encoders and one decoder. We analyze each module with several experiments and reassemble the best components to propose Assem-VC, a new state-of-the-art any-to-many non-parallel V… ▽ More

    Submitted 11 October, 2021; v1 submitted 2 April, 2021; originally announced April 2021.

  23. arXiv:2005.03295  [pdf, other

    eess.AS cs.LG cs.SD

    Cotatron: Transcription-Guided Speech Encoder for Any-to-Many Voice Conversion without Parallel Data

    Authors: Seung-won Park, Doo-young Kim, Myun-chul Joe

    Abstract: We propose Cotatron, a transcription-guided speech encoder for speaker-independent linguistic representation. Cotatron is based on the multispeaker TTS architecture and can be trained with conventional TTS datasets. We train a voice conversion system to reconstruct speech with Cotatron features, which is similar to the previous methods based on Phonetic Posteriorgram (PPG). By training and evaluat… ▽ More

    Submitted 14 August, 2020; v1 submitted 7 May, 2020; originally announced May 2020.

    Comments: To appear in INTERSPEECH 2020

  24. arXiv:1811.11280  [pdf, ps, other

    cs.IT

    Improved upper bound on root number of linearized polynomials and its application to nonlinearity estimation of Boolean functions

    Authors: Sihem Mesnager, Kwang Ho Kim, Myong Song Jo

    Abstract: To determine the dimension of null space of any given linearized polynomial is one of vital problems in finite field theory, with concern to design of modern symmetric cryptosystems. But, the known general theory for this task is much far from giving the exact dimension when applied to a specific linearized polynomial. The first contribution of this paper is to give a better general method to get… ▽ More

    Submitted 27 November, 2018; originally announced November 2018.

  25. arXiv:1704.07217  [pdf, ps, other

    cs.NI

    Multi-hop Links Quality Analysis of 5G Enabled Vehicular Networks

    Authors: Shikuan Li, Zipeng Li, Xiaohu Ge, Jing Zhang, Minho Jo

    Abstract: With the emerging of the fifth generation (5G) mobile communication systems, millimeter wave transmissions are believed to be a promising solution for vehicular networks, especially in vehicle to vehicle (V2V) communications. In millimeter wave V2V communications, different vehicular networking services have different quality requirements for V2V multi-hop links. To evaluate the quality of differe… ▽ More

    Submitted 1 August, 2017; v1 submitted 24 April, 2017; originally announced April 2017.

    Comments: 6 pages, 5 figures, reject by IEEE Globecom2017, re-submitted to WCSP2017

  26. arXiv:1604.04968  [pdf, ps, other

    cs.IT cs.NI

    Multi-user Massive MIMO Communication Systems Based on Irregular Antenna Arrays

    Authors: Xiaohu Ge, Ran Zi, Haichao Wang, Jing Zhang, Minho Jo

    Abstract: In practical mobile communication engineering applications, surfaces of antenna array deployment regions are usually uneven. Therefore, massive multi-input-multi-output (MIMO) communication systems usually transmit wireless signals by irregular antenna arrays. To evaluate the performance of irregular antenna arrays, the matrix correlation coefficient and ergodic received gain are defined for massi… ▽ More

    Submitted 17 April, 2016; originally announced April 2016.

    Comments: 15 pages, 8 figures