-
CHILL at SemEval-2025 Task 2: You Can't Just Throw Entities and Hope -- Make Your LLM to Get Them Right
Authors:
Jaebok Lee,
Yonghyun Ryu,
Seongmin Park,
Yoonjung Choi
Abstract:
In this paper, we describe our approach for the SemEval 2025 Task 2 on Entity-Aware Machine Translation (EA-MT). Our system aims to improve the accuracy of translating named entities by combining two key approaches: Retrieval Augmented Generation (RAG) and iterative self-refinement techniques using Large Language Models (LLMs). A distinctive feature of our system is its self-evaluation mechanism,…
▽ More
In this paper, we describe our approach for the SemEval 2025 Task 2 on Entity-Aware Machine Translation (EA-MT). Our system aims to improve the accuracy of translating named entities by combining two key approaches: Retrieval Augmented Generation (RAG) and iterative self-refinement techniques using Large Language Models (LLMs). A distinctive feature of our system is its self-evaluation mechanism, where the LLM assesses its own translations based on two key criteria: the accuracy of entity translations and overall translation quality. We demonstrate how these methods work together and effectively improve entity handling while maintaining high-quality translations.
△ Less
Submitted 15 June, 2025;
originally announced June 2025.
-
LensNet: Enhancing Real-time Microlensing Event Discovery with Recurrent Neural Networks in the Korea Microlensing Telescope Network
Authors:
Javier Viaña,
Kyu-Ha Hwang,
Zoë de Beurs,
Jennifer C. Yee,
Andrew Vanderburg,
Michael D. Albrow,
Sun-Ju Chung,
Andrew Gould,
Cheongho Han,
Youn Kil Jung,
Yoon-Hyun Ryu,
In-Gu Shin,
Yossi Shvartzvald,
Hongjing Yang,
Weicheng Zang,
Sang-Mok Cha,
Dong-Jin Kim,
Seung-Lee Kim,
Chung-Uk Lee,
Dong-Joo Lee,
Yongseok Lee,
Byeong-Gon Park,
Richard W. Pogge
Abstract:
Traditional microlensing event vetting methods require highly trained human experts, and the process is both complex and time-consuming. This reliance on manual inspection often leads to inefficiencies and constrains the ability to scale for widespread exoplanet detection, ultimately hindering discovery rates. To address the limits of traditional microlensing event vetting, we have developed LensN…
▽ More
Traditional microlensing event vetting methods require highly trained human experts, and the process is both complex and time-consuming. This reliance on manual inspection often leads to inefficiencies and constrains the ability to scale for widespread exoplanet detection, ultimately hindering discovery rates. To address the limits of traditional microlensing event vetting, we have developed LensNet, a machine learning pipeline specifically designed to distinguish legitimate microlensing events from false positives caused by instrumental artifacts, such as pixel bleed trails and diffraction spikes. Our system operates in conjunction with a preliminary algorithm that detects increasing trends in flux. These flagged instances are then passed to LensNet for further classification, allowing for timely alerts and follow-up observations. Tailored for the multi-observatory setup of the Korea Microlensing Telescope Network (KMTNet) and trained on a rich dataset of manually classified events, LensNet is optimized for early detection and warning of microlensing occurrences, enabling astronomers to organize follow-up observations promptly. The internal model of the pipeline employs a multi-branch Recurrent Neural Network (RNN) architecture that evaluates time-series flux data with contextual information, including sky background, the full width at half maximum of the target star, flux errors, PSF quality flags, and air mass for each observation. We demonstrate a classification accuracy above 87.5%, and anticipate further improvements as we expand our training set and continue to refine the algorithm.
△ Less
Submitted 10 January, 2025;
originally announced January 2025.
-
3d human motion generation from the text via gesture action classification and the autoregressive model
Authors:
Gwantae Kim,
Youngsuk Ryu,
Junyeop Lee,
David K. Han,
Jeongmin Bae,
Hanseok Ko
Abstract:
In this paper, a deep learning-based model for 3D human motion generation from the text is proposed via gesture action classification and an autoregressive model. The model focuses on generating special gestures that express human thinking, such as waving and nodding. To achieve the goal, the proposed method predicts expression from the sentences using a text classification model based on a pretra…
▽ More
In this paper, a deep learning-based model for 3D human motion generation from the text is proposed via gesture action classification and an autoregressive model. The model focuses on generating special gestures that express human thinking, such as waving and nodding. To achieve the goal, the proposed method predicts expression from the sentences using a text classification model based on a pretrained language model and generates gestures using the gate recurrent unit-based autoregressive model. Especially, we proposed the loss for the embedding space for restoring raw motions and generating intermediate motions well. Moreover, the novel data augmentation method and stop token are proposed to generate variable length motions. To evaluate the text classification model and 3D human motion generation model, a gesture action classification dataset and action-based gesture dataset are collected. With several experiments, the proposed method successfully generates perceptually natural and realistic 3D human motion from the text. Moreover, we verified the effectiveness of the proposed method using a public-available action recognition dataset to evaluate cross-dataset generalization performance.
△ Less
Submitted 17 November, 2022;
originally announced November 2022.
-
Julia Cloud Matrix Machine: Dynamic Matrix Language Acceleration on Multicore Clusters in the Cloud
Authors:
Jay Hwan Lee,
Yeonsoo Kim,
Younghyun Ryu,
Wasuwee Sodsong,
Hyunjun Jeon,
Jinsik Park,
Bernd Burgstaller,
Bernhard Scholz
Abstract:
In emerging scientific computing environments, matrix computations of increasing size and complexity are increasingly becoming prevalent. However, contemporary matrix language implementations are insufficient in their support for efficient utilization of cloud computing resources, particularly on the user side. We thus developed an extension of the Julia high-performance computation language such…
▽ More
In emerging scientific computing environments, matrix computations of increasing size and complexity are increasingly becoming prevalent. However, contemporary matrix language implementations are insufficient in their support for efficient utilization of cloud computing resources, particularly on the user side. We thus developed an extension of the Julia high-performance computation language such that matrix computations are automatically parallelized in the cloud, where users are separated from directly interacting with complex explicitly-parallel computations. We implement lazy evaluation semantics combined with directed graphs to optimize matrix operations on the fly while dynamic simulation finds the optimal tile size and schedule for a given cluster of cloud nodes. A time model prediction of the cluster's performance capacity is constructed to enable simulations. Automatic configuration of communication and worker processes on the cloud networks allow for the framework to automatically scale up for clusters of heterogeneous nodes. Our framework's experimental evaluation comprises eleven benchmarks on an fourteen node (564 CPUs) cluster in the AWS public cloud, revealing speedups of up to a factor of 5.1, with an average 74.39% of the upper bound for speedups.
△ Less
Submitted 8 December, 2023; v1 submitted 15 May, 2022;
originally announced May 2022.
-
Google street view and deep learning: a new ground truthing approach for crop mapping
Authors:
Yulin Yan,
Youngryel Ryu
Abstract:
Ground referencing is essential for supervised crop mapping. However, conventional ground truthing involves extensive field surveys and post processing, which is costly in terms of time and labor. In this study, we applied a convolutional neural network (CNN) model to explore the efficacy of automatic ground truthing via Google street view (GSV) images in two distinct farming regions: central Illi…
▽ More
Ground referencing is essential for supervised crop mapping. However, conventional ground truthing involves extensive field surveys and post processing, which is costly in terms of time and labor. In this study, we applied a convolutional neural network (CNN) model to explore the efficacy of automatic ground truthing via Google street view (GSV) images in two distinct farming regions: central Illinois and southern California. We demonstrated the feasibility and reliability of the new ground referencing technique further by performing pixel-based crop mapping with vegetation indices as the model input. The results were evaluated using the United States Department of Agriculture (USDA) crop data layer (CDL) products. From 8,514 GSV images, the CNN model screened out 2,645 target crop images. These images were well classified into crop types, including alfalfa, almond, corn, cotton, grape, soybean, and pistachio. The overall GSV image classification accuracy reached 93% in California and 97% in Illinois. We then shifted the image geographic coordinates using fixed empirical coefficients to produce 8,173 crop reference points including 1,764 in Illinois and 6,409 in California. Evaluation of these new reference points with CDL products showed satisfactory coherence, with 94 to 97% agreement. CNN-based mapping also captured the general pattern of crop type distributions. The overall differences between CDL products and our mapping results were 4% in California and 5% in Illinois. Thus, using these deep learning and GSV image techniques, we have provided an efficient and cost-effective alternative method for ground referencing and crop mapping.
△ Less
Submitted 2 December, 2019;
originally announced December 2019.
-
Exploiting Trust Degree for Multiple-Antenna User Cooperation
Authors:
Mingxiong Zhao,
Jong Yeol Ryu,
Jemin Lee,
Tony Q. S. Quek,
Suili Feng
Abstract:
For a user cooperation system with multiple antennas, we consider a trust degree based cooperation techniques to explore the influence of the trustworthiness between users on the communication systems. For the system with two communication pairs, when one communication pair achieves its quality of service (QoS) requirement, they can help the transmission of the other communication pair according t…
▽ More
For a user cooperation system with multiple antennas, we consider a trust degree based cooperation techniques to explore the influence of the trustworthiness between users on the communication systems. For the system with two communication pairs, when one communication pair achieves its quality of service (QoS) requirement, they can help the transmission of the other communication pair according to the trust degree, which quantifies the trustworthiness between users in the cooperation. For given trust degree, we investigate the user cooperation strategies, which include the power allocation and precoder design for various antenna configurations. For SISO and MISO cases, we provide the optimal power allocation and beamformer design that maximize the expected achievable rates while guaranteeing the QoS requirement. For a SIMO case, we resort to semidefinite relaxation (SDR) technique and block coordinate update (BCU) method to solve the corresponding problem, and guarantee the rank-one solutions at each step. For a MIMO case, as MIMO is the generalization of MISO and SIMO, the similarities among their problem structures inspire us to combine the methods from MISO and SIMO together to efficiently tackle MIMO case. Simulation results show that the trust degree information has a great effect on the performance of the user cooperation in terms of the expected achievable rate, and the proposed user cooperation strategies achieve high achievable rates for given trust degree.
△ Less
Submitted 10 May, 2017;
originally announced May 2017.