-
Physics-informed neural network framework for solving forward and inverse flexoelectric problems
Authors:
Hyeonbin Moon,
Donggeun Park,
Jinwook Yeo,
Seunghwa Ryu
Abstract:
Flexoelectricity, the coupling between strain gradients and electric polarization, poses significant computational challenges due to its governing fourth-order partial differential equations that require C1-continuous solutions. To address these issues, we propose a physics-informed neural network (PINN) framework grounded in an energy-based formulation that treats both forward and inverse problem…
▽ More
Flexoelectricity, the coupling between strain gradients and electric polarization, poses significant computational challenges due to its governing fourth-order partial differential equations that require C1-continuous solutions. To address these issues, we propose a physics-informed neural network (PINN) framework grounded in an energy-based formulation that treats both forward and inverse problems within a unified architecture. The forward problem is recast as a saddle-point optimization of the total potential energy, solved via the deep energy method (DEM), which circumvents the direct computation of high-order derivatives. For the inverse problem of identifying unknown flexoelectric coefficients from sparse measurements, we introduce an additional variational loss that enforces stationarity with respect to the electric potential, ensuring robust and stable parameter inference. The framework integrates finite element-based numerical quadrature for stable energy evaluation and employs hard constraints to rigorously enforce boundary conditions. Numerical results for both direct and converse flexoelectric effects show excellent agreement with mixed-FEM solutions, and the inverse model accurately recovers material parameters from limited data. This study establishes a unified, mesh-compatible, and scalable PINN approach for high-order electromechanical problems, offering a promising alternative to traditional simulation techniques.
△ Less
Submitted 12 June, 2025;
originally announced June 2025.
-
Search for neutron decay into an antineutrino and a neutral kaon in 0.401 megaton-years exposure of Super-Kamiokande
Authors:
Super-Kamiokande Collaboration,
:,
K. Yamauchi,
K. Abe,
S. Abe,
Y. Asaoka,
M. Harada,
Y. Hayato,
K. Hiraide,
K. Hosokawa,
K. Ieki,
M. Ikeda,
J. Kameda,
Y. Kanemura,
Y. Kataoka,
S. Miki,
S. Mine,
M. Miura,
S. Moriyama,
M. Nakahata,
S. Nakayama,
Y. Noguchi,
G. Pronost,
K. Sato,
H. Sekiya
, et al. (240 additional authors not shown)
Abstract:
We searched for bound neutron decay via $n\to\barν+K^0$ predicted by the Grand Unified Theories in 0.401 Mton$\cdot$years exposure of all pure water phases in the Super-Kamiokande detector. About 4.4 times more data than in the previous search have been analyzed by a new method including a spectrum fit to kaon invariant mass distributions. No significant data excess has been observed in the signal…
▽ More
We searched for bound neutron decay via $n\to\barν+K^0$ predicted by the Grand Unified Theories in 0.401 Mton$\cdot$years exposure of all pure water phases in the Super-Kamiokande detector. About 4.4 times more data than in the previous search have been analyzed by a new method including a spectrum fit to kaon invariant mass distributions. No significant data excess has been observed in the signal regions. As a result of this analysis, we set a lower limit of $7.8\times10^{32}$ years on the neutron lifetime at a 90% confidence level.
△ Less
Submitted 17 June, 2025;
originally announced June 2025.
-
Physics-Informed Neural Operators for Generalizable and Label-Free Inference of Temperature-Dependent Thermoelectric Properties
Authors:
Hyeonbin Moon,
Songho Lee,
Wabi Demeke,
Byungki Ryu,
Seunghwa Ryu
Abstract:
Accurate characterization of temperature-dependent thermoelectric properties (TEPs), such as thermal conductivity and the Seebeck coefficient, is essential for reliable modeling and efficient design of thermoelectric devices. However, their nonlinear temperature dependence and coupled transport behavior make both forward simulation and inverse identification difficult, particularly under sparse me…
▽ More
Accurate characterization of temperature-dependent thermoelectric properties (TEPs), such as thermal conductivity and the Seebeck coefficient, is essential for reliable modeling and efficient design of thermoelectric devices. However, their nonlinear temperature dependence and coupled transport behavior make both forward simulation and inverse identification difficult, particularly under sparse measurement conditions. In this study, we develop a physics-informed machine learning approach that employs physics-informed neural networks (PINN) for solving forward and inverse problems in thermoelectric systems, and neural operators (PINO) to enable generalization across diverse material systems. The PINN enables field reconstruction and material property inference by embedding governing transport equations into the loss function, while the PINO generalizes this inference capability across diverse materials without retraining. Trained on simulated data for 20 p-type materials and evaluated on 60 unseen materials, the PINO model demonstrates accurate and label-free inference of TEPs using only sparse field data. The proposed framework offers a scalable, generalizable, and data-efficient approach for thermoelectric property identification, paving the way for high-throughput screening and inverse design of advanced thermoelectric materials.
△ Less
Submitted 9 June, 2025;
originally announced June 2025.
-
Toward Knowledge-Guided AI for Inverse Design in Manufacturing: A Perspective on Domain, Physics, and Human-AI Synergy
Authors:
Hugon Lee,
Hyeonbin Moon,
Junhyeong Lee,
Seunghwa RYu
Abstract:
Artificial intelligence (AI) is reshaping inverse design across manufacturing domain, enabling high-performance discovery in materials, products, and processes. However, purely data-driven approaches often struggle in realistic settings characterized by sparse data, high-dimensional design spaces, and nontrivial physical constraints. This perspective argues for a new generation of design systems t…
▽ More
Artificial intelligence (AI) is reshaping inverse design across manufacturing domain, enabling high-performance discovery in materials, products, and processes. However, purely data-driven approaches often struggle in realistic settings characterized by sparse data, high-dimensional design spaces, and nontrivial physical constraints. This perspective argues for a new generation of design systems that transcend black-box modeling by integrating domain knowledge, physics-informed learning, and intuitive human-AI interfaces. We first demonstrate how expert-guided sampling strategies enhance data efficiency and model generalization. Next, we discuss how physics-informed machine learning enables physically consistent modeling in data-scarce regimes. Finally, we explore how large language models emerge as interactive design agents connecting user intent with simulation tools, optimization pipelines, and collaborative workflows. Through illustrative examples and conceptual frameworks, we advocate that inverse design in manufacturing should evolve into a unified ecosystem, where domain knowledge, physical priors, and adaptive reasoning collectively enable scalable, interpretable, and accessible AI-driven design systems.
△ Less
Submitted 29 May, 2025;
originally announced June 2025.
-
Lunguage: A Benchmark for Structured and Sequential Chest X-ray Interpretation
Authors:
Jong Hak Moon,
Geon Choi,
Paloma Rabaey,
Min Gwan Kim,
Hyuk Gi Hong,
Jung-Oh Lee,
Hangyul Yoon,
Eun Woo Doe,
Jiyoun Kim,
Harshita Sharma,
Daniel C. Castro,
Javier Alvarez-Valle,
Edward Choi
Abstract:
Radiology reports convey detailed clinical observations and capture diagnostic reasoning that evolves over time. However, existing evaluation methods are limited to single-report settings and rely on coarse metrics that fail to capture fine-grained clinical semantics and temporal dependencies. We introduce LUNGUAGE,a benchmark dataset for structured radiology report generation that supports both s…
▽ More
Radiology reports convey detailed clinical observations and capture diagnostic reasoning that evolves over time. However, existing evaluation methods are limited to single-report settings and rely on coarse metrics that fail to capture fine-grained clinical semantics and temporal dependencies. We introduce LUNGUAGE,a benchmark dataset for structured radiology report generation that supports both single-report evaluation and longitudinal patient-level assessment across multiple studies. It contains 1,473 annotated chest X-ray reports, each reviewed by experts, and 80 of them contain longitudinal annotations to capture disease progression and inter-study intervals, also reviewed by experts. Using this benchmark, we develop a two-stage framework that transforms generated reports into fine-grained, schema-aligned structured representations, enabling longitudinal interpretation. We also propose LUNGUAGESCORE, an interpretable metric that compares structured outputs at the entity, relation, and attribute level while modeling temporal consistency across patient timelines. These contributions establish the first benchmark dataset, structuring framework, and evaluation metric for sequential radiology reporting, with empirical results demonstrating that LUNGUAGESCORE effectively supports structured report evaluation. The code is available at: https://github.com/SuperSupermoon/Lunguage
△ Less
Submitted 27 May, 2025;
originally announced May 2025.
-
Cross-Lingual Optimization for Language Transfer in Large Language Models
Authors:
Jungseob Lee,
Seongtae Hong,
Hyeonseok Moon,
Heuiseok Lim
Abstract:
Adapting large language models to other languages typically employs supervised fine-tuning (SFT) as a standard approach. However, it often suffers from an overemphasis on English performance, a phenomenon that is especially pronounced in data-constrained environments. To overcome these challenges, we propose \textbf{Cross-Lingual Optimization (CLO)} that efficiently transfers an English-centric LL…
▽ More
Adapting large language models to other languages typically employs supervised fine-tuning (SFT) as a standard approach. However, it often suffers from an overemphasis on English performance, a phenomenon that is especially pronounced in data-constrained environments. To overcome these challenges, we propose \textbf{Cross-Lingual Optimization (CLO)} that efficiently transfers an English-centric LLM to a target language while preserving its English capabilities. CLO utilizes publicly available English SFT data and a translation model to enable cross-lingual transfer. We conduct experiments using five models on six languages, each possessing varying levels of resource. Our results show that CLO consistently outperforms SFT in both acquiring target language proficiency and maintaining English performance. Remarkably, in low-resource languages, CLO with only 3,200 samples surpasses SFT with 6,400 samples, demonstrating that CLO can achieve better performance with less data. Furthermore, we find that SFT is particularly sensitive to data quantity in medium and low-resource languages, whereas CLO remains robust. Our comprehensive analysis emphasizes the limitations of SFT and incorporates additional training strategies in CLO to enhance efficiency.
△ Less
Submitted 20 May, 2025;
originally announced May 2025.
-
Semantic Aware Linear Transfer by Recycling Pre-trained Language Models for Cross-lingual Transfer
Authors:
Seungyoon Lee,
Seongtae Hong,
Hyeonseok Moon,
Heuiseok Lim
Abstract:
Large Language Models (LLMs) increasingly incorporate multilingual capabilities, fueling the demand to transfer them into target language-specific models. However, most approaches, which blend the source model's embedding by replacing the source vocabulary with the target language-specific vocabulary, may constrain expressive capacity in the target language since the source model is predominantly…
▽ More
Large Language Models (LLMs) increasingly incorporate multilingual capabilities, fueling the demand to transfer them into target language-specific models. However, most approaches, which blend the source model's embedding by replacing the source vocabulary with the target language-specific vocabulary, may constrain expressive capacity in the target language since the source model is predominantly trained on English data. In this paper, we propose Semantic Aware Linear Transfer (SALT), a novel cross-lingual transfer technique that recycles embeddings from target language Pre-trained Language Models (PLMs) to transmit the deep representational strengths of PLM-derived embedding to LLMs. SALT derives unique regression lines based on the similarity in the overlap of the source and target vocabularies, to handle each non-overlapping token's embedding space. Our extensive experiments show that SALT significantly outperforms other transfer methods and achieves lower loss with accelerating faster convergence during language adaptation. Notably, SALT obtains remarkable performance in cross-lingual understanding setups compared to other methods. Furthermore, we highlight the scalable use of PLMs to enhance the functionality of contemporary LLMs by conducting experiments with varying architectures.
△ Less
Submitted 22 May, 2025; v1 submitted 16 May, 2025;
originally announced May 2025.
-
Measurement of neutron production in atmospheric neutrino interactions at Super-Kamiokande
Authors:
Super-Kamiokande collaboration,
:,
S. Han,
K. Abe,
S. Abe,
Y. Asaoka,
C. Bronner,
M. Harada,
Y. Hayato,
K. Hiraide,
K. Hosokawa,
K. Ieki,
M. Ikeda,
J. Kameda,
Y. Kanemura,
R. Kaneshima,
Y. Kashiwagi,
Y. Kataoka,
S. Miki,
S. Mine,
M. Miura,
S. Moriyama,
M. Nakahata,
S. Nakayama,
Y. Noguchi
, et al. (260 additional authors not shown)
Abstract:
We present measurements of total neutron production from atmospheric neutrino interactions in water, analyzed as a function of electron-equivalent visible energy over a range of 30 MeV to 10 GeV. These results are based on 4,270 days of data collected by Super-Kamiokande, including 564 days with 0.011 wt\% gadolinium added to enhance neutron detection. Neutron signal selection is based on a neural…
▽ More
We present measurements of total neutron production from atmospheric neutrino interactions in water, analyzed as a function of electron-equivalent visible energy over a range of 30 MeV to 10 GeV. These results are based on 4,270 days of data collected by Super-Kamiokande, including 564 days with 0.011 wt\% gadolinium added to enhance neutron detection. Neutron signal selection is based on a neural network trained on simulation, with its performance validated using an Am/Be neutron point source. The measurements are compared to predictions from neutrino event generators combined with various hadron-nucleus interaction models, which include an intranuclear cascade model and a nuclear de-excitation model. We observe significant variations in the predictions depending on the choice of hadron-nucleus interaction model. We discuss key factors that contribute to describing our data, such as in-medium effects in the intranuclear cascade and the accuracy of statistical evaporation modeling.
△ Less
Submitted 20 June, 2025; v1 submitted 7 May, 2025;
originally announced May 2025.
-
Physics-Informed Neural Network-Based Discovery of Hyperelastic Constitutive Models from Extremely Scarce Data
Authors:
Hyeonbin Moon,
Donggeun Park,
Hanbin Cho,
Hong-Kyun Noh,
Jae hyuk Lim,
Seunghwa Ryu
Abstract:
The discovery of constitutive models for hyperelastic materials is essential yet challenging due to their nonlinear behavior and the limited availability of experimental data. Traditional methods typically require extensive stress-strain or full-field measurements, which are often difficult to obtain in practical settings. To overcome these challenges, we propose a physics-informed neural network…
▽ More
The discovery of constitutive models for hyperelastic materials is essential yet challenging due to their nonlinear behavior and the limited availability of experimental data. Traditional methods typically require extensive stress-strain or full-field measurements, which are often difficult to obtain in practical settings. To overcome these challenges, we propose a physics-informed neural network (PINN)-based framework that enables the discovery of constitutive models using only sparse measurement data - such as displacement and reaction force - that can be acquired from a single material test. By integrating PINNs with finite element discretization, the framework reconstructs full-field displacement and identifies the underlying strain energy density from predefined candidates, while ensuring consistency with physical laws. A two-stage training process is employed: the Adam optimizer jointly updates neural network parameters and model coefficients to obtain an initial solution, followed by L-BFGS refinement and sparse regression with l_p regularization to extract a parsimonious constitutive model. Validation on benchmark hyperelastic models demonstrates that the proposed method can accurately recover constitutive laws and displacement fields, even when the input data are limited and noisy. These findings highlight the applicability of the proposed framework to experimental scenarios where measurement data are both scarce and noisy.
△ Less
Submitted 28 April, 2025;
originally announced April 2025.
-
MIRAGE: A Metric-Intensive Benchmark for Retrieval-Augmented Generation Evaluation
Authors:
Chanhee Park,
Hyeonseok Moon,
Chanjun Park,
Heuiseok Lim
Abstract:
Retrieval-Augmented Generation (RAG) has gained prominence as an effective method for enhancing the generative capabilities of Large Language Models (LLMs) through the incorporation of external knowledge. However, the evaluation of RAG systems remains a challenge, due to the intricate interplay between retrieval and generation components. This limitation has resulted in a scarcity of benchmarks th…
▽ More
Retrieval-Augmented Generation (RAG) has gained prominence as an effective method for enhancing the generative capabilities of Large Language Models (LLMs) through the incorporation of external knowledge. However, the evaluation of RAG systems remains a challenge, due to the intricate interplay between retrieval and generation components. This limitation has resulted in a scarcity of benchmarks that facilitate a detailed, component-specific assessment. In this work, we present MIRAGE, a Question Answering dataset specifically designed for RAG evaluation. MIRAGE consists of 7,560 curated instances mapped to a retrieval pool of 37,800 entries, enabling an efficient and precise evaluation of both retrieval and generation tasks. We also introduce novel evaluation metrics aimed at measuring RAG adaptability, encompassing dimensions such as noise vulnerability, context acceptability, context insensitivity, and context misinterpretation. Through comprehensive experiments across various retriever-LLM configurations, we provide new insights into the optimal alignment of model pairs and the nuanced dynamics within RAG systems. The dataset and evaluation code are publicly available, allowing for seamless integration and customization in diverse research settings\footnote{The MIRAGE code and data are available at https://github.com/nlpai-lab/MIRAGE.
△ Less
Submitted 23 April, 2025;
originally announced April 2025.
-
Benchmarking Large Language Models for Calculus Problem-Solving: A Comparative Analysis
Authors:
In Hak Moon
Abstract:
This study presents a comprehensive evaluation of five leading large language models (LLMs) - Chat GPT 4o, Copilot Pro, Gemini Advanced, Claude Pro, and Meta AI - on their performance in solving calculus differentiation problems. The investigation assessed these models across 13 fundamental problem types, employing a systematic cross-evaluation framework where each model solved problems generated…
▽ More
This study presents a comprehensive evaluation of five leading large language models (LLMs) - Chat GPT 4o, Copilot Pro, Gemini Advanced, Claude Pro, and Meta AI - on their performance in solving calculus differentiation problems. The investigation assessed these models across 13 fundamental problem types, employing a systematic cross-evaluation framework where each model solved problems generated by all models. Results revealed significant performance disparities, with Chat GPT 4o achieving the highest success rate (94.71%), followed by Claude Pro (85.74%), Gemini Advanced (84.42%), Copilot Pro (76.30%), and Meta AI (56.75%). All models excelled at procedural differentiation tasks but showed varying limitations with conceptual understanding and algebraic manipulation. Notably, problems involving increasing/decreasing intervals and optimization word problems proved most challenging across all models. The cross-evaluation matrix revealed that Claude Pro generated the most difficult problems, suggesting distinct capabilities between problem generation and problem-solving. These findings have significant implications for educational applications, highlighting both the potential and limitations of LLMs as calculus learning tools. While they demonstrate impressive procedural capabilities, their conceptual understanding remains limited compared to human mathematical reasoning, emphasizing the continued importance of human instruction for developing deeper mathematical comprehension.
△ Less
Submitted 30 March, 2025;
originally announced April 2025.
-
Debate Only When Necessary: Adaptive Multiagent Collaboration for Efficient LLM Reasoning
Authors:
Sugyeong Eo,
Hyeonseok Moon,
Evelyn Hayoon Zi,
Chanjun Park,
Heuiseok Lim
Abstract:
Multiagent collaboration has emerged as a promising framework for enhancing the reasoning capabilities of large language models (LLMs). Despite improvements in reasoning, the approach introduces substantial computational overhead resulting from iterative agent interactions. Furthermore, engaging in unnecessary debates increases the risk of generating erroneous responses. To address these challenge…
▽ More
Multiagent collaboration has emerged as a promising framework for enhancing the reasoning capabilities of large language models (LLMs). Despite improvements in reasoning, the approach introduces substantial computational overhead resulting from iterative agent interactions. Furthermore, engaging in unnecessary debates increases the risk of generating erroneous responses. To address these challenges, we propose Debate Only When Necessary (DOWN), an adaptive multiagent debate framework that selectively activates debate based on the confidence score of the agent's initial response. Debate is activated only for queries requiring further deliberation, during which agents refine their outputs by referencing peer responses and associated confidence scores. Evaluations on benchmarks show that DOWN improves efficiency by up to six times while preserving or even outperforming the performance of existing methods. Further analysis indicates that DOWN effectively mitigates the risk of error propagation stemming from the unnecessary debate process. These findings demonstrate the effectiveness of our approach in delivering high-performance LLM solutions at a lower computational cost.
△ Less
Submitted 20 May, 2025; v1 submitted 7 April, 2025;
originally announced April 2025.
-
Enhancing Biologically Inspired Hierarchical Temporal Memory with Hardware-Accelerated Reflex Memory
Authors:
Pavia Bera,
Sabrina Hassan Moon,
Jennifer Adorno,
Dayane Alfenas Reis,
Sanjukta Bhanja
Abstract:
The rapid expansion of the Internet of Things (IoT) generates zettabytes of data that demand efficient unsupervised learning systems. Hierarchical Temporal Memory (HTM), a third-generation unsupervised AI algorithm, models the neocortex of the human brain by simulating columns of neurons to process and predict sequences. These neuron columns can memorize and infer sequences across multiple orders.…
▽ More
The rapid expansion of the Internet of Things (IoT) generates zettabytes of data that demand efficient unsupervised learning systems. Hierarchical Temporal Memory (HTM), a third-generation unsupervised AI algorithm, models the neocortex of the human brain by simulating columns of neurons to process and predict sequences. These neuron columns can memorize and infer sequences across multiple orders. While multiorder inferences offer robust predictive capabilities, they often come with significant computational overhead. The Sequence Memory (SM) component of HTM, which manages these inferences, encounters bottlenecks primarily due to its extensive programmable interconnects. In many cases, it has been observed that first-order temporal relationships have proven to be sufficient without any significant loss in efficiency. This paper introduces a Reflex Memory (RM) block, inspired by the Spinal Cord's working mechanisms, designed to accelerate the processing of first-order inferences. The RM block performs these inferences significantly faster than the SM. The integration of RM with HTM forms a system called the Accelerated Hierarchical Temporal Memory (AHTM), which processes repetitive information more efficiently than the original HTM while still supporting multiorder inferences. The experimental results demonstrate that the HTM predicts an event in 0.945 s, whereas the AHTM module does so in 0.125 s. Additionally, the hardware implementation of RM in a content-addressable memory (CAM) block, known as Hardware-Accelerated Hierarchical Temporal Memory (H-AHTM), predicts an event in just 0.094 s, significantly improving inference speed. Compared to the original algorithm \cite{bautista2020matlabhtm}, AHTM accelerates inference by up to 7.55x, while H-AHTM further enhances performance with a 10.10x speedup.
△ Less
Submitted 1 April, 2025;
originally announced April 2025.
-
Simple yet Effective Node Property Prediction on Edge Streams under Distribution Shifts
Authors:
Jongha Lee,
Taehyung Kwon,
Heechan Moon,
Kijung Shin
Abstract:
The problem of predicting node properties (e.g., node classes) in graphs has received significant attention due to its broad range of applications. Graphs from real-world datasets often evolve over time, with newly emerging edges and dynamically changing node properties, posing a significant challenge for this problem. In response, temporal graph neural networks (TGNNs) have been developed to pred…
▽ More
The problem of predicting node properties (e.g., node classes) in graphs has received significant attention due to its broad range of applications. Graphs from real-world datasets often evolve over time, with newly emerging edges and dynamically changing node properties, posing a significant challenge for this problem. In response, temporal graph neural networks (TGNNs) have been developed to predict dynamic node properties from a stream of emerging edges. However, our analysis reveals that most TGNN-based methods are (a) far less effective without proper node features and, due to their complex model architectures, (b) vulnerable to distribution shifts. In this paper, we propose SPLASH, a simple yet powerful method for predicting node properties on edge streams under distribution shifts. Our key contributions are as follows: (1) we propose feature augmentation methods and an automatic feature selection method for edge streams, which improve the effectiveness of TGNNs, (2) we propose a lightweight MLP-based TGNN architecture that is highly efficient and robust under distribution shifts, and (3) we conduct extensive experiments to evaluate the accuracy, efficiency, generalization, and qualitative performance of the proposed method and its competitors on dynamic node classification, dynamic anomaly detection, and node affinity prediction tasks across seven real-world datasets.
△ Less
Submitted 31 March, 2025;
originally announced April 2025.
-
FLEX: A Benchmark for Evaluating Robustness of Fairness in Large Language Models
Authors:
Dahyun Jung,
Seungyoon Lee,
Hyeonseok Moon,
Chanjun Park,
Heuiseok Lim
Abstract:
Recent advancements in Large Language Models (LLMs) have significantly enhanced interactions between users and models. These advancements concurrently underscore the need for rigorous safety evaluations due to the manifestation of social biases, which can lead to harmful societal impacts. Despite these concerns, existing benchmarks may overlook the intrinsic weaknesses of LLMs, which can generate…
▽ More
Recent advancements in Large Language Models (LLMs) have significantly enhanced interactions between users and models. These advancements concurrently underscore the need for rigorous safety evaluations due to the manifestation of social biases, which can lead to harmful societal impacts. Despite these concerns, existing benchmarks may overlook the intrinsic weaknesses of LLMs, which can generate biased responses even with simple adversarial instructions. To address this critical gap, we introduce a new benchmark, Fairness Benchmark in LLM under Extreme Scenarios (FLEX), designed to test whether LLMs can sustain fairness even when exposed to prompts constructed to induce bias. To thoroughly evaluate the robustness of LLMs, we integrate prompts that amplify potential biases into the fairness assessment. Comparative experiments between FLEX and existing benchmarks demonstrate that traditional evaluations may underestimate the inherent risks in models. This highlights the need for more stringent LLM evaluation benchmarks to guarantee safety and fairness.
△ Less
Submitted 25 March, 2025;
originally announced March 2025.
-
Cluster algebras and skein algebras for surfaces
Authors:
Hiroaki Karuo,
Han-Bom Moon,
Helen Wong
Abstract:
We consider two algebras of curves associated to an oriented surface of finite type - the cluster algebra from combinatorial algebra, and the skein algebra from quantum topology. We focus on generalizations of cluster algebras and generalizations of skein algebras that include arcs whose endpoints are marked points on the boundary or in the interior of the surface. We show that the generalizations…
▽ More
We consider two algebras of curves associated to an oriented surface of finite type - the cluster algebra from combinatorial algebra, and the skein algebra from quantum topology. We focus on generalizations of cluster algebras and generalizations of skein algebras that include arcs whose endpoints are marked points on the boundary or in the interior of the surface. We show that the generalizations are closely related by maps that can be explicitly defined, and we explore the structural implications, including (non-)finite generation. We also discuss open questions about the algebraic structure of the algebras.
△ Less
Submitted 15 March, 2025;
originally announced March 2025.
-
PMT calibration for the JSNS2-II far detector with an embedded LED system
Authors:
Jisu Park,
M. K. Cheoun,
J. H. Choi,
J. Y. Choi,
T. Dodo,
J. Goh,
M. Harada,
S. Hasegawa,
W. Hwang,
T. Iida,
H. I. Jang,
J. S. Jang,
K. K. Joo,
D. E. Jung,
S. K. Kang,
Y. Kasugai,
T. Kawasaki,
E. M. Kim,
S. B. Kim,
S. Y. Kim,
H. Kinoshita,
T. Konno,
D. H. Lee,
C. Little,
T. Maruyama
, et al. (31 additional authors not shown)
Abstract:
The JSNS2-II (the second phase of JSNS2, J-PARC Sterile Neutrino Search at J-PARC Spallation Neutron Source) is an experiment aimed at searching for sterile neutrinos. This experiment has entered its second phase, employing two liquid scintillator detectors located at near and far positions from the neutrino source. Recently, the far detector of the experiment has been completed and is currently i…
▽ More
The JSNS2-II (the second phase of JSNS2, J-PARC Sterile Neutrino Search at J-PARC Spallation Neutron Source) is an experiment aimed at searching for sterile neutrinos. This experiment has entered its second phase, employing two liquid scintillator detectors located at near and far positions from the neutrino source. Recently, the far detector of the experiment has been completed and is currently in the calibration phase. This paper presents a detailed description of the calibration process utilizing the LED system. The LED system of the far detector uses two Ultra-Violet (UV) LEDs, which are effective in calibrating all of PMTs at once. The UV light is converted into the visible light wavelengths inside liquid scintillator via the wavelength shifters, providing pseudo-isotropic light. The properties of all functioning Photo-Multiplier-Tubes (PMTs) to detect the neutrino events in the far detector, such as gain, its dependence of supplied High Voltage (HV), and Peak-to-Valley (PV) were calibrated. To achieve a good energy resolution for physics events, up to 10% of the relative gain adjustment is required for all functioning PMTs. This will be achieved using the measured HV curves and the LED calibration. The Peak-to-Valley (PV) ratio values are the similar to those from the production company, which distinguish the single photo-electron signal from the pedestal. Additionally, the precision of PMT signal timing is measured to be 2.1 ns, meeting the event reconstruction requirement of 10 ns.
△ Less
Submitted 11 March, 2025;
originally announced March 2025.
-
Call for Rigor in Reporting Quality of Instruction Tuning Data
Authors:
Hyeonseok Moon,
Jaehyung Seo,
Heuiseok Lim
Abstract:
Instruction tuning is crucial for adapting large language models (LLMs) to align with user intentions. Numerous studies emphasize the significance of the quality of instruction tuning (IT) data, revealing a strong correlation between IT data quality and the alignment performance of LLMs. In these studies, the quality of IT data is typically assessed by evaluating the performance of LLMs trained wi…
▽ More
Instruction tuning is crucial for adapting large language models (LLMs) to align with user intentions. Numerous studies emphasize the significance of the quality of instruction tuning (IT) data, revealing a strong correlation between IT data quality and the alignment performance of LLMs. In these studies, the quality of IT data is typically assessed by evaluating the performance of LLMs trained with that data. However, we identified a prevalent issue in such practice: hyperparameters for training models are often selected arbitrarily without adequate justification. We observed significant variations in hyperparameters applied across different studies, even when training the same model with the same data. In this study, we demonstrate the potential problems arising from this practice and emphasize the need for careful consideration in verifying data quality. Through our experiments on the quality of LIMA data and a selected set of 1,000 Alpaca data points, we demonstrate that arbitrary hyperparameter decisions can make any arbitrary conclusion.
△ Less
Submitted 15 May, 2025; v1 submitted 3 March, 2025;
originally announced March 2025.
-
Performance Comparison of Large Language Models on Advanced Calculus Problems
Authors:
In Hak Moon
Abstract:
This paper presents an in-depth analysis of the performance of seven different Large Language Models (LLMs) in solving a diverse set of math advanced calculus problems. The study aims to evaluate these models' accuracy, reliability, and problem-solving capabilities, including ChatGPT 4o, Gemini Advanced with 1.5 Pro, Copilot Pro, Claude 3.5 Sonnet, Meta AI, Mistral AI, and Perplexity. The assessme…
▽ More
This paper presents an in-depth analysis of the performance of seven different Large Language Models (LLMs) in solving a diverse set of math advanced calculus problems. The study aims to evaluate these models' accuracy, reliability, and problem-solving capabilities, including ChatGPT 4o, Gemini Advanced with 1.5 Pro, Copilot Pro, Claude 3.5 Sonnet, Meta AI, Mistral AI, and Perplexity. The assessment was conducted through a series of thirty-two test problems, encompassing a total of 320 points. The problems covered various topics, from vector calculations and geometric interpretations to integral evaluations and optimization tasks. The results highlight significant trends and patterns in the models' performance, revealing both their strengths and weaknesses - for instance, models like ChatGPT 4o and Mistral AI demonstrated consistent accuracy across various problem types, indicating their robustness and reliability in mathematical problem-solving, while models such as Gemini Advanced with 1.5 Pro and Meta AI exhibited specific weaknesses, particularly in complex problems involving integrals and optimization, suggesting areas for targeted improvements. The study also underscores the importance of re-prompting in achieving accurate solutions, as seen in several instances where models initially provided incorrect answers but corrected them upon re-prompting. Overall, this research provides valuable insights into the current capabilities and limitations of LLMs in the domain of math calculus, with the detailed analysis of each model's performance on specific problems offering a comprehensive understanding of their strengths and areas for improvement, contributing to the ongoing development and refinement of LLM technology. The findings are particularly relevant for educators, researchers, and developers seeking to leverage LLMs for educational and practical applications in mathematics.
△ Less
Submitted 5 March, 2025;
originally announced March 2025.
-
Design of the Global Reconstruction Logic in the Belle II Level-1 Trigger system
Authors:
Y. -T. Lai,
T. Koga,
Y. Iwasaki,
Y. Ahn,
H. Bae,
M. Campajola,
B. G. Cheon,
H. -E. Cho,
T. Ferber,
I. Haide,
G. Heine,
C. -L. Hsu,
C. Kiesling,
C. -H. Kim,
J. B. Kim,
K. Kim,
S. H. Kim,
I. S. Lee,
M. J. Lee,
Y. P. Liao,
J. Lin,
A. Little,
H. K. Moon,
H. Nakazawa,
M. Neu
, et al. (10 additional authors not shown)
Abstract:
The Belle~II experiment is designed to search for physics beyond the Standard Model by investigating rare decays at the SuperKEKB \(e^{+}e^{-}\) collider. Owing to the significant beam background at high luminosity, the data acquisition system employs a hardware-based Level-1~Trigger to reduce the readout data throughput by selecting collision events of interest in real time. The Belle~II Level-1~…
▽ More
The Belle~II experiment is designed to search for physics beyond the Standard Model by investigating rare decays at the SuperKEKB \(e^{+}e^{-}\) collider. Owing to the significant beam background at high luminosity, the data acquisition system employs a hardware-based Level-1~Trigger to reduce the readout data throughput by selecting collision events of interest in real time. The Belle~II Level-1~Trigger system utilizes FPGAs to reconstruct various detector observables from the raw data for trigger decision-making. The Global Reconstruction Logic receives these processed observables from four sub-trigger systems and provides a global summary for the final trigger decision. Its logic encompasses charged particle tracking, matching between sub-triggers, and the identification of special event topologies associated with low-multiplicity decays. This article discusses the hardware devices, FPGA firmware, integration with peripheral systems, and the design and performance of the trigger algorithms implemented within the Global Reconstruction Logic.
△ Less
Submitted 3 March, 2025;
originally announced March 2025.
-
Neutron multiplicity measurement in muon capture on oxygen nuclei in the Gd-loaded Super-Kamiokande detector
Authors:
The Super-Kamiokande Collaboration,
:,
S. Miki,
K. Abe,
S. Abe,
Y. Asaoka,
C. Bronner,
M. Harada,
Y. Hayato,
K. Hiraide,
K. Hosokawa,
K. Ieki,
M. Ikeda,
J. Kameda,
Y. Kanemura,
R. Kaneshima,
Y. Kashiwagi,
Y. Kataoka,
S. Mine,
M. Miura,
S. Moriyama,
M. Nakahata,
S. Nakayama,
Y. Noguchi,
K. Okamoto
, et al. (265 additional authors not shown)
Abstract:
In recent neutrino detectors, neutrons produced in neutrino reactions play an important role. Muon capture on oxygen nuclei is one of the processes that produce neutrons in water Cherenkov detectors. We measured neutron multiplicity in the process using cosmic ray muons that stop in the gadolinium-loaded Super-Kamiokande detector. For this measurement, neutron detection efficiency is obtained with…
▽ More
In recent neutrino detectors, neutrons produced in neutrino reactions play an important role. Muon capture on oxygen nuclei is one of the processes that produce neutrons in water Cherenkov detectors. We measured neutron multiplicity in the process using cosmic ray muons that stop in the gadolinium-loaded Super-Kamiokande detector. For this measurement, neutron detection efficiency is obtained with the muon capture events followed by gamma rays to be $50.2^{+2.0}_{-2.1}\%$. By fitting the observed multiplicity considering the detection efficiency, we measure neutron multiplicity in muon capture as $P(0)=24\pm3\%$, $P(1)=70^{+3}_{-2}\%$, $P(2)=6.1\pm0.5\%$, $P(3)=0.38\pm0.09\%$. This is the first measurement of the multiplicity of neutrons associated with muon capture without neutron energy threshold.
△ Less
Submitted 24 February, 2025;
originally announced February 2025.
-
A muon tagging with Flash ADC waveform baselines
Authors:
D. H. Lee,
M. K. Cheoun,
J. H. Choi,
J. Y. Choi,
T. Dodo,
J. Goh,
K. Haga,
M. Harada,
S. Hasegawa,
W. Hwang,
T. Iida,
H. I. Jang,
J. S. Jang,
K. K. Joo,
D. E. Jung,
S. K. Kang,
Y. Kasugai,
T. Kawasaki,
E. M. Kim,
S. B. Kim,
S. Y. Kim,
H. Kinoshita,
T. Konno,
C. Little,
T. Maruyama
, et al. (32 additional authors not shown)
Abstract:
This manuscript describes an innovative method to tag the muons using the baseline information of the Flash ADC (FADC) waveform of PMTs in the JSNS1 (J-PARC Sterile Neutrino Search at J-PARC Spallation Neutron Source) experiment. This experiment is designed for the search for sterile neutrinos, and a muon tagging is an essential key component for the background rejection since the detector of the…
▽ More
This manuscript describes an innovative method to tag the muons using the baseline information of the Flash ADC (FADC) waveform of PMTs in the JSNS1 (J-PARC Sterile Neutrino Search at J-PARC Spallation Neutron Source) experiment. This experiment is designed for the search for sterile neutrinos, and a muon tagging is an essential key component for the background rejection since the detector of the experiment is located over-ground, where is the 3rd floor of the J-PARC Material and Life experimental facility (MLF). Especially, stopping muons inside the detector create the Michel electrons, and they are important background to be rejected. Utilizing this innovative method, more than 99.8% of Michel electrons can be rejected even without a detector veto region. This technique can be employed for any experiments which uses the similar detector configurations.
△ Less
Submitted 22 February, 2025;
originally announced February 2025.
-
Real-Field Hong-Ou-Mandel Interference of Indistinguishable Coherent Photons via Long Optical Injection-Locking over 50 km Fiber
Authors:
Seoyeon Yang,
Danbi Kim,
Hansol Jeong,
Han Seb Moon
Abstract:
Measurement-device-independent quantum key distribution (MDI-QKD) has garnered significant attention for its potential to enable security-loophole-free quantum communication. Successful MDI-QKD protocols rely on performing a two-photon Bell-state measurement at an intermediate node, with a high-visibility Hong-Ou-Mandel (HOM) interference pattern between two independent coherent photons being cruc…
▽ More
Measurement-device-independent quantum key distribution (MDI-QKD) has garnered significant attention for its potential to enable security-loophole-free quantum communication. Successful MDI-QKD protocols rely on performing a two-photon Bell-state measurement at an intermediate node, with a high-visibility Hong-Ou-Mandel (HOM) interference pattern between two independent coherent photons being crucial. In this study, we present a novel approach for developing indistinguishable coherent photon sources over 50 km of optical fiber in a real-world setting. We introduce the long optical injection-locking (long-OIL) technique, which enables frequency locking between two long-distance coherent photons beyond the coherence length of the master laser. Using the long-OIL technique, we achieved time-resolved HOM interference with a visibility of 48(2)%, approaching the theoretical 50% limit for two independent continuous-wave coherent photons. Our results demonstrate that the long-OIL platform is a promising solution for MDI-QKD with repeaterless secret key capacity.
△ Less
Submitted 7 February, 2025;
originally announced February 2025.
-
Kotlarski's lemma for dyadic models
Authors:
Grigory Franguridi,
Hyungsik Roger Moon
Abstract:
We show how to identify the distributions of the error components in the two-way dyadic model $y_{ij}=c+α_i+η_j+\varepsilon_{ij}$. To this end, we extend the lemma of Kotlarski (1967), mimicking the arguments of Evdokimov and White (2012). We allow the characteristic functions of the error components to have real zeros, as long as they do not overlap with zeros of their first derivatives.
We show how to identify the distributions of the error components in the two-way dyadic model $y_{ij}=c+α_i+η_j+\varepsilon_{ij}$. To this end, we extend the lemma of Kotlarski (1967), mimicking the arguments of Evdokimov and White (2012). We allow the characteristic functions of the error components to have real zeros, as long as they do not overlap with zeros of their first derivatives.
△ Less
Submitted 4 February, 2025;
originally announced February 2025.
-
Center of generalized skein algebras
Authors:
Hiroaki Karuo,
Han-Bom Moon,
Helen Wong
Abstract:
We consider a generalization of the Kauffman bracket skein algebra of a surface that is generated by loops and arcs between marked points on the interior or boundary, up to skein relations defined by Muller and Roger-Yang. We compute the center of this Muller-Roger-Yang skein algebra and show that it is almost Azumaya when the quantum parameter $q$ is a primitive $n$-th root of unity with odd $n$.…
▽ More
We consider a generalization of the Kauffman bracket skein algebra of a surface that is generated by loops and arcs between marked points on the interior or boundary, up to skein relations defined by Muller and Roger-Yang. We compute the center of this Muller-Roger-Yang skein algebra and show that it is almost Azumaya when the quantum parameter $q$ is a primitive $n$-th root of unity with odd $n$. We also discuss the implications on the representation theory of the Muller-Roger-Yang generalized skein algebra.
△ Less
Submitted 18 January, 2025;
originally announced January 2025.
-
Find the Intention of Instruction: Comprehensive Evaluation of Instruction Understanding for Large Language Models
Authors:
Hyeonseok Moon,
Jaehyung Seo,
Seungyoon Lee,
Chanjun Park,
Heuiseok Lim
Abstract:
One of the key strengths of Large Language Models (LLMs) is their ability to interact with humans by generating appropriate responses to given instructions. This ability, known as instruction-following capability, has established a foundation for the use of LLMs across various fields and serves as a crucial metric for evaluating their performance. While numerous evaluation benchmarks have been dev…
▽ More
One of the key strengths of Large Language Models (LLMs) is their ability to interact with humans by generating appropriate responses to given instructions. This ability, known as instruction-following capability, has established a foundation for the use of LLMs across various fields and serves as a crucial metric for evaluating their performance. While numerous evaluation benchmarks have been developed, most focus solely on clear and coherent instructions. However, we have noted that LLMs can become easily distracted by instruction-formatted statements, which may lead to an oversight of their instruction comprehension skills. To address this issue, we introduce the Intention of Instruction (IoInst) benchmark. This benchmark evaluates LLMs' capacity to remain focused and understand instructions without being misled by extraneous instructions. The primary objective of this benchmark is to identify the appropriate instruction that accurately guides the generation of a given context. Our findings suggest that even recently introduced state-of-the-art models still lack instruction understanding capability. Along with the proposition of IoInst in this study, we also present broad analyses of the several strategies potentially applicable to IoInst.
△ Less
Submitted 22 January, 2025; v1 submitted 26 December, 2024;
originally announced December 2024.
-
Measurement of reactor antineutrino oscillation amplitude and frequency using 3800 days of complete data sample of the RENO experiment
Authors:
S. Jeon,
H. I. Kim,
J. H. Choi,
H. I. Jang,
J. S. Jang,
K. K. Joo,
D. E. Jung,
J. G. Kim,
J. H. Kim,
J. Y. Kim,
S. B. Kim,
S. Y. Kim,
W. Kim,
E. Kwon,
D. H. Lee,
H. G. Lee,
W. J. Lee,
I. T. Lim,
D. H. Moon,
M. Y. Pac,
J. S. Park,
R. G. Park,
H. Seo,
J. W. Seo,
C. D. Shin
, et al. (5 additional authors not shown)
Abstract:
We report an updated neutrino mixing angle of $θ_{13}$ obtained from a complete data sample of the RENO experiment. The experiment has measured the amplitude and frequency of reactor anti-electron-neutrinos ($\barν_{e}$) oscillations at the Hanbit nuclear power plant, Younggwang, Korea, since August 2011. As of March 2023, the data acquisition was completed after a total of 3800 live days of detec…
▽ More
We report an updated neutrino mixing angle of $θ_{13}$ obtained from a complete data sample of the RENO experiment. The experiment has measured the amplitude and frequency of reactor anti-electron-neutrinos ($\barν_{e}$) oscillations at the Hanbit nuclear power plant, Younggwang, Korea, since August 2011. As of March 2023, the data acquisition was completed after a total of 3800 live days of detector operation. The observed candidates via inverse beta decay (IBD) are 1,211,995 (144,667) in the near (far) detector. Based on an observed energy-dependent reactor neutrino disappearance, neutrino oscillation parameters of $θ_{13}$ and $\lvertΔm_{ee}^2\rvert$ are precisely determined as $\sin^{2}2θ_{13}=0.0920_{-0.0042}^{+0.0044}(\text{stat.})_{-0.0041}^{+0.0041}(\text{syst.})$ and $\lvertΔm_{ee}^2\rvert=\left[2.57_{-0.11}^{+0.10}(\text{stat.})_{-0.05}^{+0.05}(\text{syst.})\right]\times10^{-3}~\text{eV}^{2}$. Compared to the previous RENO results published in Ref.~\cite{PhysRevLett.121.201801}, the precision is improved from 7.5\% to 6.4\% for $\sin^{2}2θ_{13}$ and from 5.2\% to 4.5\% for $\lvertΔm_{ee}^2\rvert$. The statistical error of the measurement has reached our goal and is hardly improved with additional data-taking.
△ Less
Submitted 24 December, 2024;
originally announced December 2024.
-
The first JSNS$^2$ measurement of electron neutrino flux using $^{12}C(ν_{e},e^{-}) ^{12}N_{g.s.}$ reaction
Authors:
T. Dodo,
M. K. Cheoun,
J. H. Choi,
J. Y. Choi,
J. Goh,
K. Haga,
M. Harada,
S. Hasegawa,
W. Hwang,
H. I. Jang,
J. S. Jang,
K. K. Joo,
D. E. Jung,
S. K. Kang,
Y. Kasugai,
T. Kawasaki,
E. M. Kim,
S. Y. Kim,
S. B. Kim,
H. Kinoshita,
T. Konno,
D. H. Lee,
C. Little,
T. Maruyama,
E. Marzec
, et al. (26 additional authors not shown)
Abstract:
JSNS$^2$ (J-PARC Sterile Neutrino Search at J-PARC Spallation Neutron Source) is an experiment searching for sterile neutrinos through the observation of $\barν_μ \rightarrow \barν_e$ appearance oscillations, using neutrinos produced by muon decay-at-rest. A key aspect of the experiment involves accurately understanding the neutrino flux and the quantities of pions and muons, which are progenitors…
▽ More
JSNS$^2$ (J-PARC Sterile Neutrino Search at J-PARC Spallation Neutron Source) is an experiment searching for sterile neutrinos through the observation of $\barν_μ \rightarrow \barν_e$ appearance oscillations, using neutrinos produced by muon decay-at-rest. A key aspect of the experiment involves accurately understanding the neutrino flux and the quantities of pions and muons, which are progenitors of (anti-)neutrinos, given that their production rates have yet to be measured. We present the first electron-neutrino flux measurement using $^{12}\mathrm{C}(ν_{e},e^{-}) ^{12}\mathrm{N}_{g.s.}$ reaction in JSNS$^2$, yielding a flux of (6.7 $\pm$ 1.6 (stat.) $\pm$ 1.7 (syst.)) $\times$ 10$^{-9}$ cm$^{-2}$ proton$^{-1}$ at the JSNS$^2$ detector location, located at 24 meters distance from the mercury target. This flux measurement is consistent with predictions from simulations based on hadron models.
△ Less
Submitted 24 December, 2024;
originally announced December 2024.
-
WigglyEyes: Inferring Eye Movements from Keypress Data
Authors:
Yujun Zhu,
Danqing Shi,
Hee-Seung Moon,
Antti Oulasvirta
Abstract:
We present a model for inferring where users look during interaction based on keypress data only. Given a key log, it outputs a scanpath that tells, moment-by-moment, how the user had moved eyes while entering those keys. The model can be used as a proxy for human data in cases where collecting real eye tracking data is expensive or impossible. Our technical insight is three-fold: first, we presen…
▽ More
We present a model for inferring where users look during interaction based on keypress data only. Given a key log, it outputs a scanpath that tells, moment-by-moment, how the user had moved eyes while entering those keys. The model can be used as a proxy for human data in cases where collecting real eye tracking data is expensive or impossible. Our technical insight is three-fold: first, we present an inference architecture that considers the individual characteristics of the user, inferred as a low-dimensional parameter vector; second, we present a novel loss function for synchronizing inferred eye movements with the keypresses; third, we train the model using a hybrid approach with both human data and synthetically generated data. The approach can be applied in interactive systems where predictive models of user behavior are available. We report results from evaluation in the challenging case of touchscreen typing, where the model accurately inferred real eye movements.
△ Less
Submitted 13 February, 2025; v1 submitted 20 December, 2024;
originally announced December 2024.
-
Towards modeling evolving longitudinal health trajectories with a transformer-based deep learning model
Authors:
Hans Moen,
Vishnu Raj,
Andrius Vabalas,
Markus Perola,
Samuel Kaski,
Andrea Ganna,
Pekka Marttinen
Abstract:
Health registers contain rich information about individuals' health histories. Here our interest lies in understanding how individuals' health trajectories evolve in a nationwide longitudinal dataset with coded features, such as clinical codes, procedures, and drug purchases. We introduce a straightforward approach for training a Transformer-based deep learning model in a way that lets us analyze…
▽ More
Health registers contain rich information about individuals' health histories. Here our interest lies in understanding how individuals' health trajectories evolve in a nationwide longitudinal dataset with coded features, such as clinical codes, procedures, and drug purchases. We introduce a straightforward approach for training a Transformer-based deep learning model in a way that lets us analyze how individuals' trajectories change over time. This is achieved by modifying the training objective and by applying a causal attention mask. We focus here on a general task of predicting the onset of a range of common diseases in a given future forecast interval. However, instead of providing a single prediction about diagnoses that could occur in this forecast interval, our approach enable the model to provide continuous predictions at every time point up until, and conditioned on, the time of the forecast period. We find that this model performs comparably to other models, including a bi-directional transformer model, in terms of basic prediction performance while at the same time offering promising trajectory modeling properties. We explore a couple of ways to use this model for analyzing health trajectories and aiding in early detection of events that forecast possible later disease onsets. We hypothesize that this method may be helpful in continuous monitoring of peoples' health trajectories and enabling interventions in ongoing health trajectories, as well as being useful in retrospective analyses.
△ Less
Submitted 11 December, 2024;
originally announced December 2024.
-
Performance of the prototype beam drift chamber for LAMPS at RAON with proton and Carbon-12 beams
Authors:
H. Kim,
Y. Bae,
C. Heo,
J. Seo,
J. Hwang,
D. H. Moon,
D. S. Ahn,
J. K. Ahn,
J. Bae,
J. Bok,
Y. Cheon,
S. W. Choi,
S. Do,
B. Hong,
S. -W. Hong,
J. Huh,
S. Hwang,
Y. Jang,
B. Kang,
A. Kim,
B. Kim,
C. Kim,
E. -J. Kim,
G. Kim,
G. Kim
, et al. (23 additional authors not shown)
Abstract:
Beam Drift Chamber (BDC) is designed to reconstruct the trajectories of incident rare isotope beams provided by RAON (Rare isotope Accelerator complex for ON-line experiments) into the experimental target of LAMPS (Large Acceptance Multi-Purpose Spectrometer). To conduct the performance test of the BDC, the prototype BDC (pBDC) is manufactured and evaluated with the high energy ion beams from HIMA…
▽ More
Beam Drift Chamber (BDC) is designed to reconstruct the trajectories of incident rare isotope beams provided by RAON (Rare isotope Accelerator complex for ON-line experiments) into the experimental target of LAMPS (Large Acceptance Multi-Purpose Spectrometer). To conduct the performance test of the BDC, the prototype BDC (pBDC) is manufactured and evaluated with the high energy ion beams from HIMAC (Heavy Ion Medical Accelerator in Chiba) facility in Japan. Two kinds of ion beams, 100 MeV proton, and 200 MeV/u $^{12}$C, have been utilized for this evaluation, and the track reconstruction efficiency and position resolution have been measured as the function of applied high voltage. This paper introduces the construction details and presents the track reconstruction efficiency and position resolution of pBDC.
△ Less
Submitted 6 December, 2024;
originally announced December 2024.
-
On the rank index of projective curves of almost minimal degree
Authors:
Jaewoo Jung,
Hyunsuk Moon,
Euisung Park
Abstract:
In this article, we investigate the rank index of projective curves $\mathscr{C} \subset \mathbb{P}^r$ of degree $r+1$ when $\mathscr{C} = π_p (\tilde{\mathscr{C}})$ for the standard rational normal curve $\tilde{\mathscr{C}} \subset \mathbb{P}^{r+1}$ and a point $p \in \mathbb{P}^{r+1} \setminus \tilde{\mathscr{C}}^3$. Here, the rank index of a closed subscheme $X \subset \mathbb{P}^r$ is defined…
▽ More
In this article, we investigate the rank index of projective curves $\mathscr{C} \subset \mathbb{P}^r$ of degree $r+1$ when $\mathscr{C} = π_p (\tilde{\mathscr{C}})$ for the standard rational normal curve $\tilde{\mathscr{C}} \subset \mathbb{P}^{r+1}$ and a point $p \in \mathbb{P}^{r+1} \setminus \tilde{\mathscr{C}}^3$. Here, the rank index of a closed subscheme $X \subset \mathbb{P}^r$ is defined to be the least integer $k$ such that its homogeneous ideal can be generated by quadratic polynomials of rank $\leq k$. Our results show that the rank index of $\mathscr{C}$ is at most $4$, and it is exactly equal to $3$ when the projection center $p$ is a coordinate point of $\mathbb{P}^{r+1}$. We also investigate the case where $p \in \tilde{\mathscr{C}}^3 \setminus \tilde{\mathscr{C}}^2$.
△ Less
Submitted 26 November, 2024;
originally announced November 2024.
-
Tuning the shear and extensional rheology of semi-flexible polyelectrolyte solutions
Authors:
H. Moon,
S. Yamani,
G. H. McKinley,
J. Lee
Abstract:
Semi-flexible polyelectrolytes are a group of biopolymers with a wide range of applications from drag reducing agents in turbulent flows to thickening agents in food and cosmetics. In this study, we investigate the rheology of aqueous solutions of xanthan gum as a canonical semi-flexible polyelectrolyte in steady shear and transient extensional flows via torsional rheometry and dripping-onto-subst…
▽ More
Semi-flexible polyelectrolytes are a group of biopolymers with a wide range of applications from drag reducing agents in turbulent flows to thickening agents in food and cosmetics. In this study, we investigate the rheology of aqueous solutions of xanthan gum as a canonical semi-flexible polyelectrolyte in steady shear and transient extensional flows via torsional rheometry and dripping-onto-substrate (DoS), respectively. The high molecular weight of the xanthan gum and the numerous charged groups on the side branches attached to the backbone allow the shear and extensional rheology of the xanthan gum solutions to be tuned over a wide range by changing the ionic strength of the solvent. In steady shear flow, increasing the xanthan gum concentration increases both the zero shear viscosity and the extent of shear-thinning of the solution. Conversely, increasing the ionic strength of the solvent by addition of sodium chloride (NaCl) decreases both the zero shear viscosity and the level of shear-thinning. In transient extensional flow, increasing the xanthan gum concentration changes the dynamics of the capillary thinning from an inelastic power-law (IP) response to an elastocapillary (EC) balance, from which an extensional relaxation time can be measured based on the rate of filament thinning. Increasing the NaCl concentration decreases the extensional relaxation time and the transient extensional viscosity of the viscoelastic solution. Based on the dynamics of capillary thinning observed in the DoS experiments, we provide a relationship for the smallest extensional relaxation time that can be measured using DoS. We suggest that the change in the dynamics of capillary thinning from an IP response to an EC response can be used as an easy and robust experimental method for identifying the rheologically effective overlap concentration of a semi-flexible polyelectrolyte solution, i.e., the critical concentration at which polymer molecules start to interact with each other to produce a viscoelastic strain-stiffening response (often perceived as "stringiness") in transient extensional flows such as those involved in dripping, dispensing and filling operations.
△ Less
Submitted 19 October, 2024;
originally announced October 2024.
-
Using Large Multimodal Models to Extract Knowledge Components for Knowledge Tracing from Multimedia Question Information
Authors:
Hyeongdon Moon,
Richard Davis,
Seyed Parsa Neshaei,
Pierre Dillenbourg
Abstract:
Knowledge tracing models have enabled a range of intelligent tutoring systems to provide feedback to students. However, existing methods for knowledge tracing in learning sciences are predominantly reliant on statistical data and instructor-defined knowledge components, making it challenging to integrate AI-generated educational content with traditional established methods. We propose a method for…
▽ More
Knowledge tracing models have enabled a range of intelligent tutoring systems to provide feedback to students. However, existing methods for knowledge tracing in learning sciences are predominantly reliant on statistical data and instructor-defined knowledge components, making it challenging to integrate AI-generated educational content with traditional established methods. We propose a method for automatically extracting knowledge components from educational content using instruction-tuned large multimodal models. We validate this approach by comprehensively evaluating it against knowledge tracing benchmarks in five domains. Our results indicate that the automatically extracted knowledge components can effectively replace human-tagged labels, offering a promising direction for enhancing intelligent tutoring systems in limited-data scenarios, achieving more explainable assessments in educational settings, and laying the groundwork for automated assessment.
△ Less
Submitted 30 September, 2024;
originally announced September 2024.
-
Illustrious: an Open Advanced Illustration Model
Authors:
Sang Hyun Park,
Jun Young Koh,
Junha Lee,
Joy Song,
Dongha Kim,
Hoyeon Moon,
Hyunju Lee,
Min Song
Abstract:
In this work, we share the insights for achieving state-of-the-art quality in our text-to-image anime image generative model, called Illustrious. To achieve high resolution, dynamic color range images, and high restoration ability, we focus on three critical approaches for model improvement. First, we delve into the significance of the batch size and dropout control, which enables faster learning…
▽ More
In this work, we share the insights for achieving state-of-the-art quality in our text-to-image anime image generative model, called Illustrious. To achieve high resolution, dynamic color range images, and high restoration ability, we focus on three critical approaches for model improvement. First, we delve into the significance of the batch size and dropout control, which enables faster learning of controllable token based concept activations. Second, we increase the training resolution of images, affecting the accurate depiction of character anatomy in much higher resolution, extending its generation capability over 20MP with proper methods. Finally, we propose the refined multi-level captions, covering all tags and various natural language captions as a critical factor for model development. Through extensive analysis and experiments, Illustrious demonstrates state-of-the-art performance in terms of animation style, outperforming widely-used models in illustration domains, propelling easier customization and personalization with nature of open source. We plan to publicly release updated Illustrious model series sequentially as well as sustainable plans for improvements.
△ Less
Submitted 30 September, 2024;
originally announced September 2024.
-
Search for proton decay via $p\rightarrow{e^+η}$ and $p\rightarrow{μ^+η}$ with a 0.37 Mton-year exposure of Super-Kamiokande
Authors:
Super-Kamiokande Collaboration,
:,
N. Taniuchi,
K. Abe,
S. Abe,
Y. Asaoka,
C. Bronner,
M. Harada,
Y. Hayato,
K. Hiraide,
K. Hosokawa,
K. Ieki,
M. Ikeda,
J. Kameda,
Y. Kanemura,
R. Kaneshima,
Y. Kashiwagi,
Y. Kataoka,
S. Miki,
S. Mine,
M. Miura,
S. Moriyama,
M. Nakahata,
S. Nakayama,
Y. Noguchi
, et al. (267 additional authors not shown)
Abstract:
A search for proton decay into $e^+/μ^+$ and a $η$ meson has been performed using data from a 0.373 Mton$\cdot$year exposure (6050.3 live days) of Super-Kamiokande. Compared to previous searches this work introduces an improved model of the intranuclear $η$ interaction cross section, resulting in a factor of two reduction in uncertainties from this source and $\sim$10\% increase in signal efficien…
▽ More
A search for proton decay into $e^+/μ^+$ and a $η$ meson has been performed using data from a 0.373 Mton$\cdot$year exposure (6050.3 live days) of Super-Kamiokande. Compared to previous searches this work introduces an improved model of the intranuclear $η$ interaction cross section, resulting in a factor of two reduction in uncertainties from this source and $\sim$10\% increase in signal efficiency. No significant data excess was found above the expected number of atmospheric neutrino background events resulting in no indication of proton decay into either mode. Lower limits on the proton partial lifetime of $1.4\times\mathrm{10^{34}~years}$ for $p\rightarrow e^+η$ and $7.3\times\mathrm{10^{33}~years}$ for $p\rightarrow μ^+η$ at the 90$\%$ C.L. were set. These limits are around 1.5 times longer than our previous study and are the most stringent to date.
△ Less
Submitted 29 September, 2024;
originally announced September 2024.
-
DM: Dual-path Magnitude Network for General Speech Restoration
Authors:
Da-Hee Yang,
Dail Kim,
Joon-Hyuk Chang,
Jeonghwan Choi,
Han-gil Moon
Abstract:
In this paper, we introduce a novel general speech restoration model: the Dual-path Magnitude (DM) network, designed to address multiple distortions including noise, reverberation, and bandwidth degradation effectively. The DM network employs dual parallel magnitude decoders that share parameters: one uses a masking-based algorithm for distortion removal and the other employs a mapping-based appro…
▽ More
In this paper, we introduce a novel general speech restoration model: the Dual-path Magnitude (DM) network, designed to address multiple distortions including noise, reverberation, and bandwidth degradation effectively. The DM network employs dual parallel magnitude decoders that share parameters: one uses a masking-based algorithm for distortion removal and the other employs a mapping-based approach for speech restoration. A novel aspect of the DM network is the integration of the magnitude spectrogram output from the masking decoder into the mapping decoder through a skip connection, enhancing the overall restoration capability. This integrated approach overcomes the inherent limitations observed in previous models, as detailed in a step-by-step analysis. The experimental results demonstrate that the DM network outperforms other baseline models in the comprehensive aspect of general speech restoration, achieving substantial restoration with fewer parameters.
△ Less
Submitted 13 September, 2024;
originally announced September 2024.
-
First Measurement of Missing Energy Due to Nuclear Effects in Monoenergetic Neutrino Charged Current Interactions
Authors:
E. Marzec,
S. Ajimura,
A. Antonakis,
M. Botran,
M. K. Cheoun,
J. H. Choi,
J. W. Choi,
J. Y. Choi,
T. Dodo,
H. Furuta,
J. H. Goh,
K. Haga,
M. Harada,
S. Hasegawa,
Y. Hino,
T. Hiraiwa,
W. Hwang,
T. Iida,
E. Iwai,
S. Iwata,
H. I. Jang,
J. S. Jang,
M. C. Jang,
H. K. Jeon,
S. H. Jeon
, et al. (59 additional authors not shown)
Abstract:
We present the first measurement of the missing energy due to nuclear effects in monoenergetic, muon neutrino charged-current interactions on carbon, originating from $K^+ \rightarrow μ^+ ν_μ$ decay at rest ($E_{ν_μ}=235.5$ MeV), performed with the J-PARC Sterile Neutrino Search at the J-PARC Spallation Neutron Source liquid scintillator based experiment. Toward characterizing the neutrino interac…
▽ More
We present the first measurement of the missing energy due to nuclear effects in monoenergetic, muon neutrino charged-current interactions on carbon, originating from $K^+ \rightarrow μ^+ ν_μ$ decay at rest ($E_{ν_μ}=235.5$ MeV), performed with the J-PARC Sterile Neutrino Search at the J-PARC Spallation Neutron Source liquid scintillator based experiment. Toward characterizing the neutrino interaction, ostensibly $ν_μn \rightarrow μ^- p$ or $ν_μ$$^{12}\mathrm{C}$ $\rightarrow μ^-$$^{12}\mathrm{N}$, we define the missing energy as the energy transferred to the nucleus ($ω$) minus the kinetic energy of the outgoing proton(s), $E_{m} \equivω-\sum T_p$, and relate this to visible energy in the detector, $E_{m}=E_{ν_μ} (235.5 \mathrm{MeV})-m_μ(105.7 \mathrm{MeV}) + [m_n-m_p (1.3 \mathrm{MeV})] - E_{\mathrm{vis}}$. The missing energy, which is naively expected to be zero in the absence of nuclear effects (e.g. nucleon separation energy, Fermi momenta, and final-state interactions), is uniquely sensitive to many aspects of the interaction, and has previously been inaccessible with neutrinos. The shape-only, differential cross section measurement reported, based on a $(77\pm3)$% pure double-coincidence kaon decay-at-rest signal (621 total events), provides detailed insight into neutrino-nucleus interactions, allowing even the nuclear orbital shell of the struck nucleon to be inferred. The measurement provides an important benchmark for models and event generators at hundreds of MeV neutrino energies, characterized by the difficult-to-model transition region between neutrino-nucleus and neutrino-nucleon scattering, and relevant for applications in nuclear physics, neutrino oscillation measurements,and Type-II supernova studies.
△ Less
Submitted 26 February, 2025; v1 submitted 2 September, 2024;
originally announced September 2024.
-
Robust Estimation of Regression Models with Potentially Endogenous Outliers via a Modern Optimization Lens
Authors:
Zhan Gao,
Hyungsik Roger Moon
Abstract:
This paper addresses the robust estimation of linear regression models in the presence of potentially endogenous outliers. Through Monte Carlo simulations, we demonstrate that existing $L_1$-regularized estimation methods, including the Huber estimator and the least absolute deviation (LAD) estimator, exhibit significant bias when outliers are endogenous. Motivated by this finding, we investigate…
▽ More
This paper addresses the robust estimation of linear regression models in the presence of potentially endogenous outliers. Through Monte Carlo simulations, we demonstrate that existing $L_1$-regularized estimation methods, including the Huber estimator and the least absolute deviation (LAD) estimator, exhibit significant bias when outliers are endogenous. Motivated by this finding, we investigate $L_0$-regularized estimation methods. We propose systematic heuristic algorithms, notably an iterative hard-thresholding algorithm and a local combinatorial search refinement, to solve the combinatorial optimization problem of the \(L_0\)-regularized estimation efficiently. Our Monte Carlo simulations yield two key results: (i) The local combinatorial search algorithm substantially improves solution quality compared to the initial projection-based hard-thresholding algorithm while offering greater computational efficiency than directly solving the mixed integer optimization problem. (ii) The $L_0$-regularized estimator demonstrates superior performance in terms of bias reduction, estimation accuracy, and out-of-sample prediction errors compared to $L_1$-regularized alternatives. We illustrate the practical value of our method through an empirical application to stock return forecasting.
△ Less
Submitted 7 August, 2024;
originally announced August 2024.
-
HELPS for Emergency Location Service: Hyper-Enhanced Local Positioning System
Authors:
Hichan Moon,
Hyosoon Park,
Jiwon Seo
Abstract:
In this study, we propose a novel positioning and searching system for emergency location services, namely the hyper-enhanced local positioning system (HELPS), which is applicable to all mobile phone users, including legacy feature phone users. In the case of an emergency, rescuers are dispatched with portable signal measurement equipment around the estimated location of the emergency caller. Each…
▽ More
In this study, we propose a novel positioning and searching system for emergency location services, namely the hyper-enhanced local positioning system (HELPS), which is applicable to all mobile phone users, including legacy feature phone users. In the case of an emergency, rescuers are dispatched with portable signal measurement equipment around the estimated location of the emergency caller. Each signal measurement device measures the uplink signal from the mobile phone of the caller. After calculating the rough location of the caller's mobile phone based on these measurements, rescuers can efficiently search for the caller using the received uplink signal strength. Thus, the positioning accuracy in a conventional sense is not a limitation for rescuers in finding the caller. HELPS is not a traditional positioning system but rather a system with humans in the loop designed to reduce search time in emergencies. HELPS can provide emergency location information even in environments where the GPS or Wi-Fi is not functional. Furthermore, for HELPS operation, no hardware changes or software installations are required on the caller's mobile phone.
△ Less
Submitted 7 August, 2024;
originally announced August 2024.
-
A NuSTAR Census of the X-ray Binary Population of the M31 Disk
Authors:
Hannah Moon,
Daniel R. Wik,
V. Antoniou,
M. Eracleous,
Ann E. Hornschemeier,
Margaret Lazzarini,
Bret D. Lehmer,
Neven Vulic,
Benjamin F. Williams,
T. J. Maccarone,
K. Pottschmidt,
Andrew Ptak,
Mihoko Yukita,
Andreas Zezas
Abstract:
Using hard (E>10 keV) X-ray observations with NuSTAR, we are able to differentiate between accretion states, and thus compact object types, of neutron stars and black holes in X-ray binaries (XRBs) in M31, our nearest Milky Way-type neighbor. Using ten moderate-depth (20-50 ks) observations of the disk of M31 covering a total of ~0.45 deg$^{2}$, we detect 20 sources at 2$σ$ in the 4-25 keV band pa…
▽ More
Using hard (E>10 keV) X-ray observations with NuSTAR, we are able to differentiate between accretion states, and thus compact object types, of neutron stars and black holes in X-ray binaries (XRBs) in M31, our nearest Milky Way-type neighbor. Using ten moderate-depth (20-50 ks) observations of the disk of M31 covering a total of ~0.45 deg$^{2}$, we detect 20 sources at 2$σ$ in the 4-25 keV band pass, 14 of which we consider to be XRB candidates. This complements an existing deeper (100-400 ks) survey covering ~0.2 deg$^{2}$ of the bulge and the northeastern disk. We make tentative classifications of 9 of these sources with the use of diagnostic color-intensity and color-color diagrams, which separate sources into various neutron star and black hole regimes, identifying 3 black holes and 6 neutron stars. In addition, we create X-ray luminosity functions for both the full (4-25 keV) and hard (12-25 keV) band, as well as sub-populations of the full band based on compact object type and association with globular clusters. Our best fit globular cluster XLF is shallower than the field XLF, and preliminary BH and NS XLFs suggest a difference in shape based on compact object type. We find that the cumulative disk XLFs in the full and hard band are best fit by power laws with indices of 1.32 and 1.28 respectively. This is consistent with models of the Milky Way XLF from Grimm et al. (2002), Voss & Ajello (2010), and Doroshenko et al. (2014).
△ Less
Submitted 5 August, 2024;
originally announced August 2024.
-
Bayesian Active Learning for Semantic Segmentation
Authors:
Sima Didari,
Wenjun Hu,
Jae Oh Woo,
Heng Hao,
Hankyu Moon,
Seungjai Min
Abstract:
Fully supervised training of semantic segmentation models is costly and challenging because each pixel within an image needs to be labeled. Therefore, the sparse pixel-level annotation methods have been introduced to train models with a subset of pixels within each image. We introduce a Bayesian active learning framework based on sparse pixel-level annotation that utilizes a pixel-level Bayesian u…
▽ More
Fully supervised training of semantic segmentation models is costly and challenging because each pixel within an image needs to be labeled. Therefore, the sparse pixel-level annotation methods have been introduced to train models with a subset of pixels within each image. We introduce a Bayesian active learning framework based on sparse pixel-level annotation that utilizes a pixel-level Bayesian uncertainty measure based on Balanced Entropy (BalEnt) [84]. BalEnt captures the information between the models' predicted marginalized probability distribution and the pixel labels. BalEnt has linear scalability with a closed analytical form and can be calculated independently per pixel without relational computations with other pixels. We train our proposed active learning framework for Cityscapes, Camvid, ADE20K and VOC2012 benchmark datasets and show that it reaches supervised levels of mIoU using only a fraction of labeled pixels while outperforming the previous state-of-the-art active learning models with a large margin.
△ Less
Submitted 3 August, 2024;
originally announced August 2024.
-
Centrality dependence of Lévy-stable two-pion Bose-Einstein correlations in $\sqrt{s_{_{NN}}}=200$ GeV Au$+$Au collisions
Authors:
PHENIX Collaboration,
N. J. Abdulameer,
U. Acharya,
A. Adare,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
R. Akimoto,
H. Al-Ta'ani,
J. Alexander,
A. Angerami,
K. Aoki,
N. Apadula,
Y. Aramaki,
H. Asano,
E. C. Aschenauer,
E. T. Atomssa,
T. C. Awes,
B. Azmoun,
V. Babintsev,
M. Bai,
B. Bannier,
K. N. Barish,
B. Bassalleck,
S. Bathe
, et al. (377 additional authors not shown)
Abstract:
The PHENIX experiment measured the centrality dependence of two-pion Bose-Einstein correlation functions in $\sqrt{s_{_{NN}}}=200$~GeV Au$+$Au collisions at the Relativistic Heavy Ion Collider at Brookhaven National Laboratory. The data are well represented by Lévy-stable source distributions. The extracted source parameters are the correlation-strength parameter $λ$, the Lévy index of stability…
▽ More
The PHENIX experiment measured the centrality dependence of two-pion Bose-Einstein correlation functions in $\sqrt{s_{_{NN}}}=200$~GeV Au$+$Au collisions at the Relativistic Heavy Ion Collider at Brookhaven National Laboratory. The data are well represented by Lévy-stable source distributions. The extracted source parameters are the correlation-strength parameter $λ$, the Lévy index of stability $α$, and the Lévy-scale parameter $R$ as a function of transverse mass $m_T$ and centrality. The $λ(m_T)$ parameter is constant at larger values of $m_T$, but decreases as $m_T$ decreases. The Lévy scale parameter $R(m_T)$ decreases with $m_T$ and exhibits proportionality to the length scale of the nuclear overlap region. The Lévy exponent $α(m_T)$ is independent of $m_T$ within uncertainties in each investigated centrality bin, but shows a clear centrality dependence. At all centralities, the Lévy exponent $α$ is significantly different from that of Gaussian ($α=2$) or Cauchy ($α=1$) source distributions. Comparisons to the predictions of Monte-Carlo simulations of resonance-decay chains show that in all but the most peripheral centrality class (50%-60%), the obtained results are inconsistent with the measurements, unless a significant reduction of the in-medium mass of the $η'$ meson is included. In each centrality class, the best value of the in-medium $η'$ mass is compared to the mass of the $η$ meson, as well as to several theoretical predictions that consider restoration of $U_A(1)$ symmetry in hot hadronic matter.
△ Less
Submitted 20 December, 2024; v1 submitted 11 July, 2024;
originally announced July 2024.
-
LANSCE-mQ: Dedicated search for milli/fractionally charged particles at LANL
Authors:
Yu-Dai Tsai,
Insung Hwang,
Ryan Schmitz,
Matthew Citron,
Kranti Gunthoti,
Jacob Steenis,
Hoyong Jeong,
Hyunki Moon,
Jae Hyeok Yoo,
Ming Xiong Liu
Abstract:
In this paper, we propose an experiment, LANSCE-mQ, aiming to detect fractionally charged and millicharged particles (mCP) using an 800 MeV proton beam fixed target at the Los Alamos Neutron Science Center (LANSCE) facility. This search can shed new light on numerous fundamental questions, including charge quantization, the predictions of string theories and grand unification theories, the gauge s…
▽ More
In this paper, we propose an experiment, LANSCE-mQ, aiming to detect fractionally charged and millicharged particles (mCP) using an 800 MeV proton beam fixed target at the Los Alamos Neutron Science Center (LANSCE) facility. This search can shed new light on numerous fundamental questions, including charge quantization, the predictions of string theories and grand unification theories, the gauge symmetry of the Standard Model, dark sector models, and the tests of cosmic reheating. We propose to install two-layer scintillation detectors made of plastic (such as EJ-200) or CeBr3 to search for mCPs. Dedicated Geant4 detector simulations and in situ measurements have been conducted to obtain a preliminary determination of the background rate. The dominant backgrounds are beam-induced neutrons and coincident dark current signals from the photomultiplier tubes, while beam-induced gammas and cosmic muons are subdominant. We determined that LANSCE-mQ, the dedicated mCP experiment, has the leading mCP sensitivity for mass between ~ 1 MeV to 300 MeV.
△ Less
Submitted 9 July, 2024;
originally announced July 2024.
-
Query-Guided Self-Supervised Summarization of Nursing Notes
Authors:
Ya Gao,
Hans Moen,
Saila Koivusalo,
Miika Koskinen,
Pekka Marttinen
Abstract:
Nursing notes, an important part of Electronic Health Records (EHRs), track a patient's health during a care episode. Summarizing key information in nursing notes can help clinicians quickly understand patients' conditions. However, existing summarization methods in the clinical setting, especially abstractive methods, have overlooked nursing notes and require reference summaries for training. We…
▽ More
Nursing notes, an important part of Electronic Health Records (EHRs), track a patient's health during a care episode. Summarizing key information in nursing notes can help clinicians quickly understand patients' conditions. However, existing summarization methods in the clinical setting, especially abstractive methods, have overlooked nursing notes and require reference summaries for training. We introduce QGSumm, a novel query-guided self-supervised domain adaptation approach for abstractive nursing note summarization. The method uses patient-related clinical queries for guidance, and hence does not need reference summaries for training. Through automatic experiments and manual evaluation by an expert clinician, we study our approach and other state-of-the-art Large Language Models (LLMs) for nursing note summarization. Our experiments show: 1) GPT-4 is competitive in maintaining information in the original nursing notes, 2) QGSumm can generate high-quality summaries with a good balance between recall of the original content and hallucination rate lower than other top methods. Ultimately, our work offers a new perspective on conditional text summarization, tailored to clinical applications.
△ Less
Submitted 2 December, 2024; v1 submitted 4 July, 2024;
originally announced July 2024.
-
Scalp Diagnostic System With Label-Free Segmentation and Training-Free Image Translation
Authors:
Youngmin Kim,
Saejin Kim,
Hoyeon Moon,
Youngjae Yu,
Junhyug Noh
Abstract:
Scalp diseases and alopecia affect millions of people around the world, underscoring the urgent need for early diagnosis and management of the disease. However, the development of a comprehensive AI-based diagnosis system encompassing these conditions remains an underexplored domain due to the challenges associated with data imbalance and the costly nature of labeling. To address these issues, we…
▽ More
Scalp diseases and alopecia affect millions of people around the world, underscoring the urgent need for early diagnosis and management of the disease. However, the development of a comprehensive AI-based diagnosis system encompassing these conditions remains an underexplored domain due to the challenges associated with data imbalance and the costly nature of labeling. To address these issues, we propose ScalpVision, an AI-driven system for the holistic diagnosis of scalp diseases and alopecia. In ScalpVision, effective hair segmentation is achieved using pseudo image-label pairs and an innovative prompting method in the absence of traditional hair masking labels. This approach is crucial for extracting key features such as hair thickness and count, which are then used to assess alopecia severity. Additionally, ScalpVision introduces DiffuseIT-M, a generative model adept at dataset augmentation while maintaining hair information, facilitating improved predictions of scalp disease severity. Our experimental results affirm ScalpVision's efficiency in diagnosing a variety of scalp conditions and alopecia, showcasing its potential as a valuable tool in dermatological care.
△ Less
Submitted 25 June, 2024; v1 submitted 24 June, 2024;
originally announced June 2024.
-
Post-hoc Utterance Refining Method by Entity Mining for Faithful Knowledge Grounded Conversations
Authors:
Yoonna Jang,
Suhyune Son,
Jeongwoo Lee,
Junyoung Son,
Yuna Hur,
Jungwoo Lim,
Hyeonseok Moon,
Kisu Yang,
Heuiseok Lim
Abstract:
Despite the striking advances in recent language generation performance, model-generated responses have suffered from the chronic problem of hallucinations that are either untrue or unfaithful to a given source. Especially in the task of knowledge grounded conversation, the models are required to generate informative responses, but hallucinated utterances lead to miscommunication. In particular, e…
▽ More
Despite the striking advances in recent language generation performance, model-generated responses have suffered from the chronic problem of hallucinations that are either untrue or unfaithful to a given source. Especially in the task of knowledge grounded conversation, the models are required to generate informative responses, but hallucinated utterances lead to miscommunication. In particular, entity-level hallucination that causes critical misinformation and undesirable conversation is one of the major concerns. To address this issue, we propose a post-hoc refinement method called REM. It aims to enhance the quality and faithfulness of hallucinated utterances by refining them based on the source knowledge. If the generated utterance has a low source-faithfulness score with the given knowledge, REM mines the key entities in the knowledge and implicitly uses them for refining the utterances. We verify that our method reduces entity hallucination in the utterance. Also, we show the adaptability and efficacy of REM with extensive experiments and generative results. Our code is available at https://github.com/YOONNAJANG/REM.
△ Less
Submitted 16 June, 2024;
originally announced June 2024.
-
A Generalized Pointing Error Model for FSO Links with Fixed-Wing UAVs for 6G: Analysis and Trajectory Optimization
Authors:
Hyung-Joo Moon,
Chan-Byoung Chae,
Kai-Kit Wong,
Mohamed-Slim Alouini
Abstract:
Free-space optical (FSO) communication is a promising solution to support wireless backhaul links in emerging 6G non-terrestrial networks. At the link level, pointing errors in FSO links can significantly impact capacity, making accurate modeling of these errors essential for both assessing and enhancing communication performance. In this paper, we introduce a novel model for FSO pointing errors i…
▽ More
Free-space optical (FSO) communication is a promising solution to support wireless backhaul links in emerging 6G non-terrestrial networks. At the link level, pointing errors in FSO links can significantly impact capacity, making accurate modeling of these errors essential for both assessing and enhancing communication performance. In this paper, we introduce a novel model for FSO pointing errors in unmanned aerial vehicles (UAVs) that incorporates three-dimensional (3D) jitter, including roll, pitch, and yaw angle jittering. We derive a probability density function for the pointing error angle based on the relative position and posture of the UAV to the ground station. This model is then integrated into a trajectory optimization problem designed to maximize energy efficiency while meeting constraints on speed, acceleration, and elevation angle. Our proposed optimization method significantly improves energy efficiency by adjusting the UAV's flight trajectory to minimize exposure to directions highly affected by jitter. The simulation results emphasize the importance of using UAV-specific 3D jitter models in achieving accurate performance measurements and effective system optimization in FSO communication networks. Utilizing our generalized model, the optimized trajectories achieve up to 11.8 percent higher energy efficiency compared to those derived from conventional Gaussian pointing error models.
△ Less
Submitted 8 June, 2024;
originally announced June 2024.
-
First joint oscillation analysis of Super-Kamiokande atmospheric and T2K accelerator neutrino data
Authors:
Super-Kamiokande,
T2K collaborations,
:,
S. Abe,
K. Abe,
N. Akhlaq,
R. Akutsu,
H. Alarakia-Charles,
A. Ali,
Y. I. Alj Hakim,
S. Alonso Monsalve,
S. Amanai,
C. Andreopoulos,
L. H. V. Anthony,
M. Antonova,
S. Aoki,
K. A. Apte,
T. Arai,
T. Arihara,
S. Arimoto,
Y. Asada,
R. Asaka,
Y. Ashida,
E. T. Atkin,
N. Babu
, et al. (524 additional authors not shown)
Abstract:
The Super-Kamiokande and T2K collaborations present a joint measurement of neutrino oscillation parameters from their atmospheric and beam neutrino data. It uses a common interaction model for events overlapping in neutrino energy and correlated detector systematic uncertainties between the two datasets, which are found to be compatible. Using 3244.4 days of atmospheric data and a beam exposure of…
▽ More
The Super-Kamiokande and T2K collaborations present a joint measurement of neutrino oscillation parameters from their atmospheric and beam neutrino data. It uses a common interaction model for events overlapping in neutrino energy and correlated detector systematic uncertainties between the two datasets, which are found to be compatible. Using 3244.4 days of atmospheric data and a beam exposure of $19.7(16.3) \times 10^{20}$ protons on target in (anti)neutrino mode, the analysis finds a 1.9$σ$ exclusion of CP-conservation (defined as $J_{CP}=0$) and a preference for the normal mass ordering.
△ Less
Submitted 15 October, 2024; v1 submitted 21 May, 2024;
originally announced May 2024.
-
Improving Instruction Following in Language Models through Proxy-Based Uncertainty Estimation
Authors:
JoonHo Lee,
Jae Oh Woo,
Juree Seok,
Parisa Hassanzadeh,
Wooseok Jang,
JuYoun Son,
Sima Didari,
Baruch Gutow,
Heng Hao,
Hankyu Moon,
Wenjun Hu,
Yeong-Dae Kwon,
Taehee Lee,
Seungjai Min
Abstract:
Assessing response quality to instructions in language models is vital but challenging due to the complexity of human language across different contexts. This complexity often results in ambiguous or inconsistent interpretations, making accurate assessment difficult. To address this issue, we propose a novel Uncertainty-aware Reward Model (URM) that introduces a robust uncertainty estimation for t…
▽ More
Assessing response quality to instructions in language models is vital but challenging due to the complexity of human language across different contexts. This complexity often results in ambiguous or inconsistent interpretations, making accurate assessment difficult. To address this issue, we propose a novel Uncertainty-aware Reward Model (URM) that introduces a robust uncertainty estimation for the quality of paired responses based on Bayesian approximation. Trained with preference datasets, our uncertainty-enabled proxy not only scores rewards for responses but also evaluates their inherent uncertainty. Empirical results demonstrate significant benefits of incorporating the proposed proxy into language model training. Our method boosts the instruction following capability of language models by refining data curation for training and improving policy optimization objectives, thereby surpassing existing methods by a large margin on benchmarks such as Vicuna and MT-bench. These findings highlight that our proposed approach substantially advances language model training and paves a new way of harnessing uncertainty within language models.
△ Less
Submitted 31 January, 2025; v1 submitted 10 May, 2024;
originally announced May 2024.