-
GIFARC: Synthetic Dataset for Leveraging Human-Intuitive Analogies to Elevate AI Reasoning
Authors:
Woochang Sim,
Hyunseok Ryu,
Kyungmin Choi,
Sungwon Han,
Sundong Kim
Abstract:
The Abstraction and Reasoning Corpus (ARC) poses a stringent test of general AI capabilities, requiring solvers to infer abstract patterns from only a handful of examples. Despite substantial progress in deep learning, state-of-the-art models still achieve accuracy rates of merely 40-55% on 2024 ARC Competition, indicative of a significant gap between their performance and human-level reasoning. I…
▽ More
The Abstraction and Reasoning Corpus (ARC) poses a stringent test of general AI capabilities, requiring solvers to infer abstract patterns from only a handful of examples. Despite substantial progress in deep learning, state-of-the-art models still achieve accuracy rates of merely 40-55% on 2024 ARC Competition, indicative of a significant gap between their performance and human-level reasoning. In this work, we seek to bridge that gap by introducing an analogy-inspired ARC dataset, GIFARC. Leveraging large language models (LLMs) and vision-language models (VLMs), we synthesize new ARC-style tasks from a variety of GIF images that include analogies. Each new task is paired with ground-truth analogy, providing an explicit mapping between visual transformations and everyday concepts. By embedding robust human-intuitive analogies into ARC-style tasks, GIFARC guides AI agents to evaluate the task analogically before engaging in brute-force pattern search, thus efficiently reducing problem complexity and build a more concise and human-understandable solution. We empirically validate that guiding LLM with analogic approach with GIFARC affects task-solving approaches of LLMs to align with analogic approach of human.
△ Less
Submitted 26 May, 2025;
originally announced May 2025.
-
Decoding Covert Speech from EEG Using a Functional Areas Spatio-Temporal Transformer
Authors:
Muyun Jiang,
Yi Ding,
Wei Zhang,
Kok Ann Colin Teo,
LaiGuan Fong,
Shuailei Zhang,
Zhiwei Guo,
Chenyu Liu,
Raghavan Bhuvanakantham,
Wei Khang Jeremy Sim,
Chuan Huat Vince Foo,
Rong Hui Jonathan Chua,
Parasuraman Padmanabhan,
Victoria Leong,
Jia Lu,
Balazs Gulyas,
Cuntai Guan
Abstract:
Covert speech involves imagining speaking without audible sound or any movements. Decoding covert speech from electroencephalogram (EEG) is challenging due to a limited understanding of neural pronunciation mapping and the low signal-to-noise ratio of the signal. In this study, we developed a large-scale multi-utterance speech EEG dataset from 57 right-handed native English-speaking subjects, each…
▽ More
Covert speech involves imagining speaking without audible sound or any movements. Decoding covert speech from electroencephalogram (EEG) is challenging due to a limited understanding of neural pronunciation mapping and the low signal-to-noise ratio of the signal. In this study, we developed a large-scale multi-utterance speech EEG dataset from 57 right-handed native English-speaking subjects, each performing covert and overt speech tasks by repeating the same word in five utterances within a ten-second duration. Given the spatio-temporal nature of the neural activation process during speech pronunciation, we developed a Functional Areas Spatio-temporal Transformer (FAST), an effective framework for converting EEG signals into tokens and utilizing transformer architecture for sequence encoding. Our results reveal distinct and interpretable speech neural features by the visualization of FAST-generated activation maps across frontal and temporal brain regions with each word being covertly spoken, providing new insights into the discriminative features of the neural representation of covert speech. This is the first report of such a study, which provides interpretable evidence for speech decoding from EEG. The code for this work has been made public at https://github.com/Jiang-Muyun/FAST
△ Less
Submitted 2 April, 2025;
originally announced April 2025.
-
Enhancing Analogical Reasoning in the Abstraction and Reasoning Corpus via Model-Based RL
Authors:
Jihwan Lee,
Woochang Sim,
Sejin Kim,
Sundong Kim
Abstract:
This paper demonstrates that model-based reinforcement learning (model-based RL) is a suitable approach for the task of analogical reasoning. We hypothesize that model-based RL can solve analogical reasoning tasks more efficiently through the creation of internal models. To test this, we compared DreamerV3, a model-based RL method, with Proximal Policy Optimization, a model-free RL method, on the…
▽ More
This paper demonstrates that model-based reinforcement learning (model-based RL) is a suitable approach for the task of analogical reasoning. We hypothesize that model-based RL can solve analogical reasoning tasks more efficiently through the creation of internal models. To test this, we compared DreamerV3, a model-based RL method, with Proximal Policy Optimization, a model-free RL method, on the Abstraction and Reasoning Corpus (ARC) tasks. Our results indicate that model-based RL not only outperforms model-free RL in learning and generalizing from single tasks but also shows significant advantages in reasoning across similar tasks.
△ Less
Submitted 27 August, 2024;
originally announced August 2024.
-
Metasurfaces for infrared multi-modal microscopy: phase contrast and bright field
Authors:
Shaban B. Sulejman,
Lukas Wesemann,
Mikkaela McCormack,
Jiajun Meng,
James A. Hutchison,
Niken Priscilla,
Gawain McColl,
Katrina Read,
Wilson Sim,
Andrey A. Sukhorukov,
Kenneth B. Crozier,
Ann Roberts
Abstract:
Different imaging modalities are used to extract the diverse information carried in an optical field. Two prominent modalities include bright field and phase contrast microscopy that can visualize the amplitude and phase features of a sample, respectively. However, capturing both of these images on the same camera typically requires interchanging optical components. Metasurfaces are ultra-thin nan…
▽ More
Different imaging modalities are used to extract the diverse information carried in an optical field. Two prominent modalities include bright field and phase contrast microscopy that can visualize the amplitude and phase features of a sample, respectively. However, capturing both of these images on the same camera typically requires interchanging optical components. Metasurfaces are ultra-thin nanostructures that can merge both of these operations into a single miniaturized device. Here, a silicon-based metasurface that supports a Mie resonance is demonstrated to perform near-infrared phase contrast and bright field multi-modal microscopy that can be tuned by changing the polarization of the illumination. We performed experiments using optical fields with phase variations synthesized by a spatial light modulator and introduced by propagation through semi-transparent samples, including C. elegans, unstained human prostate cancer cells and breast tissue. The results demonstrate the potential of metasurfaces for label-free point-of-care testing.
△ Less
Submitted 9 June, 2024; v1 submitted 6 June, 2024;
originally announced June 2024.
-
HyperCLOVA X Technical Report
Authors:
Kang Min Yoo,
Jaegeun Han,
Sookyo In,
Heewon Jeon,
Jisu Jeong,
Jaewook Kang,
Hyunwook Kim,
Kyung-Min Kim,
Munhyong Kim,
Sungju Kim,
Donghyun Kwak,
Hanock Kwak,
Se Jung Kwon,
Bado Lee,
Dongsoo Lee,
Gichang Lee,
Jooho Lee,
Baeseong Park,
Seongjin Shin,
Joonsang Yu,
Seolki Baek,
Sumin Byeon,
Eungsup Cho,
Dooseok Choe,
Jeesung Han
, et al. (371 additional authors not shown)
Abstract:
We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t…
▽ More
We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment to responsible AI. The model is evaluated across various benchmarks, including comprehensive reasoning, knowledge, commonsense, factuality, coding, math, chatting, instruction-following, and harmlessness, in both Korean and English. HyperCLOVA X exhibits strong reasoning capabilities in Korean backed by a deep understanding of the language and cultural nuances. Further analysis of the inherent bilingual nature and its extension to multilingualism highlights the model's cross-lingual proficiency and strong generalization ability to untargeted languages, including machine translation between several language pairs and cross-lingual inference tasks. We believe that HyperCLOVA X can provide helpful guidance for regions or countries in developing their sovereign LLMs.
△ Less
Submitted 13 April, 2024; v1 submitted 2 April, 2024;
originally announced April 2024.
-
Reasoning Abilities of Large Language Models: In-Depth Analysis on the Abstraction and Reasoning Corpus
Authors:
Seungpil Lee,
Woochang Sim,
Donghyeon Shin,
Wongyu Seo,
Jiwon Park,
Seokki Lee,
Sanha Hwang,
Sejin Kim,
Sundong Kim
Abstract:
The existing methods for evaluating the inference abilities of Large Language Models (LLMs) have been predominantly results-centric, making it challenging to assess the inference process comprehensively. We introduce a novel approach using the Abstraction and Reasoning Corpus (ARC) benchmark to evaluate the inference and contextual understanding abilities of LLMs in a process-centric manner, focus…
▽ More
The existing methods for evaluating the inference abilities of Large Language Models (LLMs) have been predominantly results-centric, making it challenging to assess the inference process comprehensively. We introduce a novel approach using the Abstraction and Reasoning Corpus (ARC) benchmark to evaluate the inference and contextual understanding abilities of LLMs in a process-centric manner, focusing on three key components from the Language of Thought Hypothesis (LoTH): Logical Coherence, Compositionality, and Productivity. Our carefully designed experiments reveal that while LLMs demonstrate some inference capabilities, they still significantly lag behind human-level reasoning in these three aspects. The main contribution of this paper lies in introducing the LoTH perspective, which provides a method for evaluating the reasoning process that conventional results-oriented approaches fail to capture, thereby offering new insights into the development of human-level reasoning in artificial intelligence systems.
△ Less
Submitted 22 November, 2024; v1 submitted 18 March, 2024;
originally announced March 2024.
-
Contrastive Graph Pooling for Explainable Classification of Brain Networks
Authors:
Jiaxing Xu,
Qingtian Bian,
Xinhang Li,
Aihu Zhang,
Yiping Ke,
Miao Qiao,
Wei Zhang,
Wei Khang Jeremy Sim,
Balázs Gulyás
Abstract:
Functional magnetic resonance imaging (fMRI) is a commonly used technique to measure neural activation. Its application has been particularly important in identifying underlying neurodegenerative conditions such as Parkinson's, Alzheimer's, and Autism. Recent analysis of fMRI data models the brain as a graph and extracts features by graph neural networks (GNNs). However, the unique characteristics…
▽ More
Functional magnetic resonance imaging (fMRI) is a commonly used technique to measure neural activation. Its application has been particularly important in identifying underlying neurodegenerative conditions such as Parkinson's, Alzheimer's, and Autism. Recent analysis of fMRI data models the brain as a graph and extracts features by graph neural networks (GNNs). However, the unique characteristics of fMRI data require a special design of GNN. Tailoring GNN to generate effective and domain-explainable features remains challenging. In this paper, we propose a contrastive dual-attention block and a differentiable graph pooling method called ContrastPool to better utilize GNN for brain networks, meeting fMRI-specific requirements. We apply our method to 5 resting-state fMRI brain network datasets of 3 diseases and demonstrate its superiority over state-of-the-art baselines. Our case study confirms that the patterns extracted by our method match the domain knowledge in neuroscience literature, and disclose direct and interesting insights. Our contributions underscore the potential of ContrastPool for advancing the understanding of brain networks and neurodegenerative conditions. The source code is available at https://github.com/AngusMonroe/ContrastPool.
△ Less
Submitted 6 September, 2024; v1 submitted 7 July, 2023;
originally announced July 2023.
-
Permanence based Hidden Community and Graph Recovery in Social Networks
Authors:
Jaeyoung Choi,
Wooseok Sim
Abstract:
Due to the recent development of data analysis techniques, technologies for detecting communities through information expressed in social networks have been developed. Although it has several advantages, including the ability to effectively share recommended items through an estimated community, it may cause personal privacy issues. Therefore, recently, the problem of hiding the real community wel…
▽ More
Due to the recent development of data analysis techniques, technologies for detecting communities through information expressed in social networks have been developed. Although it has several advantages, including the ability to effectively share recommended items through an estimated community, it may cause personal privacy issues. Therefore, recently, the problem of hiding the real community well is being studied at the same time. As an example, Mittal \etal, proposed an algorithm called NEURAL that can hide this community well based on a metric called permanence. Based on this, in this study, we propose a Reverse NEURAL (R-NEURAL) algorithm that restores the community as well as the original graph structure using permanence. The proposed algorithm includes a method for well restoring not only the community hidden by the NEURAL algorithm but also the original graph structure modified by recovered edges. We conduct experiments on real-world graphs and found that the proposed algorithm recovers well the hidden community as well as the graph structure.
△ Less
Submitted 25 April, 2023; v1 submitted 29 January, 2023;
originally announced January 2023.
-
Implementing partisan symmetry: Problems and paradoxes
Authors:
Daryl DeFord,
Natasha Dhamankar,
Moon Duchin,
Varun Gupta,
Mackenzie McPike,
Gabe Schoenbach,
Ki Wan Sim
Abstract:
We consider the measures of partisan symmetry proposed for practical use in the political science literature, as clarified and developed in Katz, King, and Rosenblatt (2020). Elementary mathematical manipulation shows the symmetry metrics to have surprising properties that call their meaningfulness into question. To accompany the general analysis, we study measures of partisan symmetry with respec…
▽ More
We consider the measures of partisan symmetry proposed for practical use in the political science literature, as clarified and developed in Katz, King, and Rosenblatt (2020). Elementary mathematical manipulation shows the symmetry metrics to have surprising properties that call their meaningfulness into question. To accompany the general analysis, we study measures of partisan symmetry with respect to recent voting patterns in Utah, Texas, and North Carolina, flagging problems in each case. Taken together, these observations should raise major concerns about the available techniques for quantitative scores of partisan symmetry -- including the mean-median score, the partisan bias score, and the more general "partisan symmetry standard" -- as the decennial redistricting begins.
△ Less
Submitted 3 March, 2021; v1 submitted 16 August, 2020;
originally announced August 2020.
-
Dilatometric Study of LiHoF4 In a Transverse Magnetic Field
Authors:
J. L. Dunn,
C. Stahl,
Y. Reshitnyk,
W. Sim,
R. W. Hill
Abstract:
Theoretical and experimental work have not provided a consistent picture of the phase diagram of the nearly ideal Ising ferromagnet LiHoF4 in a transverse magnetic field. Using a newly fabricated capacitive dilatometer, we have investigated the thermal expansion and magnetostriction of LiHoF4 in magnetic fields applied perpendicular to the Ising direction. Critical points for the ferromagnetic pha…
▽ More
Theoretical and experimental work have not provided a consistent picture of the phase diagram of the nearly ideal Ising ferromagnet LiHoF4 in a transverse magnetic field. Using a newly fabricated capacitive dilatometer, we have investigated the thermal expansion and magnetostriction of LiHoF4 in magnetic fields applied perpendicular to the Ising direction. Critical points for the ferromagnetic phase transition have been determined from both methods in the classical paramagnetic to ferromagnetic regime. Excellent agreement has been found with existing experimental data suggesting that, in this regime, the current theoretical calculations have not entirely captured the physics of this interesting model system.
△ Less
Submitted 11 May, 2010;
originally announced May 2010.
-
Quark Number Susceptibility with Finite Chemical Potential in Holographic QCD
Authors:
Youngman Kim,
Yoshinori Matsuo,
Woojoo Sim,
Shingo Takeuchi,
Takuya Tsukioka
Abstract:
We study the quark number susceptibility in holographic QCD with a finite chemical potential or under an external magnetic field at finite temperature. We first consider the quark number susceptibility with the chemical potential. We observe that approaching the critical temperature from high temperature regime, the quark number susceptibility divided by temperature square develops a peak as we in…
▽ More
We study the quark number susceptibility in holographic QCD with a finite chemical potential or under an external magnetic field at finite temperature. We first consider the quark number susceptibility with the chemical potential. We observe that approaching the critical temperature from high temperature regime, the quark number susceptibility divided by temperature square develops a peak as we increase the chemical potential, which confirms recent lattice QCD results. We discuss this behavior in connection with the existence of the critical end point in the QCD phase diagram. We also consider the quark number susceptibility under the external magnetic field. We predict that the quark number susceptibility exhibits a blow-up behavior at low temperature as we raise the value of the magnetic field. We finally spell out some limitations of our study.
△ Less
Submitted 23 May, 2010; v1 submitted 29 January, 2010;
originally announced January 2010.
-
Heterotic Action in SUGRA-SYM Background
Authors:
Jaemo Park,
Cheol Ryou,
Woojoo Sim
Abstract:
We consider the generalization of the heterotic action considered by Cherkis and Schwarz where the chiral bosons are introduced in a manifestly covariant way using an auxiliary field. In particular, we construct the kappa-symmetric heterotic action in ten-dimensional supergravity background coupled to super Yang-Mills theory and prove its kappa-symmetry. The usual Bianchi identity of Type I supe…
▽ More
We consider the generalization of the heterotic action considered by Cherkis and Schwarz where the chiral bosons are introduced in a manifestly covariant way using an auxiliary field. In particular, we construct the kappa-symmetric heterotic action in ten-dimensional supergravity background coupled to super Yang-Mills theory and prove its kappa-symmetry. The usual Bianchi identity of Type I supergravity with super Yang-Mills $dH_3= -\tr F\wedge F$ is crucially used. For technical reason, the Yang-Mills field is restricted to be abelian.
△ Less
Submitted 30 December, 2009; v1 submitted 24 November, 2009;
originally announced November 2009.
-
Supersymmetric Heterotic Action out of M5 Brane
Authors:
Jaemo Park,
Woojoo Sim
Abstract:
Generalizing the work by Cherkis and Schwarz [1], we carry out the double dimensional reduction of supersymmetric M5 brane on K3 to obtain the supersymmetric action of heterotic string in 7-dimensional flat space-time. Motivated by this result, we propose the supersymmetric heterotic action in 10-dimensional flat space-time where the current algebra is realized in a novel way. We explicitly veri…
▽ More
Generalizing the work by Cherkis and Schwarz [1], we carry out the double dimensional reduction of supersymmetric M5 brane on K3 to obtain the supersymmetric action of heterotic string in 7-dimensional flat space-time. Motivated by this result, we propose the supersymmetric heterotic action in 10-dimensional flat space-time where the current algebra is realized in a novel way. We explicitly verify the kappa-symmetry of the proposed action.
△ Less
Submitted 14 May, 2009;
originally announced May 2009.
-
Recursive relations for a quiver gauge theory
Authors:
Jaemo Park,
Woojoo Sim
Abstract:
We study the recursive relations for a quiver gauge theory with the gauge group $SU(N_1)\times SU(N_2)$ with bifundamental fermions transforming as $(N_1,\bar{N_2})$. We work out the recursive relation for the amplitudes involving a pair of quark and antiquark and gluons of each gauge group. We realize directly in the recursive relations the invariance under the order preserving permutations of…
▽ More
We study the recursive relations for a quiver gauge theory with the gauge group $SU(N_1)\times SU(N_2)$ with bifundamental fermions transforming as $(N_1,\bar{N_2})$. We work out the recursive relation for the amplitudes involving a pair of quark and antiquark and gluons of each gauge group. We realize directly in the recursive relations the invariance under the order preserving permutations of the gluons of the first and the second gauge group. We check the proposed relations for MHV, 6-point and 7-point amplitudes and find the agreements with the known results and the known relations with the single gauge group amplitudes. The proposed recursive relation is much more efficient in calculating the amplitudes than using the known relations with the amplitudes of the single gauge group.
△ Less
Submitted 24 August, 2006; v1 submitted 12 July, 2006;
originally announced July 2006.