-
Does Rationale Quality Matter? Enhancing Mental Disorder Detection via Selective Reasoning Distillation
Authors:
Hoyun Song,
Huije Lee,
Jisu Shin,
Sukmin Cho,
Changgeon Ko,
Jong C. Park
Abstract:
The detection of mental health problems from social media and the interpretation of these results have been extensively explored. Research has shown that incorporating clinical symptom information into a model enhances domain expertise, improving its detection and interpretation performance. While large language models (LLMs) are shown to be effective for generating explanatory rationales in menta…
▽ More
The detection of mental health problems from social media and the interpretation of these results have been extensively explored. Research has shown that incorporating clinical symptom information into a model enhances domain expertise, improving its detection and interpretation performance. While large language models (LLMs) are shown to be effective for generating explanatory rationales in mental health detection, their substantially large parameter size and high computational cost limit their practicality. Reasoning distillation transfers this ability to smaller language models (SLMs), but inconsistencies in the relevance and domain alignment of LLM-generated rationales pose a challenge. This paper investigates how rationale quality impacts SLM performance in mental health detection and explanation generation. We hypothesize that ensuring high-quality and domain-relevant rationales enhances the distillation. To this end, we propose a framework that selects rationales based on their alignment with expert clinical reasoning. Experiments show that our quality-focused approach significantly enhances SLM performance in both mental disorder detection and rationale generation. This work highlights the importance of rationale quality and offers an insightful framework for knowledge transfer in mental health applications.
△ Less
Submitted 26 May, 2025;
originally announced May 2025.
-
A Multi-Task Benchmark for Abusive Language Detection in Low-Resource Settings
Authors:
Fitsum Gaim,
Hoyun Song,
Huije Lee,
Changgeon Ko,
Eui Jun Hwang,
Jong C. Park
Abstract:
Content moderation research has recently made significant advances, but still fails to serve the majority of the world's languages due to the lack of resources, leaving millions of vulnerable users to online hostility. This work presents a large-scale human-annotated multi-task benchmark dataset for abusive language detection in Tigrinya social media with joint annotations for three tasks: abusive…
▽ More
Content moderation research has recently made significant advances, but still fails to serve the majority of the world's languages due to the lack of resources, leaving millions of vulnerable users to online hostility. This work presents a large-scale human-annotated multi-task benchmark dataset for abusive language detection in Tigrinya social media with joint annotations for three tasks: abusiveness, sentiment, and topic classification. The dataset comprises 13,717 YouTube comments annotated by nine native speakers, collected from 7,373 videos with a total of over 1.2 billion views across 51 channels. We developed an iterative term clustering approach for effective data selection. Recognizing that around 64% of Tigrinya social media content uses Romanized transliterations rather than native Ge'ez script, our dataset accommodates both writing systems to reflect actual language use. We establish strong baselines across the tasks in the benchmark, while leaving significant challenges for future contributions. Our experiments reveal that small, specialized multi-task models outperform the current frontier models in the low-resource setting, achieving up to 86% accuracy (+7 points) in abusiveness detection. We make the resources publicly available to promote research on online safety.
△ Less
Submitted 17 May, 2025;
originally announced May 2025.
-
The Relevance of Non-axiality and Low-lying Excited States for Slow Magnetic Relaxation in Pentagonal-bipyramidal Erbium(III) Complexes Probed by High-frequency EPR
Authors:
J. Arneth,
L. Spillecke,
C. Koo,
T. A. Bazhenova,
E. B. Yagubskii,
R. Klingeler
Abstract:
High-frequency/high-field electron paramagnetic resonance studies on a series of seven-coordinate pentagonal-bipyramidal (PBP) erbium(III) complexes Er(DAPMBH/H$_2$DAPS)X (H$_2$DAPMBH = 2,6-diacetylpyridine bis-4-methoxy benzoylhydrazone, H$_4$DAPS = 2,6-diacetylpyridine bis-(salicylhydrazone)) demonstrate the effects of different apical ligands (X = (H$_2$O)Cl (1), (CH$_3$OH)N$_3$ (2), Cl$_2$ (3)…
▽ More
High-frequency/high-field electron paramagnetic resonance studies on a series of seven-coordinate pentagonal-bipyramidal (PBP) erbium(III) complexes Er(DAPMBH/H$_2$DAPS)X (H$_2$DAPMBH = 2,6-diacetylpyridine bis-4-methoxy benzoylhydrazone, H$_4$DAPS = 2,6-diacetylpyridine bis-(salicylhydrazone)) demonstrate the effects of different apical ligands (X = (H$_2$O)Cl (1), (CH$_3$OH)N$_3$ (2), Cl$_2$ (3)) on the local magnetic anisotropy of the central Er(III) ions. In particular, we report direct experimental determination of the effective $g$-values and zero field splittings of the energetically low-lying Kramers doublets. Our quantitative determination of the magnetic anisotropy highlights the relevance of an axial $g$-tensor for SMM behaviour and suggests that fast magnetic relaxation is mainly driven by a thermally assisted quantum tunnelling process via low-lying excited states.
△ Less
Submitted 29 April, 2025;
originally announced April 2025.
-
CHILES IX: Observational and Simulated HI Content and Star Formation of Blue Galaxies in Different Cosmic Web Environments
Authors:
Nicholas Luber,
Farhanul Hasan,
J. H. van Gorkom,
D. J. Pisano,
Joseph N. Burchett,
Julia Blue Bird,
Hansung B. Him,
Kelley M. Hess,
Lucas R. Hunt,
David C. Koo,
Sushma Kurapati,
Danielle Lucero,
Nir Mandelker,
Martin Meyer,
Emmanuel Momjian,
Daisuke Nagai,
Joel R. Primack,
Min S. Yun
Abstract:
We examine the redshift evolution of the relationship between the neutral atomic hydrogen ({\HI}) content and star-formation properties of blue galaxies, along with their location in the cosmic web. Using the COSMOS {\HI} Large Extragalactic Survey (CHILES) and the IllustrisTNG (TNG100) cosmological simulation, and the {\disperse} algorithm, we identify the filamentary structure in both observatio…
▽ More
We examine the redshift evolution of the relationship between the neutral atomic hydrogen ({\HI}) content and star-formation properties of blue galaxies, along with their location in the cosmic web. Using the COSMOS {\HI} Large Extragalactic Survey (CHILES) and the IllustrisTNG (TNG100) cosmological simulation, and the {\disperse} algorithm, we identify the filamentary structure in both observations and simulations, measure the distance of galaxies to the nearest filament spine {\dfil}, and calculate the mean {\HI} gas fraction and the relative specific star formation rate (sSFR) of blue galaxies in three different cosmic web environments -- $0<{\dfil}/\mathrm{Mpc}<2$ (filament cores), $2<{\dfil}/\mathrm{Mpc}<4$ (filament outskirts), and $4<{\dfil}/\mathrm{Mpc}<20$ (voids). We find that, although there are some similarities between CHILES and TNG, there exist significant discrepancies in the dependence of {\HI} and star formation on the cosmic web and on redshift. TNG overpredicts the observed {\HI} fraction and relative sSFR at $z=0-0.5$, with the tension being strongest in the voids. CHILES observes a decline in the {\HI} fraction from filament cores to voids, exactly the opposite of the trend predicted by TNG. CHILES observes an increase in {\HI} fraction at $z=0.5\rightarrow0$ in the voids, while TNG predicts an increase in this time in all environments. Further dividing the sample into stellar mass bins, we find that the {\HI} in ${\logms}>10$ galaxies is better reproduced by TNG than {\HI} in ${\logms}=9-10$ galaxies.
△ Less
Submitted 4 April, 2025;
originally announced April 2025.
-
Two-stage evolution of magnetic correlations in spiral spin liquid material, Ca$_{10}$Cr$_{7}$O$_{28}$
Authors:
Changhyun Koo,
Jaena Park,
Johannes Werner,
Suheon Lee,
Christian Balz,
A. T. M. Nazmul Islam,
Yugo Oshima,
Bella Lake,
Kwang-Yong Choi,
Rüdiger Klingeler
Abstract:
We present an X-band and tunable high-frequency/high-field electron spin resonance (HF-ESR) study of single-crystalline Ca$_{10}$Cr$_{7}$O$_{28}$, which constitutes alternating antiferromagnetic and ferromagnetic kagome bilayers. At high temperatures, a phonon-assisted relaxation process is evoked to account for the pronounced increase of the linewidth in an exchange-narrowing regime (…
▽ More
We present an X-band and tunable high-frequency/high-field electron spin resonance (HF-ESR) study of single-crystalline Ca$_{10}$Cr$_{7}$O$_{28}$, which constitutes alternating antiferromagnetic and ferromagnetic kagome bilayers. At high temperatures, a phonon-assisted relaxation process is evoked to account for the pronounced increase of the linewidth in an exchange-narrowing regime ($k_{\rm B}T\gg J$). In contrast, at low temperatures ($k_{\rm B}T\lesssim J$), a power-law behavior in line narrowing is observed. Our data reveal two distinct power-law regimes for the linewidth which crossover at $T^*\approx 7.5$~K. Notably, the intriguing evolution of the ESR linewidth in this alternating kagome bilayer system with opposite sign of exchange interactions highlights distinct spin dynamics compared to those in a uniform kagome antiferromagnet.
△ Less
Submitted 3 April, 2025;
originally announced April 2025.
-
The 2D Materials Roadmap
Authors:
Wencai Ren,
Peter Bøggild,
Joan Redwing,
Kostya Novoselov,
Luzhao Sun,
Yue Qi,
Kaicheng Jia,
Zhongfan Liu,
Oliver Burton,
Jack Alexander-Webber,
Stephan Hofmann,
Yang Cao,
Yu Long,
Quan-Hong Yang,
Dan Li,
Soo Ho Choi,
Ki Kang Kim,
Young Hee Lee,
Mian Li,
Qing Huang,
Yury Gogotsi,
Nicholas Clark,
Amy Carl,
Roman Gorbachev,
Thomas Olsen
, et al. (48 additional authors not shown)
Abstract:
Over the past two decades, 2D materials have rapidly evolved into a diverse and expanding family of material platforms. Many members of this materials class have demonstrated their potential to deliver transformative impact on fundamental research and technological applications across different fields. In this roadmap, we provide an overview of the key aspects of 2D material research and developme…
▽ More
Over the past two decades, 2D materials have rapidly evolved into a diverse and expanding family of material platforms. Many members of this materials class have demonstrated their potential to deliver transformative impact on fundamental research and technological applications across different fields. In this roadmap, we provide an overview of the key aspects of 2D material research and development, spanning synthesis, properties and commercial applications. We specifically present roadmaps for high impact 2D materials, including graphene and its derivatives, transition metal dichalcogenides, MXenes as well as their heterostructures and moiré systems. The discussions are organized into thematic sections covering emerging research areas (e.g., twisted electronics, moiré nano-optoelectronics, polaritronics, quantum photonics, and neuromorphic computing), breakthrough applications in key technologies (e.g., 2D transistors, energy storage, electrocatalysis, filtration and separation, thermal management, flexible electronics, sensing, electromagnetic interference shielding, and composites) and other important topics (computational discovery of novel materials, commercialization and standardization). This roadmap focuses on the current research landscape, future challenges and scientific and technological advances required to address, with the intent to provide useful references for promoting the development of 2D materials.
△ Less
Submitted 28 April, 2025; v1 submitted 28 March, 2025;
originally announced March 2025.
-
Can Memory-Augmented Language Models Generalize on Reasoning-in-a-Haystack Tasks?
Authors:
Payel Das,
Ching-Yun Ko,
Sihui Dai,
Georgios Kollias,
Subhajit Chaudhury,
Aurelie Lozano
Abstract:
Large language models often expose their brittleness in reasoning tasks, especially while executing long chains of reasoning over context. We propose MemReasoner, a new and simple memory-augmented LLM architecture, in which the memory learns the relative order of facts in context, and enables hopping over them, while the decoder selectively attends to the memory. MemReasoner is trained end-to-end,…
▽ More
Large language models often expose their brittleness in reasoning tasks, especially while executing long chains of reasoning over context. We propose MemReasoner, a new and simple memory-augmented LLM architecture, in which the memory learns the relative order of facts in context, and enables hopping over them, while the decoder selectively attends to the memory. MemReasoner is trained end-to-end, with optional supporting fact supervision of varying degrees. We train MemReasoner, along with existing memory-augmented transformer models and a state-space model, on two distinct synthetic multi-hop reasoning tasks. Experiments performed under a variety of challenging scenarios, including the presence of long distractor text or target answer changes in test set, show strong generalization of MemReasoner on both single- and two-hop tasks. This generalization of MemReasoner is achieved using none-to-weak supporting fact supervision (using none and 1\% of supporting facts for one- and two-hop tasks, respectively). In contrast, baseline models overall struggle to generalize and benefit far less from using full supporting fact supervision. The results highlight the importance of explicit memory mechanisms, combined with additional weak supervision, for improving large language model's context processing ability toward reasoning tasks.
△ Less
Submitted 10 March, 2025;
originally announced March 2025.
-
A temperate super-Jupiter imaged with JWST in the mid-infrared
Authors:
E. C. Matthews,
A. L. Carter,
P. Pathak,
C. V. Morley,
M. W. Phillips,
S. Krishanth P. M.,
F. Feng,
M. J. Bonse,
L. A. Boogaard,
J. A. Burt,
I. J. M. Crossfield,
E. S. Douglas,
Th. Henning,
J. Hom,
C. -L. Ko,
M. Kasper,
A. -M. Lagrange,
D. Petit dit de la Roche,
F. Philipot
Abstract:
Of the ~25 directly imaged planets to date, all are younger than 500Myr and all but 6 are younger than 100Myr. Eps Ind A (HD209100, HIP108870) is a K5V star of roughly solar age (recently derived as 3.7-5.7Gyr and 3.5$^{+0.8}_{-1.3}$Gyr). A long-term radial velocity trend as well as an astrometric acceleration led to claims of a giant planet orbiting the nearby star (3.6384$\pm$0.0013pc). Here we…
▽ More
Of the ~25 directly imaged planets to date, all are younger than 500Myr and all but 6 are younger than 100Myr. Eps Ind A (HD209100, HIP108870) is a K5V star of roughly solar age (recently derived as 3.7-5.7Gyr and 3.5$^{+0.8}_{-1.3}$Gyr). A long-term radial velocity trend as well as an astrometric acceleration led to claims of a giant planet orbiting the nearby star (3.6384$\pm$0.0013pc). Here we report JWST coronagraphic images that reveal a giant exoplanet which is consistent with these radial and astrometric measurements, but inconsistent with the previously claimed planet properties. The new planet has temperature ~275K, and is remarkably bright at 10.65um and 15.50um. Non-detections between 3.5-5um indicate an unknown opacity source in the atmosphere, possibly suggesting a high metallicity, high carbon-to-oxygen ratio planet. The best-fit temperature of the planet is consistent with theoretical thermal evolution models, which are previously untested at this temperature range. The data indicates that this is likely the only giant planet in the system and we therefore refer to it as ``b", despite it having significantly different orbital properties than the previously claimed planet ``b".
△ Less
Submitted 3 March, 2025;
originally announced March 2025.
-
STAR: Spectral Truncation and Rescale for Model Merging
Authors:
Yu-Ang Lee,
Ching-Yun Ko,
Tejaswini Pedapati,
I-Hsin Chung,
Mi-Yen Yeh,
Pin-Yu Chen
Abstract:
Model merging is an efficient way of obtaining a multi-task model from several pretrained models without further fine-tuning, and it has gained attention in various domains, including natural language processing (NLP). Despite the efficiency, a key challenge in model merging is the seemingly inevitable decrease in task performance as the number of models increases. In this paper, we propose…
▽ More
Model merging is an efficient way of obtaining a multi-task model from several pretrained models without further fine-tuning, and it has gained attention in various domains, including natural language processing (NLP). Despite the efficiency, a key challenge in model merging is the seemingly inevitable decrease in task performance as the number of models increases. In this paper, we propose $\mathbf{S}$pectral $\mathbf{T}$runcation $\mathbf{A}$nd $\mathbf{R}$escale (STAR) that aims at mitigating ``merging conflicts'' by truncating small components in the respective spectral spaces, which is followed by an automatic parameter rescaling scheme to retain the nuclear norm of the original matrix. STAR requires no additional inference on original training data and is robust to hyperparamater choice. We demonstrate the effectiveness of STAR through extensive model merging cases on diverse NLP tasks. Specifically, STAR works robustly across varying model sizes, and can outperform baselines by 4.2$\%$ when merging 12 models on Flan-T5. Our code is publicly available at https://github.com/IBM/STAR.
△ Less
Submitted 14 February, 2025;
originally announced February 2025.
-
Artificial Intelligence to Assess Dental Findings from Panoramic Radiographs -- A Multinational Study
Authors:
Yin-Chih Chelsea Wang,
Tsao-Lun Chen,
Shankeeth Vinayahalingam,
Tai-Hsien Wu,
Chu Wei Chang,
Hsuan Hao Chang,
Hung-Jen Wei,
Mu-Hsiung Chen,
Ching-Chang Ko,
David Anssari Moin,
Bram van Ginneken,
Tong Xi,
Hsiao-Cheng Tsai,
Min-Huey Chen,
Tzu-Ming Harry Hsu,
Hye Chou
Abstract:
Dental panoramic radiographs (DPRs) are widely used in clinical practice for comprehensive oral assessment but present challenges due to overlapping structures and time constraints in interpretation.
This study aimed to establish a solid baseline for the AI-automated assessment of findings in DPRs by developing, evaluating an AI system, and comparing its performance with that of human readers ac…
▽ More
Dental panoramic radiographs (DPRs) are widely used in clinical practice for comprehensive oral assessment but present challenges due to overlapping structures and time constraints in interpretation.
This study aimed to establish a solid baseline for the AI-automated assessment of findings in DPRs by developing, evaluating an AI system, and comparing its performance with that of human readers across multinational data sets.
We analyzed 6,669 DPRs from three data sets (the Netherlands, Brazil, and Taiwan), focusing on 8 types of dental findings. The AI system combined object detection and semantic segmentation techniques for per-tooth finding identification. Performance metrics included sensitivity, specificity, and area under the receiver operating characteristic curve (AUC-ROC). AI generalizability was tested across data sets, and performance was compared with human dental practitioners.
The AI system demonstrated comparable or superior performance to human readers, particularly +67.9% (95% CI: 54.0%-81.9%; p < .001) sensitivity for identifying periapical radiolucencies and +4.7% (95% CI: 1.4%-8.0%; p = .008) sensitivity for identifying missing teeth. The AI achieved a macro-averaged AUC-ROC of 96.2% (95% CI: 94.6%-97.8%) across 8 findings. AI agreements with the reference were comparable to inter-human agreements in 7 of 8 findings except for caries (p = .024). The AI system demonstrated robust generalization across diverse imaging and demographic settings and processed images 79 times faster (95% CI: 75-82) than human readers.
The AI system effectively assessed findings in DPRs, achieving performance on par with or better than human experts while significantly reducing interpretation time. These results highlight the potential for integrating AI into clinical workflows to improve diagnostic efficiency and accuracy, and patient management.
△ Less
Submitted 14 February, 2025;
originally announced February 2025.
-
Effects of chiral symmetry restoration on dilepton production in heavy ion collisions
Authors:
Wen-Hao Zhou,
Che Ming Ko,
Kai-Jia Sun
Abstract:
Because of their weak interactions with the strongly interacting matter produced in relativistic heavy-ion collisions, dileptons provide an ideal probe of the early dynamics of these collisions. Here, we study dilepton production using a partonic transport model that is based on an extended Nambu-Jona-Lasinio (NJL) model. In this model, the in-medium quark masses decrease with increasing temperatu…
▽ More
Because of their weak interactions with the strongly interacting matter produced in relativistic heavy-ion collisions, dileptons provide an ideal probe of the early dynamics of these collisions. Here, we study dilepton production using a partonic transport model that is based on an extended Nambu-Jona-Lasinio (NJL) model. In this model, the in-medium quark masses decrease with increasing temperature as a result of the restoration of chiral symmetry. We find that the extracted temperature from dileptons of intermediate masses agrees well with the temperature of the partonic matter, suggesting that dilepton production can be used as a thermometer for the produced partonic matter. Our results also indicate that the extracted in-medium quark masses decrease with increasing dilepton temperature, implying that dilepton production can further serve as a probe of chiral symmetry restoration in high energy heavy-ion collisions.
△ Less
Submitted 25 December, 2024;
originally announced December 2024.
-
Understanding and Improving Training-Free AI-Generated Image Detections with Vision Foundation Models
Authors:
Chung-Ting Tsai,
Ching-Yun Ko,
I-Hsin Chung,
Yu-Chiang Frank Wang,
Pin-Yu Chen
Abstract:
The rapid advancement of generative models has introduced serious risks, including deepfake techniques for facial synthesis and editing. Traditional approaches rely on training classifiers and enhancing generalizability through various feature extraction techniques. Meanwhile, training-free detection methods address issues like limited data and overfitting by directly leveraging statistical proper…
▽ More
The rapid advancement of generative models has introduced serious risks, including deepfake techniques for facial synthesis and editing. Traditional approaches rely on training classifiers and enhancing generalizability through various feature extraction techniques. Meanwhile, training-free detection methods address issues like limited data and overfitting by directly leveraging statistical properties from vision foundation models to distinguish between real and fake images. The current leading training-free approach, RIGID, utilizes DINOv2 sensitivity to perturbations in image space for detecting fake images, with fake image embeddings exhibiting greater sensitivity than those of real images. This observation prompts us to investigate how detection performance varies across model backbones, perturbation types, and datasets. Our experiments reveal that detection performance is closely linked to model robustness, with self-supervised (SSL) models providing more reliable representations. While Gaussian noise effectively detects general objects, it performs worse on facial images, whereas Gaussian blur is more effective due to potential frequency artifacts. To further improve detection, we introduce Contrastive Blur, which enhances performance on facial images, and MINDER (MINimum distance DetEctoR), which addresses noise type bias, balancing performance across domains. Beyond performance gains, our work offers valuable insights for both the generative and detection communities, contributing to a deeper understanding of model robustness property utilized for deepfake detection.
△ Less
Submitted 28 November, 2024;
originally announced November 2024.
-
Different Bias Under Different Criteria: Assessing Bias in LLMs with a Fact-Based Approach
Authors:
Changgeon Ko,
Jisu Shin,
Hoyun Song,
Jeongyeon Seo,
Jong C. Park
Abstract:
Large language models (LLMs) often reflect real-world biases, leading to efforts to mitigate these effects and make the models unbiased. Achieving this goal requires defining clear criteria for an unbiased state, with any deviation from these criteria considered biased. Some studies define an unbiased state as equal treatment across diverse demographic groups, aiming for balanced outputs from LLMs…
▽ More
Large language models (LLMs) often reflect real-world biases, leading to efforts to mitigate these effects and make the models unbiased. Achieving this goal requires defining clear criteria for an unbiased state, with any deviation from these criteria considered biased. Some studies define an unbiased state as equal treatment across diverse demographic groups, aiming for balanced outputs from LLMs. However, differing perspectives on equality and the importance of pluralism make it challenging to establish a universal standard. Alternatively, other approaches propose using fact-based criteria for more consistent and objective evaluations, though these methods have not yet been fully applied to LLM bias assessments. Thus, there is a need for a metric with objective criteria that offers a distinct perspective from equality-based approaches. Motivated by this need, we introduce a novel metric to assess bias using fact-based criteria and real-world statistics. In this paper, we conducted a human survey demonstrating that humans tend to perceive LLM outputs more positively when they align closely with real-world demographic distributions. Evaluating various LLMs with our proposed metric reveals that model bias varies depending on the criteria used, highlighting the need for multi-perspective assessment.
△ Less
Submitted 26 November, 2024;
originally announced November 2024.
-
Sources and Radiations of the Fermi Bubbles
Authors:
Vladimir A. Dogiel,
Chung-Ming Ko
Abstract:
Two enigmatic gamma-ray features in the Galactic central region, known as Fermi Bubbles (FBs), were found from Fermi-LAT data. An energy release (e.g., by tidal disruption events in the Galactic center, GC), generates a cavity with a shock that expands into the local ambient medium of the Galactic halo. A decade or so ago, a phenomenological model of the FBs was suggested as a result of routine st…
▽ More
Two enigmatic gamma-ray features in the Galactic central region, known as Fermi Bubbles (FBs), were found from Fermi-LAT data. An energy release (e.g., by tidal disruption events in the Galactic center, GC), generates a cavity with a shock that expands into the local ambient medium of the Galactic halo. A decade or so ago, a phenomenological model of the FBs was suggested as a result of routine star disruptions by the supermassive black hole in the GC which might provide enough energy for large-scale structures, like the FBs. In 2020, analytical and numerical models of the FBs as a process of routine tidal disruption of stars near the GC were developed, which can provide enough cumulative energy to form and maintain large scale structures like the FBs. The disruption events are expected to be ten to hundred events per million years, providing the average power of energy release from the GC into the halo of 3E41 erg/s, which is needed to support the FBs. Analysis of the evolution of superbubbles in exponentially stratified disks concluded that the FB envelope would be destroyed by the Rayleigh-Taylor (RT) instabilities at late stages. The shell is composed of a swept-up gas of the bubble, whose thickness is much thinner in comparison to the size of the envelope. We assume that hydrodynamic turbulence is excited in the FB envelope by the RT instability. In this case, the universal energy spectrum of turbulence may be developed in the inertial range of wavenumbers of fluctuations (the Kolmogorov-Obukhov spectrum). From our model we suppose the power of the FBs is transformed partly into the energy of hydrodynamic turbulence in the envelope. If so, hydrodynamic turbulence may generate MHD-fluctuations, which accelerate cosmic rays there and generate gamma-ray and radio emission from the FBs. We hope that this model may interpret the observed nonthermal emission from the bubbles.
△ Less
Submitted 22 November, 2024;
originally announced November 2024.
-
Attention Tracker: Detecting Prompt Injection Attacks in LLMs
Authors:
Kuo-Han Hung,
Ching-Yun Ko,
Ambrish Rawat,
I-Hsin Chung,
Winston H. Hsu,
Pin-Yu Chen
Abstract:
Large Language Models (LLMs) have revolutionized various domains but remain vulnerable to prompt injection attacks, where malicious inputs manipulate the model into ignoring original instructions and executing designated action. In this paper, we investigate the underlying mechanisms of these attacks by analyzing the attention patterns within LLMs. We introduce the concept of the distraction effec…
▽ More
Large Language Models (LLMs) have revolutionized various domains but remain vulnerable to prompt injection attacks, where malicious inputs manipulate the model into ignoring original instructions and executing designated action. In this paper, we investigate the underlying mechanisms of these attacks by analyzing the attention patterns within LLMs. We introduce the concept of the distraction effect, where specific attention heads, termed important heads, shift focus from the original instruction to the injected instruction. Building on this discovery, we propose Attention Tracker, a training-free detection method that tracks attention patterns on instruction to detect prompt injection attacks without the need for additional LLM inference. Our method generalizes effectively across diverse models, datasets, and attack types, showing an AUROC improvement of up to 10.0% over existing methods, and performs well even on small LLMs. We demonstrate the robustness of our approach through extensive evaluations and provide insights into safeguarding LLM-integrated systems from prompt injection vulnerabilities.
△ Less
Submitted 22 April, 2025; v1 submitted 1 November, 2024;
originally announced November 2024.
-
Medical Imaging Complexity and its Effects on GAN Performance
Authors:
William Cagas,
Chan Ko,
Blake Hsiao,
Shryuk Grandhi,
Rishi Bhattacharya,
Kevin Zhu,
Michael Lam
Abstract:
The proliferation of machine learning models in diverse clinical applications has led to a growing need for high-fidelity, medical image training data. Such data is often scarce due to cost constraints and privacy concerns. Alleviating this burden, medical image synthesis via generative adversarial networks (GANs) emerged as a powerful method for synthetically generating photo-realistic images bas…
▽ More
The proliferation of machine learning models in diverse clinical applications has led to a growing need for high-fidelity, medical image training data. Such data is often scarce due to cost constraints and privacy concerns. Alleviating this burden, medical image synthesis via generative adversarial networks (GANs) emerged as a powerful method for synthetically generating photo-realistic images based on existing sets of real medical images. However, the exact image set size required to efficiently train such a GAN is unclear. In this work, we experimentally establish benchmarks that measure the relationship between a sample dataset size and the fidelity of the generated images, given the dataset's distribution of image complexities. We analyze statistical metrics based on delentropy, an image complexity measure rooted in Shannon's entropy in information theory. For our pipeline, we conduct experiments with two state-of-the-art GANs, StyleGAN 3 and SPADE-GAN, trained on multiple medical imaging datasets with variable sample sizes. Across both GANs, general performance improved with increasing training set size but suffered with increasing complexity.
△ Less
Submitted 23 October, 2024;
originally announced October 2024.
-
Large Language Models can be Strong Self-Detoxifiers
Authors:
Ching-Yun Ko,
Pin-Yu Chen,
Payel Das,
Youssef Mroueh,
Soham Dan,
Georgios Kollias,
Subhajit Chaudhury,
Tejaswini Pedapati,
Luca Daniel
Abstract:
Reducing the likelihood of generating harmful and toxic output is an essential task when aligning large language models (LLMs). Existing methods mainly rely on training an external reward model (i.e., another language model) or fine-tuning the LLM using self-generated data to influence the outcome. In this paper, we show that LLMs have the capability of self-detoxification without the use of an ad…
▽ More
Reducing the likelihood of generating harmful and toxic output is an essential task when aligning large language models (LLMs). Existing methods mainly rely on training an external reward model (i.e., another language model) or fine-tuning the LLM using self-generated data to influence the outcome. In this paper, we show that LLMs have the capability of self-detoxification without the use of an additional reward model or re-training. We propose \textit{Self-disciplined Autoregressive Sampling (SASA)}, a lightweight controlled decoding algorithm for toxicity reduction of LLMs. SASA leverages the contextual representations from an LLM to learn linear subspaces characterizing toxic v.s. non-toxic output in analytical forms. When auto-completing a response token-by-token, SASA dynamically tracks the margin of the current output to steer the generation away from the toxic subspace, by adjusting the autoregressive sampling strategy. Evaluated on LLMs of different scale and nature, namely Llama-3.1-Instruct (8B), Llama-2 (7B), and GPT2-L models with the RealToxicityPrompts, BOLD, and AttaQ benchmarks, SASA markedly enhances the quality of the generated sentences relative to the original models and attains comparable performance to state-of-the-art detoxification techniques, significantly reducing the toxicity level by only using the LLM's internal representations.
△ Less
Submitted 4 October, 2024;
originally announced October 2024.
-
A PyTorch Benchmark for High-Contrast Imaging Post Processing
Authors:
Chia-Lin Ko,
Ewan S. Douglas,
Justin Hom
Abstract:
Direct imaging of exoplanets is a challenging task that involves distinguishing faint planetary signals from the overpowering glare of their host stars, often obscured by time-varying stellar noise known as "speckles". The predominant algorithms for speckle noise subtraction employ principal-based point spread function (PSF) fitting techniques to discern planetary signals from stellar speckle nois…
▽ More
Direct imaging of exoplanets is a challenging task that involves distinguishing faint planetary signals from the overpowering glare of their host stars, often obscured by time-varying stellar noise known as "speckles". The predominant algorithms for speckle noise subtraction employ principal-based point spread function (PSF) fitting techniques to discern planetary signals from stellar speckle noise. We introduce torchKLIP, a benchmark package developed within the machine learning (ML) framework PyTorch. This work enables ML techniques to utilize extensive PSF libraries to enhance direct imaging post-processing. Such advancements promise to improve the post-processing of high-contrast images from leading-edge astronomical instruments like the James Webb Space Telescope and extreme adaptive optics systems.
△ Less
Submitted 24 September, 2024;
originally announced September 2024.
-
EQ-CBM: A Probabilistic Concept Bottleneck with Energy-based Models and Quantized Vectors
Authors:
Sangwon Kim,
Dasom Ahn,
Byoung Chul Ko,
In-su Jang,
Kwang-Ju Kim
Abstract:
The demand for reliable AI systems has intensified the need for interpretable deep neural networks. Concept bottleneck models (CBMs) have gained attention as an effective approach by leveraging human-understandable concepts to enhance interpretability. However, existing CBMs face challenges due to deterministic concept encoding and reliance on inconsistent concepts, leading to inaccuracies. We pro…
▽ More
The demand for reliable AI systems has intensified the need for interpretable deep neural networks. Concept bottleneck models (CBMs) have gained attention as an effective approach by leveraging human-understandable concepts to enhance interpretability. However, existing CBMs face challenges due to deterministic concept encoding and reliance on inconsistent concepts, leading to inaccuracies. We propose EQ-CBM, a novel framework that enhances CBMs through probabilistic concept encoding using energy-based models (EBMs) with quantized concept activation vectors (qCAVs). EQ-CBM effectively captures uncertainties, thereby improving prediction reliability and accuracy. By employing qCAVs, our method selects homogeneous vectors during concept encoding, enabling more decisive task performance and facilitating higher levels of human intervention. Empirical results using benchmark datasets demonstrate that our approach outperforms the state-of-the-art in both concept and task accuracy.
△ Less
Submitted 22 September, 2024;
originally announced September 2024.
-
SDSS-IV MaNGA: Stellar rotational support in disk galaxies vs. central surface density and stellar population age
Authors:
Xiaohan Wang,
Yifei Luo,
S. M. Faber,
David C. Koo,
Shude Mao,
Kyle B. Westfall,
Shengdong Lu,
Weichen Wang,
Kevin Bundy,
N. Boardman,
Vladimir Avila-Reese,
José G. Fernández-Trincado,
Richard R. Lane
Abstract:
We investigate how the stellar rotational support changes as a function of spatially resolved stellar population age ($\rm D_n4000$) and relative central stellar surface density ($ΔΣ_1$) for MaNGA isolated/central disk galaxies. We find that the galaxy rotational support $λ_{R_\mathrm{e}}$ varies smoothly as a function of $ΔΣ_1$ and $\rm D_n4000$. $\rm D_n4000$ vs. $ΔΣ_1$ follows a "J-shape", with…
▽ More
We investigate how the stellar rotational support changes as a function of spatially resolved stellar population age ($\rm D_n4000$) and relative central stellar surface density ($ΔΣ_1$) for MaNGA isolated/central disk galaxies. We find that the galaxy rotational support $λ_{R_\mathrm{e}}$ varies smoothly as a function of $ΔΣ_1$ and $\rm D_n4000$. $\rm D_n4000$ vs. $ΔΣ_1$ follows a "J-shape", with $λ_{R_\mathrm{e}}$ contributing to the scatters. In this "J-shaped" pattern rotational support increases with central $\rm D_n4000$ when $ΔΣ_1$ is low but decreases with $ΔΣ_1$ when $ΔΣ_1$ is high. Restricting attention to low-$ΔΣ_1$ (i.e, large-radius) galaxies, we suggest that the trend of increasing rotational support with $\rm D_n4000$ for these objects is produced by a mix of two different processes, a primary trend characterized by growth in $λ_{R_\mathrm{e}}$ along with mass through gas accretion, on top of which disturbance episodes are overlaid, which reduce rotational support and trigger increased star formation. An additional finding is that star forming galaxies with low $ΔΣ_1$ have relatively larger radii than galaxies with higher $ΔΣ_1$ at fixed stellar mass. Assuming that these relative radii rankings are preserved while galaxies are star forming then implies clear evolutionary paths in central $\rm D_n4000$ vs. $ΔΣ_1$. The paper closes with comments on the implications that these paths have for the evolution of pseudo-bulges vs. classical-bulges. The utility of using $\rm D_n4000$-$ΔΣ_1$ to study $λ_{R_\mathrm{e}}$ reinforces the notion that galaxy kinematics correlate both with structure and with stellar-population state, and indicates the importance of a multi-dimensional description for understanding bulge and galaxy evolution.
△ Less
Submitted 5 August, 2024;
originally announced August 2024.
-
Jet-Induced Enhancement of Deuteron Production in $pp$ and $p$-Pb Collisions at the LHC
Authors:
Yi-Heng Feng,
Che Ming Ko,
Yu-Gang Ma,
Kai-Jia Sun,
Xin-Nian Wang,
Zhong Yang,
Song Zhang
Abstract:
Jet-associated deuteron production in $pp$ collisions at $\sqrt{s}=13$ TeV and $p$-Pb collisions at $\sqrt{s_{NN}}=5.02$ TeV is studied in the coalescence model by using the phase-space information of proton and neutron pairs from a multiphase transport (AMPT) model at the kinetic freezeout. In the low transverse momentum ($p_T$) region $p_T/A < 1.5$ GeV/$c$, where $A$ is the mass number of a nucl…
▽ More
Jet-associated deuteron production in $pp$ collisions at $\sqrt{s}=13$ TeV and $p$-Pb collisions at $\sqrt{s_{NN}}=5.02$ TeV is studied in the coalescence model by using the phase-space information of proton and neutron pairs from a multiphase transport (AMPT) model at the kinetic freezeout. In the low transverse momentum ($p_T$) region $p_T/A < 1.5$ GeV/$c$, where $A$ is the mass number of a nucleus, the in-jet coalescence factor $B_2^\text{In-jet}$ for deuteron production, given by the ratio of the in-jet deuteron number to the square of the in-jet proton number, is found to be larger than the coalescence factor $B_2$ in the medium perpendicular to the jet by a factor of about 10 in $pp$ collisions and of 25 in $p-$Pb collisions, which are consistent with the ALICE measurements at the LHC. Such large low-momentum enhancements mainly come from coalescence of nucleons inside the jet with the medium nucleons. Coalescence of nucleons inside the jet dominates deuteron production only at the higher $p_T$ region of $p_T/A\gtrsim 4$ GeV/$c$, where both the yield ratio $d/p$ of deuteron to proton numbers and the $B_2$ are also significantly larger in the jet direction than in the direction perpendicular to the jet due to the strong collinear correlation among particles produced from jet fragmentation. Studying jet-associated deuteron production in relativistic nuclear collisions thus opens up a new window to probe the phase-space structure of nucleons inside jets.
△ Less
Submitted 2 August, 2024;
originally announced August 2024.
-
SignBLEU: Automatic Evaluation of Multi-channel Sign Language Translation
Authors:
Jung-Ho Kim,
Mathew Huerta-Enochian,
Changyong Ko,
Du Hui Lee
Abstract:
Sign languages are multi-channel languages that communicate information through not just the hands (manual signals) but also facial expressions and upper body movements (non-manual signals). However, since automatic sign language translation is usually performed by generating a single sequence of glosses, researchers eschew non-manual and co-occurring manual signals in favor of a simplified list o…
▽ More
Sign languages are multi-channel languages that communicate information through not just the hands (manual signals) but also facial expressions and upper body movements (non-manual signals). However, since automatic sign language translation is usually performed by generating a single sequence of glosses, researchers eschew non-manual and co-occurring manual signals in favor of a simplified list of manual glosses. This can lead to significant information loss and ambiguity. In this paper, we introduce a new task named multi-channel sign language translation (MCSLT) and present a novel metric, SignBLEU, designed to capture multiple signal channels. We validated SignBLEU on a system-level task using three sign language corpora with varied linguistic structures and transcription methodologies and examined its correlation with human judgment through two segment-level tasks. We found that SignBLEU consistently correlates better with human judgment than competing metrics. To facilitate further MCSLT research, we report benchmark scores for the three sign language corpora and release the source code for SignBLEU at https://github.com/eq4all-projects/SignBLEU.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
Scene Graph Generation Strategy with Co-occurrence Knowledge and Learnable Term Frequency
Authors:
Hyeongjin Kim,
Sangwon Kim,
Dasom Ahn,
Jong Taek Lee,
Byoung Chul Ko
Abstract:
Scene graph generation (SGG) is an important task in image understanding because it represents the relationships between objects in an image as a graph structure, making it possible to understand the semantic relationships between objects intuitively. Previous SGG studies used a message-passing neural networks (MPNN) to update features, which can effectively reflect information about surrounding o…
▽ More
Scene graph generation (SGG) is an important task in image understanding because it represents the relationships between objects in an image as a graph structure, making it possible to understand the semantic relationships between objects intuitively. Previous SGG studies used a message-passing neural networks (MPNN) to update features, which can effectively reflect information about surrounding objects. However, these studies have failed to reflect the co-occurrence of objects during SGG generation. In addition, they only addressed the long-tail problem of the training dataset from the perspectives of sampling and learning methods. To address these two problems, we propose CooK, which reflects the Co-occurrence Knowledge between objects, and the learnable term frequency-inverse document frequency (TF-l-IDF) to solve the long-tail problem. We applied the proposed model to the SGG benchmark dataset, and the results showed a performance improvement of up to 3.8% compared with existing state-of-the-art models in SGGen subtask. The proposed method exhibits generalization ability from the results obtained, showing uniform performance improvement for all MPNN models.
△ Less
Submitted 21 May, 2024;
originally announced May 2024.
-
Deciphering Hypertriton and Antihypertriton Spins from Their Global Polarizations in Heavy-Ion Collisions
Authors:
Kai-Jia Sun,
Dai-Neng Liu,
Yun-Peng Zheng,
Jin-Hui Chen,
Che Ming Ko,
Yu-Gang Ma
Abstract:
Understanding the properties of hypernuclei is crucial for constraining the nature of hyperon-nucleon ($Y\text{-}N$) interactions, which plays a key role in determining the inner structure of compact stars. The lightest hypernuclei and antihypernuclei are the hypertriton ($^3_Λ\text{H}$), which consists of a pair of nucleons and a $Λ$ hyperon, and its antinucleus (${^3_{\barΛ}}\overline{\rm H}$).…
▽ More
Understanding the properties of hypernuclei is crucial for constraining the nature of hyperon-nucleon ($Y\text{-}N$) interactions, which plays a key role in determining the inner structure of compact stars. The lightest hypernuclei and antihypernuclei are the hypertriton ($^3_Λ\text{H}$), which consists of a pair of nucleons and a $Λ$ hyperon, and its antinucleus (${^3_{\barΛ}}\overline{\rm H}$). Significant knowledge has recently been acquired regarding the mass, lifetime, and binding energy of $^3_Λ\text{H}$. However, its exact spin, whether $\frac{1}{2}$ or $\frac{3}{2}$, remains undetermined in both experimental and theoretical studies. Here, we present a novel method of using the hypertriton global polarization in heavy-ion collisions to decipher not only its total spin but also its internal spin structure. This method is based on the finding that its three different spin structures exhibit distinct beam energy dependence of its global polarization when it is produced in these collisions from the coalescence of proton, neutron and $Λ$. Future observations of the hypertriton and antihypertriton global polarizations thus provide the opportunity to unveil the spin structures of hypertriton and antihypertriton and their production mechanisms in heavy-ion collisions.
△ Less
Submitted 14 January, 2025; v1 submitted 20 May, 2024;
originally announced May 2024.
-
Hadronic scattering effects on $Λ$ polarization in relativistic heavy ion collisions
Authors:
Haesom Sung,
Che Ming Ko,
Su Houng Lee
Abstract:
The $Λ$ hyperon spin flip and non-flip cross sections are calculated in a simple hadronic model by including both the $s$-channel process involving the spin 3/2, positive parity $Σ^*(1358)$ resonance and the $t$-channel process via the exchange of a scalar $σ$ meson. Because of its large mass, the $Λ$ spin flip to non-flip cross sections is negligibly small in the $t$-channel process compared to t…
▽ More
The $Λ$ hyperon spin flip and non-flip cross sections are calculated in a simple hadronic model by including both the $s$-channel process involving the spin 3/2, positive parity $Σ^*(1358)$ resonance and the $t$-channel process via the exchange of a scalar $σ$ meson. Because of its large mass, the $Λ$ spin flip to non-flip cross sections is negligibly small in the $t$-channel process compared to the constant value of 1/3.5 in the $s$-channel process. With the $s-$channel $Λ-π$ spin-dependent cross sections included in a schematic kinetic model, the effects of hadronic scatterings on the $Λ$ spin polarization in Au-Au collisions at $\sqrt{s_{NN}}=7.7$ GeV are studied. It is found that the $Λ$ spin polarization only decreases by 7-12\% during the hadronic stage of these collisions, which justifies the assumption in theoretical studies that compare the $Λ$ polarization calculated at the chemical freezeout to the measured one at the kinetic freezeout.
△ Less
Submitted 30 July, 2024; v1 submitted 24 April, 2024;
originally announced April 2024.
-
Non-Monotonic Relations of Galaxy Star Formation, Radius, and Structure at Fixed Stellar Mass
Authors:
Jimena Stephenson,
Aldo Rodriguez-Puebla,
S. M. Faber,
Joel R. Primack,
Vladimir Avila-Reese,
A. R. Calette,
Carlo Cannarozzo,
James Kakos,
Mariana Cano-Díaz,
David C. Koo,
Francesco Shankar,
D. F. Morell
Abstract:
We investigate the relation between galaxy structure and star formation rate (SFR) in a sample of $\sim2.9\times10^{4}$ central galaxies with $z<0.0674$ and axial ratios $b/a>0.5$. The star-forming main sequence (SFMS) shows a bend around the stellar mass of $M_\ast\leq{}M_c=2\times10^{10}{}M_{\odot}$. At $M_\ast\leq{}M_c$ the SFMS follows a power-law $\text{SFR}\propto{}M_\ast^{0.85}$, while at h…
▽ More
We investigate the relation between galaxy structure and star formation rate (SFR) in a sample of $\sim2.9\times10^{4}$ central galaxies with $z<0.0674$ and axial ratios $b/a>0.5$. The star-forming main sequence (SFMS) shows a bend around the stellar mass of $M_\ast\leq{}M_c=2\times10^{10}{}M_{\odot}$. At $M_\ast\leq{}M_c$ the SFMS follows a power-law $\text{SFR}\propto{}M_\ast^{0.85}$, while at higher masses it flattens. $M_c$ corresponds to a dark matter halo mass of $M_\text{vir}\sim{}10^{11.8}M_{\odot}$ where virial shocks occurs. Some galaxy structure (e.g., half-light radius, $R_e$) exhibits a non-monotonic dependence across the SFMS at a fixed $M_\ast$. We find $\text{SFR}\propto{R_e^{-0.28}}$ at fixed $M_\ast$, consistent with the global Kennicutt-Schmidt (KS) law. This finding suggests that galaxy sizes contribute to the scatter of the SFMS. However, at $M_\ast>M_c$ the relationship between SFR and $R_e$ diminishes. Low-mass galaxies above the mean of the SFMS have smaller radii, exhibit compact and centrally concentrated profiles resembling green valley (GV) and quiescent galaxies at the same mass, and have higher $M_{\text{H}_2}/M_\text{HI}$. Conversely, those below the SFMS exhibit larger radii, lower densities, have no GV or quiescent counterparts at their mass and have lower $M_{\text{H}_2}/M_\text{HI}$. The above data suggest two pathways for quenching low-mass galaxies, $M_\ast\leq{}M_c$: a fast one that changes the morphology on the SFMS and a slow one that does not. Above $M_c$, galaxies below the SFMS resemble GV and quiescent galaxies structurally, implying that they undergo a structural transformation already within the SFMS. For these massive galaxies, CG are strongly bimodal, with SFMS galaxies exhibiting negative color gradients, suggesting most star formation occurs in their outskirts, maintaining them within the SFMS.
△ Less
Submitted 15 April, 2024;
originally announced April 2024.
-
Softening of the Hypertriton Transverse Momentum Spectrum in Heavy-Ion Collisions
Authors:
Dai-Neng Liu,
Che Ming Ko,
Yu-Gang Ma,
Francesco Mazzaschi,
Maximiliano Puccio,
Qi-Ye Shou,
Kai-Jia Sun,
Yuan-Zhe Wang
Abstract:
Understanding the properties of hypernuclei helps to constrain the interaction between hyperon and nucleon, which is known to play an essential role in determining the properties of neutron stars. Experimental measurements have suggested that the hypertriton ($^3_Λ\text{H}$), the lightest hypernucleus, exhibits a halo structure with a deuteron core encircled by a $Λ$ hyperon at a distance of about…
▽ More
Understanding the properties of hypernuclei helps to constrain the interaction between hyperon and nucleon, which is known to play an essential role in determining the properties of neutron stars. Experimental measurements have suggested that the hypertriton ($^3_Λ\text{H}$), the lightest hypernucleus, exhibits a halo structure with a deuteron core encircled by a $Λ$ hyperon at a distance of about 10 fm. This large $Λ-d$ distance in $^3_Λ\text{H}$ wave function is found to cause a suppressed $^3_Λ\text{H}$ yield and a softening of its transverse momentum ($p_T$) spectrum in relativistic heavy-ion collisions. Within the coalescence model based on nucleons and $Λ$ hyperons from a microscopic hybrid hydro model with a hadronic afterburner for nuclear cluster production in Pb-Pb collisions at $\sqrt{s_{NN}}$= 5.02 TeV, we show how this softening of the hypertriton $p_T$ spectrum appears and leads to a smaller mean $p_T$ for $^3_Λ\text{H}$ than for helium-3 ($^3$He). The latter is opposite to the predictions from the blast-wave model which assumes that $^3_Λ\text{H}$ and $^3$He are thermally produced at the kinetic freeze-out of heavy-ion collisions. The discovered quantum mechanical softening of the (anti-)hypertriton spectrum can be experimentally tested in relativistic heavy-ion collisions at different collision energies and centralities and used to obtain valuable insights into the mechanisms for light (hyper-)nuclei production in these collisions.
△ Less
Submitted 16 July, 2024; v1 submitted 3 April, 2024;
originally announced April 2024.
-
Martingales associated with strongly quasi-invariant states
Authors:
Ameur Dhahri,
Chul Ki Ko,
Hyun Jae Yoo
Abstract:
We discuss the martingales in relevance with $G$-strongly quasi-invariant states on a $C^*$-algebra $\mathcal A$, where $G$ is a separable locally compact group of $*$-automorphisms of $\mathcal A$. In the von Neumann algebra $\mathfrak A$ of the GNS representation, we define a unitary representation of the group and define a group $\hat G$ of $*$-automorphisms of $\mathfrak A$, which is homomorph…
▽ More
We discuss the martingales in relevance with $G$-strongly quasi-invariant states on a $C^*$-algebra $\mathcal A$, where $G$ is a separable locally compact group of $*$-automorphisms of $\mathcal A$. In the von Neumann algebra $\mathfrak A$ of the GNS representation, we define a unitary representation of the group and define a group $\hat G$ of $*$-automorphisms of $\mathfrak A$, which is homomorphic to $G$. For the case of compact $G$, under some mild condition, we find a $\hat G$-invariant state on $\mathfrak A$ and define a conditional expectation with range the $\hat G$-fixed subalgebra. Moving to the separable locally compact group $G=\cup_NG_N$, which is the union of increasing compact groups, we construct a sequence of conditional expectations and thereby construct (decreasing) martingales, which have limits by the martingale convergence theorem. We provide with an example for the group of finite permutations on the set of nonnegative integers acting on a $C^*$-algebra of infinite tensor product.
△ Less
Submitted 4 February, 2025; v1 submitted 5 March, 2024;
originally announced March 2024.
-
Star-forming and Quiescent Central Galaxies Cluster Similarly: Implications for the Galaxy-Halo Connection
Authors:
James Kakos,
Aldo Rodriguez-Puebla,
Joel R. Primack,
Sandra M. Faber,
David C. Koo,
Peter Behroozi,
Vladimir Avila-Reese
Abstract:
We measure the clustering of low-redshift SDSS galaxies as a function of stellar mass ($10.0<\log(M_*/M_\odot)<11.5$) and specific star formation rate (sSFR) and compare the results to models of the galaxy--halo connection. We find that the auto-correlation functions of central galaxies exhibit little dependence on sSFR, with the well-known stronger clustering of quiescent galaxies mainly attribut…
▽ More
We measure the clustering of low-redshift SDSS galaxies as a function of stellar mass ($10.0<\log(M_*/M_\odot)<11.5$) and specific star formation rate (sSFR) and compare the results to models of the galaxy--halo connection. We find that the auto-correlation functions of central galaxies exhibit little dependence on sSFR, with the well-known stronger clustering of quiescent galaxies mainly attributable to satellites. Because halo assembly history is known to affect distinct halo clustering, this result implies that there is little net correlation between halo assembly history and central galaxy sSFR. However, cross-correlations with satellites are stronger for quiescent centrals than star-forming centrals, consistent with quiescent centrals having more satellites in their haloes at fixed $M_*$, as found in SDSS group catalogues. We model the galaxy--halo connection in an $N$-body simulation by assigning sSFRs to central galaxies in three different ways. Two of the models depend on halo assembly history (being based on halo accretion rate or concentration), while the third is independent of halo assembly history (being based on peak halo circular velocity, $V_\text{peak}$, a proxy for halo mass). All three models replicate the observed auto-correlations of central galaxies, while only the $V_\text{peak}$ model reproduces the observed cross-correlations with satellites. This further suggests that the effects of halo assembly history may not be easily seen in auto-correlations of centrals and implies that a more complete understanding of central galaxy clustering may require more than auto-correlations of centrals alone. Additionally, the good agreement with the $V_\text{peak}$ model supports the idea that quiescent galaxies reside in more massive haloes than star-forming galaxies at fixed $M_*$.
△ Less
Submitted 12 August, 2024; v1 submitted 2 March, 2024;
originally announced March 2024.
-
A Distinct Radial Acceleration Relation across Brightest Cluster Galaxies and Galaxy Clusters
Authors:
Yong Tian,
Chung-Ming Ko,
Pengfei Li,
Stacy McGaugh,
Shemile L. Poblete
Abstract:
Recent studies reveal a radial acceleration relation (RAR) in galaxies, which illustrates a tight empirical correlation connecting the observational acceleration and the baryonic acceleration with a characteristic acceleration scale. However, a distinct RAR has been revealed on BCG-cluster scales with a seventeen times larger acceleration scale by the gravitational lensing effect. In this work, we…
▽ More
Recent studies reveal a radial acceleration relation (RAR) in galaxies, which illustrates a tight empirical correlation connecting the observational acceleration and the baryonic acceleration with a characteristic acceleration scale. However, a distinct RAR has been revealed on BCG-cluster scales with a seventeen times larger acceleration scale by the gravitational lensing effect. In this work, we systematically explored the acceleration and mass correlations between dynamical and baryonic components in 50 Brightest Cluster Galaxies (BCGs). To investigate the dynamical RAR in BCGs, we derived their dynamical accelerations from the stellar kinematics using the Jeans equation through Abel inversion and adopted the baryonic mass from the SDSS photometry. We explored the spatially resolved kinematic profiles with the largest integral field spectroscopy (IFS) data mounted by the Mapping Nearby Galaxies at Apache Point Observatory (MaNGA) survey. Our results demonstrate that the dynamical RAR in BCGs is consistent with the lensing RAR on BCG-cluster scales as well as a larger acceleration scale. This finding may imply that BCGs and galaxy clusters have fundamental differences from field galaxies. We also find a mass correlation, but it is less tight than the acceleration correlation.
△ Less
Submitted 19 February, 2024;
originally announced February 2024.
-
Exploring the dust grain size and polarization mechanism in the hot and massive Class 0 disk IRAS 16293-2422 B
Authors:
Joaquin Zamponi,
María José Maureira,
Hauyu Baobab Liu,
Bo Zhao,
Dominique Segura-Cox,
Chia-Lin Ko,
Paola Caselli
Abstract:
Multiwavelength dust continuum and polarization observations arising from self-scattering have been used to investigate grain sizes in young disks. However, the polarization by self-scattering is low in face-on optically thick disks and puts some of the size constraints from polarization on hold, particularly for the younger and more massive disks. The 1.3 mm emission detected toward the hot (…
▽ More
Multiwavelength dust continuum and polarization observations arising from self-scattering have been used to investigate grain sizes in young disks. However, the polarization by self-scattering is low in face-on optically thick disks and puts some of the size constraints from polarization on hold, particularly for the younger and more massive disks. The 1.3 mm emission detected toward the hot ($\gtrsim$400 K) Class 0 disk IRAS 16293-2422 B has been attributed to self-scattering, predicting grain sizes between 200-2000 $μ$m. We investigate the effects of grain size in the resultant flux and polarization fractions from self-scattering using a hot and massive Class 0 disk model and compare with observations. We compared new and archival high-resolution observations between 1.3 and 18 mm to a set of synthetic models. We have developed a new public tool to automate this process called Synthesizer. This is an easy-to-use program to generate synthetic observations from numerical simulations. Optical depths are in the range of 130 to 2 from 1.3 to 18 mm, respectively. Predictions from significant grain growth populations, including millimetric grains are comparable to the observations at all wavelengths. The polarization fraction produced by self-scattering reaches a maximum of $\sim$0.1% at 1.3 mm for a maximum grain size of 100 $μ$m, being an order of magnitude lower than that observed with ALMA. From the comparison of Stokes I fluxes, we conclude that significant grain growth could be present in the young Class 0 disk IRAS 16293 B, particularly in the inner hot region ($<10$ au, $T>$ 300 K) where refractory organics evaporate. The polarization produced by self-scattering in our model is not high enough to explain the observations at 1.3 and 7 mm, and effects like dichroic extinction or polarization reversal of elongated aligned grains remain other possible but untested scenarios.
△ Less
Submitted 4 November, 2023;
originally announced November 2023.
-
Group of automorphisms for strongly quasi invariant states
Authors:
Ameur Dhahri,
Chul Ki Ko,
Hyun Jae Yoo
Abstract:
For a $*$-automorphism group $G$ on a $C^*$- or von Neumann algebra, we study the $G$-quasi invariant states and their properties. The $G$-quasi invariance or $G$-strongly quasi invariance are weaker than the $G$-invariance and have wide applications. We develop several properties for $G$-strongly quasi invariant states. Many of them are the extensions of the already developed theories for $G$-inv…
▽ More
For a $*$-automorphism group $G$ on a $C^*$- or von Neumann algebra, we study the $G$-quasi invariant states and their properties. The $G$-quasi invariance or $G$-strongly quasi invariance are weaker than the $G$-invariance and have wide applications. We develop several properties for $G$-strongly quasi invariant states. Many of them are the extensions of the already developed theories for $G$-invariant states. Among others, we consider the relationship between the group $G$ and modular automorphism group, invariant subalgebras, ergodicity, modular theory, and abelian subalgebras. We provide with some examples to support the results.
△ Less
Submitted 4 February, 2025; v1 submitted 2 November, 2023;
originally announced November 2023.
-
Filaments of The Slime Mold Cosmic Web And How They Affect Galaxy Evolution
Authors:
Farhanul Hasan,
Joseph N. Burchett,
Douglas Hellinger,
Oskar Elek,
Daisuke Nagai,
S. M. Faber,
Joel R. Primack,
David C. Koo,
Nir Mandelker,
Joanna Woo
Abstract:
We present a novel method for identifying cosmic web filaments using the IllustrisTNG (TNG100) cosmological simulations and investigate the impact of filaments on galaxies. We compare the use of cosmic density field estimates from the Delaunay Tessellation Field Estimator (DTFE) and the Monte Carlo Physarum Machine (MCPM), which is inspired by the slime mold organism, in the DisPerSE structure ide…
▽ More
We present a novel method for identifying cosmic web filaments using the IllustrisTNG (TNG100) cosmological simulations and investigate the impact of filaments on galaxies. We compare the use of cosmic density field estimates from the Delaunay Tessellation Field Estimator (DTFE) and the Monte Carlo Physarum Machine (MCPM), which is inspired by the slime mold organism, in the DisPerSE structure identification framework. The MCPM-based reconstruction identifies filaments with higher fidelity, finding more low-prominence/diffuse filaments and better tracing the true underlying matter distribution than the DTFE-based reconstruction. Using our new filament catalogs, we find that most galaxies are located within 1.5-2.5 Mpc of a filamentary spine, with little change in the median specific star formation rate and the median galactic gas fraction with distance to the nearest filament. Instead, we introduce the filament line density, Sigma_fil(MCPM), as the total MCPM overdensity per unit length of a local filament segment, and find that this parameter is a superior predictor of galactic gas supply and quenching. Our results indicate that most galaxies are quenched and gas-poor near high-line density filaments at z<=1. At z=0, quenching in log(M*/Msun)>10.5 galaxies is mainly driven by mass, while lower-mass galaxies are significantly affected by the filament line density. In high-line density filaments, satellites are strongly quenched, whereas centrals have reduced star formation, but not gas fraction, at z<=0.5. We discuss the prospect of applying our new filament identification method to galaxy surveys with SDSS, DESI, Subaru PFS, etc. to elucidate the effect of large-scale structure on galaxy formation.
△ Less
Submitted 13 May, 2024; v1 submitted 2 November, 2023;
originally announced November 2023.
-
Semantic Scene Graph Generation Based on an Edge Dual Scene Graph and Message Passing Neural Network
Authors:
Hyeongjin Kim,
Sangwon Kim,
Jong Taek Lee,
Byoung Chul Ko
Abstract:
Along with generative AI, interest in scene graph generation (SGG), which comprehensively captures the relationships and interactions between objects in an image and creates a structured graph-based representation, has significantly increased in recent years. However, relying on object-centric and dichotomous relationships, existing SGG methods have a limited ability to accurately predict detailed…
▽ More
Along with generative AI, interest in scene graph generation (SGG), which comprehensively captures the relationships and interactions between objects in an image and creates a structured graph-based representation, has significantly increased in recent years. However, relying on object-centric and dichotomous relationships, existing SGG methods have a limited ability to accurately predict detailed relationships. To solve these problems, a new approach to the modeling multiobject relationships, called edge dual scene graph generation (EdgeSGG), is proposed herein. EdgeSGG is based on a edge dual scene graph and Dual Message Passing Neural Network (DualMPNN), which can capture rich contextual interactions between unconstrained objects. To facilitate the learning of edge dual scene graphs with a symmetric graph structure, the proposed DualMPNN learns both object- and relation-centric features for more accurately predicting relation-aware contexts and allows fine-grained relational updates between objects. A comparative experiment with state-of-the-art (SoTA) methods was conducted using two public datasets for SGG operations and six metrics for three subtasks. Compared with SoTA approaches, the proposed model exhibited substantial performance improvements across all SGG subtasks. Furthermore, experiment on long-tail distributions revealed that incorporating the relationships between objects effectively mitigates existing long-tail problems.
△ Less
Submitted 2 November, 2023;
originally announced November 2023.
-
Dwarf galaxies show little ISM evolution from $z\sim1$ to $z\sim0$: a spectroscopic study of metallicity, star formation, and electron density
Authors:
John Pharo,
Yicheng Guo,
Guillermo Barro Calvo,
Teja Teppala,
Fuyan Bian,
Timothy Carleton,
Sandra Faber,
Puragra Guhathakurta,
David C. Koo
Abstract:
We present gas-phase metallicity measurements for 583 emission line galaxies at $0.3<z<0.85$, including 388 dwarf galaxies with $log(M_{\star}/M_{\odot}) < 9.5$, and explore the dependence of the metallicity on the stellar mass and star formation properties of the galaxies. Metallicities are determined through the measurement of emission lines in very deep ($\sim$7 hr exposure) Keck/DEIMOS spectra…
▽ More
We present gas-phase metallicity measurements for 583 emission line galaxies at $0.3<z<0.85$, including 388 dwarf galaxies with $log(M_{\star}/M_{\odot}) < 9.5$, and explore the dependence of the metallicity on the stellar mass and star formation properties of the galaxies. Metallicities are determined through the measurement of emission lines in very deep ($\sim$7 hr exposure) Keck/DEIMOS spectra taken primarily from the HALO7D survey. We measure metallicity with three strong-line calibrations (O3H$β$, R23, and O3O2) for the overall sample, as well as with the faint [Ne III]$λ$3869 and [O III]$λ$4363 emission lines for 112 and 17 galaxies where robust detections were possible. We construct mass-metallicity relations (MZR) for each calibration method, finding MZRs consistent with other strong-line results at comparable redshift, as well as with $z\sim0$ galaxies. We quantify the intrinsic scatter in the MZR as a function of mass, finding it increases with lower stellar mass. We also measure a weak but significant correlation between increased MZR scatter and higher specific star formation rate. We find a weak influence of SFR in the fundamental metallicity relation as well, with an SFR coefficient of $α=0.21$. Finally, we use the flux ratios of the [O II]$λλ$3727,3729 doublet to calculate gas electron density in $\sim$1000 galaxies with $log(M_{\star}/M_{\odot}) < 10.5$ as a function of redshift. We measure low electron densities ($n_e\sim25$ cm$^{-3}$) for $z<1$ galaxies, again consistent with $z\approx0$ conditions, but measure higher densities ($n_e\sim100$ cm$^{-3}$) at $z>1$. These results all suggest that there is little evolution in star-forming interstellar medium conditions from $z\sim1$ to $z=0$, confirmed with a more complete sample of low-mass galaxies than has previously been available in this redshift range.
△ Less
Submitted 25 October, 2023;
originally announced October 2023.
-
Galaxies Going Bananas: Inferring the 3D Geometry of High-Redshift Galaxies with JWST-CEERS
Authors:
Viraj Pandya,
Haowen Zhang,
Marc Huertas-Company,
Kartheik G. Iyer,
Elizabeth McGrath,
Guillermo Barro,
Steven L. Finkelstein,
Martin Kuemmel,
William G. Hartley,
Henry C. Ferguson,
Jeyhan S. Kartaltepe,
Joel Primack,
Avishai Dekel,
Sandra M. Faber,
David C. Koo,
Greg L. Bryan,
Rachel S. Somerville,
Ricardo O. Amorin,
Pablo Arrabal Haro,
Micaela B. Bagley,
Eric F. Bell,
Emmanuel Bertin,
Luca Costantin,
Romeel Dave,
Mark Dickinson
, et al. (31 additional authors not shown)
Abstract:
The 3D geometry of high-redshift galaxies remains poorly understood. We build a differentiable Bayesian model and use Hamiltonian Monte Carlo to efficiently and robustly infer the 3D shapes of star-forming galaxies in JWST-CEERS observations with $\log M_*/M_{\odot}=9.0-10.5$ at $z=0.5-8.0$. We reproduce previous results from HST-CANDELS in a fraction of the computing time and constrain the mean e…
▽ More
The 3D geometry of high-redshift galaxies remains poorly understood. We build a differentiable Bayesian model and use Hamiltonian Monte Carlo to efficiently and robustly infer the 3D shapes of star-forming galaxies in JWST-CEERS observations with $\log M_*/M_{\odot}=9.0-10.5$ at $z=0.5-8.0$. We reproduce previous results from HST-CANDELS in a fraction of the computing time and constrain the mean ellipticity, triaxiality, size and covariances with samples as small as $\sim50$ galaxies. We find high 3D ellipticities for all mass-redshift bins suggesting oblate (disky) or prolate (elongated) geometries. We break that degeneracy by constraining the mean triaxiality to be $\sim1$ for $\log M_*/M_{\odot}=9.0-9.5$ dwarfs at $z>1$ (favoring the prolate scenario), with significantly lower triaxialities for higher masses and lower redshifts indicating the emergence of disks. The prolate population traces out a ``banana'' in the projected $b/a-\log a$ diagram with an excess of low $b/a$, large $\log a$ galaxies. The dwarf prolate fraction rises from $\sim25\%$ at $z=0.5-1.0$ to $\sim50-80\%$ at $z=3-8$. If these are disks, they cannot be axisymmetric but instead must be unusually oval (triaxial) unlike local circular disks. We simultaneously constrain the 3D size-mass relation and its dependence on 3D geometry. High-probability prolate and oblate candidates show remarkably similar Sérsic indices ($n\sim1$), non-parametric morphological properties and specific star formation rates. Both tend to be visually classified as disks or irregular but edge-on oblate candidates show more dust attenuation. We discuss selection effects, follow-up prospects and theoretical implications.
△ Less
Submitted 15 January, 2024; v1 submitted 23 October, 2023;
originally announced October 2023.
-
Is $K_{1}/K^{*}$ enhancement in heavy ion collisions a signature of chiral symmetry restoration?
Authors:
Haesom Sung,
Sungtae Cho,
Che Ming Ko,
Su Houng Lee,
Sanghoon Lim
Abstract:
We extend the recent study of $K_{1}/K^{*}$ enhancement as a signature of chiral symmetry restoration in heavy ion collisions at the Large Hadron Collider (LHC) via the kinetic approach to include the effects due to non-unity hadron fugacities during the evolution of produced hadronic matter and the temperature-dependent $K_1$ mass. Although the effect of non-unity fugacity only slightly reduces t…
▽ More
We extend the recent study of $K_{1}/K^{*}$ enhancement as a signature of chiral symmetry restoration in heavy ion collisions at the Large Hadron Collider (LHC) via the kinetic approach to include the effects due to non-unity hadron fugacities during the evolution of produced hadronic matter and the temperature-dependent $K_1$ mass. Although the effect of non-unity fugacity only slightly reduces the $K_1/K^*$ enhancement due to chiral symmetry restoration, the inclusion of the temperature-dependent $K_1$ mass leads to a substantial reduction in the $K_1/K^*$ enhancement. However, the final $K_1/K^*$ ratio in peripheral collisions still shows a more than factor of two enhancement compared to the case without chiral symmetry restoration and thus remains a good signature for chiral symmetry restoration in the hot dense matter produced in relativistic heavy ion collisions.
△ Less
Submitted 8 November, 2023; v1 submitted 17 October, 2023;
originally announced October 2023.
-
Excited Hadron Channels in Hadronization
Authors:
Rainer J. Fries,
Jacob Purcell,
Michael Kordell II,
Che-Ming Ko
Abstract:
The proper treatment of hadronic resonances plays an important role in many aspects of heavy ion collisions. This is expected to be the case also for hadronization, due to the large degeneracies of excited states, and the abundant production of hadrons from their decays. We first show how a comprehensive treatment of excited meson states can be incorporated into quark recombination, and in extensi…
▽ More
The proper treatment of hadronic resonances plays an important role in many aspects of heavy ion collisions. This is expected to be the case also for hadronization, due to the large degeneracies of excited states, and the abundant production of hadrons from their decays. We first show how a comprehensive treatment of excited meson states can be incorporated into quark recombination, and in extension, into Hybrid Hadronization. We then discuss the quantum mechanics of forming excited states, utilizing the Wigner distribution functions of angular momentum eigenstates of isotropic 3-D harmonic oscillators. We further describe how resonance decays can be handled, based on a set of minimal assumptions, by creating an extension of hadron decays in PYTHIA 8. Finally, we present first results by simulating $e^+e^-$ collisions using PYTHIA and Hybrid Hadronization with excited mesons up to orbital angular momentum $L=4$ and radial quantum number 2. We find that states up to $L=2$ are produced profusely by quark recombination.
△ Less
Submitted 29 August, 2023;
originally announced August 2023.
-
Comparing pion production in transport simulations of heavy-ion collisions at $270A$ MeV under controlled conditions
Authors:
Jun Xu,
Hermann Wolter,
Maria Colonna,
Mircea Dan Cozma,
Pawel Danielewicz,
Che Ming Ko,
Akira Ono,
ManYee Betty Tsang,
Ying-Xun Zhang,
Hui-Gan Cheng,
Natsumi Ikeno,
Rohit Kumar,
Jun Su,
Hua Zheng,
Zhen Zhang,
Lie-Wen Chen,
Zhao-Qing Feng,
Christoph Hartnack,
Arnaud Le Fèvre,
Bao-An Li,
Yasushi Nara,
Akira Ohnishi,
Feng-Shou Zhang
Abstract:
Within the TMEP, we present a detailed study of the performance of different transport models in Sn+Sn collisions at $270A$ MeV, and put particular emphasis on the production of pions and $Δ$ resonances, which have been used as probes of the nuclear symmetry energy. We prescribe a common and rather simple physics model, and follow in detail the results of 4 BUU models and 6 QMD models. The nucleon…
▽ More
Within the TMEP, we present a detailed study of the performance of different transport models in Sn+Sn collisions at $270A$ MeV, and put particular emphasis on the production of pions and $Δ$ resonances, which have been used as probes of the nuclear symmetry energy. We prescribe a common and rather simple physics model, and follow in detail the results of 4 BUU models and 6 QMD models. The nucleonic evolution of the collision and the nucleonic observables in these codes do not completely converge, but the differences among the codes can be understood as being due to several reasons: the basic differences between BUU and QMD models in the representation of the phase-space distributions, computational differences in the mean-field evaluation, and differences in the adopted strategies for the Pauli blocking in the collision integrals. For pionic observables, we find that a higher maximum density leads to an enhanced pion yield and a reduced $π^-/π^+$ yield ratio, while a more effective Pauli blocking generally leads to a slightly suppressed pion yield and an enhanced $π^-/π^+$ yield ratio. We specifically investigate the effect of the Coulomb force, and find that it increases the total $π^-/π^+$ yield ratio but reduces the ratio at high pion energies, although differences in its implementations do not have a dominating role in the differences among the codes. Taking into account only the results of codes that strictly follow the homework specifications, we find a convergence of the codes in the final charged pion yield ratio to a $1σ$ deviation of about $5\%$. However, the uncertainty is expected to be reduced to about $1.6\%$ if the same or similar strategies and ingredients, i.e., an improved Pauli blocking and calculation of the non-linear term in the mean-field potential, are similarly used in all codes.
△ Less
Submitted 14 March, 2024; v1 submitted 10 August, 2023;
originally announced August 2023.
-
UV-Bright Star-Forming Clumps and Their Host Galaxies in UVCANDELS at 0.5 $\leq$ z $\leq$ 1
Authors:
Alec Martin,
Yicheng Guo,
Xin Wang,
Anton M. Koekemoer,
Marc Rafelski,
Harry I. Teplitz,
Rogier A. Windhorst,
Anahita Alavi,
Norman A. Grogin,
Laura Prichard,
Ben Sunnquist,
Daniel Ceverino,
Nima Chartab,
Christopher J. Conselice,
Y. Sophia Dai,
Avishai Dekel,
Johnathan P. Gardner,
Eric Gawiser,
Nimish P. Hathi,
Matthew J. Hayes,
Rolf A. Jansen,
Zhiyuan Ji,
David C. Koo,
Ray A. Lucas,
Nir Mandelker
, et al. (10 additional authors not shown)
Abstract:
Giant star-forming clumps are a prominent feature of star-forming galaxies (SFGs) and contain important clues on galaxy formation and evolution. However, basic demographics of clumps and their host galaxies remain uncertain. Using the HST/WFC3 F275W images from the Ultraviolet Imaging of the Cosmic Assembly Near-infrared Deep Extragalactic Legacy Survey (UVCANDELS), we detect and analyze giant sta…
▽ More
Giant star-forming clumps are a prominent feature of star-forming galaxies (SFGs) and contain important clues on galaxy formation and evolution. However, basic demographics of clumps and their host galaxies remain uncertain. Using the HST/WFC3 F275W images from the Ultraviolet Imaging of the Cosmic Assembly Near-infrared Deep Extragalactic Legacy Survey (UVCANDELS), we detect and analyze giant star-forming clumps in galaxies at 0.5 $\leq$ z $\leq$ 1, connecting two epochs when clumps are common (at cosmic high-noon, z $\sim$ 2) and rare (in the local universe). We construct a clump sample whose rest-frame 1600 Å luminosity is 3 times higher than the most luminous local HII regions (M$_{UV} \leq -$16 AB). In our sample, 35 $\pm$ 3$\%$ of low-mass galaxies (log[M$_{*}$/M$_{\odot}$] $<$ 10) are clumpy (i.e., containing at least one off-center clump). This fraction changes to 22 $\pm$ 3$\%$ and 22 $\pm$ 4$\%$ for intermediate (10 $\leq$ log[M$_{*}$/M$_{\odot}$] $\leq$ 10.5) and high-mass (log[M$_{*}$/M$_{\odot}$] $>$ 10.5) galaxies in agreement with previous studies. When compared to similar-mass non-clumpy SFGs, low- and intermediate-mass clumpy SFGs tend to have higher SFRs and bluer rest-frame U-V colors, while high-mass clumpy SFGs tend to be larger than non-clumpy SFGs. However, clumpy and non-clumpy SFGs have similar Sérsic index, indicating a similar underlying density profile. Furthermore, we investigate how UV luminosity of star-forming regions correlates with the physical properties of host galaxies. On average, more luminous star-forming regions reside in more luminous, smaller, and/or higher-specific SFR galaxies and are found closer to their hosts' galactic center.
△ Less
Submitted 2 October, 2023; v1 submitted 31 July, 2023;
originally announced August 2023.
-
Kinetic approach of light-nuclei production in intermediate-energy heavy-ion collisions
Authors:
Rui Wang,
Yu-Gang Ma,
Lie-Wen Chen,
Che Ming Ko,
Kai-Jia Sun,
Zhen Zhang
Abstract:
We develop a kinetic approach to the production of light nuclei up to mass number $A$ $\leqslant$ $4$ in intermediate-energy heavy-ion collisions by including them as dynamic degrees of freedom. The conversions between nucleons and light nuclei during the collisions are incorporated dynamically via the breakup of light nuclei by a nucleon and their inverse reactions. We also include the Mott effec…
▽ More
We develop a kinetic approach to the production of light nuclei up to mass number $A$ $\leqslant$ $4$ in intermediate-energy heavy-ion collisions by including them as dynamic degrees of freedom. The conversions between nucleons and light nuclei during the collisions are incorporated dynamically via the breakup of light nuclei by a nucleon and their inverse reactions. We also include the Mott effect on light nuclei, i.e., a light nucleus would no longer be bound if the phase-space density of its surrounding nucleons is too large. With this kinetic approach, we obtain a reasonable description of the measured yields of light nuclei in central Au+Au collisions at energies of $0.25$ - $1.0A~\rm GeV$ by the FOPI collaboration. Our study also indicates that the observed enhancement of the $α$-particle yield at low incident energies can be attributed to a weaker Mott effect on the $α$-particle, which makes it more difficult to dissolve in nuclear medium, as a result of its much larger binding energy.
△ Less
Submitted 4 May, 2023;
originally announced May 2023.
-
Sample-Specific Debiasing for Better Image-Text Models
Authors:
Peiqi Wang,
Yingcheng Liu,
Ching-Yun Ko,
William M. Wells,
Seth Berkowitz,
Steven Horng,
Polina Golland
Abstract:
Self-supervised representation learning on image-text data facilitates crucial medical applications, such as image classification, visual grounding, and cross-modal retrieval. One common approach involves contrasting semantically similar (positive) and dissimilar (negative) pairs of data points. Drawing negative samples uniformly from the training data set introduces false negatives, i.e., samples…
▽ More
Self-supervised representation learning on image-text data facilitates crucial medical applications, such as image classification, visual grounding, and cross-modal retrieval. One common approach involves contrasting semantically similar (positive) and dissimilar (negative) pairs of data points. Drawing negative samples uniformly from the training data set introduces false negatives, i.e., samples that are treated as dissimilar but belong to the same class. In healthcare data, the underlying class distribution is nonuniform, implying that false negatives occur at a highly variable rate. To improve the quality of learned representations, we develop a novel approach that corrects for false negatives. Our method can be viewed as a variant of debiased contrastive learning that uses estimated sample-specific class probabilities. We provide theoretical analysis of the objective function and demonstrate the proposed approach on both image and paired image-text data sets. Our experiments illustrate empirical advantages of sample-specific debiasing.
△ Less
Submitted 12 August, 2023; v1 submitted 25 April, 2023;
originally announced April 2023.
-
Measuring galaxy cluster mass profiles into the low acceleration regime with galaxy kinematics
Authors:
Pengfei Li,
Yong Tian,
Mariana P. Júlio,
Marcel S. Pawlowski,
Federico Lelli,
Stacy S. McGaugh,
James M. Schombert,
Justin I. Read,
Po-Chieh Yu,
Chung-Ming Ko
Abstract:
We probe the dynamical mass profiles of 10 galaxy clusters from the HIghest X-ray FLUx Galaxy Cluster Sample (HIFLUGCS) using galaxy kinematics. We numerically solve the spherical Jeans equation, and parameterize the dynamical mass profile and the galaxy velocity anisotropy profile using two general functions to ensure that our results are not biased towards any specific model. The mass-velocity a…
▽ More
We probe the dynamical mass profiles of 10 galaxy clusters from the HIghest X-ray FLUx Galaxy Cluster Sample (HIFLUGCS) using galaxy kinematics. We numerically solve the spherical Jeans equation, and parameterize the dynamical mass profile and the galaxy velocity anisotropy profile using two general functions to ensure that our results are not biased towards any specific model. The mass-velocity anisotropy degeneracy is ameliorated by using two "virial shape parameters" that depend on the fourth moment of velocity distribution. The resulting velocity anisotropy estimates consistently show a nearly isotropic distribution in the inner regions, with an increasing radial anisotropy towards large radii. We compare our derived dynamical masses with those calculated from X-ray gas data assuming hydrostatic equilibrium, finding that massive and rich relaxed clusters generally present consistent mass measurements, while unrelaxed or low-richness clusters have systematically larger total mass than hydrostatic mass by an average of 50\%. This might help alleviate current tensions in the measurement of $σ_8$, but it also leads to cluster baryon fractions below the cosmic value. Finally, our approach probes accelerations as low as $10^{-11}$ m s$^{-2}$, comparable to the outskirts of individual late-type galaxies. We confirm that galaxy clusters deviate from the radial acceleration relation defined by galaxies.
△ Less
Submitted 14 June, 2023; v1 submitted 16 March, 2023;
originally announced March 2023.
-
The Evolving Effect Of Cosmic Web Environment On Galaxy Quenching
Authors:
Farhanul Hasan,
Joseph N. Burchett,
Alyssa Abeyta,
Douglas Hellinger,
Nir Mandelker,
Joel R. Primack,
S. M. Faber,
David C. Koo,
Oskar Elek,
Daisuke Nagai
Abstract:
We investigate how cosmic web structures affect galaxy quenching in the IllustrisTNG (TNG100) cosmological simulations by reconstructing the cosmic web within each snapshot using the DisPerSE framework. We measure the comoving distance from each galaxy with stellar mass $\log(M_{\ast}/\mathrm{M}_{\odot}) \geq 8$ to the nearest node ($d_{\mathrm{node}}$) and the nearest filament spine (…
▽ More
We investigate how cosmic web structures affect galaxy quenching in the IllustrisTNG (TNG100) cosmological simulations by reconstructing the cosmic web within each snapshot using the DisPerSE framework. We measure the comoving distance from each galaxy with stellar mass $\log(M_{\ast}/\mathrm{M}_{\odot}) \geq 8$ to the nearest node ($d_{\mathrm{node}}$) and the nearest filament spine ($d_{\mathrm{fil}}$) to study the dependence of both median specific star formation rate (<sSFR>) and median gas fraction (<$f_{\mathrm{gas}}$>) on these distances. We find that the <sSFR> of galaxies is only dependent on cosmic web environment at $z<2$, with the dependence increasing with time. At $z\leq0.5$, $8 \leq \log(M_{\ast}/\mathrm{M}_{\odot}) < 9$ galaxies are quenched at $d_{\mathrm{node}}\lesssim1$~Mpc, and have significantly-suppressed star formation at $d_{\mathrm{fil}}\lesssim1$~Mpc, trends driven mostly by satellite galaxies. At $z\leq1$, in contrast to the monotonic drop in <sSFR> of $\log(M_{\ast}/\mathrm{M}_{\odot}) <10$ galaxies with decreasing $d_{\mathrm{node}}$ and $d_{\mathrm{fil}}$, $\log(M_{\ast}/\mathrm{M}_{\odot}) \geq 10$ galaxies - both centrals and satellites - experience an upturn in <sSFR> at $d_{\mathrm{node}}\lesssim0.2$~Mpc. Much of this cosmic web dependence of star formation activity can be explained by an evolution in $<f_{\mathrm{gas}}>$. Our results suggest that in the past $\sim$10 Gyr, low-mass satellites are quenched by rapid gas stripping in dense environments near nodes and gradual gas starvation in intermediate-density environments near filaments, while at earlier times cosmic web structures efficiently channeled cold gas into most galaxies. State-of-the-art ongoing spectroscopic surveys such as SDSS and DESI, as well as those planned with the Subaru Prime Focus Spectrograph, JWST and Roman, are required to test our predictions against observations.
△ Less
Submitted 24 April, 2023; v1 submitted 14 March, 2023;
originally announced March 2023.
-
Dense Nuclear Matter Equation of State from Heavy-Ion Collisions
Authors:
Agnieszka Sorensen,
Kshitij Agarwal,
Kyle W. Brown,
Zbigniew Chajęcki,
Paweł Danielewicz,
Christian Drischler,
Stefano Gandolfi,
Jeremy W. Holt,
Matthias Kaminski,
Che-Ming Ko,
Rohit Kumar,
Bao-An Li,
William G. Lynch,
Alan B. McIntosh,
William G. Newton,
Scott Pratt,
Oleh Savchuk,
Maria Stefaniak,
Ingo Tews,
ManYee Betty Tsang,
Ramona Vogt,
Hermann Wolter,
Hanna Zbroszczyk,
Navid Abbasi,
Jörg Aichelin
, et al. (111 additional authors not shown)
Abstract:
The nuclear equation of state (EOS) is at the center of numerous theoretical and experimental efforts in nuclear physics. With advances in microscopic theories for nuclear interactions, the availability of experiments probing nuclear matter under conditions not reached before, endeavors to develop sophisticated and reliable transport simulations to interpret these experiments, and the advent of mu…
▽ More
The nuclear equation of state (EOS) is at the center of numerous theoretical and experimental efforts in nuclear physics. With advances in microscopic theories for nuclear interactions, the availability of experiments probing nuclear matter under conditions not reached before, endeavors to develop sophisticated and reliable transport simulations to interpret these experiments, and the advent of multi-messenger astronomy, the next decade will bring new opportunities for determining the nuclear matter EOS, elucidating its dependence on density, temperature, and isospin asymmetry. Among controlled terrestrial experiments, collisions of heavy nuclei at intermediate beam energies (from a few tens of MeV/nucleon to about 25 GeV/nucleon in the fixed-target frame) probe the widest ranges of baryon density and temperature, enabling studies of nuclear matter from a few tenths to about 5 times the nuclear saturation density and for temperatures from a few to well above a hundred MeV, respectively. Collisions of neutron-rich isotopes further bring the opportunity to probe effects due to the isospin asymmetry. However, capitalizing on the enormous scientific effort aimed at uncovering the dense nuclear matter EOS, both at RHIC and at FRIB as well as at other international facilities, depends on the continued development of state-of-the-art hadronic transport simulations. This white paper highlights the essential role that heavy-ion collision experiments and hadronic transport simulations play in understanding strong interactions in dense nuclear matter, with an emphasis on how these efforts can be used together with microscopic approaches and neutron star studies to uncover the nuclear EOS.
△ Less
Submitted 25 January, 2024; v1 submitted 30 January, 2023;
originally announced January 2023.
-
The Neon Gap: Probing Ionization with Dwarf Galaxies at z~1
Authors:
John Pharo,
Yicheng Guo,
David C. Koo,
John C. Forbes,
Puragra Guhathakurta
Abstract:
We present measurements of [NeIII]λ3869 emission in z~1 low-mass galaxies taken from the Keck/DEIMOS spectroscopic surveys HALO7D and DEEPWinds. We identify 167 individual galaxies with significant [NeIII] emission lines, including 112 "dwarf" galaxies with log(M_{\star}/M_{\odot}) < 9.5, with 0.3 < z < 1.4. We also measure [NeIII] emission from composite spectra derived from all [OII]λλ3727,3729…
▽ More
We present measurements of [NeIII]λ3869 emission in z~1 low-mass galaxies taken from the Keck/DEIMOS spectroscopic surveys HALO7D and DEEPWinds. We identify 167 individual galaxies with significant [NeIII] emission lines, including 112 "dwarf" galaxies with log(M_{\star}/M_{\odot}) < 9.5, with 0.3 < z < 1.4. We also measure [NeIII] emission from composite spectra derived from all [OII]λλ3727,3729 line emitters in this range. This provides a unique sample of [NeIII]-emitters in the gap between well-studied emitters at z = 0 and 2 < z < 3. To study evolution in ionization conditions in the ISM over this time, we analyze the log([NeIII]λ3869/[OII]λλ3727,3729) ratio (Ne3O2) as a function of the stellar mass and of the log([OIII]λλ4959,5007/[OII]λλ3727,3729) ratio (O32). We find that the typical star-forming dwarf galaxy at this redshift, as measured from the composite spectra, shares the Ne3O2-M_{\star} relation with local galaxies, but have higher O32 at given Ne3O2. This finding implies that the ionization and metallicity characteristics of the z~1 dwarf population do not evolve substantially from z~1 to z=0, suggesting that the known evolution in those parameter from z~2 has largely taken place by z~1. Individual [NeIII]-detected galaxies have emission characteristics situated between local and z~2 galaxies, with elevated Ne3O2 and O32 emission potentially explained by variations in stellar and nebular metallicity. We also compare our dwarf sample to similarly low-mass z > 7 galaxies identified in JWST Early Release Observations, finding four HALO7D dwarfs with similar size, metallicity, and star formation properties.
△ Less
Submitted 18 January, 2023;
originally announced January 2023.
-
Cross-Modal Learning with 3D Deformable Attention for Action Recognition
Authors:
Sangwon Kim,
Dasom Ahn,
Byoung Chul Ko
Abstract:
An important challenge in vision-based action recognition is the embedding of spatiotemporal features with two or more heterogeneous modalities into a single feature. In this study, we propose a new 3D deformable transformer for action recognition with adaptive spatiotemporal receptive fields and a cross-modal learning scheme. The 3D deformable transformer consists of three attention modules: 3D d…
▽ More
An important challenge in vision-based action recognition is the embedding of spatiotemporal features with two or more heterogeneous modalities into a single feature. In this study, we propose a new 3D deformable transformer for action recognition with adaptive spatiotemporal receptive fields and a cross-modal learning scheme. The 3D deformable transformer consists of three attention modules: 3D deformability, local joint stride, and temporal stride attention. The two cross-modal tokens are input into the 3D deformable attention module to create a cross-attention token with a reflected spatiotemporal correlation. Local joint stride attention is applied to spatially combine attention and pose tokens. Temporal stride attention temporally reduces the number of input tokens in the attention module and supports temporal expression learning without the simultaneous use of all tokens. The deformable transformer iterates L-times and combines the last cross-modal token for classification. The proposed 3D deformable transformer was tested on the NTU60, NTU120, FineGYM, and PennAction datasets, and showed results better than or similar to pre-trained state-of-the-art methods even without a pre-training process. In addition, by visualizing important joints and correlations during action recognition through spatial joint and temporal stride attention, the possibility of achieving an explainable potential for action recognition is presented.
△ Less
Submitted 17 August, 2023; v1 submitted 11 December, 2022;
originally announced December 2022.
-
STAR-Transformer: A Spatio-temporal Cross Attention Transformer for Human Action Recognition
Authors:
Dasom Ahn,
Sangwon Kim,
Hyunsu Hong,
Byoung Chul Ko
Abstract:
In action recognition, although the combination of spatio-temporal videos and skeleton features can improve the recognition performance, a separate model and balancing feature representation for cross-modal data are required. To solve these problems, we propose Spatio-TemporAl cRoss (STAR)-transformer, which can effectively represent two cross-modal features as a recognizable vector. First, from t…
▽ More
In action recognition, although the combination of spatio-temporal videos and skeleton features can improve the recognition performance, a separate model and balancing feature representation for cross-modal data are required. To solve these problems, we propose Spatio-TemporAl cRoss (STAR)-transformer, which can effectively represent two cross-modal features as a recognizable vector. First, from the input video and skeleton sequence, video frames are output as global grid tokens and skeletons are output as joint map tokens, respectively. These tokens are then aggregated into multi-class tokens and input into STAR-transformer. The STAR-transformer encoder layer consists of a full self-attention (FAttn) module and a proposed zigzag spatio-temporal attention (ZAttn) module. Similarly, the continuous decoder consists of a FAttn module and a proposed binary spatio-temporal attention (BAttn) module. STAR-transformer learns an efficient multi-feature representation of the spatio-temporal features by properly arranging pairings of the FAttn, ZAttn, and BAttn modules. Experimental results on the Penn-Action, NTU RGB+D 60, and 120 datasets show that the proposed method achieves a promising improvement in performance in comparison to previous state-of-the-art methods.
△ Less
Submitted 14 October, 2022;
originally announced October 2022.
-
Demand Layering for Real-Time DNN Inference with Minimized Memory Usage
Authors:
Mingoo Ji,
Saehanseul Yi,
Changjin Koo,
Sol Ahn,
Dongjoo Seo,
Nikil Dutt,
Jong-Chan Kim
Abstract:
When executing a deep neural network (DNN), its model parameters are loaded into GPU memory before execution, incurring a significant GPU memory burden. There are studies that reduce GPU memory usage by exploiting CPU memory as a swap device. However, this approach is not applicable in most embedded systems with integrated GPUs where CPU and GPU share a common memory. In this regard, we present De…
▽ More
When executing a deep neural network (DNN), its model parameters are loaded into GPU memory before execution, incurring a significant GPU memory burden. There are studies that reduce GPU memory usage by exploiting CPU memory as a swap device. However, this approach is not applicable in most embedded systems with integrated GPUs where CPU and GPU share a common memory. In this regard, we present Demand Layering, which employs a fast solid-state drive (SSD) as a co-running partner of a GPU and exploits the layer-by-layer execution of DNNs. In our approach, a DNN is loaded and executed in a layer-by-layer manner, minimizing the memory usage to the order of a single layer. Also, we developed a pipeline architecture that hides most additional delays caused by the interleaved parameter loadings alongside layer executions. Our implementation shows a 96.5% memory reduction with just 14.8% delay overhead on average for representative DNNs. Furthermore, by exploiting the memory-delay tradeoff, near-zero delay overhead (under 1 ms) can be achieved with a slightly increased memory usage (still an 88.4% reduction), showing the great potential of Demand Layering.
△ Less
Submitted 8 October, 2022;
originally announced October 2022.
-
SynBench: Task-Agnostic Benchmarking of Pretrained Representations using Synthetic Data
Authors:
Ching-Yun Ko,
Pin-Yu Chen,
Jeet Mohapatra,
Payel Das,
Luca Daniel
Abstract:
Recent success in fine-tuning large models, that are pretrained on broad data at scale, on downstream tasks has led to a significant paradigm shift in deep learning, from task-centric model design to task-agnostic representation learning and task-specific fine-tuning. As the representations of pretrained models are used as a foundation for different downstream tasks, this paper proposes a new task…
▽ More
Recent success in fine-tuning large models, that are pretrained on broad data at scale, on downstream tasks has led to a significant paradigm shift in deep learning, from task-centric model design to task-agnostic representation learning and task-specific fine-tuning. As the representations of pretrained models are used as a foundation for different downstream tasks, this paper proposes a new task-agnostic framework, \textit{SynBench}, to measure the quality of pretrained representations using synthetic data. We set up a reference by a theoretically-derived robustness-accuracy tradeoff of the class conditional Gaussian mixture. Given a pretrained model, the representations of data synthesized from the Gaussian mixture are used to compare with our reference to infer the quality. By comparing the ratio of area-under-curve between the raw data and their representations, SynBench offers a quantifiable score for robustness-accuracy performance benchmarking. Our framework applies to a wide range of pretrained models taking continuous data inputs and is independent of the downstream tasks and datasets. Evaluated with several pretrained vision transformer models, the experimental results show that our SynBench score well matches the actual linear probing performance of the pre-trained model when fine-tuned on downstream tasks. Moreover, our framework can be used to inform the design of robust linear probing on pretrained representations to mitigate the robustness-accuracy tradeoff in downstream tasks.
△ Less
Submitted 7 October, 2022; v1 submitted 6 October, 2022;
originally announced October 2022.