Search | arXiv e-print repository

Divergence metrics in the study of Markov and hidden Markov processes

Authors: Jin Won Kim, Amirhossein Taghvaei, Prashant G. Mehta

Abstract: This paper is divided into two parts. The first part reviews the formulae for f-divergences in the study of continuous-time Markov processes and explores their applications in areas such as stochastic stability, the second law of thermodynamics, and its non-equilibrium extensions. This sets the foundation for the second part, which focuses on f-divergence in the study of hidden Markov processes. I… ▽ More This paper is divided into two parts. The first part reviews the formulae for f-divergences in the study of continuous-time Markov processes and explores their applications in areas such as stochastic stability, the second law of thermodynamics, and its non-equilibrium extensions. This sets the foundation for the second part, which focuses on f-divergence in the study of hidden Markov processes. In this context, we present analyses of filter stability and stochastic thermodynamics, with the latter being used to illustrate the concept of a Maxwell demon in an over-damped Langevin model with white noise observations. The paper's expository style and unified formalism for both Markov and hidden Markov processes aim to serve as a valuable resource for researchers working across related fields. △ Less

Submitted 2 October, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

arXiv:2404.09920 [pdf, other]

doi 10.3847/1538-4357/ad5fee

Combined Pre-Supernova Alert System with Kamland and Super-Kamiokande

Authors: KamLAND, Super-Kamiokande Collaborations, :, Seisho Abe, Minori Eizuka, Sawako Futagi, Azusa Gando, Yoshihito Gando, Shun Goto, Takahiko Hachiya, Kazumi Hata, Koichi Ichimura, Sei Ieki, Haruo Ikeda, Kunio Inoue, Koji Ishidoshiro, Yuto Kamei, Nanami Kawada, Yasuhiro Kishimoto, Masayuki Koga, Maho Kurasawa, Tadao Mitsui, Haruhiko Miyake, Daisuke Morita, Takeshi Nakahata , et al. (290 additional authors not shown)

Abstract: Preceding a core-collapse supernova, various processes produce an increasing amount of neutrinos of all flavors characterized by mounting energies from the interior of massive stars. Among them, the electron antineutrinos are potentially detectable by terrestrial neutrino experiments such as KamLAND and Super-Kamiokande via inverse beta decay interactions. Once these pre-supernova neutrinos are ob… ▽ More Preceding a core-collapse supernova, various processes produce an increasing amount of neutrinos of all flavors characterized by mounting energies from the interior of massive stars. Among them, the electron antineutrinos are potentially detectable by terrestrial neutrino experiments such as KamLAND and Super-Kamiokande via inverse beta decay interactions. Once these pre-supernova neutrinos are observed, an early warning of the upcoming core-collapse supernova can be provided. In light of this, KamLAND and Super-Kamiokande, both located in the Kamioka mine in Japan, have been monitoring pre-supernova neutrinos since 2015 and 2021, respectively. Recently, we performed a joint study between KamLAND and Super-Kamiokande on pre-supernova neutrino detection. A pre-supernova alert system combining the KamLAND detector and the Super-Kamiokande detector was developed and put into operation, which can provide a supernova alert to the astrophysics community. Fully leveraging the complementary properties of these two detectors, the combined alert is expected to resolve a pre-supernova neutrino signal from a 15 M$_{\odot}$ star within 510 pc of the Earth, at a significance level corresponding to a false alarm rate of no more than 1 per century. For a Betelgeuse-like model with optimistic parameters, it can provide early warnings up to 12 hours in advance. △ Less

Submitted 1 July, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

Comments: Resubmitted to ApJ. 22 pages, 16 figures, for more information about the combined pre-supernova alert system, see https://www.lowbg.org/presnalarm/

arXiv:2404.08725 [pdf, other]

doi 10.1093/ptep/ptae128

Development of a data overflow protection system for Super-Kamiokande to maximize data from nearby supernovae

Authors: M. Mori, K. Abe, Y. Hayato, K. Hiraide, K. Hosokawa, K. Ieki, M. Ikeda, J. Kameda, Y. Kanemura, R. Kaneshima, Y. Kashiwagi, Y. Kataoka, S. Miki, S. Mine, M. Miura, S. Moriyama, Y. Nakano, M. Nakahata, S. Nakayama, Y. Noguchi, K. Okamoto, K. Sato, H. Sekiya, H. Shiba, K. Shimizu , et al. (230 additional authors not shown)

Abstract: Neutrinos from very nearby supernovae, such as Betelgeuse, are expected to generate more than ten million events over 10\,s in Super-Kamokande (SK). At such large event rates, the buffers of the SK analog-to-digital conversion board (QBEE) will overflow, causing random loss of data that is critical for understanding the dynamics of the supernova explosion mechanism. In order to solve this problem,… ▽ More Neutrinos from very nearby supernovae, such as Betelgeuse, are expected to generate more than ten million events over 10\,s in Super-Kamokande (SK). At such large event rates, the buffers of the SK analog-to-digital conversion board (QBEE) will overflow, causing random loss of data that is critical for understanding the dynamics of the supernova explosion mechanism. In order to solve this problem, two new DAQ modules were developed to aid in the observation of very nearby supernovae. The first of these, the SN module, is designed to save only the number of hit PMTs during a supernova burst and the second, the Veto module, prescales the high rate neutrino events to prevent the QBEE from overflowing based on information from the SN module. In the event of a very nearby supernova, these modules allow SK to reconstruct the time evolution of the neutrino event rate from beginning to end using both QBEE and SN module data. This paper presents the development and testing of these modules together with an analysis of supernova-like data generated with a flashing laser diode. We demonstrate that the Veto module successfully prevents DAQ overflows for Betelgeuse-like supernovae as well as the long-term stability of the new modules. During normal running the Veto module is found to issue DAQ vetos a few times per month resulting in a total dead time less than 1\,ms, and does not influence ordinary operations. Additionally, using simulation data we find that supernovae closer than 800~pc will trigger Veto module resulting in a prescaling of the observed neutrino data. △ Less

Submitted 13 August, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

Comments: 28 pages, 18 figures. Submitted to PTEP

arXiv:2404.06696 [pdf, other]

Dual Ensemble Kalman Filter for Stochastic Optimal Control

Authors: Anant A. Joshi, Amirhossein Taghvaei, Prashant G. Mehta, Sean P. Meyn

Abstract: In this paper, stochastic optimal control problems in continuous time and space are considered. In recent years, such problems have received renewed attention from the lens of reinforcement learning (RL) which is also one of our motivation. The main contribution is a simulation-based algorithm -- dual ensemble Kalman filter (EnKF) -- to numerically approximate the solution of these problems. The p… ▽ More In this paper, stochastic optimal control problems in continuous time and space are considered. In recent years, such problems have received renewed attention from the lens of reinforcement learning (RL) which is also one of our motivation. The main contribution is a simulation-based algorithm -- dual ensemble Kalman filter (EnKF) -- to numerically approximate the solution of these problems. The paper extends our previous work where the dual EnKF was applied in deterministic settings of the problem. The theoretical results and algorithms are illustrated with numerical experiments. △ Less

Submitted 26 October, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

Comments: Accepted to IEEE Conference on Decision and Control, 2024

arXiv:2403.16258 [pdf, other]

Laplacian-guided Entropy Model in Neural Codec with Blur-dissipated Synthesis

Authors: Atefeh Khoshkhahtinat, Ali Zafari, Piyush M. Mehta, Nasser M. Nasrabadi

Abstract: While replacing Gaussian decoders with a conditional diffusion model enhances the perceptual quality of reconstructions in neural image compression, their lack of inductive bias for image data restricts their ability to achieve state-of-the-art perceptual levels. To address this limitation, we adopt a non-isotropic diffusion model at the decoder side. This model imposes an inductive bias aimed at… ▽ More While replacing Gaussian decoders with a conditional diffusion model enhances the perceptual quality of reconstructions in neural image compression, their lack of inductive bias for image data restricts their ability to achieve state-of-the-art perceptual levels. To address this limitation, we adopt a non-isotropic diffusion model at the decoder side. This model imposes an inductive bias aimed at distinguishing between frequency contents, thereby facilitating the generation of high-quality images. Moreover, our framework is equipped with a novel entropy model that accurately models the probability distribution of latent representation by exploiting spatio-channel correlations in latent space, while accelerating the entropy decoding step. This channel-wise entropy model leverages both local and global spatial contexts within each channel chunk. The global spatial context is built upon the Transformer, which is specifically designed for image compression tasks. The designed Transformer employs a Laplacian-shaped positional encoding, the learnable parameters of which are adaptively adjusted for each channel cluster. Our experiments demonstrate that our proposed framework yields better perceptual quality compared to cutting-edge generative-based codecs, and the proposed entropy model contributes to notable bitrate savings. △ Less

Submitted 24 March, 2024; originally announced March 2024.

Comments: Accepted by CVPR2024

arXiv:2403.08619 [pdf, other]

doi 10.1103/PhysRevD.110.082008

Measurements of the charge ratio and polarization of cosmic-ray muons with the Super-Kamiokande detector

Authors: H. Kitagawa, T. Tada, K. Abe, C. Bronner, Y. Hayato, K. Hiraide, K. Hosokawa, K. Ieki, M. Ikeda, J. Kameda, Y. Kanemura, R. Kaneshima, Y. Kashiwagi, Y. Kataoka, S. Miki, S. Mine, M. Miura, S. Moriyama, Y. Nakano, M. Nakahata, S. Nakayama, Y. Noguchi, K. Okamoto, K. Sato, H. Sekiya , et al. (231 additional authors not shown)

Abstract: We present the results of the charge ratio ($R$) and polarization ($P^μ_{0}$) measurements using the decay electron events collected from 2008 September to 2022 June by the Super-Kamiokande detector. Because of its underground location and long operation, we performed high precision measurements by accumulating cosmic-ray muons. We measured the muon charge ratio to be $R=1.32 \pm 0.02$… ▽ More We present the results of the charge ratio ($R$) and polarization ($P^μ_{0}$) measurements using the decay electron events collected from 2008 September to 2022 June by the Super-Kamiokande detector. Because of its underground location and long operation, we performed high precision measurements by accumulating cosmic-ray muons. We measured the muon charge ratio to be $R=1.32 \pm 0.02$ $(\mathrm{stat.}{+}\mathrm{syst.})$ at $E_μ\cos θ_{\mathrm{Zenith}}=0.7^{+0.3}_{-0.2}$ $\mathrm{TeV}$, where $E_μ$ is the muon energy and $θ_{\mathrm{Zenith}}$ is the zenith angle of incoming cosmic-ray muons. This result is consistent with the Honda flux model while this suggests a tension with the $πK$ model of $1.9σ$. We also measured the muon polarization at the production location to be $P^μ_{0}=0.52 \pm 0.02$ $(\mathrm{stat.}{+}\mathrm{syst.})$ at the muon momentum of $0.9^{+0.6}_{-0.1}$ $\mathrm{TeV}/c$ at the surface of the mountain; this also suggests a tension with the Honda flux model of $1.5σ$. This is the most precise measurement ever to experimentally determine the cosmic-ray muon polarization near $1~\mathrm{TeV}/c$. These measurement results are useful to improve the atmospheric neutrino simulations. △ Less

Submitted 4 November, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

Comments: 29 pages, 45 figures

Journal ref: Phys. Rev. D 110, 082008 (2024)

arXiv:2403.07796 [pdf, other]

doi 10.1016/j.nima.2024.169480

Second gadolinium loading to Super-Kamiokande

Authors: K. Abe, C. Bronner, Y. Hayato, K. Hiraide, K. Hosokawa, K. Ieki, M. Ikeda, J. Kameda, Y. Kanemura, R. Kaneshima, Y. Kashiwagi, Y. Kataoka, S. Miki, S. Mine, M. Miura, S. Moriyama, Y. Nakano, M. Nakahata, S. Nakayama, Y. Noguchi, K. Sato, H. Sekiya, H. Shiba, K. Shimizu, M. Shiozawa , et al. (225 additional authors not shown)

Abstract: The first loading of gadolinium (Gd) into Super-Kamiokande in 2020 was successful, and the neutron capture efficiency on Gd reached 50\%. To further increase the Gd neutron capture efficiency to 75\%, 26.1 tons of $\rm Gd_2(\rm SO_4)_3\cdot \rm 8H_2O$ was additionally loaded into Super-Kamiokande (SK) from May 31 to July 4, 2022. As the amount of loaded $\rm Gd_2(\rm SO_4)_3\cdot \rm 8H_2O$ was do… ▽ More The first loading of gadolinium (Gd) into Super-Kamiokande in 2020 was successful, and the neutron capture efficiency on Gd reached 50\%. To further increase the Gd neutron capture efficiency to 75\%, 26.1 tons of $\rm Gd_2(\rm SO_4)_3\cdot \rm 8H_2O$ was additionally loaded into Super-Kamiokande (SK) from May 31 to July 4, 2022. As the amount of loaded $\rm Gd_2(\rm SO_4)_3\cdot \rm 8H_2O$ was doubled compared to the first loading, the capacity of the powder dissolving system was doubled. We also developed new batches of gadolinium sulfate with even further reduced radioactive impurities. In addition, a more efficient screening method was devised and implemented to evaluate these new batches of $\rm Gd_2(\rm SO_4)_3\cdot \rm 8H_2O$. Following the second loading, the Gd concentration in SK was measured to be $333.5\pm2.5$ ppm via an Atomic Absorption Spectrometer (AAS). From the mean neutron capture time constant of neutrons from an Am/Be calibration source, the Gd concentration was independently measured to be 332.7 $\pm$ 6.8(sys.) $\pm$ 1.1(stat.) ppm, consistent with the AAS result. Furthermore, during the loading the Gd concentration was monitored continually using the capture time constant of each spallation neutron produced by cosmic-ray muons,and the final neutron capture efficiency was shown to become 1.5 times higher than that of the first loaded phase, as expected. △ Less

Submitted 18 June, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

Comments: 34 pages, 13 figures, submitted to Nuclear Inst. and Methods in Physics Research, A

Journal ref: Nuclear Inst. and Methods in Physics Research, A 1065 (2024) 169480

arXiv:2403.06760 [pdf, other]

Performance of SK-Gd's Upgraded Real-time Supernova Monitoring System

Authors: Y. Kashiwagi, K. Abe, C. Bronner, Y. Hayato, K. Hiraide, K. Hosokawa, K. Ieki, M. Ikeda, J. Kameda, Y. Kanemura, R. Kaneshima, Y. Kataoka, S. Miki, S. Mine, M. Miura, S. Moriyama, Y. Nakano, M. Nakahata, S. Nakayama, Y. Noguchi, K. Sato, H. Sekiya, H. Shiba, K. Shimizu, M. Shiozawa , et al. (214 additional authors not shown)

Abstract: Among multi-messenger observations of the next galactic core-collapse supernova, Super-Kamiokande (SK) plays a critical role in detecting the emitted supernova neutrinos, determining the direction to the supernova (SN), and notifying the astronomical community of these observations in advance of the optical signal. On 2022, SK has increased the gadolinium dissolved in its water target (SK-Gd) and… ▽ More Among multi-messenger observations of the next galactic core-collapse supernova, Super-Kamiokande (SK) plays a critical role in detecting the emitted supernova neutrinos, determining the direction to the supernova (SN), and notifying the astronomical community of these observations in advance of the optical signal. On 2022, SK has increased the gadolinium dissolved in its water target (SK-Gd) and has achieved a Gd concentration of 0.033%, resulting in enhanced neutron detection capability, which in turn enables more accurate determination of the supernova direction. Accordingly, SK-Gd's real-time supernova monitoring system (Abe te al. 2016b) has been upgraded. SK_SN Notice, a warning system that works together with this monitoring system, was released on December 13, 2021, and is available through GCN Notices (Barthelmy et al. 2000). When the monitoring system detects an SN-like burst of events, SK_SN Notice will automatically distribute an alarm with the reconstructed direction to the supernova candidate within a few minutes. In this paper, we present a systematic study of SK-Gd's response to a simulated galactic SN. Assuming a supernova situated at 10 kpc, neutrino fluxes from six supernova models are used to characterize SK-Gd's pointing accuracy using the same tools as the online monitoring system. The pointing accuracy is found to vary from 3-7$^\circ$ depending on the models. However, if the supernova is closer than 10 kpc, SK_SN Notice can issue an alarm with three-degree accuracy, which will benefit follow-up observations by optical telescopes with large fields of view. △ Less

Submitted 13 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

Comments: 38 pages, 29 figures, 6 tables

arXiv:2403.06350 [pdf, other]

doi 10.18653/v1/2024.acl-long.843

IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages

Authors: Mohammed Safi Ur Rahman Khan, Priyam Mehta, Ananth Sankar, Umashankar Kumaravelan, Sumanth Doddapaneni, Suriyaprasaad B, Varun Balan G, Sparsh Jain, Anoop Kunchukuttan, Pratyush Kumar, Raj Dabre, Mitesh M. Khapra

Abstract: Despite the considerable advancements in English LLMs, the progress in building comparable models for other languages has been hindered due to the scarcity of tailored resources. Our work aims to bridge this divide by introducing an expansive suite of resources specifically designed for the development of Indic LLMs, covering 22 languages, containing a total of 251B tokens and 74.8M instruction-re… ▽ More Despite the considerable advancements in English LLMs, the progress in building comparable models for other languages has been hindered due to the scarcity of tailored resources. Our work aims to bridge this divide by introducing an expansive suite of resources specifically designed for the development of Indic LLMs, covering 22 languages, containing a total of 251B tokens and 74.8M instruction-response pairs. Recognizing the importance of both data quality and quantity, our approach combines highly curated manually verified data, unverified yet valuable data, and synthetic data. We build a clean, open-source pipeline for curating pre-training data from diverse sources, including websites, PDFs, and videos, incorporating best practices for crawling, cleaning, flagging, and deduplication. For instruction-fine tuning, we amalgamate existing Indic datasets, translate/transliterate English datasets into Indian languages, and utilize LLaMa2 and Mixtral models to create conversations grounded in articles from Indian Wikipedia and Wikihow. Additionally, we address toxicity alignment by generating toxic prompts for multiple scenarios and then generate non-toxic responses by feeding these toxic prompts to an aligned LLaMa2 model. We hope that the datasets, tools, and resources released as a part of this work will not only propel the research and development of Indic LLMs but also establish an open-source blueprint for extending such efforts to other languages. The data and other artifacts created as part of this work are released with permissive licenses. △ Less

Submitted 28 November, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

Comments: ACL-2024 Outstanding Paper

arXiv:2403.05530 [pdf, other]

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1112 additional authors not shown)

Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February version on the great majority of capabilities and benchmarks; (2) Gemini 1.5 Flash, a more lightweight variant designed for efficiency with minimal regression in quality. Gemini 1.5 models achieve near-perfect recall on long-context retrieval tasks across modalities, improve the state-of-the-art in long-document QA, long-video QA and long-context ASR, and match or surpass Gemini 1.0 Ultra's state-of-the-art performance across a broad set of benchmarks. Studying the limits of Gemini 1.5's long-context ability, we find continued improvement in next-token prediction and near-perfect retrieval (>99%) up to at least 10M tokens, a generational leap over existing models such as Claude 3.0 (200k) and GPT-4 Turbo (128k). Finally, we highlight real-world use cases, such as Gemini 1.5 collaborating with professionals on completing their tasks achieving 26 to 75% time savings across 10 different job categories, as well as surprising new capabilities of large language models at the frontier; when given a grammar manual for Kalamang, a language with fewer than 200 speakers worldwide, the model learns to translate English to Kalamang at a similar level to a person who learned from the same content. △ Less

Submitted 16 December, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

arXiv:2403.05497 [pdf, other]

Les Houches Lectures on Community Ecology: From Niche Theory to Statistical Mechanics

Authors: Wenping Cui, Robert Marsland III, Pankaj Mehta

Abstract: Ecosystems are among the most interesting and well-studied examples of self-organized complex systems. Community ecology, the study of how species interact with each other and the environment, has a rich tradition. Over the last few years, there has been a growing theoretical and experimental interest in these problems from the physics and quantitative biology communities. Here, we give an overvie… ▽ More Ecosystems are among the most interesting and well-studied examples of self-organized complex systems. Community ecology, the study of how species interact with each other and the environment, has a rich tradition. Over the last few years, there has been a growing theoretical and experimental interest in these problems from the physics and quantitative biology communities. Here, we give an overview of community ecology, highlighting the deep connections between ecology and statistical physics. We start by introducing the two classes of mathematical models that have served as the workhorses of community ecology: Consumer Resource Models (CRM) and the generalized Lotka-Volterra models (GLV). We place a special emphasis on graphical methods and general principles. We then review recent works showing a deep and surprising connection between ecological dynamics and constrained optimization. We then shift our focus by analyzing these same models in "high-dimensions" (i.e. in the limit where the number of species and resources in the ecosystem becomes large) and discuss how such complex ecosystems can be analyzed using methods from the statistical physics of disordered systems such as the cavity method and Random Matrix Theory. △ Less

Submitted 8 March, 2024; originally announced March 2024.

Comments: 48 pages, 9 figures, Les Houches Theoretical Biophysics Summer School 2023

arXiv:2403.03212 [pdf, other]

Performance of a modular ton-scale pixel-readout liquid argon time projection chamber

Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, T. Alves, H. Amar, P. Amedo, J. Anderson, D. A. Andrade , et al. (1340 additional authors not shown)

Abstract: The Module-0 Demonstrator is a single-phase 600 kg liquid argon time projection chamber operated as a prototype for the DUNE liquid argon near detector. Based on the ArgonCube design concept, Module-0 features a novel 80k-channel pixelated charge readout and advanced high-coverage photon detection system. In this paper, we present an analysis of an eight-day data set consisting of 25 million cosmi… ▽ More The Module-0 Demonstrator is a single-phase 600 kg liquid argon time projection chamber operated as a prototype for the DUNE liquid argon near detector. Based on the ArgonCube design concept, Module-0 features a novel 80k-channel pixelated charge readout and advanced high-coverage photon detection system. In this paper, we present an analysis of an eight-day data set consisting of 25 million cosmic ray events collected in the spring of 2021. We use this sample to demonstrate the imaging performance of the charge and light readout systems as well as the signal correlations between the two. We also report argon purity and detector uniformity measurements, and provide comparisons to detector simulations. △ Less

Submitted 5 March, 2024; originally announced March 2024.

Comments: 47 pages, 41 figures

Report number: FERMILAB-PUB-24-0073-LBNF

arXiv:2403.01276 [pdf, other]

A universal niche geometry governs the response of ecosystems to environmental perturbations

Authors: Akshit Goyal, Jason W. Rocks, Pankaj Mehta

Abstract: How ecosystems respond to environmental perturbations is a fundamental question in ecology, made especially challenging due to the strong coupling between species and their environment. Here, we introduce a theoretical framework for calculating the steady-state response of ecosystems to environmental perturbations in generalized consumer-resource. Our construction is applicable to a wide class of… ▽ More How ecosystems respond to environmental perturbations is a fundamental question in ecology, made especially challenging due to the strong coupling between species and their environment. Here, we introduce a theoretical framework for calculating the steady-state response of ecosystems to environmental perturbations in generalized consumer-resource. Our construction is applicable to a wide class of systems, including models with non-reciprocal interactions, cross-feeding, and non-linear growth/consumption rates. Within our framework, all ecological variables are embedded into four distinct vector spaces and ecological interactions are represented by geometric transformations between these spaces. We show that near a steady state, such geometric transformations directly map environmental perturbations - in resource availability and mortality rates - to shifts in niche structure. We illustrate these ideas in a variety of settings including a minimal model for pH-induced toxicity in bacterial denitrification. We end by discussing the biological implications of our framework. In particular, we show that it is extremely difficult to distinguish cooperative and competitive interactions by measuring species' responses to external perturbations. △ Less

Submitted 22 November, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

Comments: 13 pages, 5 figures

arXiv:2402.13496 [pdf, other]

doi 10.1609/aaai.v39i16.33860

Heterogeneous Graph Neural Network on Semantic Tree

Authors: Mingyu Guan, Jack W. Stokes, Qinlong Luo, Fuchen Liu, Purvanshi Mehta, Elnaz Nouri, Taesoo Kim

Abstract: The recent past has seen an increasing interest in Heterogeneous Graph Neural Networks (HGNNs), since many real-world graphs are heterogeneous in nature, from citation graphs to email graphs. However, existing methods ignore a tree hierarchy among metapaths, naturally constituted by different node types and relation types. In this paper, we present HetTree, a novel HGNN that models both the graph… ▽ More The recent past has seen an increasing interest in Heterogeneous Graph Neural Networks (HGNNs), since many real-world graphs are heterogeneous in nature, from citation graphs to email graphs. However, existing methods ignore a tree hierarchy among metapaths, naturally constituted by different node types and relation types. In this paper, we present HetTree, a novel HGNN that models both the graph structure and heterogeneous aspects in a scalable and effective manner. Specifically, HetTree builds a semantic tree data structure to capture the hierarchy among metapaths. To effectively encode the semantic tree, HetTree uses a novel subtree attention mechanism to emphasize metapaths that are more helpful in encoding parent-child relationships. Moreover, HetTree proposes carefully matching pre-computed features and labels correspondingly, constituting a complete metapath representation. Our evaluation of HetTree on a variety of real-world datasets demonstrates that it outperforms all existing baselines on open benchmarks and efficiently scales to large real-world graphs with millions of nodes and edges. △ Less

Submitted 2 March, 2025; v1 submitted 20 February, 2024; originally announced February 2024.

Comments: Accepted at AAAI 2025

arXiv:2402.05075 [pdf, other]

ARCollab: Towards Multi-User Interactive Cardiovascular Surgical Planning in Mobile Augmented Reality

Authors: Pratham Mehta, Harsha Karanth, Haoyang Yang, Timothy Slesnick, Fawwaz Shaw, Duen Horng Chau

Abstract: Surgical planning for congenital heart diseases requires a collaborative approach, traditionally involving the 3D-printing of physical heart models for inspection by surgeons and cardiologists. Recent advancements in mobile augmented reality (AR) technologies have offered a promising alternative, noted for their ease-of-use and portability. Despite this progress, there remains a gap in research ex… ▽ More Surgical planning for congenital heart diseases requires a collaborative approach, traditionally involving the 3D-printing of physical heart models for inspection by surgeons and cardiologists. Recent advancements in mobile augmented reality (AR) technologies have offered a promising alternative, noted for their ease-of-use and portability. Despite this progress, there remains a gap in research exploring the use of multi-user mobile AR environments for facilitating collaborative cardiovascular surgical planning. We are developing ARCollab, an iOS AR application designed to allow multiple surgeons and cardiologists to interact with patient-specific 3D heart models in a shared environment. ARCollab allows surgeons and cardiologists to import heart models, perform gestures to manipulate the heart, and collaborate with other users without having to produce a physical heart model. We are excited by the potential for ARCollab to make long-term real-world impact, thanks to the ubiquity of iOS devices that will allow for ARCollab's easy distribution, deployment and adoption. △ Less

Submitted 7 February, 2024; originally announced February 2024.

arXiv:2402.01877 [pdf, other]

Mobile Fitting Room: On-device Virtual Try-on via Diffusion Models

Authors: Justin Blalock, David Munechika, Harsha Karanth, Alec Helbling, Pratham Mehta, Seongmin Lee, Duen Horng Chau

Abstract: The growing digital landscape of fashion e-commerce calls for interactive and user-friendly interfaces for virtually trying on clothes. Traditional try-on methods grapple with challenges in adapting to diverse backgrounds, poses, and subjects. While newer methods, utilizing the recent advances of diffusion models, have achieved higher-quality image generation, the human-centered dimensions of mobi… ▽ More The growing digital landscape of fashion e-commerce calls for interactive and user-friendly interfaces for virtually trying on clothes. Traditional try-on methods grapple with challenges in adapting to diverse backgrounds, poses, and subjects. While newer methods, utilizing the recent advances of diffusion models, have achieved higher-quality image generation, the human-centered dimensions of mobile interface delivery and privacy concerns remain largely unexplored. We present Mobile Fitting Room, the first on-device diffusion-based virtual try-on system. To address multiple inter-related technical challenges such as high-quality garment placement and model compression for mobile devices, we present a novel technical pipeline and an interface design that enables privacy preservation and user customization. A usage scenario highlights how our tool can provide a seamless, interactive virtual try-on experience for customers and provide a valuable service for fashion e-commerce businesses. △ Less

Submitted 2 February, 2024; originally announced February 2024.

Comments: 7 pages, 3 figures

arXiv:2402.01568 [pdf, other]

Doping Liquid Argon with Xenon in ProtoDUNE Single-Phase: Effects on Scintillation Light

Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, H. Amar Es-sghir, P. Amedo, J. Anderson, D. A. Andrade, C. Andreopoulos , et al. (1297 additional authors not shown)

Abstract: Doping of liquid argon TPCs (LArTPCs) with a small concentration of xenon is a technique for light-shifting and facilitates the detection of the liquid argon scintillation light. In this paper, we present the results of the first doping test ever performed in a kiloton-scale LArTPC. From February to May 2020, we carried out this special run in the single-phase DUNE Far Detector prototype (ProtoDUN… ▽ More Doping of liquid argon TPCs (LArTPCs) with a small concentration of xenon is a technique for light-shifting and facilitates the detection of the liquid argon scintillation light. In this paper, we present the results of the first doping test ever performed in a kiloton-scale LArTPC. From February to May 2020, we carried out this special run in the single-phase DUNE Far Detector prototype (ProtoDUNE-SP) at CERN, featuring 720 t of total liquid argon mass with 410 t of fiducial mass. A 5.4 ppm nitrogen contamination was present during the xenon doping campaign. The goal of the run was to measure the light and charge response of the detector to the addition of xenon, up to a concentration of 18.8 ppm. The main purpose was to test the possibility for reduction of non-uniformities in light collection, caused by deployment of photon detectors only within the anode planes. Light collection was analysed as a function of the xenon concentration, by using the pre-existing photon detection system (PDS) of ProtoDUNE-SP and an additional smaller set-up installed specifically for this run. In this paper we first summarize our current understanding of the argon-xenon energy transfer process and the impact of the presence of nitrogen in argon with and without xenon dopant. We then describe the key elements of ProtoDUNE-SP and the injection method deployed. Two dedicated photon detectors were able to collect the light produced by xenon and the total light. The ratio of these components was measured to be about 0.65 as 18.8 ppm of xenon were injected. We performed studies of the collection efficiency as a function of the distance between tracks and light detectors, demonstrating enhanced uniformity of response for the anode-mounted PDS. We also show that xenon doping can substantially recover light losses due to contamination of the liquid argon by nitrogen. △ Less

Submitted 2 August, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

Comments: 36 pages, 20 figures. Corrected author list; corrected typos across paper and polished text

Report number: CERN-EP-2024-024; FERMILAB-PUB-23-0819-LBNF

arXiv:2402.01074 [pdf, other]

Neural Models and Algorithms for Sensorimotor Control of an Octopus Arm

Authors: Tixian Wang, Udit Halder, Ekaterina Gribkova, Rhanor Gillette, Mattia Gazzola, Prashant G. Mehta

Abstract: In this article, a biophysically realistic model of a soft octopus arm with internal musculature is presented. The modeling is motivated by experimental observations of sensorimotor control where an arm localizes and reaches a target. Major contributions of this article are: (i) development of models to capture the mechanical properties of arm musculature, the electrical properties of the arm peri… ▽ More In this article, a biophysically realistic model of a soft octopus arm with internal musculature is presented. The modeling is motivated by experimental observations of sensorimotor control where an arm localizes and reaches a target. Major contributions of this article are: (i) development of models to capture the mechanical properties of arm musculature, the electrical properties of the arm peripheral nervous system (PNS), and the coupling of PNS with muscular contractions; (ii) modeling the arm sensory system, including chemosensing and proprioception; and (iii) algorithms for sensorimotor control, which include a novel feedback neural motor control law for mimicking target-oriented arm reaching motions, and a novel consensus algorithm for solving sensing problems such as locating a food source from local chemical sensory information (exogenous) and arm deformation information (endogenous). Several analytical results, including rest-state characterization and stability properties of the proposed sensing and motor control algorithms, are provided. Numerical simulations demonstrate the efficacy of our approach. Qualitative comparisons against observed arm rest shapes and target-oriented reaching motions are also reported. △ Less

Submitted 27 April, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

arXiv:2401.17155 [pdf, other]

Cell spheroid viscoelasticity is deformation-dependent

Authors: Ruben C. Boot, Anouk van der Net, Christos Gogou, Pranav Mehta, Dimphna H. Meijer, Gijsje H. Koenderink, Pouyan E. Boukany

Abstract: Tissue surface tension influences cell sorting and tissue fusion. Earlier mechanical studies suggest that multicellular spheroids actively reinforce their surface tension with applied force. Here we study this open question through high-throughput microfluidic micropipette aspiration measurements on cell spheroids to identify the role of force duration and cell contractility. We find that larger s… ▽ More Tissue surface tension influences cell sorting and tissue fusion. Earlier mechanical studies suggest that multicellular spheroids actively reinforce their surface tension with applied force. Here we study this open question through high-throughput microfluidic micropipette aspiration measurements on cell spheroids to identify the role of force duration and cell contractility. We find that larger spheroid deformations lead to faster cellular retraction once the pressure is released, regardless of the applied force and cellular contractility. These new insights demonstrate that spheroid viscoelasticity is deformation-dependent and challenge whether surface tension truly reinforces. △ Less

Submitted 30 January, 2024; originally announced January 2024.

arXiv:2401.04481 [pdf, other]

Fighting Fire with Fire: Adversarial Prompting to Generate a Misinformation Detection Dataset

Authors: Shrey Satapara, Parth Mehta, Debasis Ganguly, Sandip Modha

Abstract: The recent success in language generation capabilities of large language models (LLMs), such as GPT, Bard, Llama etc., can potentially lead to concerns about their possible misuse in inducing mass agitation and communal hatred via generating fake news and spreading misinformation. Traditional means of developing a misinformation ground-truth dataset does not scale well because of the extensive man… ▽ More The recent success in language generation capabilities of large language models (LLMs), such as GPT, Bard, Llama etc., can potentially lead to concerns about their possible misuse in inducing mass agitation and communal hatred via generating fake news and spreading misinformation. Traditional means of developing a misinformation ground-truth dataset does not scale well because of the extensive manual effort required to annotate the data. In this paper, we propose an LLM-based approach of creating silver-standard ground-truth datasets for identifying misinformation. Specifically speaking, given a trusted news article, our proposed approach involves prompting LLMs to automatically generate a summarised version of the original article. The prompts in our proposed approach act as a controlling mechanism to generate specific types of factual incorrectness in the generated summaries, e.g., incorrect quantities, false attributions etc. To investigate the usefulness of this dataset, we conduct a set of experiments where we train a range of supervised models for the task of misinformation detection. △ Less

Submitted 9 January, 2024; originally announced January 2024.

arXiv:2312.12907 [pdf, ps, other]

doi 10.1103/PhysRevD.109.092001

Solar neutrino measurements using the full data period of Super-Kamiokande-IV

Authors: Super-Kamiokande Collaboration, :, K. Abe, C. Bronner, Y. Hayato, K. Hiraide, K. Hosokawa, K. Ieki, M. Ikeda, S. Imaizumi, K. Iyogi, J. Kameda, Y. Kanemura, R. Kaneshima, Y. Kashiwagi, Y. Kataoka, Y. Kato, Y. Kishimoto, S. Miki, S. Mine, M. Miura, T. Mochizuki, S. Moriyama, Y. Nagao, M. Nakahata , et al. (305 additional authors not shown)

Abstract: An analysis of solar neutrino data from the fourth phase of Super-Kamiokande~(SK-IV) from October 2008 to May 2018 is performed and the results are presented. The observation time of the data set of SK-IV corresponds to $2970$~days and the total live time for all four phases is $5805$~days. For more precise solar neutrino measurements, several improvements are applied in this analysis: lowering th… ▽ More An analysis of solar neutrino data from the fourth phase of Super-Kamiokande~(SK-IV) from October 2008 to May 2018 is performed and the results are presented. The observation time of the data set of SK-IV corresponds to $2970$~days and the total live time for all four phases is $5805$~days. For more precise solar neutrino measurements, several improvements are applied in this analysis: lowering the data acquisition threshold in May 2015, further reduction of the spallation background using neutron clustering events, precise energy reconstruction considering the time variation of the PMT gain. The observed number of solar neutrino events in $3.49$--$19.49$ MeV electron kinetic energy region during SK-IV is $65,443^{+390}_{-388}\,(\mathrm{stat.})\pm 925\,(\mathrm{syst.})$ events. Corresponding $\mathrm{^{8}B}$ solar neutrino flux is $(2.314 \pm 0.014\, \rm{(stat.)} \pm 0.040 \, \rm{(syst.)}) \times 10^{6}~\mathrm{cm^{-2}\,s^{-1}}$, assuming a pure electron-neutrino flavor component without neutrino oscillations. The flux combined with all SK phases up to SK-IV is $(2.336 \pm 0.011\, \rm{(stat.)} \pm 0.043 \, \rm{(syst.)}) \times 10^{6}~\mathrm{cm^{-2}\,s^{-1}}$. Based on the neutrino oscillation analysis from all solar experiments, including the SK $5805$~days data set, the best-fit neutrino oscillation parameters are $\rm{sin^{2} θ_{12,\,solar}} = 0.306 \pm 0.013 $ and $Δm^{2}_{21,\,\mathrm{solar}} = (6.10^{+ 0.95}_{-0.81}) \times 10^{-5}~\rm{eV}^{2}$, with a deviation of about 1.5$σ$ from the $Δm^{2}_{21}$ parameter obtained by KamLAND. The best-fit neutrino oscillation parameters obtained from all solar experiments and KamLAND are $\sin^{2} θ_{12,\,\mathrm{global}} = 0.307 \pm 0.012 $ and $Δm^{2}_{21,\,\mathrm{global}} = (7.50^{+ 0.19}_{-0.18}) \times 10^{-5}~\rm{eV}^{2}$. △ Less

Submitted 20 February, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

Comments: 47 pages, 61 figures

Journal ref: Phys. Rev. D 109, 092001 (2024)

arXiv:2312.11805 [pdf, other]

Gemini: A Family of Highly Capable Multimodal Models

Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1326 additional authors not shown)

Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultra model advances the state of the art in 30 of 32 of these benchmarks - notably being the first model to achieve human-expert performance on the well-studied exam benchmark MMLU, and improving the state of the art in every one of the 20 multimodal benchmarks we examined. We believe that the new capabilities of the Gemini family in cross-modal reasoning and language understanding will enable a wide variety of use cases. We discuss our approach toward post-training and deploying Gemini models responsibly to users through services including Gemini, Gemini Advanced, Google AI Studio, and Cloud Vertex AI. △ Less

Submitted 9 May, 2025; v1 submitted 18 December, 2023; originally announced December 2023.

arXiv:2312.03130 [pdf, other]

The DUNE Far Detector Vertical Drift Technology, Technical Design Report

Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, H. Amar, P. Amedo, J. Anderson, D. A. Andrade, C. Andreopoulos , et al. (1304 additional authors not shown)

Abstract: DUNE is an international experiment dedicated to addressing some of the questions at the forefront of particle physics and astrophysics, including the mystifying preponderance of matter over antimatter in the early universe. The dual-site experiment will employ an intense neutrino beam focused on a near and a far detector as it aims to determine the neutrino mass hierarchy and to make high-precisi… ▽ More DUNE is an international experiment dedicated to addressing some of the questions at the forefront of particle physics and astrophysics, including the mystifying preponderance of matter over antimatter in the early universe. The dual-site experiment will employ an intense neutrino beam focused on a near and a far detector as it aims to determine the neutrino mass hierarchy and to make high-precision measurements of the PMNS matrix parameters, including the CP-violating phase. It will also stand ready to observe supernova neutrino bursts, and seeks to observe nucleon decay as a signature of a grand unified theory underlying the standard model. The DUNE far detector implements liquid argon time-projection chamber (LArTPC) technology, and combines the many tens-of-kiloton fiducial mass necessary for rare event searches with the sub-centimeter spatial resolution required to image those events with high precision. The addition of a photon detection system enhances physics capabilities for all DUNE physics drivers and opens prospects for further physics explorations. Given its size, the far detector will be implemented as a set of modules, with LArTPC designs that differ from one another as newer technologies arise. In the vertical drift LArTPC design, a horizontal cathode bisects the detector, creating two stacked drift volumes in which ionization charges drift towards anodes at either the top or bottom. The anodes are composed of perforated PCB layers with conductive strips, enabling reconstruction in 3D. Light-trap-style photon detection modules are placed both on the cryostat's side walls and on the central cathode where they are optically powered. This Technical Design Report describes in detail the technical implementations of each subsystem of this LArTPC that, together with the other far detector modules and the near detector, will enable DUNE to achieve its physics goals. △ Less

Submitted 5 December, 2023; originally announced December 2023.

Comments: 425 pages; 281 figures Central editing team: A. Heavey, S. Kettell, A. Marchionni, S. Palestini, S. Rajogopalan, R. J. Wilson

Report number: Fermilab Report no: TM-2813-LBNF

arXiv:2311.12789 [pdf, other]

doi 10.1007/s10509-023-04253-8

Formation of a Bose Star in a Rotating Cloud

Authors: Kuldeep J. Purohit, Pravin Kumar Natwariya, Jitesh R. Bhatt, Prashant K. Mehta

Abstract: In this paper, we study the evolutions of a self-gravitating cloud of bosonic dark matter with finite angular momentum and self-interaction. This is achieved by using the sixth-order pseudospectral operator splitting method to solve the system of nonlinear Schrödinger and Poisson equations. The initial cloud is assumed to have mass density randomly distributed throughout three-dimensional space. T… ▽ More In this paper, we study the evolutions of a self-gravitating cloud of bosonic dark matter with finite angular momentum and self-interaction. This is achieved by using the sixth-order pseudospectral operator splitting method to solve the system of nonlinear Schrödinger and Poisson equations. The initial cloud is assumed to have mass density randomly distributed throughout three-dimensional space. The dark matter particles in the initial cloud are in the kinetic regime, i.e., their de Broglie wavelength is much smaller than the halo size. It is shown that Bose stars are indeed formed in the numerical simulation presented here. The presence of angular momentum and self-interaction in the initial cloud can significantly influence the star formation time in a non-trivial fashion. Furthermore, the plots of the vorticity magnitude profile after the star formation time indicate that the formed star may not have any intrinsic angular momentum for the cases when the self-interaction among the particles is either negligible or attractive. These results are in agreement with the earlier analytical studies of an isolated rotating Bose star. However, for the case of repulsive self-interaction, the vorticity magnitude analysis shows a possibility that the star formed in the numerical simulations may possess intrinsic angular momentum. It is also shown that the average mass and radius diagrams of the star are strongly influenced by the presence of angular momentum in the initial cloud. △ Less

Submitted 21 November, 2023; originally announced November 2023.

Comments: Based on arXiv:2109.02601. Important new results have been obtained

Journal ref: Astrophys Space Sci 368, 97 (2023)

arXiv:2311.05105 [pdf, other]

doi 10.1103/PhysRevD.109.072014

Atmospheric neutrino oscillation analysis with neutron tagging and an expanded fiducial volume in Super-Kamiokande I-V

Authors: Super-Kamiokande Collaboration, :, T. Wester, K. Abe, C. Bronner, Y. Hayato, K. Hiraide, K. Hosokawa, K. Ieki, M. Ikeda, J. Kameda, Y. Kanemura, R. Kaneshima, Y. Kashiwagi, Y. Kataoka, S. Miki, S. Mine, M. Miura, S. Moriyama, Y. Nakano, M. Nakahata, S. Nakayama, Y. Noguchi, K. Sato, H. Sekiya , et al. (212 additional authors not shown)

Abstract: We present a measurement of neutrino oscillation parameters with the Super-Kamiokande detector using atmospheric neutrinos from the complete pure-water SK I-V (April 1996-July 2020) data set, including events from an expanded fiducial volume. The data set corresponds to 6511.3 live days and an exposure of 484.2 kiloton-years. Measurements of the neutrino oscillation parameters $Δm^2_{32}$,… ▽ More We present a measurement of neutrino oscillation parameters with the Super-Kamiokande detector using atmospheric neutrinos from the complete pure-water SK I-V (April 1996-July 2020) data set, including events from an expanded fiducial volume. The data set corresponds to 6511.3 live days and an exposure of 484.2 kiloton-years. Measurements of the neutrino oscillation parameters $Δm^2_{32}$, $\sin^2θ_{23}$, $\sin^2 θ_{13}$, $δ_{CP}$, and the preference for the neutrino mass ordering are presented with atmospheric neutrino data alone, and with constraints on $\sin^2 θ_{13}$ from reactor neutrino experiments. Our analysis including constraints on $\sin^2 θ_{13}$ favors the normal mass ordering at the 92.3% level. △ Less

Submitted 8 November, 2023; originally announced November 2023.

Comments: 24 pages, 18 figures

arXiv:2311.03842 [pdf, ps, other]

Measurement of the neutrino-oxygen neutral-current quasielastic cross section using atmospheric neutrinos in the SK-Gd experiment

Authors: S. Sakai, K. Abe, C. Bronner, Y. Hayato, K. Hiraide, K. Hosokawa, K. Ieki, M. Ikeda, J. Kameda, Y. Kanemura, R. Kaneshima, Y. Kashiwagi, Y. Kataoka, S. Miki, S. Mine, M. Miura, S. Moriyama, Y. Nakano, M. Nakahata, S. Nakayama, Y. Noguchi, K. Sato, H. Sekiya, H. Shiba, K. Shimizu , et al. (211 additional authors not shown)

Abstract: We report the first measurement of the atmospheric neutrino-oxygen neutral-current quasielastic (NCQE) cross section in the gadolinium-loaded Super-Kamiokande (SK) water Cherenkov detector. In June 2020, SK began a new experimental phase, named SK-Gd, by loading 0.011% by mass of gadolinium into the ultrapure water of the SK detector. The introduction of gadolinium to ultrapure water has the effec… ▽ More We report the first measurement of the atmospheric neutrino-oxygen neutral-current quasielastic (NCQE) cross section in the gadolinium-loaded Super-Kamiokande (SK) water Cherenkov detector. In June 2020, SK began a new experimental phase, named SK-Gd, by loading 0.011% by mass of gadolinium into the ultrapure water of the SK detector. The introduction of gadolinium to ultrapure water has the effect of improving the neutron-tagging efficiency. Using a 552.2 day data set from August 2020 to June 2022, we measure the NCQE cross section to be 0.74 $\pm$ 0.22(stat.) $^{+0.85}_{-0.15}$ (syst.) $\times$ 10$^{-38}$ cm$^{2}$/oxygen in the energy range from 160 MeV to 10 GeV, which is consistent with the atmospheric neutrino-flux-averaged theoretical NCQE cross section and the measurement in the SK pure-water phase within the uncertainties. Furthermore, we compare the models of the nucleon-nucleus interactions in water and find that the Binary Cascade model and the Liege Intranuclear Cascade model provide a somewhat better fit to the observed data than the Bertini Cascade model. Since the atmospheric neutrino-oxygen NCQE reactions are one of the main backgrounds in the search for diffuse supernova neutrino background (DSNB), these new results will contribute to future studies - and the potential discovery - of the DSNB in SK. △ Less

Submitted 7 November, 2023; originally announced November 2023.

Comments: 8 pages, 3 figures

arXiv:2311.02855 [pdf, other]

doi 10.1109/TAES.2023.3332056

Neural-based Compression Scheme for Solar Image Data

Authors: Ali Zafari, Atefeh Khoshkhahtinat, Jeremy A. Grajeda, Piyush M. Mehta, Nasser M. Nasrabadi, Laura E. Boucheron, Barbara J. Thompson, Michael S. F. Kirk, Daniel da Silva

Abstract: Studying the solar system and especially the Sun relies on the data gathered daily from space missions. These missions are data-intensive and compressing this data to make them efficiently transferable to the ground station is a twofold decision to make. Stronger compression methods, by distorting the data, can increase data throughput at the cost of accuracy which could affect scientific analysis… ▽ More Studying the solar system and especially the Sun relies on the data gathered daily from space missions. These missions are data-intensive and compressing this data to make them efficiently transferable to the ground station is a twofold decision to make. Stronger compression methods, by distorting the data, can increase data throughput at the cost of accuracy which could affect scientific analysis of the data. On the other hand, preserving subtle details in the compressed data requires a high amount of data to be transferred, reducing the desired gains from compression. In this work, we propose a neural network-based lossy compression method to be used in NASA's data-intensive imagery missions. We chose NASA's SDO mission which transmits 1.4 terabytes of data each day as a proof of concept for the proposed algorithm. In this work, we propose an adversarially trained neural network, equipped with local and non-local attention modules to capture both the local and global structure of the image resulting in a better trade-off in rate-distortion (RD) compared to conventional hand-engineered codecs. The RD variational autoencoder used in this work is jointly trained with a channel-dependent entropy model as a shared prior between the analysis and synthesis transforms to make the entropy coding of the latent code more effective. Our neural image compression algorithm outperforms currently-in-use and state-of-the-art codecs such as JPEG and JPEG-2000 in terms of the RD performance when compressing extreme-ultraviolet (EUV) data. As a proof of concept for use of this algorithm in SDO data analysis, we have performed coronal hole (CH) detection using our compressed images, and generated consistent segmentations, even at a compression rate of $\sim0.1$ bits per pixel (compared to 8 bits per pixel on the original data) using EUV data from SDO. △ Less

Submitted 5 November, 2023; originally announced November 2023.

Comments: Accepted for publication in IEEE Transactions on Aerospace and Electronic Systems (TAES). arXiv admin note: text overlap with arXiv:2210.06478

arXiv:2311.01798 [pdf, other]

doi 10.1242/jeb.247175

Passive elasticity properties of $\textit{Octopus rubescens}$ arm

Authors: Udit Halder, Ekaterina Gribkova, Rhanor Gillette, Prashant G. Mehta

Abstract: In this report, passive elasticity properties of $\textit{Octopus rubescens}$ arm tissue are investigated using a multidisciplinary approach encompassing biomechanical experiments, computational modeling, and analyses. Tensile tests are conducted to obtain stress-strain relationships of the arm under axial stretch. Rheological tests are also performed to probe into dynamic shear response of the ar… ▽ More In this report, passive elasticity properties of $\textit{Octopus rubescens}$ arm tissue are investigated using a multidisciplinary approach encompassing biomechanical experiments, computational modeling, and analyses. Tensile tests are conducted to obtain stress-strain relationships of the arm under axial stretch. Rheological tests are also performed to probe into dynamic shear response of the arm tissue. Based on these tests, comparisons against three different viscoelasticity models are reported. △ Less

Submitted 3 November, 2023; originally announced November 2023.

arXiv:2311.01159 [pdf, other]

Search for Periodic Time Variations of the Solar $^8$B Neutrino Flux between 1996 and 2018 in Super-Kamiokande

Authors: K. Abe, C. Bronner, Y. Hayato, K. Hiraide, K. Hosokawa, K. Ieki, M. Ikeda, J. Kameda, Y. Kanemura, R. Kaneshima, Y. Kashiwagi, Y. Kataoka, S. Miki, S. Mine, M. Miura, S. Moriyama, Y. Nakano, M. Nakahata, S. Nakayama, Y. Noguchi, K. Sato, H. Sekiya, H. Shiba, K. Shimizu, M. Shiozawa , et al. (211 additional authors not shown)

Abstract: We report a search for time variations of the solar $^8$B neutrino flux using 5804 live days of Super-Kamiokande data collected between May 31, 1996, and May 30, 2018. Super-Kamiokande measured the precise time of each solar neutrino interaction over 22 calendar years to search for solar neutrino flux modulations with unprecedented precision. Periodic modulations are searched for in a dataset comp… ▽ More We report a search for time variations of the solar $^8$B neutrino flux using 5804 live days of Super-Kamiokande data collected between May 31, 1996, and May 30, 2018. Super-Kamiokande measured the precise time of each solar neutrino interaction over 22 calendar years to search for solar neutrino flux modulations with unprecedented precision. Periodic modulations are searched for in a dataset comprising five-day interval solar neutrino flux measurements with a maximum likelihood method. We also applied the Lomb-Scargle method to this dataset to compare it with previous reports. The only significant modulation found is due to the elliptic orbit of the Earth around the Sun. The observed modulation is consistent with astronomical data: we measured an eccentricity of (1.53$\pm$0.35)\%, and a perihelion shift of ($-$1.5$\pm$13.5) days. △ Less

Submitted 6 June, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

Comments: 8 pages, 5 figures, 2 tables, and data file: "sksolartimevariation5804d.txt" (the data file updated with additional 3 columns -- R^2 correction, upper-error, lower-error)

Journal ref: Phys.Rev.Lett 132, 241803 (2024)

arXiv:2310.04595 [pdf, other]

Segmented Harmonic Loss: Handling Class-Imbalanced Multi-Label Clinical Data for Medical Coding with Large Language Models

Authors: Surjya Ray, Pratik Mehta, Hongen Zhang, Ada Chaman, Jian Wang, Chung-Jen Ho, Michael Chiou, Tashfeen Suleman

Abstract: The precipitous rise and adoption of Large Language Models (LLMs) have shattered expectations with the fastest adoption rate of any consumer-facing technology in history. Healthcare, a field that traditionally uses NLP techniques, was bound to be affected by this meteoric rise. In this paper, we gauge the extent of the impact by evaluating the performance of LLMs for the task of medical coding on… ▽ More The precipitous rise and adoption of Large Language Models (LLMs) have shattered expectations with the fastest adoption rate of any consumer-facing technology in history. Healthcare, a field that traditionally uses NLP techniques, was bound to be affected by this meteoric rise. In this paper, we gauge the extent of the impact by evaluating the performance of LLMs for the task of medical coding on real-life noisy data. We conducted several experiments on MIMIC III and IV datasets with encoder-based LLMs, such as BERT. Furthermore, we developed Segmented Harmonic Loss, a new loss function to address the extreme class imbalance that we found to prevail in most medical data in a multi-label scenario by segmenting and decoupling co-occurring classes of the dataset with a new segmentation algorithm. We also devised a technique based on embedding similarity to tackle noisy data. Our experimental results show that when trained with the proposed loss, the LLMs achieve significant performance gains even on noisy long-tailed datasets, outperforming the F1 score of the state-of-the-art by over ten percentage points. △ Less

Submitted 6 October, 2023; originally announced October 2023.

Comments: 16 pages,3 figures, 3 tables

arXiv:2309.10799 [pdf, other]

Multi-Context Dual Hyper-Prior Neural Image Compression

Authors: Atefeh Khoshkhahtinat, Ali Zafari, Piyush M. Mehta, Mohammad Akyash, Hossein Kashiani, Nasser M. Nasrabadi

Abstract: Transform and entropy models are the two core components in deep image compression neural networks. Most existing learning-based image compression methods utilize convolutional-based transform, which lacks the ability to model long-range dependencies, primarily due to the limited receptive field of the convolution operation. To address this limitation, we propose a Transformer-based nonlinear tran… ▽ More Transform and entropy models are the two core components in deep image compression neural networks. Most existing learning-based image compression methods utilize convolutional-based transform, which lacks the ability to model long-range dependencies, primarily due to the limited receptive field of the convolution operation. To address this limitation, we propose a Transformer-based nonlinear transform. This transform has the remarkable ability to efficiently capture both local and global information from the input image, leading to a more decorrelated latent representation. In addition, we introduce a novel entropy model that incorporates two different hyperpriors to model cross-channel and spatial dependencies of the latent representation. To further improve the entropy model, we add a global context that leverages distant relationships to predict the current latent more accurately. This global context employs a causal attention mechanism to extract long-range information in a content-dependent manner. Our experiments show that our proposed framework performs better than the state-of-the-art methods in terms of rate-distortion performance. △ Less

Submitted 19 September, 2023; originally announced September 2023.

Comments: Accepted to IEEE 22$^nd$ International Conference on Machine Learning and Applications 2023 (ICMLA) - Selected for Oral Presentation

arXiv:2309.10791 [pdf, other]

Multi-spectral Entropy Constrained Neural Compression of Solar Imagery

Authors: Ali Zafari, Atefeh Khoshkhahtinat, Piyush M. Mehta, Nasser M. Nasrabadi, Barbara J. Thompson, Michael S. F. Kirk, Daniel da Silva

Abstract: Missions studying the dynamic behaviour of the Sun are defined to capture multi-spectral images of the sun and transmit them to the ground station in a daily basis. To make transmission efficient and feasible, image compression systems need to be exploited. Recently successful end-to-end optimized neural network-based image compression systems have shown great potential to be used in an ad-hoc man… ▽ More Missions studying the dynamic behaviour of the Sun are defined to capture multi-spectral images of the sun and transmit them to the ground station in a daily basis. To make transmission efficient and feasible, image compression systems need to be exploited. Recently successful end-to-end optimized neural network-based image compression systems have shown great potential to be used in an ad-hoc manner. In this work we have proposed a transformer-based multi-spectral neural image compressor to efficiently capture redundancies both intra/inter-wavelength. To unleash the locality of window-based self attention mechanism, we propose an inter-window aggregated token multi head self attention. Additionally to make the neural compressor autoencoder shift invariant, a randomly shifted window attention mechanism is used which makes the transformer blocks insensitive to translations in their input domain. We demonstrate that the proposed approach not only outperforms the conventional compression algorithms but also it is able to better decorrelates images along the multiple wavelengths compared to single spectral compression. △ Less

Submitted 10 October, 2023; v1 submitted 19 September, 2023; originally announced September 2023.

Comments: Accepted to IEEE 22$^{nd}$ International Conference on Machine Learning and Applications 2023 (ICMLA)

arXiv:2309.10784 [pdf, other]

Context-Aware Neural Video Compression on Solar Dynamics Observatory

Authors: Atefeh Khoshkhahtinat, Ali Zafari, Piyush M. Mehta, Nasser M. Nasrabadi, Barbara J. Thompson, Michael S. F. Kirk, Daniel da Silva

Abstract: NASA's Solar Dynamics Observatory (SDO) mission collects large data volumes of the Sun's daily activity. Data compression is crucial for space missions to reduce data storage and video bandwidth requirements by eliminating redundancies in the data. In this paper, we present a novel neural Transformer-based video compression approach specifically designed for the SDO images. Our primary objective i… ▽ More NASA's Solar Dynamics Observatory (SDO) mission collects large data volumes of the Sun's daily activity. Data compression is crucial for space missions to reduce data storage and video bandwidth requirements by eliminating redundancies in the data. In this paper, we present a novel neural Transformer-based video compression approach specifically designed for the SDO images. Our primary objective is to efficiently exploit the temporal and spatial redundancies inherent in solar images to obtain a high compression ratio. Our proposed architecture benefits from a novel Transformer block called Fused Local-aware Window (FLaWin), which incorporates window-based self-attention modules and an efficient fused local-aware feed-forward (FLaFF) network. This architectural design allows us to simultaneously capture short-range and long-range information while facilitating the extraction of rich and diverse contextual representations. Moreover, this design choice results in reduced computational complexity. Experimental results demonstrate the significant contribution of the FLaWin Transformer block to the compression performance, outperforming conventional hand-engineered video codecs such as H.264 and H.265 in terms of rate-distortion trade-off. △ Less

Submitted 19 September, 2023; originally announced September 2023.

Comments: Accepted to IEEE 22$^{nd}$ International Conference on Machine Learning and Applications 2023 (ICMLA) - Selected for Oral Presentation

arXiv:2308.16606 [pdf, ps, other]

doi 10.1103/PhysRevD.108.092009

Measurements of the $ν_μ$ and $\barν_μ$-induced Coherent Charged Pion Production Cross Sections on $^{12}C$ by the T2K experiment

Authors: K. Abe, N. Akhlaq, R. Akutsu, A. Ali, S. Alonso Monsalve, C. Alt, C. Andreopoulos, M. Antonova, S. Aoki, T. Arihara, Y. Asada, Y. Ashida, E. T. Atkin, M. Barbi, G. J. Barker, G. Barr, D. Barrow, M. Batkiewicz-Kwasniak, V. Berardi, L. Berns, S. Bhadra, A. Blanchet, A. Blondel, S. Bolognesi, T. Bonus , et al. (359 additional authors not shown)

Abstract: We report an updated measurement of the $ν_μ$-induced, and the first measurement of the $\barν_μ$-induced coherent charged pion production cross section on $^{12}C$ nuclei in the T2K experiment. This is measured in a restricted region of the final-state phase space for which $p_{μ,π} > 0.2$ GeV, $\cos(θ_μ) > 0.8$ and $\cos(θ_π) > 0.6$, and at a mean (anti)neutrino energy of 0.85 GeV using the T2K… ▽ More We report an updated measurement of the $ν_μ$-induced, and the first measurement of the $\barν_μ$-induced coherent charged pion production cross section on $^{12}C$ nuclei in the T2K experiment. This is measured in a restricted region of the final-state phase space for which $p_{μ,π} > 0.2$ GeV, $\cos(θ_μ) > 0.8$ and $\cos(θ_π) > 0.6$, and at a mean (anti)neutrino energy of 0.85 GeV using the T2K near detector. The measured $ν_μ$ CC coherent pion production flux-averaged cross section on $^{12}C$ is $(2.98 \pm 0.37 (stat.) \pm 0.31 (syst.) \substack{ +0.49 \\ -0.00 } \mathrm{ (Q^2\,model)}) \times 10^{-40}~\mathrm{cm}^{2}$. The new measurement of the $\barν_μ$-induced cross section on $^{12}{C}$ is $(3.05 \pm 0.71 (stat.) \pm 0.39 (syst.) \substack{ +0.74 \\ -0.00 } \mathrm{(Q^2\,model)}) \times 10^{-40}~\mathrm{cm}^{2}$. The results are compatible with both the NEUT 5.4.0 Berger-Sehgal (2009) and GENIE 2.8.0 Rein-Sehgal (2007) model predictions. △ Less

Submitted 14 October, 2023; v1 submitted 31 August, 2023; originally announced August 2023.

Journal ref: Phys.Rev.D 108 (2023) 9, 092009

arXiv:2308.15757 [pdf, other]

doi 10.1103/PhysRevLett.132.127401

Phase transition to chaos in complex ecosystems with non-reciprocal species-resource interactions

Authors: Emmy Blumenthal, Jason W. Rocks, Pankaj Mehta

Abstract: Non-reciprocal interactions between microscopic constituents can profoundly shape the large-scale properties of complex systems. Here, we investigate the effects of non-reciprocity in the context of theoretical ecology by analyzing a generalization of MacArthur's consumer-resource model with asymmetric interactions between species and resources. Using a mixture of analytic cavity calculations and… ▽ More Non-reciprocal interactions between microscopic constituents can profoundly shape the large-scale properties of complex systems. Here, we investigate the effects of non-reciprocity in the context of theoretical ecology by analyzing a generalization of MacArthur's consumer-resource model with asymmetric interactions between species and resources. Using a mixture of analytic cavity calculations and numerical simulations, we show that such ecosystems generically undergo a phase transition to chaotic dynamics as the amount of non-reciprocity is increased. We analytically construct the phase diagram for this model and show that the emergence of chaos is controlled by a single quantity: the ratio of surviving species to surviving resources. We also numerically calculate the Lyapunov exponents in the chaotic phase and carefully analyze finite-size effects. Our findings show how non-reciprocal interactions can give rise to complex and unpredictable dynamical behaviors even in the simplest ecological consumer-resource models. △ Less

Submitted 26 February, 2024; v1 submitted 30 August, 2023; originally announced August 2023.

Comments: 5 pages, 4 figures; SI: 22 pages, 19 figures

Journal ref: Phys. Rev. Lett. 132, 127401, 21 March 2024

arXiv:2308.02620 [pdf, other]

doi 10.1109/ICIP49359.2023.10222816

Frequency Disentangled Features in Neural Image Compression

Authors: Ali Zafari, Atefeh Khoshkhahtinat, Piyush Mehta, Mohammad Saeed Ebrahimi Saadabadi, Mohammad Akyash, Nasser M. Nasrabadi

Abstract: The design of a neural image compression network is governed by how well the entropy model matches the true distribution of the latent code. Apart from the model capacity, this ability is indirectly under the effect of how close the relaxed quantization is to the actual hard quantization. Optimizing the parameters of a rate-distortion variational autoencoder (R-D VAE) is ruled by this approximated… ▽ More The design of a neural image compression network is governed by how well the entropy model matches the true distribution of the latent code. Apart from the model capacity, this ability is indirectly under the effect of how close the relaxed quantization is to the actual hard quantization. Optimizing the parameters of a rate-distortion variational autoencoder (R-D VAE) is ruled by this approximated quantization scheme. In this paper, we propose a feature-level frequency disentanglement to help the relaxed scalar quantization achieve lower bit rates by guiding the high entropy latent features to include most of the low-frequency texture of the image. In addition, to strengthen the de-correlating power of the transformer-based analysis/synthesis transform, an augmented self-attention score calculation based on the Hadamard product is utilized during both encoding and decoding. Channel-wise autoregressive entropy modeling takes advantage of the proposed frequency separation as it inherently directs high-informational low-frequency channels to the first chunks and conditions the future chunks on it. The proposed network not only outperforms hand-engineered codecs, but also neural network-based codecs built on computation-heavy spatially autoregressive entropy models. △ Less

Submitted 4 August, 2023; originally announced August 2023.

Comments: Accepted to 30$^{th}$ IEEE International Conference on Image Processing (ICIP 2023)

arXiv:2307.15667 [pdf, other]

doi 10.1088/1475-7516/2024/01/058

A relook at the GZK Neutrino-Photon Connection: Impact of Extra-galactic Radio Background & UHECR properties

Authors: Sovan Chakraborty, Poonam Mehta, Prantik Sarmah

Abstract: Ultra-high energy cosmic rays (UHECRs) beyond the Greisen-Zatsepin-Kuzmin (GZK) cut-off provide us with a unique opportunity to understand the universe at extreme energies. Secondary GZK photons and GZK neutrinos associated with the same interaction are indeed interconnected and render access to multi-messenger analysis of UHECRs. The GZK photon flux is heavily attenuated due to the interaction wi… ▽ More Ultra-high energy cosmic rays (UHECRs) beyond the Greisen-Zatsepin-Kuzmin (GZK) cut-off provide us with a unique opportunity to understand the universe at extreme energies. Secondary GZK photons and GZK neutrinos associated with the same interaction are indeed interconnected and render access to multi-messenger analysis of UHECRs. The GZK photon flux is heavily attenuated due to the interaction with Cosmic Microwave Background (CMB) and the Extra-galactic Radio Background (ERB). The present estimate of the ERB comprising of several model uncertainties together with the ARCADE2 radio results in large propagation uncertainties in the GZK photon flux. On the other hand, the weakly interacting GZK neutrino flux is unaffected by these propagation effects. In this work, we make an updated estimate of the GZK photon and GZK neutrino fluxes considering a wide variation of both the production and propagation properties of the UHECR like, the spectral index, the cut-off energy of the primary spectrum, the distribution of sources and the uncertainties in the ERB estimation. We explore the detection prospects of the GZK fluxes with various present and upcoming UHECR and UHE neutrino detectors such as Auger, TA, GRAND, ANITA, ARA, IceCube and IceCube-Gen2. The predicted fluxes are found to be beyond the reach of the current detectors. In future, proposed IceCube-Gen2, Auger upgrade and GRAND experiments will have the sensitivity to the predicted GZK photon and GZK neutrino fluxes. Such detection can put constraints on the UHECR source properties and the propagation effects due to the ERB. We also propose an indirect limit on the GZK photon flux using the neutrino-photon connection for any future detection of GZK neutrinos by the IceCube-Gen2 detector. We find this limit to be consistent with our GZK flux predictions. △ Less

Submitted 11 January, 2024; v1 submitted 28 July, 2023; originally announced July 2023.

Comments: Accepted for publication in JCAP

Journal ref: JCAP01(2024)058

arXiv:2307.04496 [pdf, other]

Distinguishing between Dirac and Majorana neutrinos using temporal correlations

Authors: Bhavya Soni, Sheeba Shafaq, Poonam Mehta

Abstract: In the context of two flavour neutrino oscillations, it is understood that the $2\times 2$ mixing matrix is parameterized by one angle and a Majorana phase. However, this phase does not impact the oscillation probabilities in vacuum or in matter with constant density. Interestingly, the Majorana phase becomes relevant when we describe neutrino oscillations along with neutrino decay. This is due to… ▽ More In the context of two flavour neutrino oscillations, it is understood that the $2\times 2$ mixing matrix is parameterized by one angle and a Majorana phase. However, this phase does not impact the oscillation probabilities in vacuum or in matter with constant density. Interestingly, the Majorana phase becomes relevant when we describe neutrino oscillations along with neutrino decay. This is due to the fact that effective Hamiltonian has Hermitian and anti-Hermitian components which cannot be simultaneously diagonalized (resulting in decay eigenstates being different from the mass eigenstates). We consider the $\cal PT$ symmetric non-Hermitian Hamiltonian describing two flavour neutrino case and study the violation of Leggett-Garg Inequalities (LGI) in this context for the first time. We demonstrate that temporal correlations in the form of LGI allow us to probe whether neutrinos are Dirac or Majorana. We elucidate the role played by the mixing and decay parameters on the extent of violation of LGI. We emphasize that for optimized choice of parameters, the difference in $K_4$ ($K_3$) for Dirac and Majorana case is $\sim 15\%$ ($\sim 10\%$). △ Less

Submitted 10 July, 2023; originally announced July 2023.

Comments: 17 pages and 8 figures. Comments welcome

arXiv:2306.02169 [pdf, other]

Probabilistic Solar Proxy Forecasting with Neural Network Ensembles

Authors: Joshua D. Daniell, Piyush M. Mehta

Abstract: Space weather indices are used commonly to drive forecasts of thermosphere density, which directly affects objects in low-Earth orbit (LEO) through atmospheric drag. One of the most commonly used space weather proxies, $F_{10.7 cm}$, correlates well with solar extreme ultra-violet (EUV) energy deposition into the thermosphere. Currently, the USAF contracts Space Environment Technologies (SET), whi… ▽ More Space weather indices are used commonly to drive forecasts of thermosphere density, which directly affects objects in low-Earth orbit (LEO) through atmospheric drag. One of the most commonly used space weather proxies, $F_{10.7 cm}$, correlates well with solar extreme ultra-violet (EUV) energy deposition into the thermosphere. Currently, the USAF contracts Space Environment Technologies (SET), which uses a linear algorithm to forecast $F_{10.7 cm}$. In this work, we introduce methods using neural network ensembles with multi-layer perceptrons (MLPs) and long-short term memory (LSTMs) to improve on the SET predictions. We make predictions only from historical $F_{10.7 cm}$ values, but also investigate data manipulation to improve forecasting. We investigate data manipulation methods (backwards averaging and lookback) as well as multi step and dynamic forecasting. This work shows an improvement over the baseline when using ensemble methods. The best models found in this work are ensemble approaches using multi step or a combination of multi step and dynamic predictions. Nearly all approaches offer an improvement, with the best models improving between 45 and 55\% on relative MSE. Other relative error metrics were shown to improve greatly when ensembles methods were used. We were also able to leverage the ensemble approach to provide a distribution of predicted values; allowing an investigation into forecast uncertainty. Our work found models that produced less biased predictions at elevated and high solar activity levels. Uncertainty was also investigated through the use of a calibration error score metric (CES), our best ensemble reached similar CES as other work. △ Less

Submitted 3 June, 2023; originally announced June 2023.

Comments: 23 pages, 12 figures, 5 Tables

arXiv:2305.16824 [pdf, other]

doi 10.1140/epjc/s10052-025-13834-6

Signals of eV-scale sterile neutrino at long baseline neutrino experiments

Authors: Sabila Parveen, Kiran Sharma, Sudhanwa Patra, Poonam Mehta

Abstract: While most of the results of the neutrino oscillation experiments can be accommodated within the standard paradigm of three active flavor, there are tantalizing hints of an light eV-scale sterile neutrino from anomalous results of a few short baseline experiments. This additional light sterile neutrino is expected to leave an imprint on the physics observables pertaining to standard unknowns such… ▽ More While most of the results of the neutrino oscillation experiments can be accommodated within the standard paradigm of three active flavor, there are tantalizing hints of an light eV-scale sterile neutrino from anomalous results of a few short baseline experiments. This additional light sterile neutrino is expected to leave an imprint on the physics observables pertaining to standard unknowns such as determination of the Dirac-type leptonic $CP$ phase, $δ_{13}$, the question of neutrino mass hierarchy and the octant of $θ_{23}$. The upcoming long baseline neutrino experiments such as T2HK, DUNE and P2O will be sensitive to active - sterile mixing. In the present work, we examine and assess the capability of these long baseline experiments to probe the sterile neutrino at the level of probabilities and event rates. We perform a detailed study by taking into account the values of parameters that are presently allowed and (a) study the impact on $CP$ violation by examining the role played by various appearance and disappearance channels, (b) address the question of disentangling the intrinsic effects from extrinsic effects in the standard paradigm as well as three active plus one light sterile neutrino, and finally (c) assess the ability of these long baseline experiments to distinguish between the two scenarios. Our results indicate that for the true values of sterile parameters and for all values of $δ_{13}$, the sensitivity of P2O is the lowest while the sensitivity of T2HK is modest ($<3\,σ$) and the sensitivity of DUNE is $> 3\,σ$. For larger values of the sterile mixing angles, there is an improvement in the sensitivity for all the three considered experiments. △ Less

Submitted 7 February, 2025; v1 submitted 26 May, 2023; originally announced May 2023.

Comments: v2: to appear in EPJC

Journal ref: Eur.Phys.J.C 85 (2025) 2, 181

arXiv:2305.16443 [pdf, other]

Human-Machine Comparison for Cross-Race Face Verification: Race Bias at the Upper Limits of Performance?

Authors: Geraldine Jeckeln, Selin Yavuzcan, Kate A. Marquis, Prajay Sandipkumar Mehta, Amy N. Yates, P. Jonathon Phillips, Alice J. O'Toole

Abstract: Face recognition algorithms perform more accurately than humans in some cases, though humans and machines both show race-based accuracy differences. As algorithms continue to improve, it is important to continually assess their race bias relative to humans. We constructed a challenging test of 'cross-race' face verification and used it to compare humans and two state-of-the-art face recognition sy… ▽ More Face recognition algorithms perform more accurately than humans in some cases, though humans and machines both show race-based accuracy differences. As algorithms continue to improve, it is important to continually assess their race bias relative to humans. We constructed a challenging test of 'cross-race' face verification and used it to compare humans and two state-of-the-art face recognition systems. Pairs of same- and different-identity faces of White and Black individuals were selected to be difficult for humans and an open-source implementation of the ArcFace face recognition algorithm from 2019 (5). Human participants (54 Black; 51 White) judged whether face pairs showed the same identity or different identities on a 7-point Likert-type scale. Two top-performing face recognition systems from the Face Recognition Vendor Test-ongoing performed the same test (7). By design, the test proved challenging for humans as a group, who performed above chance, but far less than perfect. Both state-of-the-art face recognition systems scored perfectly (no errors), consequently with equal accuracy for both races. We conclude that state-of-the-art systems for identity verification between two frontal face images of Black and White individuals can surpass the general population. Whether this result generalizes to challenging in-the-wild images is a pressing concern for deploying face recognition systems in unconstrained environments. △ Less

Submitted 30 May, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

Comments: 8 pages, 6 figures

arXiv:2305.12850 [pdf, other]

doi 10.1109/TAC.2024.3413573

Variance Decay Property for Filter Stability

Authors: Jin Won Kim, Prashant G. Mehta

Abstract: This paper is concerned with the problem of nonlinear (stochastic) filter stability of a hidden Markov model (HMM) with white noise observations. A contribution is the variance decay property which is used to conclude filter stability. For this purpose, a new notion of the Poincaré inequality (PI) is introduced for the nonlinear filter. PI is related to both the ergodicity of the Markov process as… ▽ More This paper is concerned with the problem of nonlinear (stochastic) filter stability of a hidden Markov model (HMM) with white noise observations. A contribution is the variance decay property which is used to conclude filter stability. For this purpose, a new notion of the Poincaré inequality (PI) is introduced for the nonlinear filter. PI is related to both the ergodicity of the Markov process as well as the observability of the HMM. The proofs are based upon a recently discovered minimum variance duality which is used to transform the nonlinear filtering problem into a stochastic optimal control problem for a backward stochastic differential equation (BSDE). △ Less

Submitted 26 June, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

Comments: 16 pages

Journal ref: IEEE Transactions on Automatic Control, 2024

arXiv:2305.10655 [pdf, other]

doi 10.1007/978-3-031-17027-0_2

DeepEdit: Deep Editable Learning for Interactive Segmentation of 3D Medical Images

Authors: Andres Diaz-Pinto, Pritesh Mehta, Sachidanand Alle, Muhammad Asad, Richard Brown, Vishwesh Nath, Alvin Ihsani, Michela Antonelli, Daniel Palkovics, Csaba Pinter, Ron Alkalay, Steve Pieper, Holger R. Roth, Daguang Xu, Prerna Dogra, Tom Vercauteren, Andrew Feng, Abood Quraini, Sebastien Ourselin, M. Jorge Cardoso

Abstract: Automatic segmentation of medical images is a key step for diagnostic and interventional tasks. However, achieving this requires large amounts of annotated volumes, which can be tedious and time-consuming task for expert annotators. In this paper, we introduce DeepEdit, a deep learning-based method for volumetric medical image annotation, that allows automatic and semi-automatic segmentation, and… ▽ More Automatic segmentation of medical images is a key step for diagnostic and interventional tasks. However, achieving this requires large amounts of annotated volumes, which can be tedious and time-consuming task for expert annotators. In this paper, we introduce DeepEdit, a deep learning-based method for volumetric medical image annotation, that allows automatic and semi-automatic segmentation, and click-based refinement. DeepEdit combines the power of two methods: a non-interactive (i.e. automatic segmentation using nnU-Net, UNET or UNETR) and an interactive segmentation method (i.e. DeepGrow), into a single deep learning model. It allows easy integration of uncertainty-based ranking strategies (i.e. aleatoric and epistemic uncertainty computation) and active learning. We propose and implement a method for training DeepEdit by using standard training combined with user interaction simulation. Once trained, DeepEdit allows clinicians to quickly segment their datasets by using the algorithm in auto segmentation mode or by providing clicks via a user interface (i.e. 3D Slicer, OHIF). We show the value of DeepEdit through evaluation on the PROSTATEx dataset for prostate/prostatic lesions and the Multi-Atlas Labeling Beyond the Cranial Vault (BTCV) dataset for abdominal CT segmentation, using state-of-the-art network architectures as baseline for comparison. DeepEdit could reduce the time and effort annotating 3D medical images compared to DeepGrow alone. Source code is available at https://github.com/Project-MONAI/MONAILabel △ Less

Submitted 17 May, 2023; originally announced May 2023.

arXiv:2305.09916 [pdf, other]

Updated T2K measurements of muon neutrino and antineutrino disappearance using 3.6 $\times$ 10$^{21}$ protons on target

Authors: K. Abe, N. Akhlaq, R. Akutsu, H. Alarakia-Charles, A. Ali, Y. I. Alj Hakim, S. Alonso Monsalve, C. Alt, C. Andreopoulos, M. Antonova, S. Aoki, T. Arihara, Y. Asada, Y. Ashida, E. T. Atkin, M. Barbi, G. J. Barker, G. Barr, D. Barrow, M. Batkiewicz-Kwasniak, F. Bench, V. Berardi, L. Berns, S. Bhadra, A. Blanchet , et al. (385 additional authors not shown)

Abstract: Muon neutrino and antineutrino disappearance probabilities are identical in the standard three-flavor neutrino oscillation framework, but CPT violation and non-standard interactions can violate this symmetry. In this work we report the measurements of $\sin^{2} θ_{23}$ and $Δm_{32}^2$ independently for neutrinos and antineutrinos. The aforementioned symmetry violation would manifest as an inconsis… ▽ More Muon neutrino and antineutrino disappearance probabilities are identical in the standard three-flavor neutrino oscillation framework, but CPT violation and non-standard interactions can violate this symmetry. In this work we report the measurements of $\sin^{2} θ_{23}$ and $Δm_{32}^2$ independently for neutrinos and antineutrinos. The aforementioned symmetry violation would manifest as an inconsistency in the neutrino and antineutrino oscillation parameters. The analysis discussed here uses a total of 1.97$\times$10$^{21}$ and 1.63$\times$10$^{21}$ protons on target taken with a neutrino and antineutrino beam respectively, and benefits from improved flux and cross-section models, new near detector samples and more than double the data reducing the overall uncertainty of the result. No significant deviation is observed, consistent with the standard neutrino oscillation picture. △ Less

Submitted 16 October, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

arXiv:2305.05135 [pdf, other]

doi 10.3847/2041-8213/acdc9e

Search for astrophysical electron antineutrinos in Super-Kamiokande with 0.01wt% gadolinium-loaded water

Authors: M. Harada, K. Abe, C. Bronner, Y. Hayato, K. Hiraide, K. Hosokawa, K. Ieki, M. Ikeda, J. Kameda, Y. Kanemura, R. Kaneshima, Y. Kashiwagi, Y. Kataoka, S. Miki, S. Mine, M. Miura, S. Moriyama, Y. Nakano, M. Nakahata, S. Nakayama, Y. Noguchi, K. Okamoto, K. Sato, H. Sekiya, H. Shiba , et al. (216 additional authors not shown)

Abstract: We report the first search result for the flux of astrophysical electron antineutrinos for energies O(10) MeV in the gadolinium-loaded Super-Kamiokande (SK) detector. In June 2020, gadolinium was introduced to the ultra-pure water of the SK detector in order to detect neutrons more efficiently. In this new experimental phase, SK-Gd, we can search for electron antineutrinos via inverse beta decay w… ▽ More We report the first search result for the flux of astrophysical electron antineutrinos for energies O(10) MeV in the gadolinium-loaded Super-Kamiokande (SK) detector. In June 2020, gadolinium was introduced to the ultra-pure water of the SK detector in order to detect neutrons more efficiently. In this new experimental phase, SK-Gd, we can search for electron antineutrinos via inverse beta decay with efficient background rejection and higher signal efficiency thanks to the high efficiency of the neutron tagging technique. In this paper, we report the result for the initial stage of SK-Gd with a $22.5\times552$ $\rm kton\cdot day$ exposure at 0.01% Gd mass concentration. No significant excess over the expected background in the observed events is found for the neutrino energies below 31.3 MeV. Thus, the flux upper limits are placed at the 90% confidence level. The limits and sensitivities are already comparable with the previous SK result with pure-water ($22.5 \times 2970 \rm kton\cdot day$) owing to the enhanced neutron tagging. △ Less

Submitted 30 May, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

arXiv:2305.00770 [pdf, other]

doi 10.1080/14685248.2023.2274100

Fractional and tempered fractional models for Reynolds-averaged Navier-Stokes equations

Authors: Pavan Pranjivan Mehta

Abstract: Turbulence is a non-local phenomenon and has multiple-scales. Non-locality can be addressed either implicitly or explicitly. Implicitly, by subsequent resolution of all spatio-temporal scales. However, if directly solved for the temporal or spatially averaged fields, a closure problem arises on account of missing information between two points. To solve the closure problem in Reynolds-averaged Nav… ▽ More Turbulence is a non-local phenomenon and has multiple-scales. Non-locality can be addressed either implicitly or explicitly. Implicitly, by subsequent resolution of all spatio-temporal scales. However, if directly solved for the temporal or spatially averaged fields, a closure problem arises on account of missing information between two points. To solve the closure problem in Reynolds-averaged Navier-Stokes equations (RANS), an eddy-viscosity hypotheses has been a popular modelling choice, where it follows either a linear or non-linear stress-strain relationship. Here, a non-constant diffusivity is introduced. Such a non-constant diffusivity is also characteristic of non-Fickian diffusion equation addressing anomalous diffusion process. An alternative approach, is a fractional derivative based diffusion equations. Thus, in the paper, we formulate a fractional stress-strain relationship using variable-order Caputo fractional derivative. This provides new opportunities for future modelling effort. We pedagogically study of our model construction, starting from one-sided model and followed by two-sided model. Non-locality at a point is the amalgamation of all the effects, thus we find the two-sided model is physically consistent. Further, our construction can also addresses viscous effects, which is a local process. Thus, our fractional model addresses the amalgamation of local and non-local process. We also show its validity at infinite Reynolds number limit. This study is further extended to tempered fractional calculus, where tempering ensures finite jump lengths, this is an important remark for unbounded flows. Two tempered definitions are introduced with a smooth and sharp cutoff, by the exponential term and Heaviside function, respectively and we also define the horizon of non-local interactions. We further study the equivalence between the two definitions. △ Less

Submitted 28 June, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

Comments: A part of this paper is also available as arXiv preprint arXiv:2105.03646v1. Tempered F-RANS result first presented at ICTAM 2020+1 held in Italy, 2021 chaired by Prof. A. Quarteroni (postponed by a year due to pandemic). Results submitted to ICTAM 2020 by Jan. 2020 (refer book of abstracts, Pages 1235-1236 : https://iutam.org/wp-content/uploads/2023/06/ABSTRACT_BOOK_ICTAM_2021.pdf)

MSC Class: 76F99; 26A33 (primary)

arXiv:2304.14576 [pdf, other]

doi 10.1145/3543873.3587581

Can deepfakes be created by novice users?

Authors: Pulak Mehta, Gauri Jagatap, Kevin Gallagher, Brian Timmerman, Progga Deb, Siddharth Garg, Rachel Greenstadt, Brendan Dolan-Gavitt

Abstract: Recent advancements in machine learning and computer vision have led to the proliferation of Deepfakes. As technology democratizes over time, there is an increasing fear that novice users can create Deepfakes, to discredit others and undermine public discourse. In this paper, we conduct user studies to understand whether participants with advanced computer skills and varying levels of computer sci… ▽ More Recent advancements in machine learning and computer vision have led to the proliferation of Deepfakes. As technology democratizes over time, there is an increasing fear that novice users can create Deepfakes, to discredit others and undermine public discourse. In this paper, we conduct user studies to understand whether participants with advanced computer skills and varying levels of computer science expertise can create Deepfakes of a person saying a target statement using limited media files. We conduct two studies; in the first study (n = 39) participants try creating a target Deepfake in a constrained time frame using any tool they desire. In the second study (n = 29) participants use pre-specified deep learning-based tools to create the same Deepfake. We find that for the first study, 23.1% of the participants successfully created complete Deepfakes with audio and video, whereas, for the second user study, 58.6% of the participants were successful in stitching target speech to the target video. We further use Deepfake detection software tools as well as human examiner-based analysis, to classify the successfully generated Deepfake outputs as fake, suspicious, or real. The software detector classified 80% of the Deepfakes as fake, whereas the human examiners classified 100% of the videos as fake. We conclude that creating Deepfakes is a simple enough task for a novice user given adequate tools and time; however, the resulting Deepfakes are not sufficiently real-looking and are unable to completely fool detection software as well as human examiners △ Less

Submitted 27 April, 2023; originally announced April 2023.

arXiv:2304.10694 [pdf, other]

doi 10.1103/PhysRevE.108.044409

Geometry of ecological coexistence and niche differentiation

Authors: Emmy Blumenthal, Pankaj Mehta

Abstract: A fundamental problem in ecology is to understand how competition shapes biodiversity and species coexistence. Historically, one important approach for addressing this question has been to analyze consumer resource models using geometric arguments. This has led to broadly applicable principles such as Tilman's $R^*$ and species coexistence cones. Here, we extend these arguments by constructing a n… ▽ More A fundamental problem in ecology is to understand how competition shapes biodiversity and species coexistence. Historically, one important approach for addressing this question has been to analyze consumer resource models using geometric arguments. This has led to broadly applicable principles such as Tilman's $R^*$ and species coexistence cones. Here, we extend these arguments by constructing a novel geometric framework for understanding species coexistence based on convex polytopes in the space of consumer preferences. We show how the geometry of consumer preferences can be used to predict species which may coexist and enumerate ecologically-stable steady states and transitions between them. Collectively, these results provide a framework for understanding the role of species traits within niche theory. △ Less

Submitted 29 October, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

Comments: 14 pages, 11 figures

Journal ref: Physical Review E, 10.1103/PhysRevE.108.044409, 24 October 2023

arXiv:2304.08413 [pdf, other]

Topology, dynamics, and control of an octopus-analog muscular hydrostat

Authors: Arman Tekinalp, Noel Naughton, Seung-Hyun Kim, Udit Halder, Rhanor Gillette, Prashant G. Mehta, William Kier, Mattia Gazzola

Abstract: Muscular hydrostats, such as octopus arms or elephant trunks, lack bones entirely, endowing them with exceptional dexterity and reconfigurability. Key to their unmatched ability to control nearly infinite degrees of freedom is the architecture into which muscle fibers are weaved. Their arrangement is, effectively, the instantiation of a sophisticated mechanical program that mediates, and likely fa… ▽ More Muscular hydrostats, such as octopus arms or elephant trunks, lack bones entirely, endowing them with exceptional dexterity and reconfigurability. Key to their unmatched ability to control nearly infinite degrees of freedom is the architecture into which muscle fibers are weaved. Their arrangement is, effectively, the instantiation of a sophisticated mechanical program that mediates, and likely facilitates, the control and realization of complex, dynamic morphological reconfigurations. Here, by combining medical imaging, biomechanical data, live behavioral experiments and numerical simulations, we synthesize a model octopus arm entailing ~200 continuous muscles groups, and begin to unravel its complexity. We show how 3D arm motions can be understood in terms of storage, transport, and conversion of topological quantities, effected by simple muscle activation templates. These, in turn, can be composed into higher-level control strategies that, compounded by the arm's compliance, are demonstrated in a range of object manipulation tasks rendered additionally challenging by the need to appropriately align suckers, to sense and grasp. Overall, our work exposes broad design and algorithmic principles pertinent to muscular hydrostats, robotics, and dynamics, while significantly advancing our ability to model muscular structures from medical imaging, with potential implications for human health and care. △ Less

Submitted 17 April, 2023; originally announced April 2023.

Comments: 8 pages, 4 figures

arXiv:2303.17007 [pdf]

doi 10.1103/PhysRevD.107.112012

Impact of cross-section uncertainties on supernova neutrino spectral parameter fitting in the Deep Underground Neutrino Experiment

Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, Z. Ahmad, J. Ahmed, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, P. Amedo, J. Anderson, D. A. Andrade , et al. (1294 additional authors not shown)

Abstract: A primary goal of the upcoming Deep Underground Neutrino Experiment (DUNE) is to measure the $\mathcal{O}(10)$ MeV neutrinos produced by a Galactic core-collapse supernova if one should occur during the lifetime of the experiment. The liquid-argon-based detectors planned for DUNE are expected to be uniquely sensitive to the $ν_e$ component of the supernova flux, enabling a wide variety of physics… ▽ More A primary goal of the upcoming Deep Underground Neutrino Experiment (DUNE) is to measure the $\mathcal{O}(10)$ MeV neutrinos produced by a Galactic core-collapse supernova if one should occur during the lifetime of the experiment. The liquid-argon-based detectors planned for DUNE are expected to be uniquely sensitive to the $ν_e$ component of the supernova flux, enabling a wide variety of physics and astrophysics measurements. A key requirement for a correct interpretation of these measurements is a good understanding of the energy-dependent total cross section $σ(E_ν)$ for charged-current $ν_e$ absorption on argon. In the context of a simulated extraction of supernova $ν_e$ spectral parameters from a toy analysis, we investigate the impact of $σ(E_ν)$ modeling uncertainties on DUNE's supernova neutrino physics sensitivity for the first time. We find that the currently large theoretical uncertainties on $σ(E_ν)$ must be substantially reduced before the $ν_e$ flux parameters can be extracted reliably: in the absence of external constraints, a measurement of the integrated neutrino luminosity with less than 10\% bias with DUNE requires $σ(E_ν)$ to be known to about 5%. The neutrino spectral shape parameters can be known to better than 10% for a 20% uncertainty on the cross-section scale, although they will be sensitive to uncertainties on the shape of $σ(E_ν)$. A direct measurement of low-energy $ν_e$-argon scattering would be invaluable for improving the theoretical precision to the needed level. △ Less

Submitted 7 July, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

Comments: 25 pages, 21 figures

Report number: FERMILAB-PUB-23-132-CSAID-LBNF-ND-T

Journal ref: Phys. Rev. D 107, 112012 (2023)

Showing 51–100 of 379 results for author: Mehta, P