-
MIRACL-VISION: A Large, multilingual, visual document retrieval benchmark
Authors:
Radek Osmulski,
Gabriel de Souza P. Moreira,
Ronay Ak,
Mengyao Xu,
Benedikt Schifferer,
Even Oldridge
Abstract:
Document retrieval is an important task for search and Retrieval-Augmented Generation (RAG) applications. Large Language Models (LLMs) have contributed to improving the accuracy of text-based document retrieval. However, documents with complex layout and visual elements like tables, charts and infographics are not perfectly represented in textual format. Recently, image-based document retrieval pi…
▽ More
Document retrieval is an important task for search and Retrieval-Augmented Generation (RAG) applications. Large Language Models (LLMs) have contributed to improving the accuracy of text-based document retrieval. However, documents with complex layout and visual elements like tables, charts and infographics are not perfectly represented in textual format. Recently, image-based document retrieval pipelines have become popular, which use visual large language models (VLMs) to retrieve relevant page images given a query. Current evaluation benchmarks on visual document retrieval are limited, as they primarily focus only English language, rely on synthetically generated questions and offer a small corpus size. Therefore, we introduce MIRACL-VISION, a multilingual visual document retrieval evaluation benchmark. MIRACL-VISION covers 18 languages, and is an extension of the MIRACL dataset, a popular benchmark to evaluate text-based multilingual retrieval pipelines. MIRACL was built using a human-intensive annotation process to generate high-quality questions. In order to reduce MIRACL-VISION corpus size to make evaluation more compute friendly while keeping the datasets challenging, we have designed a method for eliminating the "easy" negatives from the corpus. We conducted extensive experiments comparing MIRACL-VISION with other benchmarks, using popular public text and image models. We observe a gap in state-of-the-art VLM-based embedding models on multilingual capabilities, with up to 59.7% lower retrieval accuracy than a text-based retrieval models. Even for the English language, the visual models retrieval accuracy is 12.1% lower compared to text-based models. MIRACL-VISION is a challenging, representative, multilingual evaluation benchmark for visual retrieval pipelines and will help the community build robust models for document retrieval.
△ Less
Submitted 21 May, 2025; v1 submitted 16 May, 2025;
originally announced May 2025.
-
JAX-bandflux: differentiable supernovae SALT modelling for cosmological analysis on GPUs
Authors:
Samuel Alan Kossoff Leeney
Abstract:
JAX-bandflux is a JAX implementation of critical supernova modelling functionality for cosmological analysis. The codebase implements key components of the established library SNCosmo in a differentiable framework, offering efficient parallelisation and gradient-based optimisation capabilities through GPU acceleration. The package facilitates differentiable computation of supernova light curve mea…
▽ More
JAX-bandflux is a JAX implementation of critical supernova modelling functionality for cosmological analysis. The codebase implements key components of the established library SNCosmo in a differentiable framework, offering efficient parallelisation and gradient-based optimisation capabilities through GPU acceleration. The package facilitates differentiable computation of supernova light curve measurements, supporting the inference of SALT parameters necessary for cosmological analysis.
△ Less
Submitted 10 April, 2025;
originally announced April 2025.
-
Localized Heating and Dynamics of the Solar Corona due to a Symbiosis of Waves and Reconnection
Authors:
A. K. Srivastava,
Sripan Mondal,
Eric R. Priest,
Sudheer K. Mishra,
David I. Pontin,
R. Y. Kwon,
Ding Yuan,
K. Murawski,
Ayumi Asai
Abstract:
The Sun's outer atmosphere, the corona, is maintained at mega-Kelvin temperatures and fills the heliosphere with a supersonic outflowing wind. The dissipation of magnetic waves and direct electric currents are likely to be the most significant processes for heating the corona, but a lively debate exists on their relative roles. Here, we suggest that the two are often intrinsically linked, since ma…
▽ More
The Sun's outer atmosphere, the corona, is maintained at mega-Kelvin temperatures and fills the heliosphere with a supersonic outflowing wind. The dissipation of magnetic waves and direct electric currents are likely to be the most significant processes for heating the corona, but a lively debate exists on their relative roles. Here, we suggest that the two are often intrinsically linked, since magnetic waves may trigger current dissipation, and impulsive reconnection can launch magnetic waves. We present a study of the first of these processes by using a 2D physics-based numerical simulation using the Adaptive Mesh Refined (AMR) Versatile Advection Code (VAC). Magnetic waves such as fast magnetoacoustic waves are often observed to propagate in the large-scale corona and interact with local magnetic structures. The present numerical simulations show how the propagation of magnetic disturbances towards a null point or separator can lead to the accumulation of the electric currents. Lorentz forces can laterally push and vertically stretch the magnetic fields, forming a current sheet with a strong magnetic-field gradient. The magnetic field lines then break and reconnect, and so contribute towards coronal heating. Numerical results are presented that support these ideas and support the concept of a symbiosis between waves and reconnection in heating the solar corona.
△ Less
Submitted 20 March, 2025;
originally announced March 2025.
-
Conceptual study on using Doppler backscattering to measure magnetic pitch angle in tokamak plasmas
Authors:
AK Yeoh,
VH Hall-Chen,
QT Pratt,
BS Victor,
J Damba,
TL Rhodes,
NA Crocker,
KR Fong,
JC Hillesheim,
FI Parra,
J Ruiz Ruiz
Abstract:
We introduce a new approach to measure the magnetic pitch angle profile in tokamak plasmas with Doppler backscattering (DBS), a technique traditionally used for measuring flows and density fluctuations. The DBS signal is maximised when its probe beam's wavevector is perpendicular to the magnetic field at the cutoff location, independent of the density fluctuations. Hence, if one could isolate this…
▽ More
We introduce a new approach to measure the magnetic pitch angle profile in tokamak plasmas with Doppler backscattering (DBS), a technique traditionally used for measuring flows and density fluctuations. The DBS signal is maximised when its probe beam's wavevector is perpendicular to the magnetic field at the cutoff location, independent of the density fluctuations. Hence, if one could isolate this effect, DBS would then yield information about the magnetic pitch angle. By varying the toroidal launch angle, the DBS beam reaches cutoff with different angles with respect to the magnetic field, but with other properties remaining similar. Hence, the toroidal launch angle which gives maximum backscattered power is thus that which is matched to the pitch angle at the cutoff location, enabling inference of the magnetic pitch angle. We performed systematic scans of the DBS toroidal launch angle for repeated DIII-D tokamak discharges. Experimental DBS data from this scan were analysed and combined with Gaussian beam-tracing simulations using the Scotty code. The pitch-angle inferred from DBS is consistent with that from magnetics-only and motional-Stark-effect-constrained (MSE) equilibrium reconstruction in the edge. In the core, the pitch angles from DBS and magnetics-only reconstructions differ by one to two degrees, while simultaneous MSE measurements were not available. The uncertainty in these measurements was under a degree; we show that this uncertainty is primarily due to the error in toroidal steering, the number of toroidally separated measurements, and shot-to-shot repeatability. We find that the error of pitch-angle measurements can be reduced by optimising the poloidal launch angle and initial beam properties.
△ Less
Submitted 26 February, 2025;
originally announced February 2025.
-
KMT2B-related disorders: expansion of the phenotypic spectrum and long-term efficacy of deep brain stimulation
Authors:
L Cif,
D Demailly,
JP Lin,
KE Barwick,
M Sa,
L Abela,
S Malhotra,
WK Chong,
D Steel,
A Sanchis-Juan,
A Ngoh,
N Trump,
E Meyer,
X Vasques,
J Rankin,
MW Allain,
CD Applegate,
S Attaripour Isfahani,
J Baleine,
B Balint,
JA Bassetti,
EL Baple,
KP Bhatia,
C Blanchet,
L Burglen
, et al. (90 additional authors not shown)
Abstract:
Heterozygous mutations in KMT2B are associated with an early-onset, progressive, and often complex dystonia (DYT28). Key characteristics of typical disease include focal motor features at disease presentation, evolving through a caudocranial pattern into generalized dystonia, with prominent oromandibular, laryngeal, and cervical involvement. Although KMT2B-related disease is emerging as one of the…
▽ More
Heterozygous mutations in KMT2B are associated with an early-onset, progressive, and often complex dystonia (DYT28). Key characteristics of typical disease include focal motor features at disease presentation, evolving through a caudocranial pattern into generalized dystonia, with prominent oromandibular, laryngeal, and cervical involvement. Although KMT2B-related disease is emerging as one of the most common causes of early-onset genetic dystonia, much remains to be understood about the full spectrum of the disease. We describe a cohort of 53 patients with KMT2B mutations, with detailed delineation of their clinical phenotype and molecular genetic features. We report new disease presentations, including atypical patterns of dystonia evolution and a subgroup of patients with a non-dystonic neurodevelopmental phenotype. In addition to the previously reported systemic features, our study has identified co-morbidities, including the risk of status dystonicus, intrauterine growth retardation, and endocrinopathies. Analysis of this study cohort (n = 53) in tandem with published cases (n = 80) revealed that patients with chromosomal deletions and protein-truncating variants had a significantly higher burden of systemic disease (with earlier onset of dystonia) than those with missense variants. Eighteen individuals had detailed longitudinal data available after insertion of deep brain stimulation for medically refractory dystonia. Median age at deep brain stimulation was 11.5 years (range: 4.5 to 37.0 years). Follow-up after deep brain stimulation ranged from 0.25 to 22 years. Significant improvement of motor function and disability (as assessed by the Burke-Fahn-Marsden Dystonia Rating Scales, BFMDRS-M and BFMDRS-D) was evident at 6 months, 1 year, and last follow-up (motor, P = 0.001, P = 0.004, and P = 0.012; disability, P = 0.009, P = 0.002, and P = 0.012).
△ Less
Submitted 10 February, 2025;
originally announced February 2025.
-
Kinematics of Cataclysmic Variables in the Solar Neighborhood in the Gaia Era
Authors:
R. Canbay,
T. Ak,
S. Bilir,
F. Soydugan,
Z. Eker
Abstract:
Using high-precision astrometric data from Gaia DR3 and updated systemic velocities from the literature, kinematical properties of cataclysmic variables (CVs) were investigated. By constraining the data according to the total space velocity error and Galactic population class, a reliable sample of data was obtained. Non-magnetic CVs located in the thin disk have been found to have a total space ve…
▽ More
Using high-precision astrometric data from Gaia DR3 and updated systemic velocities from the literature, kinematical properties of cataclysmic variables (CVs) were investigated. By constraining the data according to the total space velocity error and Galactic population class, a reliable sample of data was obtained. Non-magnetic CVs located in the thin disk have been found to have a total space velocity dispersion of $σ_ν = 46.33\pm4.23$ km s$^{-1}$, indicating that the thin disk CVs with a mean kinematical age of $τ= 3.95\pm0.75$ Gyr are much younger than the local thin disk of the Galaxy with $τ\sim$6-9 Gyr. Total space velocity dispersions of non-magnetic CVs belonging to the thin disk component of the Galaxy were found to be $σ_ν=47.67\pm3.94$ and $σ_ν=44.43\pm4.33$ km s$^{-1}$ for the systems below and above the orbital period gap, respectively, corresponding to kinematical ages of $τ=4.19\pm0.71$ and $τ=3.61\pm0.74$ Gyr. $γ$ velocity dispersions of the thin disk CVs below and above the gap were obtained $σ_γ = 27.52\pm2.28$ and $σ_γ = 25.65\pm2.44$ km s$^{-1}$, respectively. This study also shows that the orbital period is decreasing with increasing age, as expected from the standard theory. The age-orbital period relation for non-magnetic thin disk CVs was obtained as $dP/dt=-2.09\pm0.22\times10^{-5}$ sec yr$^{-1}$. However, a significant difference could not be found between the $γ$ velocity dispersions of the systems below and above the gap, which were calculated to be $σ_γ = 27.52\pm2.28$ and $σ_γ = 25.65\pm2.44$ km s$^{-1}$, respectively.
△ Less
Submitted 9 December, 2024;
originally announced December 2024.
-
Transformation relations for UBV photometric system of 1m telescope at the TÜBİTAK National Observatory
Authors:
T. Ak,
R. Canbay,
T. Yontan
Abstract:
UBV CCD observations of standard stars selected from Landolt (2009, 2013) were performed using the 1-meter telescope (T100) of the TÜBİTAK National Observatory equipped with a back-illuminated and UV enhanced CCD camera and Bessell UBV filters. Observations span a long time from the years 2012 to 2024, 50 photometric nights in total. Photometric measurements were used to find the standard transfor…
▽ More
UBV CCD observations of standard stars selected from Landolt (2009, 2013) were performed using the 1-meter telescope (T100) of the TÜBİTAK National Observatory equipped with a back-illuminated and UV enhanced CCD camera and Bessell UBV filters. Observations span a long time from the years 2012 to 2024, 50 photometric nights in total. Photometric measurements were used to find the standard transformation relations of the T100 photometric system. The atmospheric extinction coefficients, zero points and transformation coefficients of each night were determined. It could not be found time dependence of the secondary extinction coefficients. However, it was determined that the primary extinction coefficients decreased until the year 2019 and increased after that year. It could not be found a strong seasonal variation of the extinction coefficients. Small differences in seasonal median values of them were used to attempt to find the atmospheric extinction sources. We found calculated minus catalogue values for each standard star, $Δ(U-B)$, $Δ(B-V)$ and $ΔV$. Means and standard deviations of $Δ(U-B)$, $Δ(B-V)$ and $ΔV$ were estimated to be 1.4$\pm$76, 1.9$\pm$18 and 0.0$\pm$36 mmag, respectively. We found that our data well matched Landolt's standards for $V$ and $B-V$, i.e. there are no systematic differences. However, there are systematic differences for $U-B$ between the two photometric systems, which is probably originated from the quantum efficiency differences of the detectors used in the photometric systems, although the median differences are relatively small ($|Δ(U-B)|$< 50 mmag) for stars with $-0.5<U-B~{(\rm mag)} <1.6$ and $0.2<B-V~{(\rm mag)} <1.8$. As an overall result, we conclude that the transformation relations found in this study can be used for standardized photometry with the T100 photometric system.
△ Less
Submitted 2 December, 2024;
originally announced December 2024.
-
Radial Metallicity Gradients for the Chemically Selected Galactic Thin Disc Main-Sequence Stars
Authors:
F. Akbaba,
T. Ak,
S. Bilir,
O. Plevne,
Onal Tas. O,
G. M. Seabroke
Abstract:
{We present the radial metallicity gradients within the Galactic thin disc population through main-sequence stars selected on the chemical plane using GALAH DR3 accompanied with Gaia DR3 astrometric data. The [Fe/H], [$α$/Fe] and [Mg/H] radial gradients are estimated for guiding radius as $-0.074\pm 0.006$, $+0.004\pm0.002$, $-0.074\pm0.006$ dex kpc$^{-1}$ and for the traceback early orbital radiu…
▽ More
{We present the radial metallicity gradients within the Galactic thin disc population through main-sequence stars selected on the chemical plane using GALAH DR3 accompanied with Gaia DR3 astrometric data. The [Fe/H], [$α$/Fe] and [Mg/H] radial gradients are estimated for guiding radius as $-0.074\pm 0.006$, $+0.004\pm0.002$, $-0.074\pm0.006$ dex kpc$^{-1}$ and for the traceback early orbital radius as $-0.040\pm0.002$, $+0.003\pm 0.001$, $-0.039\pm 0.002$ dex kpc$^{-1}$ for 66,545 thin-disc stars, respectively. Alteration of the chemical structure within the Galactic disc caused by the radial orbital variations complicates results for the radial metallicity gradient. The effect of radial orbital variations on the metallicity gradients as a function on time indicates the following results: (i) The presence of a gradient along the disc throughout the time for which the model provides similar prediction, (ii) the radial orbital variations becomes more pronounced with the age of the stellar population and (iii) the effect of radial orbital variations on the metallicity gradients is minimal. The effect of radial orbital variations is found to be at most 6\% which does not statistically affect the radial gradient results. These findings contribute to a better understanding of the chemical evolution within the Galactic disc and provide an important basis for further research.
△ Less
Submitted 20 November, 2024;
originally announced November 2024.
-
Enhancing Q&A Text Retrieval with Ranking Models: Benchmarking, fine-tuning and deploying Rerankers for RAG
Authors:
Gabriel de Souza P. Moreira,
Ronay Ak,
Benedikt Schifferer,
Mengyao Xu,
Radek Osmulski,
Even Oldridge
Abstract:
Ranking models play a crucial role in enhancing overall accuracy of text retrieval systems. These multi-stage systems typically utilize either dense embedding models or sparse lexical indices to retrieve relevant passages based on a given query, followed by ranking models that refine the ordering of the candidate passages by its relevance to the query.
This paper benchmarks various publicly avai…
▽ More
Ranking models play a crucial role in enhancing overall accuracy of text retrieval systems. These multi-stage systems typically utilize either dense embedding models or sparse lexical indices to retrieve relevant passages based on a given query, followed by ranking models that refine the ordering of the candidate passages by its relevance to the query.
This paper benchmarks various publicly available ranking models and examines their impact on ranking accuracy. We focus on text retrieval for question-answering tasks, a common use case for Retrieval-Augmented Generation systems. Our evaluation benchmarks include models some of which are commercially viable for industrial applications.
We introduce a state-of-the-art ranking model, NV-RerankQA-Mistral-4B-v3, which achieves a significant accuracy increase of ~14% compared to pipelines with other rerankers. We also provide an ablation study comparing the fine-tuning of ranking models with different sizes, losses and self-attention mechanisms.
Finally, we discuss challenges of text retrieval pipelines with ranking models in real-world industry applications, in particular the trade-offs among model size, ranking accuracy and system requirements like indexing and serving latency / throughput.
△ Less
Submitted 11 September, 2024;
originally announced September 2024.
-
Widespread misidentification of SEM instruments in the peer-reviewed materials science and engineering literature
Authors:
Reese AK Richardson,
Jeonghyun Moon,
Spencer S Hong,
Luís A Nunes Amaral
Abstract:
Removed per arXiv policy. Please see version at https://doi.org/10.31219/osf.io/4wqcr
Removed per arXiv policy. Please see version at https://doi.org/10.31219/osf.io/4wqcr
△ Less
Submitted 27 August, 2024;
originally announced September 2024.
-
The Penults of Tak: Adventures in impartial, normal-play, positional games
Authors:
Boris Alexeev,
Paul Ellis,
Michael Richter,
Thotsaporn Aek Thanatipanonda
Abstract:
For normal play, impartial games, we define penults as those positions in which every option results in an immediate win for the other player. We explore the number of tokens in penults of two positional games, Impartial Tic and Impartial Tak. We obtain a complete classification in the former case. We then explore winning strategies and further directions.
For normal play, impartial games, we define penults as those positions in which every option results in an immediate win for the other player. We explore the number of tokens in penults of two positional games, Impartial Tic and Impartial Tak. We obtain a complete classification in the former case. We then explore winning strategies and further directions.
△ Less
Submitted 3 August, 2024;
originally announced August 2024.
-
von Neumann and Newman Pokers with Finite Decks
Authors:
Tipaluck Krityakierne,
Thotsaporn Aek Thanatipanonda,
Doron Zeilberger
Abstract:
John von Neumann studied a simplified version of poker where the "deck" consists of infinitely many cards, in fact, all real numbers between $0$ and $1$. We harness the power of computation, both numeric and symbolic, to investigate analogs with finitely many cards. We also study finite analogs of a simplified poker introduced by D.J. Newman, and conclude with a thorough investigation, fully imple…
▽ More
John von Neumann studied a simplified version of poker where the "deck" consists of infinitely many cards, in fact, all real numbers between $0$ and $1$. We harness the power of computation, both numeric and symbolic, to investigate analogs with finitely many cards. We also study finite analogs of a simplified poker introduced by D.J. Newman, and conclude with a thorough investigation, fully implemented in Maple, of the three-player game, doing both the finite and the infinite versions. This paper is accompanied by two Maple packages and numerous output files.
△ Less
Submitted 22 July, 2024;
originally announced July 2024.
-
NV-Retriever: Improving text embedding models with effective hard-negative mining
Authors:
Gabriel de Souza P. Moreira,
Radek Osmulski,
Mengyao Xu,
Ronay Ak,
Benedikt Schifferer,
Even Oldridge
Abstract:
Text embedding models have been popular for information retrieval applications such as semantic search and Question-Answering systems based on Retrieval-Augmented Generation (RAG). Those models are typically Transformer models that are fine-tuned with contrastive learning objectives. One of the challenging aspects of fine-tuning embedding models is the selection of high quality hard-negative passa…
▽ More
Text embedding models have been popular for information retrieval applications such as semantic search and Question-Answering systems based on Retrieval-Augmented Generation (RAG). Those models are typically Transformer models that are fine-tuned with contrastive learning objectives. One of the challenging aspects of fine-tuning embedding models is the selection of high quality hard-negative passages for contrastive learning. In this paper we introduce a family of positive-aware mining methods that use the positive relevance score as an anchor for effective false negative removal, leading to faster training and more accurate retrieval models. We provide an ablation study on hard-negative mining methods over their configurations, exploring different teacher and base models. We further demonstrate the efficacy of our proposed mining methods at scale with the NV-Retriever-v1 model, which scores 60.9 on MTEB Retrieval (BEIR) benchmark and placed 1st when it was published to the MTEB Retrieval on July, 2024.
△ Less
Submitted 7 February, 2025; v1 submitted 22 July, 2024;
originally announced July 2024.
-
Investigating Optical Variability of the Blazar S5 0716+714 On Diverse Time-scales
Authors:
Ergün Ege,
Aykut Özdönmez,
Aditi Agarwal,
Tansel Ak
Abstract:
We present the results of the observational study of the blazar S5 0716+716 in the optical bands B, V, R, and I between March 2019 and August 2023 to investigate its variability on diverse time-scales. The blazar was followed up by the T60 robotic telescope in Turkey for 416 nights to obtain long-term variability during this period. In order to search for intraday variability of the object, we hav…
▽ More
We present the results of the observational study of the blazar S5 0716+716 in the optical bands B, V, R, and I between March 2019 and August 2023 to investigate its variability on diverse time-scales. The blazar was followed up by the T60 robotic telescope in Turkey for 416 nights to obtain long-term variability during this period. In order to search for intraday variability of the object, we have carried out 21 nights of observations with the T100 telescope for at least 1 hour. The blazar showed a ~2.47 mag variation in the optical R-band during our monitoring period, the brightest state on 18.01.2020 (MJD 58866) as R=12.109$\pm$0.011 and the faintest state on 23.03.2019 (MJD 58565) as R=14.580$\pm$0.013. We employed the nested ANOVA test and the power enhanced F-test to quantify intraday variability which showed that the blazar was significantly variable in the R-band on 12 out of 21 nights. Correlation analysis of the light curves shows that the emission in the BVRI optical bands was strongly correlated both in the short and long term without any time lag. The blazar has likely quasi-periods of 186$\pm$30, 532$\pm$76 days in the optical R-band light curve according to the WWZ, and the LS periodogram. The IDV and LTV features are discussed within the frame of prospective scenarios.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Characterization of the Autonomic Nervous System Activity in Females Classified According to Mood Scores During the Follicular Phase
Authors:
Makiko Aok,
Mai Nishimura,
Masato Suzuki,
Eiriko Terasawa,
Hisayo Okayama
Abstract:
Many sexually mature females suffer from premenstrual syndrome (PMS), but effective coping methods for PMS are limited due to the complexity of symptoms and unclear pathogenesis. Awareness has shown promise in alleviating PMS symptoms but faces challenges in long-term recording and consistency. Our research goal is to establish a convenient and simple method to make individual female aware of thei…
▽ More
Many sexually mature females suffer from premenstrual syndrome (PMS), but effective coping methods for PMS are limited due to the complexity of symptoms and unclear pathogenesis. Awareness has shown promise in alleviating PMS symptoms but faces challenges in long-term recording and consistency. Our research goal is to establish a convenient and simple method to make individual female aware of their own psychological, and autonomic conditions. In previous research, we demonstrated that participants could be classified into non-PMS and PMS groups based on mood scores obtained during the follicular phase. However, the properties of neurophysiological activity in the participants classified by mood scores have not been elucidated. This study aimed to classify participants based on their scores on a mood questionnaire during the follicular phase and to evaluate their autonomic nervous system (ANS) activity using a simple device that measures pulse waves from the earlobe. Participants were grouped into Cluster I (high positive mood) and Cluster II (low mood). Cluster II participants showed reduced parasympathetic nervous system activity from the follicular to the menstrual phase, indicating potential PMS symptoms. The study demonstrates the feasibility of using mood scores to classify individuals into PMS and non-PMS groups and monitor ANS changes across menstrual phases. Despite limitations such as sample size and device variability, the findings highlight a promising avenue for convenient PMS self-monitoring.
△ Less
Submitted 8 May, 2024;
originally announced May 2024.
-
What-if Analysis Framework for Digital Twins in 6G Wireless Network Management
Authors:
Elif Ak,
Berk Canberk,
Vishal Sharma,
Octavia A. Dobre,
Trung Q. Duong
Abstract:
This study explores implementing a digital twin network (DTN) for efficient 6G wireless network management, aligning with the fault, configuration, accounting, performance, and security (FCAPS) model. The DTN architecture comprises the Physical Twin Layer, implemented using NS-3, and the Service Layer, featuring machine learning and reinforcement learning for optimizing carrier sensitivity thresho…
▽ More
This study explores implementing a digital twin network (DTN) for efficient 6G wireless network management, aligning with the fault, configuration, accounting, performance, and security (FCAPS) model. The DTN architecture comprises the Physical Twin Layer, implemented using NS-3, and the Service Layer, featuring machine learning and reinforcement learning for optimizing carrier sensitivity threshold and transmit power control in wireless networks. We introduce a robust "What-if Analysis" module, utilizing conditional tabular generative adversarial network (CTGAN) for synthetic data generation to mimic various network scenarios. These scenarios assess four network performance metrics: throughput, latency, packet loss, and coverage. Our findings demonstrate the efficiency of the proposed what-if analysis framework in managing complex network conditions, highlighting the importance of the scenario-maker step and the impact of twinning intervals on network performance.
△ Less
Submitted 24 April, 2024; v1 submitted 17 April, 2024;
originally announced April 2024.
-
X-CBA: Explainability Aided CatBoosted Anomal-E for Intrusion Detection System
Authors:
Kiymet Kaya,
Elif Ak,
Sumeyye Bas,
Berk Canberk,
Sule Gunduz Oguducu
Abstract:
The effectiveness of Intrusion Detection Systems (IDS) is critical in an era where cyber threats are becoming increasingly complex. Machine learning (ML) and deep learning (DL) models provide an efficient and accurate solution for identifying attacks and anomalies in computer networks. However, using ML and DL models in IDS has led to a trust deficit due to their non-transparent decision-making. T…
▽ More
The effectiveness of Intrusion Detection Systems (IDS) is critical in an era where cyber threats are becoming increasingly complex. Machine learning (ML) and deep learning (DL) models provide an efficient and accurate solution for identifying attacks and anomalies in computer networks. However, using ML and DL models in IDS has led to a trust deficit due to their non-transparent decision-making. This transparency gap in IDS research is significant, affecting confidence and accountability. To address, this paper introduces a novel Explainable IDS approach, called X-CBA, that leverages the structural advantages of Graph Neural Networks (GNNs) to effectively process network traffic data, while also adapting a new Explainable AI (XAI) methodology. Unlike most GNN-based IDS that depend on labeled network traffic and node features, thereby overlooking critical packet-level information, our approach leverages a broader range of traffic data through network flows, including edge attributes, to improve detection capabilities and adapt to novel threats. Through empirical testing, we establish that our approach not only achieves high accuracy with 99.47% in threat detection but also advances the field by providing clear, actionable explanations of its analytical outcomes. This research also aims to bridge the current gap and facilitate the broader integration of ML/DL technologies in cybersecurity defenses by offering a local and global explainability solution that is both precise and interpretable.
△ Less
Submitted 2 June, 2024; v1 submitted 1 February, 2024;
originally announced February 2024.
-
A YANG-aided Unified Strategy for Black Hole Detection for Backbone Networks
Authors:
Elif Ak,
Kiymet Kaya,
Eren Ozaltun,
Sule Gunduz Oguducu,
Berk Canberk
Abstract:
Despite the crucial importance of addressing Black Hole failures in Internet backbone networks, effective detection strategies in backbone networks are lacking. This is largely because previous research has been centered on Mobile Ad-hoc Networks (MANETs), which operate under entirely different dynamics, protocols, and topologies, making their findings not directly transferable to backbone network…
▽ More
Despite the crucial importance of addressing Black Hole failures in Internet backbone networks, effective detection strategies in backbone networks are lacking. This is largely because previous research has been centered on Mobile Ad-hoc Networks (MANETs), which operate under entirely different dynamics, protocols, and topologies, making their findings not directly transferable to backbone networks. Furthermore, detecting Black Hole failures in backbone networks is particularly challenging. It requires a comprehensive range of network data due to the wide variety of conditions that need to be considered, making data collection and analysis far from straightforward. Addressing this gap, our study introduces a novel approach for Black Hole detection in backbone networks using specialized Yet Another Next Generation (YANG) data models with Black Hole-sensitive Metric Matrix (BHMM) analysis. This paper details our method of selecting and analyzing four YANG models relevant to Black Hole detection in ISP networks, focusing on routing protocols and ISP-specific configurations. Our BHMM approach derived from these models demonstrates a 10% improvement in detection accuracy and a 13% increase in packet delivery rate, highlighting the efficiency of our approach. Additionally, we evaluate the Machine Learning approach leveraged with BHMM analysis in two different network settings, a commercial ISP network, and a scientific research-only network topology. This evaluation also demonstrates the practical applicability of our method, yielding significantly improved prediction outcomes in both environments.
△ Less
Submitted 1 February, 2024;
originally announced February 2024.
-
CCD UBV and Gaia DR3 Analyses of Open Clusters King 6 and NGC 1605
Authors:
S. Gokmen,
Z. Eker,
T. Yontan,
S. Bilir,
T. Ak,
S. Ak,
T. Banks,
A. Sarajedini
Abstract:
A detailed analysis of ground-based CCD UBV photometry and space-based Gaia Data Release 3 (DR3) data for the open clusters King 6 and NGC 1605 was performed. Using the pyUPMASK algorithm on Gaia astrometric data to estimate cluster membership probabilities, we have identified 112 stars in King 6 and 160 stars in NGC 1605 as the statistically most likely members of each cluster. We calculated redd…
▽ More
A detailed analysis of ground-based CCD UBV photometry and space-based Gaia Data Release 3 (DR3) data for the open clusters King 6 and NGC 1605 was performed. Using the pyUPMASK algorithm on Gaia astrometric data to estimate cluster membership probabilities, we have identified 112 stars in King 6 and 160 stars in NGC 1605 as the statistically most likely members of each cluster. We calculated reddening and metallicity separately using UBV two-color diagrams to estimate parameter values via independent methods. The color excess $E(B-V)$ and photometric metallicity [Fe/H] for King 6 are $0.515 \pm 0.030$ mag and $0.02 \pm 0.20$ dex, respectively. For NGC 1605, they are $0.840 \pm 0.054$ mag and $0.01 \pm 0.20$ dex. With reddening and metallicity kept constant, we have estimated the distances and cluster ages by fitting PARSEC isochrones to color-magnitude diagrams based on the Gaia and UBV data. Photometric distances are 723 $\pm$ 34 pc for King 6 and 3054 $\pm$ 243 pc for NGC 1605. The cluster ages are $200 \pm 20$ Myr and $400 \pm 50$ Myr for King 6 and NGC 1605, respectively. Mass function slopes were found to be 1.29 $\pm$ 0.18 and 1.63 $\pm$ 0.36 for King 6 and NGC 1605, respectively. These values are in good agreement with the value of Salpeter (1955). The relaxation times were estimated as 5.8 Myr for King 6 and 60 Myr for NGC 1605. This indicates that both clusters are dynamically relaxed since these times are less than the estimated cluster ages. Galactic orbit analysis shows that both clusters formed outside the solar circle and are members of the young thin-disc population.
△ Less
Submitted 31 October, 2023;
originally announced November 2023.
-
NICE: CVPR 2023 Challenge on Zero-shot Image Captioning
Authors:
Taehoon Kim,
Pyunghwan Ahn,
Sangyun Kim,
Sihaeng Lee,
Mark Marsden,
Alessandra Sala,
Seung Hwan Kim,
Bohyung Han,
Kyoung Mu Lee,
Honglak Lee,
Kyounghoon Bae,
Xiangyu Wu,
Yi Gao,
Hailiang Zhang,
Yang Yang,
Weili Guo,
Jianfeng Lu,
Youngtaek Oh,
Jae Won Cho,
Dong-jin Kim,
In So Kweon,
Junmo Kim,
Wooyoung Kang,
Won Young Jhoo,
Byungseok Roh
, et al. (17 additional authors not shown)
Abstract:
In this report, we introduce NICE (New frontiers for zero-shot Image Captioning Evaluation) project and share the results and outcomes of 2023 challenge. This project is designed to challenge the computer vision community to develop robust image captioning models that advance the state-of-the-art both in terms of accuracy and fairness. Through the challenge, the image captioning models were tested…
▽ More
In this report, we introduce NICE (New frontiers for zero-shot Image Captioning Evaluation) project and share the results and outcomes of 2023 challenge. This project is designed to challenge the computer vision community to develop robust image captioning models that advance the state-of-the-art both in terms of accuracy and fairness. Through the challenge, the image captioning models were tested using a new evaluation dataset that includes a large variety of visual concepts from many domains. There was no specific training data provided for the challenge, and therefore the challenge entries were required to adapt to new types of image descriptions that had not been seen during training. This report includes information on the newly proposed NICE dataset, evaluation methods, challenge results, and technical details of top-ranking entries. We expect that the outcomes of the challenge will contribute to the improvement of AI models on various vision-language tasks.
△ Less
Submitted 10 September, 2023; v1 submitted 5 September, 2023;
originally announced September 2023.
-
Two Games on Arithmetic Functions: SALIQUANT and NONTOTIENT
Authors:
Paul Ellis,
Jason Shi,
Thotsaporn Aek Thanatipanonda,
Andrew Tu
Abstract:
We investigate the Sprague-Grundy sequences for two normal-play impartial games based on arithmetic functions, first described by Iannucci and Larsson in \cite{sum}. In each game, the set of positions is N (natural numbers). In saliquant, the options are to subtract a non-divisor. Here we obtain several nice number theoretic lemmas, a fundamental theorem, and two conjectures about the eventual den…
▽ More
We investigate the Sprague-Grundy sequences for two normal-play impartial games based on arithmetic functions, first described by Iannucci and Larsson in \cite{sum}. In each game, the set of positions is N (natural numbers). In saliquant, the options are to subtract a non-divisor. Here we obtain several nice number theoretic lemmas, a fundamental theorem, and two conjectures about the eventual density of Sprague-Grundy values.
In nontotient, the only option is to subtract the number of relatively prime residues. Here are able to calculate certain Sprague-Grundy values, and start to understand an appropriate class function.
△ Less
Submitted 3 September, 2023;
originally announced September 2023.
-
Electron Cloud Measurements in Fermilab Booster
Authors:
S. A. K. Wijethunga,
N. Eddy,
J. Eldred,
C. Y. Tan,
B. Fellenz,
E. Pozdeyev,
R. V. Sharankova
Abstract:
Fermilab Booster synchrotron requires an intensity upgrade from 4.5x1012 to 6.5x1012 protons per pulse as a part of Fermilab's Proton Improvement Plan-II (PIP-II). One of the factors which may limit the high-intensity performance is the fast transverse instabilities caused by electron cloud effects. According to the experience in the Recycler, the electron cloud gradually builds up over multiple t…
▽ More
Fermilab Booster synchrotron requires an intensity upgrade from 4.5x1012 to 6.5x1012 protons per pulse as a part of Fermilab's Proton Improvement Plan-II (PIP-II). One of the factors which may limit the high-intensity performance is the fast transverse instabilities caused by electron cloud effects. According to the experience in the Recycler, the electron cloud gradually builds up over multiple turns inside the combined function magnets and can reach final intensities orders of magnitude greater than in a pure dipole. Since the Booster synchrotron also incorporates combined function magnets, it is important to measure the presence of electron cloud. The presence or apparent absence of the electron cloud was investigated using two different methods: measuring bunch-by-bunch tune shift by changing the bunch train structure at different intensities and propagating a microwave carrier signal through the beampipe and analyzing the phase modulation of the signal. This paper presents the results of the two methods and corresponding simulation results conducted using PyECLOUD software.
△ Less
Submitted 19 July, 2023;
originally announced July 2023.
-
Numerical Simulations of the Decaying Transverse Oscillations in the Cool Jet
Authors:
Abhishek K. Srivastava,
Balveer Singh
Abstract:
We describe a 2.5D MHD simulation describing the evolution of cool jets triggered by initial vertical velocity perturbations in the solar chromosphere. We implement random velocity pulses of amplitude 20-50 km/s between 1 Mm and 1.5 Mm, along with various switch-off periods between 50 s and 300 s. The applied vertical velocity pulses create a series of magnetoacoustic shocks steepening above TR. T…
▽ More
We describe a 2.5D MHD simulation describing the evolution of cool jets triggered by initial vertical velocity perturbations in the solar chromosphere. We implement random velocity pulses of amplitude 20-50 km/s between 1 Mm and 1.5 Mm, along with various switch-off periods between 50 s and 300 s. The applied vertical velocity pulses create a series of magnetoacoustic shocks steepening above TR. These shocks interact with each other in the inner corona, leading to complex localized velocity fields. The upward propagation of such perturbations creates low-pressure regions behind them, which propel a variety of cool jets and plasma flows. We study the transverse oscillations of a representative cool jet J1 , which moves up to the height of 6.2 Mm above the TR from its origin point. During its evolution, the plasma flows make the spine of jet J1 radially inhomogeneous, which is visible in the density and Alfvén speed smoothly varying across the jet. The highly dense J1 supports the propagating transverse wave of period of approximately 195 s with a phase speed of about 125 km/s. In the distance-time map of density, it is manifested as a transverse kink wave. However, the careful investigation of the distance-time maps of the x- and z-components of velocity reveals that these transverse waves are actually the mixed Alfvénic modes. The transverse wave shows evidence of damping in the jet. We conclude that the cross-field structuring of the density and characteristic Alfvén speed within J1 causes the onset of the resonant conversion and leakage of the wave energy outward to dissipate these transverse oscillations via resonant absorption. The wave energy flux is estimated as approximately of 1.0 x 10^6 ergs cm^{-2} s^{-1}. This energy, if it dissipates through the resonant absorption into the corona where the jet is propagated, is sufficient energy for the localized coronal heating.
△ Less
Submitted 23 June, 2023;
originally announced June 2023.
-
IndicTrans2: Towards High-Quality and Accessible Machine Translation Models for all 22 Scheduled Indian Languages
Authors:
Jay Gala,
Pranjal A. Chitale,
Raghavan AK,
Varun Gumma,
Sumanth Doddapaneni,
Aswanth Kumar,
Janki Nawale,
Anupama Sujatha,
Ratish Puduppully,
Vivek Raghavan,
Pratyush Kumar,
Mitesh M. Khapra,
Raj Dabre,
Anoop Kunchukuttan
Abstract:
India has a rich linguistic landscape with languages from 4 major language families spoken by over a billion people. 22 of these languages are listed in the Constitution of India (referred to as scheduled languages) are the focus of this work. Given the linguistic diversity, high-quality and accessible Machine Translation (MT) systems are essential in a country like India. Prior to this work, ther…
▽ More
India has a rich linguistic landscape with languages from 4 major language families spoken by over a billion people. 22 of these languages are listed in the Constitution of India (referred to as scheduled languages) are the focus of this work. Given the linguistic diversity, high-quality and accessible Machine Translation (MT) systems are essential in a country like India. Prior to this work, there was (i) no parallel training data spanning all 22 languages, (ii) no robust benchmarks covering all these languages and containing content relevant to India, and (iii) no existing translation models which support all the 22 scheduled languages of India. In this work, we aim to address this gap by focusing on the missing pieces required for enabling wide, easy, and open access to good machine translation systems for all 22 scheduled Indian languages. We identify four key areas of improvement: curating and creating larger training datasets, creating diverse and high-quality benchmarks, training multilingual models, and releasing models with open access. Our first contribution is the release of the Bharat Parallel Corpus Collection (BPCC), the largest publicly available parallel corpora for Indic languages. BPCC contains a total of 230M bitext pairs, of which a total of 126M were newly added, including 644K manually translated sentence pairs created as part of this work. Our second contribution is the release of the first n-way parallel benchmark covering all 22 Indian languages, featuring diverse domains, Indian-origin content, and source-original test sets. Next, we present IndicTrans2, the first model to support all 22 languages, surpassing existing models on multiple existing and new benchmarks created as a part of this work. Lastly, to promote accessibility and collaboration, we release our models and associated data with permissive licenses at https://github.com/AI4Bharat/IndicTrans2.
△ Less
Submitted 20 December, 2023; v1 submitted 25 May, 2023;
originally announced May 2023.
-
A Chandra X-ray Study of Supernova Remnant N63A in the Large Magellanic Cloud
Authors:
E. Karagoz,
N. Alan,
S. Bilir,
S. Ak
Abstract:
We perform extensive spectroscopy of the supernova remnant N63A in the Large Magellanic Cloud, using $\sim 43$ ks {\it Chandra} archival data. By analysing the spectra of the entire remnant, we determine the abundance distributions for O, Ne, Mg, Si, and Fe. We detect evidence of enhanced O and possibly Ne and Mg in some of the central regions which might indicate an asymmetric distribution of the…
▽ More
We perform extensive spectroscopy of the supernova remnant N63A in the Large Magellanic Cloud, using $\sim 43$ ks {\it Chandra} archival data. By analysing the spectra of the entire remnant, we determine the abundance distributions for O, Ne, Mg, Si, and Fe. We detect evidence of enhanced O and possibly Ne and Mg in some of the central regions which might indicate an asymmetric distribution of the ejecta. The average O/Ne, O/Mg, and Ne/Mg abundance ratios of the ejecta are in plausible agreement with the nucleosynthesis products from the explosion of a $\sim40$ $M_{\odot}$ progenitor. We estimate an upper limit on the Sedov age of $\sim 5,400\pm200$ yr and explosion energy of $\sim 8.9\pm 1.6\times 10^{51}$ erg for N63A. We discuss the implications of our results for the morphological structure of the remnant, its circumstellar medium and the nature of the progenitor star.
△ Less
Submitted 4 May, 2023;
originally announced May 2023.
-
Learning Failure Prevention Skills for Safe Robot Manipulation
Authors:
Abdullah Cihan Ak,
Eren Erdal Aksoy,
Sanem Sariel
Abstract:
Robots are more capable of achieving manipulation tasks for everyday activities than before. But the safety of manipulation skills that robots employ is still an open problem. Considering all possible failures during skill learning increases the complexity of the process and restrains learning an optimal policy. Beyond that, in unstructured environments, it is not easy to enumerate all possible fa…
▽ More
Robots are more capable of achieving manipulation tasks for everyday activities than before. But the safety of manipulation skills that robots employ is still an open problem. Considering all possible failures during skill learning increases the complexity of the process and restrains learning an optimal policy. Beyond that, in unstructured environments, it is not easy to enumerate all possible failures beforehand. In the context of safe skill manipulation, we reformulate skills as base and failure prevention skills where base skills aim at completing tasks and failure prevention skills focus on reducing the risk of failures to occur. Then, we propose a modular and hierarchical method for safe robot manipulation by augmenting base skills by learning failure prevention skills with reinforcement learning, forming a skill library to address different safety risks. Furthermore, a skill selection policy that considers estimated risks is used for the robot to select the best control policy for safe manipulation. Our experiments show that the proposed method achieves the given goal while ensuring safety by preventing failures. We also show that with the proposed method, skill learning is feasible, novel failures are easily adaptable, and our safe manipulation tools can be transferred to the real environment.
△ Less
Submitted 4 May, 2023;
originally announced May 2023.
-
Reconnection generated plasma flows in the quasi-separatrix layer in localised solar corona
Authors:
Sripan Mondal,
A. K. Srivastava,
Sudheer K. Mishra,
K. Sangal,
Pradeep Kayshap,
Yang Guo,
David I. Pontin,
Vadim M. Uritsky,
Leon Ofman,
T. -J. Wang,
Ding Yuan
Abstract:
Multiwavelength observations of the propagating disturbances (PDs), discovered by Atmospheric Imaging Assembly (AIA) onboard Solar Dynamics Observatory (SDO), are analyzed to determine its driving mechanism and physical nature. Two magnetic strands in the localised corona are observed to approach and merge with each other followed by the generation of brightening, which further propagates in a cus…
▽ More
Multiwavelength observations of the propagating disturbances (PDs), discovered by Atmospheric Imaging Assembly (AIA) onboard Solar Dynamics Observatory (SDO), are analyzed to determine its driving mechanism and physical nature. Two magnetic strands in the localised corona are observed to approach and merge with each other followed by the generation of brightening, which further propagates in a cusp-shaped magnetic channel. Differential emission measure analysis shows an occurrence of heating in this region-of-interest (ROI). We extrapolate potential magnetic field lines at coronal heights from observed Helioseismic and Magnetic Imager (HMI) vector magnetogram via Green's function method using MPI-AMRVAC. We analyze the field to locate magnetic nulls and quasi-separatrix layers (QSLs) which are preferential locations for magnetic reconnection. Dominant QSLs including a magnetic null are found to exist and match the geometry followed by PDs, therefore, it provides conclusive evidence of magnetic reconnection. In addition, spectroscopic analysis of Interface Region Imaging Spectrograph (IRIS) Si IV 1393.77 Å line profiles show a rise of line-width in the same time range depicting presence of mass motion in the observed cusp-shaped region. PDs are observed to exhibit periodicities of around four minutes. The speeds of PDs measured by Surfing Transform Technique are almost close to each other in four different SDO/AIA bandpasses, i.e., 304, 171, 193 and 131 Å excluding the interpretation of PDs in terms of slow magnetoacoustic waves. We describe comprehensively the observed PDs as quasi-periodic plasma flows generated due to periodic reconnection in vicinity of a coronal magnetic null.
△ Less
Submitted 3 May, 2023;
originally announced May 2023.
-
The Age-Metallicity Relation in the Solar Neighbourhood
Authors:
S. Doner,
S. Ak,
O. Onal Tas,
O. Plevne
Abstract:
Age-metallicity relation for the Galactic disc is a crucial tool and to constrain the Galactic chemical evolution models. We investigate the age-metallicity relation of the Galactic disc using the red giant branch stars in the Solar neighbourhood. The data cover the Galactocentric radius of $7\leq R_{\rm gc} (\rm kpc) \leq9.5$, but extends up to 4 kpc in height from the Galactic plane. We use kine…
▽ More
Age-metallicity relation for the Galactic disc is a crucial tool and to constrain the Galactic chemical evolution models. We investigate the age-metallicity relation of the Galactic disc using the red giant branch stars in the Solar neighbourhood. The data cover the Galactocentric radius of $7\leq R_{\rm gc} (\rm kpc) \leq9.5$, but extends up to 4 kpc in height from the Galactic plane. We use kinematic age derived from highly precise astrometric data of Gaia Data Release 2 and element abundance ratios from high-resolution spectroscopic data of APOGEE-2 catalogues. We apply a two-component Gaussian mixture model to chemically separate the programme stars into thin and thick disc populations. The stars in each population are grouped into different distance intervals from the Galactic plane. The mean metal abundances and velocity dispersions of the stars in the groups were calculated and the kinematic ages were determined from their kinematic parameters. We found a steep relation for the thin disc with -0.057$\pm$0.007 dex Gyr$^{-1}$, and even a steeper value of -0.103$\pm$0.009 dex Gyr$^{-1}$ for the thick disc. These age-metallicity relations along with the prominent differences in age, metallicity, and kinematic behaviours seen from the data, clearly show it is important to consider the distinct formation scenarios of the Galactic disc components in modelling the Milky Way.
△ Less
Submitted 28 April, 2023;
originally announced April 2023.
-
Galactic Model Parameters and Space Density of Cataclysmic Variables in Gaia Era: New Constraints to Population Models
Authors:
R. Canbay,
S. Bilir,
A. Özdönmez,
T. Ak
Abstract:
The spatial distribution, Galactic model parameters and luminosity function of cataclysmic variables (CVs) are established using re-estimated trigonometric parallaxes of {\it Gaia} DR3. The data sample of 1,587 CVs in this study is claimed to be suitable for Galactic model parameter estimation as the distances are based on trigonometric parallaxes and the {\it Gaia} DR3 photometric completeness li…
▽ More
The spatial distribution, Galactic model parameters and luminosity function of cataclysmic variables (CVs) are established using re-estimated trigonometric parallaxes of {\it Gaia} DR3. The data sample of 1,587 CVs in this study is claimed to be suitable for Galactic model parameter estimation as the distances are based on trigonometric parallaxes and the {\it Gaia} DR3 photometric completeness limits were taken into account when the sample was created. According to the analysis, the scale height of All CVs increases from 248$\pm$2 to 430$\pm$4 pc towards shorter periods near the lower limit of the period gap and suddenly drops to 300$\pm$2 pc for the shortest orbital period CVs. The exponential scale heights of All CVs and magnetic systems are found to be 375$\pm$2 and 281$\pm$3 pc, respectively, considerably larger than those suggested in previous observational studies. The local space density of All CVs and magnetic systems in the sample are $6.8^{+1.3}_{-1.1}\times$10$^{-6}$ and $2.1^{+0.5}_{-0.4}\times10^{-6}$ pc$^{-3}$, respectively. Our measurements strengthen the 1-2 order of magnitude discrepancy between CV space densities predicted by population synthesis models and observations. It is likely that this discrepancy is due to objects undetected by CV surveys, such as the systems with very low $\dot{M}$ and the ones in the period gap. The comparisons of the luminosity function of white dwarfs with the luminosity function of All CVs in this study show that 500 times the luminosity function of CVs fits very well to the luminosity function of white dwarfs. We conclude that the estimations and data sample in this study can be confidently used in further analysis of CVs.
△ Less
Submitted 22 February, 2023;
originally announced February 2023.
-
BASICS: Broad quality Assessment of Static point clouds In Compression Scenarios
Authors:
Ali Ak,
Emin Zerman,
Maurice Quach,
Aladine Chetouani,
Aljosa Smolic,
Giuseppe Valenzise,
Patrick Le Callet
Abstract:
Point clouds have become increasingly prevalent in representing 3D scenes within virtual environments, alongside 3D meshes. Their ease of capture has facilitated a wide array of applications on mobile devices, from smartphones to autonomous vehicles. Notably, point cloud compression has reached an advanced stage and has been standardized. However, the availability of quality assessment datasets, w…
▽ More
Point clouds have become increasingly prevalent in representing 3D scenes within virtual environments, alongside 3D meshes. Their ease of capture has facilitated a wide array of applications on mobile devices, from smartphones to autonomous vehicles. Notably, point cloud compression has reached an advanced stage and has been standardized. However, the availability of quality assessment datasets, which are essential for developing improved objective quality metrics, remains limited. In this paper, we introduce BASICS, a large-scale quality assessment dataset tailored for static point clouds. The BASICS dataset comprises 75 unique point clouds, each compressed with four different algorithms including a learning-based method, resulting in the evaluation of nearly 1500 point clouds by 3500 unique participants. Furthermore, we conduct a comprehensive analysis of the gathered data, benchmark existing point cloud quality assessment metrics and identify their limitations. By publicly releasing the BASICS dataset, we lay the foundation for addressing these limitations and fostering the development of more precise quality metrics.
△ Less
Submitted 18 November, 2024; v1 submitted 9 February, 2023;
originally announced February 2023.
-
Bayesian approach to radio frequency interference mitigation
Authors:
S. A. K. Leeney,
W. J. Handley,
E. de Lera Acedo
Abstract:
Interfering signals such as Radio Frequency Interference from ubiquitous satellite constellations are becoming an endemic problem in fields involving physical observations of the electromagnetic spectrum. To address this we propose a novel data cleaning methodology. Contamination is simultaneously flagged and managed at the likelihood level. It is modeled in a Bayesian fashion through a piecewise…
▽ More
Interfering signals such as Radio Frequency Interference from ubiquitous satellite constellations are becoming an endemic problem in fields involving physical observations of the electromagnetic spectrum. To address this we propose a novel data cleaning methodology. Contamination is simultaneously flagged and managed at the likelihood level. It is modeled in a Bayesian fashion through a piecewise likelihood that is constrained by a Bernoulli prior distribution. The techniques described in this paper can be implemented with just a few lines of code.
△ Less
Submitted 19 February, 2024; v1 submitted 28 November, 2022;
originally announced November 2022.
-
The slowest coupon collector's problem
Authors:
Tipaluck Krityakierne,
Thotsaporn Aek Thanatipanonda
Abstract:
In the classical coupon collector's problem, every box of breakfast cereal contains one coupon from a collection of n distinct coupons, each equally likely to appear. The goal is to find the expected number of boxes a player needs to purchase to complete the whole collection. In this work, we extend the classical problem to k players who compete with one another to be the first to collect the whol…
▽ More
In the classical coupon collector's problem, every box of breakfast cereal contains one coupon from a collection of n distinct coupons, each equally likely to appear. The goal is to find the expected number of boxes a player needs to purchase to complete the whole collection. In this work, we extend the classical problem to k players who compete with one another to be the first to collect the whole collection. We find the expected numbers of boxes required for the slowest and fastest players to finish the game. The odds of a particular player being the slowest or fastest player will also be touched upon. The solutions will be discussed from both the tractable algebraic techniques as well as the probability point of views.
△ Less
Submitted 21 February, 2023; v1 submitted 8 November, 2022;
originally announced November 2022.
-
No-feedback Card Guessing Game: Moments and distributions under the optimal strategy
Authors:
Tipaluck Krityakierne,
Poohrich Siriputcharoen,
Thotsaporn Aek Thanatipanonda,
Chaloemkiat Yapolha
Abstract:
Relying on the optimal guessing strategy recently found for a no-feedback card guessing game with $k$-time riffle shuffles, we derive an exact, closed-form formula for the expected number of correct guesses and higher moments for a $1$-time shuffle case. Our approach makes use of the fast generating function based on a recurrence relation, the method of overlapping stages, and interpolation. As fo…
▽ More
Relying on the optimal guessing strategy recently found for a no-feedback card guessing game with $k$-time riffle shuffles, we derive an exact, closed-form formula for the expected number of correct guesses and higher moments for a $1$-time shuffle case. Our approach makes use of the fast generating function based on a recurrence relation, the method of overlapping stages, and interpolation. As for $k>1$-time shuffles, we establish the expected number of correct guesses through a self-contained combinatorial proof. The proof turns out to be the answer to an open problem listed in Krityakierne and Thanatipanonda (2022), asking for a combinatorial interpretation of a generating function object introduced therein.
△ Less
Submitted 17 September, 2022; v1 submitted 9 September, 2022;
originally announced September 2022.
-
Electron Cloud Measurements in Fermilab Booster
Authors:
S. A. K. Wijethunga,
J. Eldred,
C. Y. Tan,
E. Pozdeyev
Abstract:
Fermilab Booster synchrotron requires an intensity upgrade from 4.5x1012 to 6.5x1012 protons per pulse as a part of Fermilabs Proton Improvement Plan-II (PIP-II). One of the factors which may limit the high-intensity performance is the fast transverse instabilities caused by electron cloud effects. According to the experience in the Recycler, the electron cloud gradually builds up over multiple tu…
▽ More
Fermilab Booster synchrotron requires an intensity upgrade from 4.5x1012 to 6.5x1012 protons per pulse as a part of Fermilabs Proton Improvement Plan-II (PIP-II). One of the factors which may limit the high-intensity performance is the fast transverse instabilities caused by electron cloud effects. According to the experience in the Recycler, the electron cloud gradually builds up over multiple turns in the combined function magnets and can reach final intensities orders of magnitude greater than in a pure dipole. Since the Booster synchrotron also incorporates combined function magnets, it is essential to discover any existence of an electron cloud. And if it does, its effects on the PIP-II era Booster and its mitigating techniques. As the first step, the presence or absence of the electron cloud was investigated using the clearing bunch technique. This paper presents experimental details and observations of the bunch-by-bunch tune shifts of beams with various bunch train structures at low and high intensities and simulation results conducted using PyECLOUD.
△ Less
Submitted 8 August, 2022;
originally announced August 2022.
-
V1294 Aql = HD 184279: A bad boy among Be stars or an important clue to the Be phenomenon?
Authors:
P. Harmanec,
H. Božić,
P. Koubský,
S. Yang,
D. Ruždjak,
D. Sudar,
M. Šlechta,
M. Wolf,
D. Korčáková,
P. Zasche,
A. Oplištilová,
D. Vršnak,
H. Ak,
P. Eenens,
H. Bakiş,
V. Bakiş,
S. Otero,
R. Chini,
T. Demsky,
B. N. Barlow,
P. Svoboda,
J. Jonák,
K. Vitovský,
A. Harmanec
Abstract:
A reliable determination of the basic physical properties and variability patterns of hot emission-line stars is important for understanding the Be phenomenon and ultimately, the evolutionary stage of Be stars. This study is devoted to one of the most remarkable Be stars, V1294 Aql = HD 184279. We collected and analysed spectroscopic and photometric observations covering a time interval of about 2…
▽ More
A reliable determination of the basic physical properties and variability patterns of hot emission-line stars is important for understanding the Be phenomenon and ultimately, the evolutionary stage of Be stars. This study is devoted to one of the most remarkable Be stars, V1294 Aql = HD 184279. We collected and analysed spectroscopic and photometric observations covering a time interval of about 25000 d (68 yr). We present evidence that the object is a single-line 192.9 d spectroscopic binary and estimate that the secondary probably is a hot compact object with a mass of about 1.1-1.2 solar masses. We found and documented very complicated orbital and long-term spectral, light, and colour variations, which must arise from a combination of several distinct variability patterns. Attempts at modelling them are planned for a follow-up study. We place the time behaviour of V1294 Aql into context with variations known for some other systematically studied Be stars and discuss the current ideas about the nature of the Be phenomenon.
△ Less
Submitted 23 June, 2022;
originally announced June 2022.
-
Multiwavelength observations of MAXI J1820+070 during its outburst decay and subsequent mini-outburst
Authors:
M. Özbey Arabacı,
E. Kalemci,
T. Dinçer,
C. Bailyn,
D. Altamirano,
T. Ak
Abstract:
We present results from quasi-simultaneous multiwavelength observations of the Galactic black hole X-ray transient MAXI J1820$+$070 during the decay of the 2018 outburst and its entire subsequent mini-outburst in March 2019. We fit the X-ray spectra with phenomenological and Comptonizaton models and discuss the X-ray spectral evolution comparing with the multiwavelength behaviour of the system. Th…
▽ More
We present results from quasi-simultaneous multiwavelength observations of the Galactic black hole X-ray transient MAXI J1820$+$070 during the decay of the 2018 outburst and its entire subsequent mini-outburst in March 2019. We fit the X-ray spectra with phenomenological and Comptonizaton models and discuss the X-ray spectral evolution comparing with the multiwavelength behaviour of the system. The system showed a rebrightening in UV/Optical/NIR bands 7-days after the soft-to-hard transition during the main outburst decay while it was fading in X-rays and radio. In contrast, the mini-outburst occurred 165-days after the hard state transition of the initial outburst decay and was detected in all wavelengths. For both events, the measured timescales are consistent with those observed in other black hole systems. Contemporaneous hard X-ray/soft $γ$-ray observations indicate a non-thermal electron energy distribution at the beginning of the UV/Optical/NIR rebrightening, whereas a thermal distribution can fit the data during the hard mini-outburst activity. The broadband spectral energy distributions until the rebrightening are consistent with the irradiated outer accretion disc model. However, both the SEDs produced for the peak of rebrightening and close to the peak of mini-outburst provided good fits only with an additional power-law component in the UV/Optical/NIR frequency ranges which is often interpreted with a jet origin.
△ Less
Submitted 6 June, 2022;
originally announced June 2022.
-
No Feedback? No Worries! The art of guessing the right card
Authors:
Tipaluck Krityakierne,
Thotsaporn Aek Thanatipanonda
Abstract:
In 1998, Ciucu published "No-feedback card guessing for dovetail shuffles", an article which gives the optimal guessing strategy for $n$ cards ($n$ even) after $k$ riffle shuffles whenever $k>2\log_{2}\left(n\right)$. We discuss in this article the optimal guessing strategy and the asymptotic (in $n$) expected number of correct guesses for any fixed $k\geq1$. This complements the work achieved two…
▽ More
In 1998, Ciucu published "No-feedback card guessing for dovetail shuffles", an article which gives the optimal guessing strategy for $n$ cards ($n$ even) after $k$ riffle shuffles whenever $k>2\log_{2}\left(n\right)$. We discuss in this article the optimal guessing strategy and the asymptotic (in $n$) expected number of correct guesses for any fixed $k\geq1$. This complements the work achieved two decades ago by Ciucu.
△ Less
Submitted 18 May, 2022;
originally announced May 2022.
-
Federated Learning Enables Big Data for Rare Cancer Boundary Detection
Authors:
Sarthak Pati,
Ujjwal Baid,
Brandon Edwards,
Micah Sheller,
Shih-Han Wang,
G Anthony Reina,
Patrick Foley,
Alexey Gruzdev,
Deepthi Karkada,
Christos Davatzikos,
Chiharu Sako,
Satyam Ghodasara,
Michel Bilello,
Suyash Mohan,
Philipp Vollmuth,
Gianluca Brugnara,
Chandrakanth J Preetha,
Felix Sahm,
Klaus Maier-Hein,
Maximilian Zenk,
Martin Bendszus,
Wolfgang Wick,
Evan Calabrese,
Jeffrey Rudie,
Javier Villanueva-Meyer
, et al. (254 additional authors not shown)
Abstract:
Although machine learning (ML) has shown promise in numerous domains, there are concerns about generalizability to out-of-sample data. This is currently addressed by centrally sharing ample, and importantly diverse, data from multiple sites. However, such centralization is challenging to scale (or even not feasible) due to various limitations. Federated ML (FL) provides an alternative to train acc…
▽ More
Although machine learning (ML) has shown promise in numerous domains, there are concerns about generalizability to out-of-sample data. This is currently addressed by centrally sharing ample, and importantly diverse, data from multiple sites. However, such centralization is challenging to scale (or even not feasible) due to various limitations. Federated ML (FL) provides an alternative to train accurate and generalizable ML models, by only sharing numerical model updates. Here we present findings from the largest FL study to-date, involving data from 71 healthcare institutions across 6 continents, to generate an automatic tumor boundary detector for the rare disease of glioblastoma, utilizing the largest dataset of such patients ever used in the literature (25,256 MRI scans from 6,314 patients). We demonstrate a 33% improvement over a publicly trained model to delineate the surgically targetable tumor, and 23% improvement over the tumor's entire extent. We anticipate our study to: 1) enable more studies in healthcare informed by large and diverse data, ensuring meaningful results for rare diseases and underrepresented populations, 2) facilitate further quantitative analyses for glioblastoma via performance optimization of our consensus model for eventual public release, and 3) demonstrate the effectiveness of FL at such scale and task complexity as a paradigm shift for multi-site collaborations, alleviating the need for data sharing.
△ Less
Submitted 25 April, 2022; v1 submitted 22 April, 2022;
originally announced April 2022.
-
The Arithmetic-Periodicity of \textsc{cut} for $\mathcal{C}=\{1,2c\}$
Authors:
Paul Ellis,
Thotsaporn Aek Thanatipanonda
Abstract:
\textsc{cut} is a class of partition games played on a finite number of finite piles of tokens. Each version of \textsc{cut} is specified by a cut-set $\mathcal{C}\subseteq\mathbb{N}$. A legal move consists of selecting one of the piles and partitioning it into $d+1$ nonempty piles, where $d\in\mathcal{C}$. No tokens are removed from the game. It turns out that the nim-set for any…
▽ More
\textsc{cut} is a class of partition games played on a finite number of finite piles of tokens. Each version of \textsc{cut} is specified by a cut-set $\mathcal{C}\subseteq\mathbb{N}$. A legal move consists of selecting one of the piles and partitioning it into $d+1$ nonempty piles, where $d\in\mathcal{C}$. No tokens are removed from the game. It turns out that the nim-set for any $\mathcal{C}=\{1,2c\}$ with $c\geq 2$ is arithmetic-periodic, which answers an open question of \cite{par}. The key step is to show that there is a correspondence between the nim-sets of \textsc{cut} for $\mathcal{C}=\{1,6\}$ and the nim-sets of \textsc{cut} for $\mathcal{C}=\{1,2c\}, c\geq 4$. The result easily extends to the case of $\mathcal{C} = \{1, 2c_1, 2c_2, 2c_3, ...\}$, where $c_1,c_2, ... \geq 2$.
△ Less
Submitted 4 March, 2022;
originally announced March 2022.
-
A photometric and astrometric study of the open clusters NGC 1664 and NGC 6939
Authors:
S. Koc,
T. Yontan,
S. Bilir,
R. Canbay,
T. Ak,
T. Banks,
S. Ak,
E. Paunzen
Abstract:
This study calculated astrophysical parameters, as well as kinematic and galactic orbital parameters, of the open clusters NGC 1664 and NGC 6939. The work is based on CCD UBV and Gaia photometric and astrometric data from ground and space-based observations. Considering Gaia Early Data Release 3 (EDR3) astrometric data, we determined membership probabilities of stars located in both of the cluster…
▽ More
This study calculated astrophysical parameters, as well as kinematic and galactic orbital parameters, of the open clusters NGC 1664 and NGC 6939. The work is based on CCD UBV and Gaia photometric and astrometric data from ground and space-based observations. Considering Gaia Early Data Release 3 (EDR3) astrometric data, we determined membership probabilities of stars located in both of the clusters. We used two-color diagrams to determine $E(B-V)$ color excesses for NGC 1664 and NGC 6939 as $0.190 \pm 0.018$ and $0.380 \pm 0.025$ mag, respectively. Photometric metallicities for the two clusters were estimated as [Fe/H] = $-0.10 \pm 0.02$ dex for NGC 1664 and as [Fe/H] = $-0.06 \pm 0.01$ dex for NGC 6939. Using the reddening and metallicity calculated in the study, we obtained distance moduli and ages of the clusters by fitting PARSEC isochrones to the color-magnitude diagrams based on the most likely member stars. Isochrone fitting distances are $1289 \pm 47$ pc and $1716 \pm 87$ pc, which coincide with ages of $675 \pm 50$ Myr and $1.5 \pm 0.2$ Gyr for NGC 1664 and NGC 6939, respectively. We also derived the distances to the clusters using Gaia trigonometric parallaxes and compared these estimates with the literature. We concluded that the results are in good agreement with those given by the current study. Present day mass function slopes were calculated as $Γ=-1.22\pm0.33$ and $Γ=-1.18\pm0.21$ for NGC 1664 and NGC 6939, respectively, which are compatible with the Salpeter (1955) slope. Analyses showed that both of clusters are dynamically relaxed. The kinematic and dynamic orbital parameters of the clusters were calculated, indicating that the birthplaces of the clusters are outside the solar circle.
△ Less
Submitted 22 February, 2022;
originally announced February 2022.
-
Quasi-periodic spicule-like cool jets driven by Alfvén pulses
Authors:
B. Singh,
A. K. Srivastava,
K. Sharma,
S. K. Mishra,
B. N. Dwivedi
Abstract:
We perform a 2.5 dimensional magnetohydrodynamic (MHD) simulation to understand a comprehensive view of the formation of spicule-like cool jets due to initial transverse velocity pulses akin to Alfvén pulses in the solar chromosphere. We invoke multiple velocity ($V_{z}$) pulses between 1.5 and 2.0 Mm in the solar atmosphere, which create the initial transverse velocity perturbations. These pulses…
▽ More
We perform a 2.5 dimensional magnetohydrodynamic (MHD) simulation to understand a comprehensive view of the formation of spicule-like cool jets due to initial transverse velocity pulses akin to Alfvén pulses in the solar chromosphere. We invoke multiple velocity ($V_{z}$) pulses between 1.5 and 2.0 Mm in the solar atmosphere, which create the initial transverse velocity perturbations. These pulses transfer energy non-linearly to the field aligned perturbations due to the ponderomotive force. This physical process further creates the magnetoacoustic shocks followed by quasi-periodic plasma motions in the solar atmosphere. The field aligned magnetoacoustic shocks move upward which subsequently cause quasi-periodic rise and fall of the chromospheric plasma into the overlying corona as a thin and cool spicule-like jets. The magnitude of the initial applied transverse velocity pulses are taken in the range of 50-90 km $s^{-1}$. These pulses are found to be strong enough to generate the spicule-like jets. We analyze the evolution, kinematics and energetics of these spicule-like jets. We find that the transported mass flux and kinetic energy density are substantial in the localized solar-corona. These mass motions generate $\it in$ $situ$ quasi-periodic oscillations on the scale of $\simeq$ 4.0 min above the transition region.
△ Less
Submitted 1 February, 2022;
originally announced February 2022.
-
Ansatz in a Nutshell: A comprehensive step-by-step guide to polynomial, $C$-finite, holonomic, and $C^2$-finite sequences
Authors:
Tipaluck Krityakierne,
Thotsaporn Aek Thanatipanonda
Abstract:
Given a sequence 1, 1, 5, 23, 135, 925, 7285, 64755, 641075, 6993545, 83339745,..., how can we guess a formula for it? This article will quickly walk you through the concept of ansatz for classes of polynomial, $C$-finite, holonomic, and the most recent addition $C^2$-finite sequences. For each of these classes, we discuss in detail various aspects of the guess and check, generating functions, clo…
▽ More
Given a sequence 1, 1, 5, 23, 135, 925, 7285, 64755, 641075, 6993545, 83339745,..., how can we guess a formula for it? This article will quickly walk you through the concept of ansatz for classes of polynomial, $C$-finite, holonomic, and the most recent addition $C^2$-finite sequences. For each of these classes, we discuss in detail various aspects of the guess and check, generating functions, closure properties, and closed-form solutions. Every theorem is presented with an accessible proof, followed by several examples intended to motivate the development of the theories. Each example is accompanied by a Maple program with the purpose of demonstrating use of the program in solving problems in this area. While this work aims to give a comprehensive review of existing ansatzes, we also systematically fill a research gap in the literature by providing theoretical and numerical results for the $C^2$-finite sequences. We hope the readers will enjoy the journey through our unifying framework for the study of ansatz.
△ Less
Submitted 22 January, 2022; v1 submitted 20 January, 2022;
originally announced January 2022.
-
QU-BraTS: MICCAI BraTS 2020 Challenge on Quantifying Uncertainty in Brain Tumor Segmentation - Analysis of Ranking Scores and Benchmarking Results
Authors:
Raghav Mehta,
Angelos Filos,
Ujjwal Baid,
Chiharu Sako,
Richard McKinley,
Michael Rebsamen,
Katrin Datwyler,
Raphael Meier,
Piotr Radojewski,
Gowtham Krishnan Murugesan,
Sahil Nalawade,
Chandan Ganesh,
Ben Wagner,
Fang F. Yu,
Baowei Fei,
Ananth J. Madhuranthakam,
Joseph A. Maldjian,
Laura Daza,
Catalina Gomez,
Pablo Arbelaez,
Chengliang Dai,
Shuo Wang,
Hadrien Reynaud,
Yuan-han Mo,
Elsa Angelini
, et al. (67 additional authors not shown)
Abstract:
Deep learning (DL) models have provided state-of-the-art performance in various medical imaging benchmarking challenges, including the Brain Tumor Segmentation (BraTS) challenges. However, the task of focal pathology multi-compartment segmentation (e.g., tumor and lesion sub-regions) is particularly challenging, and potential errors hinder translating DL models into clinical workflows. Quantifying…
▽ More
Deep learning (DL) models have provided state-of-the-art performance in various medical imaging benchmarking challenges, including the Brain Tumor Segmentation (BraTS) challenges. However, the task of focal pathology multi-compartment segmentation (e.g., tumor and lesion sub-regions) is particularly challenging, and potential errors hinder translating DL models into clinical workflows. Quantifying the reliability of DL model predictions in the form of uncertainties could enable clinical review of the most uncertain regions, thereby building trust and paving the way toward clinical translation. Several uncertainty estimation methods have recently been introduced for DL medical image segmentation tasks. Developing scores to evaluate and compare the performance of uncertainty measures will assist the end-user in making more informed decisions. In this study, we explore and evaluate a score developed during the BraTS 2019 and BraTS 2020 task on uncertainty quantification (QU-BraTS) and designed to assess and rank uncertainty estimates for brain tumor multi-compartment segmentation. This score (1) rewards uncertainty estimates that produce high confidence in correct assertions and those that assign low confidence levels at incorrect assertions, and (2) penalizes uncertainty measures that lead to a higher percentage of under-confident correct assertions. We further benchmark the segmentation uncertainties generated by 14 independent participating teams of QU-BraTS 2020, all of which also participated in the main BraTS segmentation task. Overall, our findings confirm the importance and complementary value that uncertainty estimates provide to segmentation algorithms, highlighting the need for uncertainty quantification in medical image analyses. Finally, in favor of transparency and reproducibility, our evaluation code is made publicly available at: https://github.com/RagMeh11/QU-BraTS.
△ Less
Submitted 23 August, 2022; v1 submitted 19 December, 2021;
originally announced December 2021.
-
Role of Non-Ideal Dissipation with Heating-Cooling Misbalance on the Phase Shifts of Standing Slow Magnetohydrodynamic Waves
Authors:
Abhinav Prasad,
A. K. Srivastava,
Tongjiang Wang,
Kartika Sangal
Abstract:
We analyse the phase shifts of standing, slow magnetohydrodynamic (MHD) waves in solar coronal loops using a linear MHD model taking into account the role of thermal conductivity, compressive viscosity, radiative losses, and heating-cooling misbalance. We estimate the phase shifts in time and space of density and temperature perturbations with respect to velocity perturbations and also calculate t…
▽ More
We analyse the phase shifts of standing, slow magnetohydrodynamic (MHD) waves in solar coronal loops using a linear MHD model taking into account the role of thermal conductivity, compressive viscosity, radiative losses, and heating-cooling misbalance. We estimate the phase shifts in time and space of density and temperature perturbations with respect to velocity perturbations and also calculate the phase difference between density and temperature perturbations. The overall significance of compressive viscosity is found to be negligible for most of the loops considered in the study. For loops with high background density and/or low background temperature, the role of radiative losses (with heating-cooling misbalance) is found to be more significant. Also the effect of heating-cooling misbalance with a temperature- and density-dependent heating function is found to be more significant in the case of longer loop lengths ($L=500$\, Mm). We derived a general expression for the polytropic index [$γ_{\rm eff}$] and found that under linear MHD the effect of compressive viscosity on polytropic index is negligible. The radiative losses with constant heating lead to a monotonic increase of $γ_{\rm eff}$ with increasing density whereas the consideration of an assumed heating function [$H(ρ,T) \propto ρ^{a}T^{b}$, where $a=-0.5$ and $b=-3$] makes the $γ_{\rm eff}$ peak at a certain loop density. We also explored the role of different heating functions by varying the free parameters $a$ and $b$ for a fixed loop of $ρ_0 = 10^{-11}$\, kg $\text{m}^{-3}$, $T_0 = 6.3$\, MK and loop length $L= 180$\, Mm. We find that the consideration of different heating functions [$H(ρ,T)$] leads to a significant variation in the phase difference between density and temperature perturbations; however, the polytropic index remains close to a value of 1.66.
△ Less
Submitted 9 December, 2021;
originally announced December 2021.
-
The Seventeenth Data Release of the Sloan Digital Sky Surveys: Complete Release of MaNGA, MaStar and APOGEE-2 Data
Authors:
Abdurro'uf,
Katherine Accetta,
Conny Aerts,
Victor Silva Aguirre,
Romina Ahumada,
Nikhil Ajgaonkar,
N. Filiz Ak,
Shadab Alam,
Carlos Allende Prieto,
Andres Almeida,
Friedrich Anders,
Scott F. Anderson,
Brett H. Andrews,
Borja Anguiano,
Erik Aquino-Ortiz,
Alfonso Aragon-Salamanca,
Maria Argudo-Fernandez,
Metin Ata,
Marie Aubert,
Vladimir Avila-Reese,
Carles Badenes,
Rodolfo H. Barba,
Kat Barger,
Jorge K. Barrera-Ballesteros,
Rachael L. Beaton
, et al. (316 additional authors not shown)
Abstract:
This paper documents the seventeenth data release (DR17) from the Sloan Digital Sky Surveys; the fifth and final release from the fourth phase (SDSS-IV). DR17 contains the complete release of the Mapping Nearby Galaxies at Apache Point Observatory (MaNGA) survey, which reached its goal of surveying over 10,000 nearby galaxies. The complete release of the MaNGA Stellar Library (MaStar) accompanies…
▽ More
This paper documents the seventeenth data release (DR17) from the Sloan Digital Sky Surveys; the fifth and final release from the fourth phase (SDSS-IV). DR17 contains the complete release of the Mapping Nearby Galaxies at Apache Point Observatory (MaNGA) survey, which reached its goal of surveying over 10,000 nearby galaxies. The complete release of the MaNGA Stellar Library (MaStar) accompanies this data, providing observations of almost 30,000 stars through the MaNGA instrument during bright time. DR17 also contains the complete release of the Apache Point Observatory Galactic Evolution Experiment 2 (APOGEE-2) survey which publicly releases infra-red spectra of over 650,000 stars. The main sample from the Extended Baryon Oscillation Spectroscopic Survey (eBOSS), as well as the sub-survey Time Domain Spectroscopic Survey (TDSS) data were fully released in DR16. New single-fiber optical spectroscopy released in DR17 is from the SPectroscipic IDentification of ERosita Survey (SPIDERS) sub-survey and the eBOSS-RM program. Along with the primary data sets, DR17 includes 25 new or updated Value Added Catalogs (VACs). This paper concludes the release of SDSS-IV survey data. SDSS continues into its fifth phase with observations already underway for the Milky Way Mapper (MWM), Local Volume Mapper (LVM) and Black Hole Mapper (BHM) surveys.
△ Less
Submitted 13 January, 2022; v1 submitted 3 December, 2021;
originally announced December 2021.
-
FashionSearchNet-v2: Learning Attribute Representations with Localization for Image Retrieval with Attribute Manipulation
Authors:
Kenan E. Ak,
Joo Hwee Lim,
Ying Sun,
Jo Yew Tham,
Ashraf A. Kassim
Abstract:
The focus of this paper is on the problem of image retrieval with attribute manipulation. Our proposed work is able to manipulate the desired attributes of the query image while maintaining its other attributes. For example, the collar attribute of the query image can be changed from round to v-neck to retrieve similar images from a large dataset. A key challenge in e-commerce is that images have…
▽ More
The focus of this paper is on the problem of image retrieval with attribute manipulation. Our proposed work is able to manipulate the desired attributes of the query image while maintaining its other attributes. For example, the collar attribute of the query image can be changed from round to v-neck to retrieve similar images from a large dataset. A key challenge in e-commerce is that images have multiple attributes where users would like to manipulate and it is important to estimate discriminative feature representations for each of these attributes. The proposed FashionSearchNet-v2 architecture is able to learn attribute specific representations by leveraging on its weakly-supervised localization module, which ignores the unrelated features of attributes in the feature space, thus improving the similarity learning. The network is jointly trained with the combination of attribute classification and triplet ranking loss to estimate local representations. These local representations are then merged into a single global representation based on the instructed attribute manipulation where desired images can be retrieved with a distance metric. The proposed method also provides explainability for its retrieval process to help provide additional information on the attention of the network. Experiments performed on several datasets that are rich in terms of the number of attributes show that FashionSearchNet-v2 outperforms the other state-of-the-art attribute manipulation techniques. Different than our earlier work (FashionSearchNet), we propose several improvements in the learning procedure and show that the proposed FashionSearchNet-v2 can be generalized to different domains other than fashion.
△ Less
Submitted 28 November, 2021;
originally announced November 2021.
-
The Card Guessing Game: A generating function approach
Authors:
Tipaluck Krityakierne,
Thotsaporn Aek Thanatipanonda
Abstract:
Consider a card guessing game with complete feedback in which a deck of $n$ cards ordered $1,\dots, n$ is riffle-shuffled once. With the goal to maximize the number of correct guesses, a player guesses cards from the top of the deck one at a time under the optimal strategy until no cards remain. We provide an expression for the expected number of correct guesses with arbitrary number of terms, an…
▽ More
Consider a card guessing game with complete feedback in which a deck of $n$ cards ordered $1,\dots, n$ is riffle-shuffled once. With the goal to maximize the number of correct guesses, a player guesses cards from the top of the deck one at a time under the optimal strategy until no cards remain. We provide an expression for the expected number of correct guesses with arbitrary number of terms, an accuracy improvement over the results of Liu (2021). In addition, using generating functions, we give a unified framework for systematically calculating higher-order moments. Although the extension of the framework to $k\geq2$ shuffles is not immediately straightforward, we are able to settle a long-standing McGrath's conjectured optimal strategy described in Bayer and Diaconis (1992) by showing that the optimal guessing strategy for $k=1$ riffle shuffle does not necessarily apply to $k\geq2$ shuffles.
△ Less
Submitted 21 July, 2022; v1 submitted 23 July, 2021;
originally announced July 2021.
-
The prominence driven forced reconnection in the solar corona and associated plasma dynamics
Authors:
A. K. Srivastava,
Sudheer K. Mishra,
P. Jelínek
Abstract:
Using the multi-temperature observations from SDO/AIA on 30th December 2019, we provide a signature of prominence driven forced magnetic reconnection in the corona and associated plasma dynamics during 09:20 UT to 10:38 UT. A hot prominence segment erupts with a speed of 21 km/s and destabilises the entire prominence system. Thereafter, it rose upward in the north during 09:28 UT to 09:48 UT with…
▽ More
Using the multi-temperature observations from SDO/AIA on 30th December 2019, we provide a signature of prominence driven forced magnetic reconnection in the corona and associated plasma dynamics during 09:20 UT to 10:38 UT. A hot prominence segment erupts with a speed of 21 km/s and destabilises the entire prominence system. Thereafter, it rose upward in the north during 09:28 UT to 09:48 UT with a speed of 24 km/s. The eruptive prominence stretches overlying field lines upward with the speed of 27-28 km/s , which further undergo into the forced reconnection. The coronal plasma also flows in southward direction with the speed of 7 km/s, and both these inflows trigger the reconnection at 09:48 UT. Thereafter, the east and westward magnetic channels are developed and separated. The east-west reorganization of the magnetic fields starts creating bi-directional plasma outflows towards the limb with their respective speed of 28 km/s and 37 km/s. Their upper ends are diffused in the overlying corona, transporting another set of upflows with the speed of 22 km/s and 19 km/s. The multi-temperature plasma (Te=6.0-7.2) evolves and elongated upto a length of ~10^5 km on the reorganized fields. The hot plasma and remaining prominence threads move from reconnection region towards another segment of prominence in the eastward direction. The prominence-prominence/loop interaction and associated reconnection generate jet-like eruptions with the speed of 178-183 km/s. After the formation of jet, the overlying magnetic channel is disappeared in the corona.
△ Less
Submitted 14 July, 2021;
originally announced July 2021.
-
Transformers with multi-modal features and post-fusion context for e-commerce session-based recommendation
Authors:
Gabriel de Souza P. Moreira,
Sara Rabhi,
Ronay Ak,
Md Yasin Kabir,
Even Oldridge
Abstract:
Session-based recommendation is an important task for e-commerce services, where a large number of users browse anonymously or may have very distinct interests for different sessions. In this paper we present one of the winning solutions for the Recommendation task of the SIGIR 2021 Workshop on E-commerce Data Challenge. Our solution was inspired by NLP techniques and consists of an ensemble of tw…
▽ More
Session-based recommendation is an important task for e-commerce services, where a large number of users browse anonymously or may have very distinct interests for different sessions. In this paper we present one of the winning solutions for the Recommendation task of the SIGIR 2021 Workshop on E-commerce Data Challenge. Our solution was inspired by NLP techniques and consists of an ensemble of two Transformer architectures - Transformer-XL and XLNet - trained with autoregressive and autoencoding approaches. To leverage most of the rich dataset made available for the competition, we describe how we prepared multi-model features by combining tabular events with textual and image vectors. We also present a model prediction analysis to better understand the effectiveness of our architectures for the session-based recommendation.
△ Less
Submitted 11 July, 2021;
originally announced July 2021.
-
A study of the Czernik 2 and NGC 7654 open clusters using CCD UBV photometric and Gaia EDR3 data
Authors:
B. Akbulut,
S. Ak,
T. Yontan,
S. Bilir,
T. Ak,
T. Banks,
E. Kaan Ulgen,
E. Paunzen
Abstract:
We analysed the open clusters Czernik 2 and NGC 7654 using CCD UBV photometric and Gaia Early Data Release 3 (EDR3) photometric and astrometric data. Structural parameters of the two clusters were derived, including the physical sizes of Czernik 2 being r=5 and NGC 7654 as 8 min. We calculated membership probabilities of stars based on their proper motion components as released in the Gaia EDR3. T…
▽ More
We analysed the open clusters Czernik 2 and NGC 7654 using CCD UBV photometric and Gaia Early Data Release 3 (EDR3) photometric and astrometric data. Structural parameters of the two clusters were derived, including the physical sizes of Czernik 2 being r=5 and NGC 7654 as 8 min. We calculated membership probabilities of stars based on their proper motion components as released in the Gaia EDR3. To identify member stars of the clusters, we used these membership probabilities taking into account location and the impact of binarity on main-sequence stars. We used membership probabilities higher than $P=0.5$ to identify 28 member stars for Czernik 2 and 369 for NGC 7654. We estimated colour-excesses and metallicities separately using two-colour diagrams to derive homogeneously determined parameters. The derived $E(B-V)$ colour excess is 0.46(0.02) mag for Czernik 2 and 0.57(0.04) mag for NGC 7654. Metallicities were obtained for the first time for both clusters, -0.08(0.02) dex for Czernik 2 and -0.05(0.01) dex for NGC 7654. Keeping the reddening and metallicity as constant quantities, we fitted PARSEC models using colour-magnitude diagrams, resulting in estimated distance moduli and ages of the two clusters. We obtained the distance modulus for Czernik 2 as 12.80(0.07) mag and for NGC 7654 as 13.20(0.16) mag, which coincide with ages of 1.2(0.2) Gyr and 120(20) Myr, respectively. The distances to the clusters were calculated using the Gaia EDR3 trigonometric parallaxes and compared with the literature. We found good agreement between the distances obtained in this study and the literature. Present day mass function slopes for both clusters are comparable with the value of Salpeter (1955), being X=-1.37(0.24) for Czernik 2 and X=-1.39(0.19) for NGC 7654.
△ Less
Submitted 7 July, 2021;
originally announced July 2021.