-
pathfinder: A Semantic Framework for Literature Review and Knowledge Discovery in Astronomy
Authors:
Kartheik G. Iyer,
Mikaeel Yunus,
Charles O'Neill,
Christine Ye,
Alina Hyk,
Kiera McCormick,
Ioana Ciuca,
John F. Wu,
Alberto Accomazzi,
Simone Astarita,
Rishabh Chakrabarty,
Jesse Cranney,
Anjalie Field,
Tirthankar Ghosal,
Michele Ginolfi,
Marc Huertas-Company,
Maja Jablonska,
Sandor Kruk,
Huiling Liu,
Gabriel Marchidan,
Rohit Mistry,
J. P. Naiman,
J. E. G. Peek,
Mugdha Polimera,
Sergio J. Rodriguez
, et al. (5 additional authors not shown)
Abstract:
The exponential growth of astronomical literature poses significant challenges for researchers navigating and synthesizing general insights or even domain-specific knowledge. We present Pathfinder, a machine learning framework designed to enable literature review and knowledge discovery in astronomy, focusing on semantic searching with natural language instead of syntactic searches with keywords.…
▽ More
The exponential growth of astronomical literature poses significant challenges for researchers navigating and synthesizing general insights or even domain-specific knowledge. We present Pathfinder, a machine learning framework designed to enable literature review and knowledge discovery in astronomy, focusing on semantic searching with natural language instead of syntactic searches with keywords. Utilizing state-of-the-art large language models (LLMs) and a corpus of 350,000 peer-reviewed papers from the Astrophysics Data System (ADS), Pathfinder offers an innovative approach to scientific inquiry and literature exploration. Our framework couples advanced retrieval techniques with LLM-based synthesis to search astronomical literature by semantic context as a complement to currently existing methods that use keywords or citation graphs. It addresses complexities of jargon, named entities, and temporal aspects through time-based and citation-based weighting schemes. We demonstrate the tool's versatility through case studies, showcasing its application in various research scenarios. The system's performance is evaluated using custom benchmarks, including single-paper and multi-paper tasks. Beyond literature review, Pathfinder offers unique capabilities for reformatting answers in ways that are accessible to various audiences (e.g. in a different language or as simplified text), visualizing research landscapes, and tracking the impact of observatories and methodologies. This tool represents a significant advancement in applying AI to astronomical research, aiding researchers at all career stages in navigating modern astronomy literature.
△ Less
Submitted 2 August, 2024;
originally announced August 2024.
-
Study of In-plane and Interlayer Interactions During Aluminum Fluoride Intercalation in Graphite: Implications for the Development of Rechargeable Batteries
Authors:
Sindy J. Rodríguez,
Adriana E. Candia,
Igor Stanković,
Mario C. G. Passeggi,
Gustavo D. Ruano
Abstract:
The electrolyte intercalation mechanism facilitates the insertion/extraction of charge into the electrode material in rechargeable batteries. Aluminum fluoride (AlF$_{3}$) has been used as an electrolyte in rechargeable aluminum batteries with graphite electrodes, demonstrating improved reversibility of battery charging and discharging processes; however, the intercalation mechanism of this neutra…
▽ More
The electrolyte intercalation mechanism facilitates the insertion/extraction of charge into the electrode material in rechargeable batteries. Aluminum fluoride (AlF$_{3}$) has been used as an electrolyte in rechargeable aluminum batteries with graphite electrodes, demonstrating improved reversibility of battery charging and discharging processes; however, the intercalation mechanism of this neutral molecule in graphite is so far unknown. In this work, we combine scanning tunneling microscopy (STM) in ultra-high vacuum conditions, calculations based on density functional theory, and large-scale molecular dynamics simulations to reveal the mechanism of AlF$_{3}$ intercalation in highly oriented pyrolytic graphite (HOPG). We report the formation of AlF$_{3}$ molecules clusters between graphite layers, their self-assembly by graphene buckling-mediated interactions, and explain the origin and distribution of superficial {\it blisters} in the material. Our findings have implications for understanding the relationship between the mobility and clustering of molecules and the expansion of the anode material. This, in turn, paves the way for future enhancements in the performance of energy storage systems.
△ Less
Submitted 31 August, 2023; v1 submitted 17 June, 2023;
originally announced June 2023.
-
Efficient Deep Learning Architectures for Fast Identification of Bacterial Strains in Resource-Constrained Devices
Authors:
R. Gallardo García,
S. Jarquín Rodríguez,
B. Beltrán Martínez,
C. Hernández Gracidas,
R. Martínez Torres
Abstract:
This work presents twelve fine-tuned deep learning architectures to solve the bacterial classification problem over the Digital Image of Bacterial Species Dataset. The base architectures were mainly published as mobile or efficient solutions to the ImageNet challenge, and all experiments presented in this work consisted of making several modifications to the original designs, in order to make them…
▽ More
This work presents twelve fine-tuned deep learning architectures to solve the bacterial classification problem over the Digital Image of Bacterial Species Dataset. The base architectures were mainly published as mobile or efficient solutions to the ImageNet challenge, and all experiments presented in this work consisted of making several modifications to the original designs, in order to make them able to solve the bacterial classification problem by using fine-tuning and transfer learning techniques. This work also proposes a novel data augmentation technique for this dataset, which is based on the idea of artificial zooming, strongly increasing the performance of every tested architecture, even doubling it in some cases. In order to get robust and complete evaluations, all experiments were performed with 10-fold cross-validation and evaluated with five different metrics: top-1 and top-5 accuracy, precision, recall, and F1 score. This paper presents a complete comparison of the twelve different architectures, cross-validated with the original and the augmented version of the dataset, the results are also compared with several literature methods. Overall, eight of the eleven architectures surpassed the 0.95 scores in top-1 accuracy with our data augmentation method, being 0.9738 the highest top-1 accuracy. The impact of the data augmentation technique is reported with relative improvement scores.
△ Less
Submitted 11 June, 2021;
originally announced June 2021.