-
EuroLLM-9B: Technical Report
Authors:
Pedro Henrique Martins,
João Alves,
Patrick Fernandes,
Nuno M. Guerreiro,
Ricardo Rei,
Amin Farajian,
Mateusz Klimaszewski,
Duarte M. Alves,
José Pombal,
Nicolas Boizard,
Manuel Faysse,
Pierre Colombo,
François Yvon,
Barry Haddow,
José G. C. de Souza,
Alexandra Birch,
André F. T. Martins
Abstract:
This report presents EuroLLM-9B, a large language model trained from scratch to support the needs of European citizens by covering all 24 official European Union languages and 11 additional languages. EuroLLM addresses the issue of European languages being underrepresented and underserved in existing open large language models. We provide a comprehensive overview of EuroLLM-9B's development, inclu…
▽ More
This report presents EuroLLM-9B, a large language model trained from scratch to support the needs of European citizens by covering all 24 official European Union languages and 11 additional languages. EuroLLM addresses the issue of European languages being underrepresented and underserved in existing open large language models. We provide a comprehensive overview of EuroLLM-9B's development, including tokenizer design, architectural specifications, data filtering, and training procedures. We describe the pre-training data collection and filtering pipeline, including the creation of EuroFilter, an AI-based multilingual filter, as well as the design of EuroBlocks-Synthetic, a novel synthetic dataset for post-training that enhances language coverage for European languages. Evaluation results demonstrate EuroLLM-9B's competitive performance on multilingual benchmarks and machine translation tasks, establishing it as the leading open European-made LLM of its size. To support open research and adoption, we release all major components of this work, including the base and instruction-tuned models, the EuroFilter classifier, and the synthetic post-training dataset.
△ Less
Submitted 16 June, 2025; v1 submitted 4 June, 2025;
originally announced June 2025.
-
CMS RPC Non-Physics Event Data Automation Ideology
Authors:
A. Dimitrov,
M. Tytgat,
K. Mota Amarilo,
A. Samalan,
K. Skovpen,
G. A. Alves,
E. Alves Coelho,
F. Marujo da Silva,
M. Barroso Ferreira Filho,
E. M. Da Costa,
D. De Jesus Damiao,
S. Fonseca De Souza,
R. Gomes De Souza,
L. Mundim,
H. Nogima,
J. P. Pinheiro,
A. Santoro,
M. Thiel,
A. Aleksandrov,
R. Hadjiiska,
P. Iaydjiev,
M. Shopova,
G. Sultanov,
L. Litov,
B. Pavlov
, et al. (79 additional authors not shown)
Abstract:
This paper presents a streamlined framework for real-time processing and analysis of condition data from the CMS experiment Resistive Plate Chambers (RPC). Leveraging data streaming, it uncovers correlations between RPC performance metrics, like currents and rates, and LHC luminosity or environmental conditions. The Java-based framework automates data handling and predictive modeling, integrating…
▽ More
This paper presents a streamlined framework for real-time processing and analysis of condition data from the CMS experiment Resistive Plate Chambers (RPC). Leveraging data streaming, it uncovers correlations between RPC performance metrics, like currents and rates, and LHC luminosity or environmental conditions. The Java-based framework automates data handling and predictive modeling, integrating extensive datasets into synchronized, query-optimized tables. By segmenting LHC operations and analyzing larger virtual detector objects, the automation enhances monitoring precision, accelerates visualization, and provides predictive insights, revolutionizing RPC performance evaluation and future behavior modeling.
△ Less
Submitted 11 April, 2025;
originally announced April 2025.
-
Relativistic Lévy processes
Authors:
Lucas G. B. de Souza,
M. G. E. da Luz,
E. P. Raposo,
Evaldo M. F. Curado,
G. M. Viswanathan
Abstract:
In this contribution, we investigate how to correctly describe sums of independent and identically distributed random velocities in the theory of special relativity. We derive a one-dimensional probability distribution of velocities stable under relativistic velocity addition. In a given system, this allows identifying distinct physical regimes in terms of the distribution's concavity at the origi…
▽ More
In this contribution, we investigate how to correctly describe sums of independent and identically distributed random velocities in the theory of special relativity. We derive a one-dimensional probability distribution of velocities stable under relativistic velocity addition. In a given system, this allows identifying distinct physical regimes in terms of the distribution's concavity at the origin and the probability of measuring relativistic velocities. These features provide a protocol to assess the relevance of stochastic relativistic effects in actual experiments. As examples, we find agreement with previous results about heavy-ion diffusion and show that our findings are consistent with the distribution of momentum deviations observed in measurements of antiproton cooling.
△ Less
Submitted 24 December, 2024;
originally announced December 2024.
-
Findings of the WMT 2024 Shared Task on Chat Translation
Authors:
Wafaa Mohammed,
Sweta Agrawal,
M. Amin Farajian,
Vera Cabarrão,
Bryan Eikema,
Ana C. Farinha,
José G. C. de Souza
Abstract:
This paper presents the findings from the third edition of the Chat Translation Shared Task. As with previous editions, the task involved translating bilingual customer support conversations, specifically focusing on the impact of conversation context in translation quality and evaluation. We also include two new language pairs: English-Korean and English-Dutch, in addition to the set of language…
▽ More
This paper presents the findings from the third edition of the Chat Translation Shared Task. As with previous editions, the task involved translating bilingual customer support conversations, specifically focusing on the impact of conversation context in translation quality and evaluation. We also include two new language pairs: English-Korean and English-Dutch, in addition to the set of language pairs from previous editions: English-German, English-French, and English-Brazilian Portuguese. We received 22 primary submissions and 32 contrastive submissions from eight teams, with each language pair having participation from at least three teams. We evaluated the systems comprehensively using both automatic metrics and human judgments via a direct assessment framework. The official rankings for each language pair were determined based on human evaluation scores, considering performance in both translation directions--agent and customer. Our analysis shows that while the systems excelled at translating individual turns, there is room for improvement in overall conversation-level translation quality.
△ Less
Submitted 15 October, 2024;
originally announced October 2024.
-
Modeling User Preferences with Automatic Metrics: Creating a High-Quality Preference Dataset for Machine Translation
Authors:
Sweta Agrawal,
José G. C. de Souza,
Ricardo Rei,
António Farinhas,
Gonçalo Faria,
Patrick Fernandes,
Nuno M Guerreiro,
Andre Martins
Abstract:
Alignment with human preferences is an important step in developing accurate and safe large language models. This is no exception in machine translation (MT), where better handling of language nuances and context-specific variations leads to improved quality. However, preference data based on human feedback can be very expensive to obtain and curate at a large scale. Automatic metrics, on the othe…
▽ More
Alignment with human preferences is an important step in developing accurate and safe large language models. This is no exception in machine translation (MT), where better handling of language nuances and context-specific variations leads to improved quality. However, preference data based on human feedback can be very expensive to obtain and curate at a large scale. Automatic metrics, on the other hand, can induce preferences, but they might not match human expectations perfectly. In this paper, we propose an approach that leverages the best of both worlds. We first collect sentence-level quality assessments from professional linguists on translations generated by multiple high-quality MT systems and evaluate the ability of current automatic metrics to recover these preferences. We then use this analysis to curate a new dataset, MT-Pref (metric induced translation preference) dataset, which comprises 18k instances covering 18 language directions, using texts sourced from multiple domains post-2022. We show that aligning TOWER models on MT-Pref significantly improves translation quality on WMT23 and FLORES benchmarks.
△ Less
Submitted 10 October, 2024;
originally announced October 2024.
-
EuroLLM: Multilingual Language Models for Europe
Authors:
Pedro Henrique Martins,
Patrick Fernandes,
João Alves,
Nuno M. Guerreiro,
Ricardo Rei,
Duarte M. Alves,
José Pombal,
Amin Farajian,
Manuel Faysse,
Mateusz Klimaszewski,
Pierre Colombo,
Barry Haddow,
José G. C. de Souza,
Alexandra Birch,
André F. T. Martins
Abstract:
The quality of open-weight LLMs has seen significant improvement, yet they remain predominantly focused on English. In this paper, we introduce the EuroLLM project, aimed at developing a suite of open-weight multilingual LLMs capable of understanding and generating text in all official European Union languages, as well as several additional relevant languages. We outline the progress made to date,…
▽ More
The quality of open-weight LLMs has seen significant improvement, yet they remain predominantly focused on English. In this paper, we introduce the EuroLLM project, aimed at developing a suite of open-weight multilingual LLMs capable of understanding and generating text in all official European Union languages, as well as several additional relevant languages. We outline the progress made to date, detailing our data collection and filtering process, the development of scaling laws, the creation of our multilingual tokenizer, and the data mix and modeling configurations. Additionally, we release our initial models: EuroLLM-1.7B and EuroLLM-1.7B-Instruct and report their performance on multilingual general benchmarks and machine translation.
△ Less
Submitted 24 September, 2024;
originally announced September 2024.
-
Exploring Foundation Models for Synthetic Medical Imaging: A Study on Chest X-Rays and Fine-Tuning Techniques
Authors:
Davide Clode da Silva,
Marina Musse Bernardes,
Nathalia Giacomini Ceretta,
Gabriel Vaz de Souza,
Gabriel Fonseca Silva,
Rafael Heitor Bordini,
Soraia Raupp Musse
Abstract:
Machine learning has significantly advanced healthcare by aiding in disease prevention and treatment identification. However, accessing patient data can be challenging due to privacy concerns and strict regulations. Generating synthetic, realistic data offers a potential solution for overcoming these limitations, and recent studies suggest that fine-tuning foundation models can produce such data e…
▽ More
Machine learning has significantly advanced healthcare by aiding in disease prevention and treatment identification. However, accessing patient data can be challenging due to privacy concerns and strict regulations. Generating synthetic, realistic data offers a potential solution for overcoming these limitations, and recent studies suggest that fine-tuning foundation models can produce such data effectively. In this study, we explore the potential of foundation models for generating realistic medical images, particularly chest x-rays, and assess how their performance improves with fine-tuning. We propose using a Latent Diffusion Model, starting with a pre-trained foundation model and refining it through various configurations. Additionally, we performed experiments with input from a medical professional to assess the realism of the images produced by each trained model.
△ Less
Submitted 6 September, 2024;
originally announced September 2024.
-
The Fourth S-PLUS Data Release: 12-filter photometry covering $\sim3000$ square degrees in the southern hemisphere
Authors:
Fabio R. Herpich,
Felipe Almeida-Fernandes,
Gustavo B. Oliveira Schwarz,
Erik V. R. Lima,
Lilianne Nakazono,
Javier Alonso-García,
Marcos A. Fonseca-Faria,
Marilia J. Sartori,
Guilherme F. Bolutavicius,
Gabriel Fabiano de Souza,
Eduardo A. Hartmann,
Liana Li,
Luna Espinosa,
Antonio Kanaan,
William Schoenell,
Ariel Werle,
Eduardo Machado-Pereira,
Luis A. Gutiérrez-Soto,
Thaís Santos-Silva,
Analia V. Smith Castelli,
Eduardo A. D. Lacerda,
Cassio L. Barbosa,
Hélio D. Perottoni,
Carlos E. Ferreira Lopes,
Raquel Ruiz Valença
, et al. (46 additional authors not shown)
Abstract:
The Southern Photometric Local Universe Survey (S-PLUS) is a project to map $\sim9300$ sq deg of the sky using twelve bands (seven narrow and five broadbands). Observations are performed with the T80-South telescope, a robotic telescope located at the Cerro Tololo Observatory in Chile. The survey footprint consists of several large contiguous areas, including fields at high and low galactic latitu…
▽ More
The Southern Photometric Local Universe Survey (S-PLUS) is a project to map $\sim9300$ sq deg of the sky using twelve bands (seven narrow and five broadbands). Observations are performed with the T80-South telescope, a robotic telescope located at the Cerro Tololo Observatory in Chile. The survey footprint consists of several large contiguous areas, including fields at high and low galactic latitudes, and towards the Magellanic Clouds. S-PLUS uses fixed exposure times to reach point source depths of about $21$ mag in the $griz$ and $20$ mag in the $u$ and the narrow filters. This paper describes the S-PLUS Data Release 4 (DR4), which includes calibrated images and derived catalogues for over 3000 sq deg, covering the aforementioned area. The catalogues provide multi-band photometry performed with the tools \texttt{DoPHOT} and \texttt{SExtractor} -- point spread function (\PSF) and aperture photometry, respectively. In addition to the characterization, we also present the scientific potential of the data. We use statistical tools to present and compare the photometry obtained through different methods. Overall we find good agreement between the different methods, with a slight systematic offset of 0.05\,mag between our \PSF and aperture photometry. We show that the astrometry accuracy is equivalent to that obtained in previous S-PLUS data releases, even in very crowded fields where photometric extraction is challenging. The depths of main survey (MS) photometry for a minimum signal-to-noise ratio $S/N = 3$ reach from $\sim19.5$ for the bluer bands to $\sim21.5$ mag on the red. The range of magnitudes over which accurate \PSF photometry is obtained is shallower, reaching $\sim19$ to $\sim20.5$ mag depending on the filter. Based on these photometric data, we provide star-galaxy-quasar classification and photometric redshift for millions of objects.
△ Less
Submitted 30 July, 2024;
originally announced July 2024.
-
Low latency carbon budget analysis reveals a large decline of the land carbon sink in 2023
Authors:
Piyu Ke,
Philippe Ciais,
Stephen Sitch,
Wei Li,
Ana Bastos,
Zhu Liu,
Yidi Xu,
Xiaofan Gui,
Jiang Bian,
Daniel S Goll,
Yi Xi,
Wanjing Li,
Michael O'Sullivan,
Jeffeson Goncalves de Souza,
Pierre Friedlingstein,
Frederic Chevallier
Abstract:
In 2023, the CO2 growth rate was 3.37 +/- 0.11 ppm at Mauna Loa, 86% above the previous year, and hitting a record high since observations began in 1958, while global fossil fuel CO2 emissions only increased by 0.6 +/- 0.5%. This implies an unprecedented weakening of land and ocean sinks, and raises the question of where and why this reduction happened. Here we show a global net land CO2 sink of 0…
▽ More
In 2023, the CO2 growth rate was 3.37 +/- 0.11 ppm at Mauna Loa, 86% above the previous year, and hitting a record high since observations began in 1958, while global fossil fuel CO2 emissions only increased by 0.6 +/- 0.5%. This implies an unprecedented weakening of land and ocean sinks, and raises the question of where and why this reduction happened. Here we show a global net land CO2 sink of 0.44 +/- 0.21 GtC yr-1, the weakest since 2003. We used dynamic global vegetation models, satellites fire emissions, an atmospheric inversion based on OCO-2 measurements, and emulators of ocean biogeochemical and data driven models to deliver a fast-track carbon budget in 2023. Those models ensured consistency with previous carbon budgets. Regional flux anomalies from 2015-2022 are consistent between top-down and bottom-up approaches, with the largest abnormal carbon loss in the Amazon during the drought in the second half of 2023 (0.31 +/- 0.19 GtC yr-1), extreme fire emissions of 0.58 +/- 0.10 GtC yr-1 in Canada and a loss in South-East Asia (0.13 +/- 0.12 GtC yr-1). Since 2015, land CO2 uptake north of 20 degree N declined by half to 1.13 +/- 0.24 GtC yr-1 in 2023. Meanwhile, the tropics recovered from the 2015-16 El Nino carbon loss, gained carbon during the La Nina years (2020-2023), then switched to a carbon loss during the 2023 El Nino (0.56 +/- 0.23 GtC yr-1). The ocean sink was stronger than normal in the equatorial eastern Pacific due to reduced upwelling from La Nina's retreat in early 2023 and the development of El Nino later. Land regions exposed to extreme heat in 2023 contributed a gross carbon loss of 1.73 GtC yr-1, indicating that record warming in 2023 had a strong negative impact on the capacity of terrestrial ecosystems to mitigate climate change.
△ Less
Submitted 17 July, 2024;
originally announced July 2024.
-
First results on monolithic CMOS detector with internal gain
Authors:
U. Follo,
G. Gioachin,
C. Ferrero,
M. Mandurrino,
M. Bregant,
S. Bufalino,
F. Carnesecchi,
D. Cavazza,
M. Colocci,
T. Corradino,
M. Da Rocha Rolo,
G. Di Nicolantonio,
S. Durando,
G. Margutti,
M. Mignone,
R. Nania,
L. Pancheri,
A. Rivetti,
B. Sabiu,
G. G. A. de Souza,
S. Strazzi,
R. Wheadon
Abstract:
In this paper we report on a set of characterisations carried out on the first monolithic LGAD prototype integrated in a customised 110 nm CMOS process having a depleted active volume thickness of 48 $μ$m. This prototype is formed by a pixel array where each pixel has a total size of 100 $μ$m $\times$ 250 $μ$m and includes a high-speed front-end amplifier. After describing the sensor and the elect…
▽ More
In this paper we report on a set of characterisations carried out on the first monolithic LGAD prototype integrated in a customised 110 nm CMOS process having a depleted active volume thickness of 48 $μ$m. This prototype is formed by a pixel array where each pixel has a total size of 100 $μ$m $\times$ 250 $μ$m and includes a high-speed front-end amplifier. After describing the sensor and the electronics architecture, both laboratory and in-beam measurements are reported and described. Optical characterisations performed with an IR pulsed laser setup have shown a sensor internal gain of about 2.5. With the same experimental setup, the electronic jitter was found to be between 50 ps and 150 ps, depending on the signal amplitude. Moreover, the analysis of a test beam performed at the Proton Synchrotron (PS) T10 facility of CERN with 10 GeV/c protons and pions indicated that the overall detector time resolution is in the range of 234 ps to 244 ps. Further TCAD investigations, based on the doping profile extracted from $C(V)$ measurements, confirmed the multiplication gain measured on the test devices. Finally, TCAD simulations were used to tune the future doping concentration of the gain layer implant, targeting sensors with a higher avalanche gain. This adjustment is expected to enhance the timing performance of the sensors of the future productions, in order to cope with the high event rate expected in most of the near future high-energy and high-luminosity physics experiments, where the time resolution will be essential to disentangle overlapping events and it will also be crucial for Particle IDentification (PID).
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
QUEST: Quality-Aware Metropolis-Hastings Sampling for Machine Translation
Authors:
Gonçalo R. A. Faria,
Sweta Agrawal,
António Farinhas,
Ricardo Rei,
José G. C. de Souza,
André F. T. Martins
Abstract:
An important challenge in machine translation (MT) is to generate high-quality and diverse translations. Prior work has shown that the estimated likelihood from the MT model correlates poorly with translation quality. In contrast, quality evaluation metrics (such as COMET or BLEURT) exhibit high correlations with human judgments, which has motivated their use as rerankers (such as quality-aware an…
▽ More
An important challenge in machine translation (MT) is to generate high-quality and diverse translations. Prior work has shown that the estimated likelihood from the MT model correlates poorly with translation quality. In contrast, quality evaluation metrics (such as COMET or BLEURT) exhibit high correlations with human judgments, which has motivated their use as rerankers (such as quality-aware and minimum Bayes risk decoding). However, relying on a single translation with high estimated quality increases the chances of "gaming the metric''. In this paper, we address the problem of sampling a set of high-quality and diverse translations. We provide a simple and effective way to avoid over-reliance on noisy quality estimates by using them as the energy function of a Gibbs distribution. Instead of looking for a mode in the distribution, we generate multiple samples from high-density areas through the Metropolis-Hastings algorithm, a simple Markov chain Monte Carlo approach. The results show that our proposed method leads to high-quality and diverse outputs across multiple language pairs (English$\leftrightarrow${German, Russian}) with two strong decoder-only LLMs (Alma-7b, Tower-7b).
△ Less
Submitted 15 October, 2024; v1 submitted 28 May, 2024;
originally announced June 2024.
-
High-order parallel-in-time method for the monodomain equation in cardiac electrophysiology
Authors:
Giacomo Rosilho de Souza,
Simone Pezzuto,
Rolf Krause
Abstract:
Simulation of the monodomain equation, crucial for modeling the heart's electrical activity, faces scalability limits when traditional numerical methods only parallelize in space. To optimize the use of large multi-processor computers by distributing the computational load more effectively, time parallelization is essential. We introduce a high-order parallel-in-time method addressing the substant…
▽ More
Simulation of the monodomain equation, crucial for modeling the heart's electrical activity, faces scalability limits when traditional numerical methods only parallelize in space. To optimize the use of large multi-processor computers by distributing the computational load more effectively, time parallelization is essential. We introduce a high-order parallel-in-time method addressing the substantial computational challenges posed by the stiff, multiscale, and nonlinear nature of cardiac dynamics. Our method combines the semi-implicit and exponential spectral deferred correction methods, yielding a hybrid method that is extended to parallel-in-time employing the PFASST framework. We thoroughly evaluate the stability, accuracy, and robustness of the proposed parallel-in-time method through extensive numerical experiments, using practical ionic models such as the ten-Tusscher-Panfilov. The results underscore the method's potential to significantly enhance real-time and high-fidelity simulations in biomedical research and clinical applications.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
Optimization of GEM detectors for applications in X-ray fluorescence imaging
Authors:
Geovane G. A. de Souza,
Hugo Natal da Luz,
Marco Bregant
Abstract:
In this work a set of simulations that aim at the optimization of gaseous detectors for applications in X-ray fluorescence imaging in the energy range of 3 -- 30keV is presented. By studying the statistical distribution of the radiation interactions with gases, the energy resolution limits after charge multiplication for 6keV X-ray photons in Ar/CO$_2$(70/30) and Kr/CO$_2$(90/10) were calculated,…
▽ More
In this work a set of simulations that aim at the optimization of gaseous detectors for applications in X-ray fluorescence imaging in the energy range of 3 -- 30keV is presented. By studying the statistical distribution of the radiation interactions with gases, the energy resolution limits after charge multiplication for 6keV X-ray photons in Ar/CO$_2$(70/30) and Kr/CO$_2$(90/10) were calculated, obtaining energy resolutions of 15.4(4)% and 14.6(2)% respectively. The detector design was also studied to reduce the presence of escape peaks and complement a model to evaluate the inevitable X-ray fluorescence of copper generated by the conductive materials inside the detector.
△ Less
Submitted 30 September, 2024; v1 submitted 23 April, 2024;
originally announced April 2024.
-
CP HDR: A feature point detection and description library for LDR and HDR images
Authors:
Artur Santos Nascimento,
Valter Guilherme Silva de Souza,
Daniel Oliveira Dantas,
Beatriz Trinchão Andrade
Abstract:
In computer vision, characteristics refer to image regions with unique properties, such as corners, edges, textures, or areas with high contrast. These regions can be represented through feature points (FPs). FP detection and description are fundamental steps to many computer vision tasks. Most FP detection and description methods use low dynamic range (LDR) images, sufficient for most application…
▽ More
In computer vision, characteristics refer to image regions with unique properties, such as corners, edges, textures, or areas with high contrast. These regions can be represented through feature points (FPs). FP detection and description are fundamental steps to many computer vision tasks. Most FP detection and description methods use low dynamic range (LDR) images, sufficient for most applications involving digital images. However, LDR images may have saturated pixels in scenes with extreme light conditions, which degrade FP detection. On the other hand, high dynamic range (HDR) images usually present a greater dynamic range but FP detection algorithms do not take advantage of all the information in such images. In this study, we present a systematic review of image detection and description algorithms that use HDR images as input. We developed a library called CP_HDR that implements the Harris corner detector, SIFT detector and descriptor, and two modifications of those algorithms specialized in HDR images, called SIFT for HDR (SfHDR) and Harris for HDR (HfHDR). Previous studies investigated the use of HDR images in FP detection, but we did not find studies investigating the use of HDR images in FP description. Using uniformity, repeatability rate, mean average precision, and matching rate metrics, we compared the performance of the CP_HDR algorithms using LDR and HDR images. We observed an increase in the uniformity of the distribution of FPs among the high-light, mid-light, and low-light areas of the images. The results show that using HDR images as input to detection algorithms improves performance and that SfHDR and HfHDR enhance FP description.
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
Tower: An Open Multilingual Large Language Model for Translation-Related Tasks
Authors:
Duarte M. Alves,
José Pombal,
Nuno M. Guerreiro,
Pedro H. Martins,
João Alves,
Amin Farajian,
Ben Peters,
Ricardo Rei,
Patrick Fernandes,
Sweta Agrawal,
Pierre Colombo,
José G. C. de Souza,
André F. T. Martins
Abstract:
While general-purpose large language models (LLMs) demonstrate proficiency on multiple tasks within the domain of translation, approaches based on open LLMs are competitive only when specializing on a single task. In this paper, we propose a recipe for tailoring LLMs to multiple tasks present in translation workflows. We perform continued pretraining on a multilingual mixture of monolingual and pa…
▽ More
While general-purpose large language models (LLMs) demonstrate proficiency on multiple tasks within the domain of translation, approaches based on open LLMs are competitive only when specializing on a single task. In this paper, we propose a recipe for tailoring LLMs to multiple tasks present in translation workflows. We perform continued pretraining on a multilingual mixture of monolingual and parallel data, creating TowerBase, followed by finetuning on instructions relevant for translation processes, creating TowerInstruct. Our final model surpasses open alternatives on several tasks relevant to translation workflows and is competitive with general-purpose closed LLMs. To facilitate future research, we release the Tower models, our specialization dataset, an evaluation framework for LLMs focusing on the translation ecosystem, and a collection of model generations, including ours, on our benchmark.
△ Less
Submitted 27 February, 2024;
originally announced February 2024.
-
Explicit stabilized multirate methods for the monodomain model in cardiac electrophysiology
Authors:
Giacomo Rosilho de Souza,
Marcus J. Grote,
Simone Pezzuto,
Rolf Krause
Abstract:
Fully explicit stabilized multirate (mRKC) methods are well-suited for the numerical solution of large multiscale systems of stiff ordinary differential equations thanks to their improved stability properties. To demonstrate their efficiency for the numerical solution of stiff, multiscale, nonlinear parabolic PDE's, we apply mRKC methods to the monodomain equation from cardiac electrophysiology. I…
▽ More
Fully explicit stabilized multirate (mRKC) methods are well-suited for the numerical solution of large multiscale systems of stiff ordinary differential equations thanks to their improved stability properties. To demonstrate their efficiency for the numerical solution of stiff, multiscale, nonlinear parabolic PDE's, we apply mRKC methods to the monodomain equation from cardiac electrophysiology. In doing so, we propose an improved version, specifically tailored to the monodomain model, which leads to the explicit exponential multirate stabilized (emRKC) method. Several numerical experiments are conducted to evaluate the efficiency of both mRKC and emRKC, while taking into account different finite element meshes (structured and unstructured) and realistic ionic models. The new emRKC method typically outperforms a standard implicit-explicit baseline method for cardiac electrophysiology. Code profiling and strong scalability results further demonstrate that emRKC is faster and inherently parallel without sacrificing accuracy.
△ Less
Submitted 24 June, 2024; v1 submitted 3 January, 2024;
originally announced January 2024.
-
Ages and metallicities of stellar clusters using S-PLUS narrow-band integrated photometry: the Small Magellanic Cloud
Authors:
Gabriel Fabiano de Souza,
Pieter Westera,
Felipe Almeida-Fernandes,
Guilherme Limberg,
Bruno Dias,
José A. Hernandez-Jimenez,
Fábio R. Herpich,
Leandro O. Kerber,
Eduardo Machado-Pereira,
Hélio D. Perottoni,
Rafael Guerço,
Liana Li,
Laura Sampedro,
Antonio Kanaan,
Tiago Ribeiro,
William Schoenell,
Claudia Mendes de Oliveira
Abstract:
The Magellanic Clouds are the most massive and closest satellite galaxies of the Milky Way, with stars covering ages from a few Myr up to 13 Gyr. This makes them important for validating integrated light methods to study stellar populations and star-formation processes, which can be applied to more distant galaxies. We characterized a set of stellar clusters in the Small Magellanic Cloud (SMC), us…
▽ More
The Magellanic Clouds are the most massive and closest satellite galaxies of the Milky Way, with stars covering ages from a few Myr up to 13 Gyr. This makes them important for validating integrated light methods to study stellar populations and star-formation processes, which can be applied to more distant galaxies. We characterized a set of stellar clusters in the Small Magellanic Cloud (SMC), using the $\textit{Southern Photometric Local Universe Survey}$. This is the first age (metallicity) determination for 11 (65) clusters of this sample. Through its 7 narrow bands, centered on important spectral features, and 5 broad bands, we can retrieve detailed information about stellar populations. We obtained ages and metallicities for all stellar clusters using the Bayesian spectral energy distribution fitting code $\texttt{BAGPIPES}$. With a sample of clusters in the color range $-0.20 < r-z < +0.35$, for which our determined parameters are most reliable, we modeled the age-metallicity relation of SMC. At any given age, the metallicities of SMC clusters are lower than those of both the Gaia Sausage-Enceladus disrupted dwarf galaxy and the Milky Way. In comparison with literature values, differences are $Δ$log(age)$\approx0.31$ and $Δ$[Fe/H]$\approx0.41$, which is comparable to low-resolution spectroscopy of individual stars. Finally, we confirm a previously known gradient, with younger clusters in the center and older ones preferentially located in the outermost regions. On the other hand, we found no evidence of a significant metallicity gradient.
△ Less
Submitted 30 November, 2023; v1 submitted 23 October, 2023;
originally announced October 2023.
-
Steering Large Language Models for Machine Translation with Finetuning and In-Context Learning
Authors:
Duarte M. Alves,
Nuno M. Guerreiro,
João Alves,
José Pombal,
Ricardo Rei,
José G. C. de Souza,
Pierre Colombo,
André F. T. Martins
Abstract:
Large language models (LLMs) are a promising avenue for machine translation (MT). However, current LLM-based MT systems are brittle: their effectiveness highly depends on the choice of few-shot examples and they often require extra post-processing due to overgeneration. Alternatives such as finetuning on translation instructions are computationally expensive and may weaken in-context learning capa…
▽ More
Large language models (LLMs) are a promising avenue for machine translation (MT). However, current LLM-based MT systems are brittle: their effectiveness highly depends on the choice of few-shot examples and they often require extra post-processing due to overgeneration. Alternatives such as finetuning on translation instructions are computationally expensive and may weaken in-context learning capabilities, due to overspecialization. In this paper, we provide a closer look at this problem. We start by showing that adapter-based finetuning with LoRA matches the performance of traditional finetuning while reducing the number of training parameters by a factor of 50. This method also outperforms few-shot prompting and eliminates the need for post-processing or in-context examples. However, we show that finetuning generally degrades few-shot performance, hindering adaptation capabilities. Finally, to obtain the best of both worlds, we propose a simple approach that incorporates few-shot examples during finetuning. Experiments on 10 language pairs show that our proposed approach recovers the original few-shot capabilities while keeping the added benefits of finetuning.
△ Less
Submitted 20 October, 2023;
originally announced October 2023.
-
An Empirical Study of Translation Hypothesis Ensembling with Large Language Models
Authors:
António Farinhas,
José G. C. de Souza,
André F. T. Martins
Abstract:
Large language models (LLMs) are becoming a one-fits-many solution, but they sometimes hallucinate or produce unreliable output. In this paper, we investigate how hypothesis ensembling can improve the quality of the generated text for the specific problem of LLM-based machine translation. We experiment with several techniques for ensembling hypotheses produced by LLMs such as ChatGPT, LLaMA, and A…
▽ More
Large language models (LLMs) are becoming a one-fits-many solution, but they sometimes hallucinate or produce unreliable output. In this paper, we investigate how hypothesis ensembling can improve the quality of the generated text for the specific problem of LLM-based machine translation. We experiment with several techniques for ensembling hypotheses produced by LLMs such as ChatGPT, LLaMA, and Alpaca. We provide a comprehensive study along multiple dimensions, including the method to generate hypotheses (multiple prompts, temperature-based sampling, and beam search) and the strategy to produce the final translation (instruction-based, quality-based reranking, and minimum Bayes risk (MBR) decoding). Our results show that MBR decoding is a very effective method, that translation quality can be improved using a small number of samples, and that instruction tuning has a strong impact on the relation between the diversity of the hypotheses and the sampling temperature.
△ Less
Submitted 17 October, 2023;
originally announced October 2023.
-
Scaling up COMETKIWI: Unbabel-IST 2023 Submission for the Quality Estimation Shared Task
Authors:
Ricardo Rei,
Nuno M. Guerreiro,
José Pombal,
Daan van Stigt,
Marcos Treviso,
Luisa Coheur,
José G. C. de Souza,
André F. T. Martins
Abstract:
We present the joint contribution of Unbabel and Instituto Superior Técnico to the WMT 2023 Shared Task on Quality Estimation (QE). Our team participated on all tasks: sentence- and word-level quality prediction (task 1) and fine-grained error span detection (task 2). For all tasks, we build on the COMETKIWI-22 model (Rei et al., 2022b). Our multilingual approaches are ranked first for all tasks,…
▽ More
We present the joint contribution of Unbabel and Instituto Superior Técnico to the WMT 2023 Shared Task on Quality Estimation (QE). Our team participated on all tasks: sentence- and word-level quality prediction (task 1) and fine-grained error span detection (task 2). For all tasks, we build on the COMETKIWI-22 model (Rei et al., 2022b). Our multilingual approaches are ranked first for all tasks, reaching state-of-the-art performance for quality estimation at word-, span- and sentence-level granularity. Compared to the previous state-of-the-art COMETKIWI-22, we show large improvements in correlation with human judgements (up to 10 Spearman points). Moreover, we surpass the second-best multilingual submission to the shared-task with up to 3.8 absolute points.
△ Less
Submitted 21 September, 2023;
originally announced September 2023.
-
Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation
Authors:
Patrick Fernandes,
Aman Madaan,
Emmy Liu,
António Farinhas,
Pedro Henrique Martins,
Amanda Bertsch,
José G. C. de Souza,
Shuyan Zhou,
Tongshuang Wu,
Graham Neubig,
André F. T. Martins
Abstract:
Many recent advances in natural language generation have been fueled by training large language models on internet-scale data. However, this paradigm can lead to models that generate toxic, inaccurate, and unhelpful content, and automatic evaluation metrics often fail to identify these behaviors. As models become more capable, human feedback is an invaluable signal for evaluating and improving mod…
▽ More
Many recent advances in natural language generation have been fueled by training large language models on internet-scale data. However, this paradigm can lead to models that generate toxic, inaccurate, and unhelpful content, and automatic evaluation metrics often fail to identify these behaviors. As models become more capable, human feedback is an invaluable signal for evaluating and improving models. This survey aims to provide an overview of the recent research that has leveraged human feedback to improve natural language generation. First, we introduce an encompassing formalization of feedback, and identify and organize existing research into a taxonomy following this formalization. Next, we discuss how feedback can be described by its format and objective, and cover the two approaches proposed to use feedback (either for training or decoding): directly using the feedback or training feedback models. We also discuss existing datasets for human-feedback data collection, and concerns surrounding feedback collection. Finally, we provide an overview of the nascent field of AI feedback, which exploits large language models to make judgments based on a set of principles and minimize the need for human intervention.
△ Less
Submitted 31 May, 2023; v1 submitted 1 May, 2023;
originally announced May 2023.
-
Impact of cross-section uncertainties on supernova neutrino spectral parameter fitting in the Deep Underground Neutrino Experiment
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
Z. Ahmad,
J. Ahmed,
B. Aimard,
F. Akbar,
K. Allison,
S. Alonso Monsalve,
M. Alrashed,
A. Alton,
R. Alvarez,
P. Amedo,
J. Anderson,
D. A. Andrade
, et al. (1294 additional authors not shown)
Abstract:
A primary goal of the upcoming Deep Underground Neutrino Experiment (DUNE) is to measure the $\mathcal{O}(10)$ MeV neutrinos produced by a Galactic core-collapse supernova if one should occur during the lifetime of the experiment. The liquid-argon-based detectors planned for DUNE are expected to be uniquely sensitive to the $ν_e$ component of the supernova flux, enabling a wide variety of physics…
▽ More
A primary goal of the upcoming Deep Underground Neutrino Experiment (DUNE) is to measure the $\mathcal{O}(10)$ MeV neutrinos produced by a Galactic core-collapse supernova if one should occur during the lifetime of the experiment. The liquid-argon-based detectors planned for DUNE are expected to be uniquely sensitive to the $ν_e$ component of the supernova flux, enabling a wide variety of physics and astrophysics measurements. A key requirement for a correct interpretation of these measurements is a good understanding of the energy-dependent total cross section $σ(E_ν)$ for charged-current $ν_e$ absorption on argon. In the context of a simulated extraction of supernova $ν_e$ spectral parameters from a toy analysis, we investigate the impact of $σ(E_ν)$ modeling uncertainties on DUNE's supernova neutrino physics sensitivity for the first time. We find that the currently large theoretical uncertainties on $σ(E_ν)$ must be substantially reduced before the $ν_e$ flux parameters can be extracted reliably: in the absence of external constraints, a measurement of the integrated neutrino luminosity with less than 10\% bias with DUNE requires $σ(E_ν)$ to be known to about 5%. The neutrino spectral shape parameters can be known to better than 10% for a 20% uncertainty on the cross-section scale, although they will be sensitive to uncertainties on the shape of $σ(E_ν)$. A direct measurement of low-energy $ν_e$-argon scattering would be invaluable for improving the theoretical precision to the needed level.
△ Less
Submitted 7 July, 2023; v1 submitted 29 March, 2023;
originally announced March 2023.
-
A data acquisition and reconstruction software for SAMPA-SRS integration
Authors:
G. G. A. de Souza,
T. S. Abelha,
T. B. Saramela,
A. F. V. Cortez,
H. N. da Luz,
C. G. Penteado,
M. Bregant
Abstract:
In this work we present the latest developments in the SAMPA-SRS integration. A software was developed to improve the acquisition configuration, acquisition, and decoding of the data. The complete framework was tested using a triple GEM-based position sensitive detector for X-rays. The detector was operated in Ar/CO$_2$ (70/30) in continuous flow, at atmospheric pressure and made use of a 1 dimens…
▽ More
In this work we present the latest developments in the SAMPA-SRS integration. A software was developed to improve the acquisition configuration, acquisition, and decoding of the data. The complete framework was tested using a triple GEM-based position sensitive detector for X-rays. The detector was operated in Ar/CO$_2$ (70/30) in continuous flow, at atmospheric pressure and made use of a 1 dimension strip readout (200$μ$m wide strips at a pitch of 400$μ$m) for charge collection. With this detector a position resolution of better than 833$μ$m was obtained, with an energy resolution of 14.2% ($σ/E$) for 5.9keV.
△ Less
Submitted 8 May, 2023; v1 submitted 20 March, 2023;
originally announced March 2023.
-
Boundary Integral Formulation of the Cell-by-Cell Model of Cardiac Electrophysiology
Authors:
Giacomo Rosilho de Souza,
Rolf Krause,
Simone Pezzuto
Abstract:
We propose a boundary element method for the accurate solution of the cell-by-cell bidomain model of electrophysiology. The cell-by-cell model, also called Extracellular-Membrane-Intracellular (EMI) model, is a system of reaction-diffusion equations describing the evolution of the electric potential within each domain: intra- and extra-cellular space and the cellular membrane. The system is parabo…
▽ More
We propose a boundary element method for the accurate solution of the cell-by-cell bidomain model of electrophysiology. The cell-by-cell model, also called Extracellular-Membrane-Intracellular (EMI) model, is a system of reaction-diffusion equations describing the evolution of the electric potential within each domain: intra- and extra-cellular space and the cellular membrane. The system is parabolic but degenerate because the time derivative is only in the membrane domain. In this work, we adopt a boundary-integral formulation for removing the degeneracy in the system and recast it to a parabolic equation on the membrane. The formulation is also numerically advantageous since the number of degrees of freedom is sensibly reduced compared to the original model. Specifically, we prove that the boundary-element discretization of the EMI model is equivalent to a system of ordinary differential equations, and we consider a time discretization based on the multirate explicit stabilized Runge-Kutta method. We numerically show that our scheme convergences exponentially in space for the single-cell case. We finally provide several numerical experiments of biological interest.
△ Less
Submitted 10 February, 2023;
originally announced February 2023.
-
Highly-parallelized simulation of a pixelated LArTPC on a GPU
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
Z. Ahmad,
J. Ahmed,
B. Aimard,
F. Akbar,
K. Allison,
S. Alonso Monsalve,
M. Alrashed,
C. Alt,
A. Alton,
R. Alvarez,
P. Amedo,
J. Anderson
, et al. (1282 additional authors not shown)
Abstract:
The rapid development of general-purpose computing on graphics processing units (GPGPU) is allowing the implementation of highly-parallelized Monte Carlo simulation chains for particle physics experiments. This technique is particularly suitable for the simulation of a pixelated charge readout for time projection chambers, given the large number of channels that this technology employs. Here we pr…
▽ More
The rapid development of general-purpose computing on graphics processing units (GPGPU) is allowing the implementation of highly-parallelized Monte Carlo simulation chains for particle physics experiments. This technique is particularly suitable for the simulation of a pixelated charge readout for time projection chambers, given the large number of channels that this technology employs. Here we present the first implementation of a full microphysical simulator of a liquid argon time projection chamber (LArTPC) equipped with light readout and pixelated charge readout, developed for the DUNE Near Detector. The software is implemented with an end-to-end set of GPU-optimized algorithms. The algorithms have been written in Python and translated into CUDA kernels using Numba, a just-in-time compiler for a subset of Python and NumPy instructions. The GPU implementation achieves a speed up of four orders of magnitude compared with the equivalent CPU version. The simulation of the current induced on $10^3$ pixels takes around 1 ms on the GPU, compared with approximately 10 s on the CPU. The results of the simulation are compared against data from a pixel-readout LArTPC prototype.
△ Less
Submitted 28 February, 2023; v1 submitted 19 December, 2022;
originally announced December 2022.
-
Identification and reconstruction of low-energy electrons in the ProtoDUNE-SP detector
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
Z. Ahmad,
J. Ahmed,
B. Aimard,
F. Akbar,
K. Allison,
S. Alonso Monsalve,
M. Alrashed,
C. Alt,
A. Alton,
R. Alvarez,
P. Amedo,
J. Anderson
, et al. (1235 additional authors not shown)
Abstract:
Measurements of electrons from $ν_e$ interactions are crucial for the Deep Underground Neutrino Experiment (DUNE) neutrino oscillation program, as well as searches for physics beyond the standard model, supernova neutrino detection, and solar neutrino measurements. This article describes the selection and reconstruction of low-energy (Michel) electrons in the ProtoDUNE-SP detector. ProtoDUNE-SP is…
▽ More
Measurements of electrons from $ν_e$ interactions are crucial for the Deep Underground Neutrino Experiment (DUNE) neutrino oscillation program, as well as searches for physics beyond the standard model, supernova neutrino detection, and solar neutrino measurements. This article describes the selection and reconstruction of low-energy (Michel) electrons in the ProtoDUNE-SP detector. ProtoDUNE-SP is one of the prototypes for the DUNE far detector, built and operated at CERN as a charged particle test beam experiment. A sample of low-energy electrons produced by the decay of cosmic muons is selected with a purity of 95%. This sample is used to calibrate the low-energy electron energy scale with two techniques. An electron energy calibration based on a cosmic ray muon sample uses calibration constants derived from measured and simulated cosmic ray muon events. Another calibration technique makes use of the theoretically well-understood Michel electron energy spectrum to convert reconstructed charge to electron energy. In addition, the effects of detector response to low-energy electron energy scale and its resolution including readout electronics threshold effects are quantified. Finally, the relation between the theoretical and reconstructed low-energy electron energy spectrum is derived and the energy resolution is characterized. The low-energy electron selection presented here accounts for about 75% of the total electron deposited energy. After the addition of lost energy using a Monte Carlo simulation, the energy resolution improves from about 40% to 25% at 50~MeV. These results are used to validate the expected capabilities of the DUNE far detector to reconstruct low-energy electrons.
△ Less
Submitted 31 May, 2023; v1 submitted 2 November, 2022;
originally announced November 2022.
-
CometKiwi: IST-Unbabel 2022 Submission for the Quality Estimation Shared Task
Authors:
Ricardo Rei,
Marcos Treviso,
Nuno M. Guerreiro,
Chrysoula Zerva,
Ana C. Farinha,
Christine Maroti,
José G. C. de Souza,
Taisiya Glushkova,
Duarte M. Alves,
Alon Lavie,
Luisa Coheur,
André F. T. Martins
Abstract:
We present the joint contribution of IST and Unbabel to the WMT 2022 Shared Task on Quality Estimation (QE). Our team participated on all three subtasks: (i) Sentence and Word-level Quality Prediction; (ii) Explainable QE; and (iii) Critical Error Detection. For all tasks we build on top of the COMET framework, connecting it with the predictor-estimator architecture of OpenKiwi, and equipping it w…
▽ More
We present the joint contribution of IST and Unbabel to the WMT 2022 Shared Task on Quality Estimation (QE). Our team participated on all three subtasks: (i) Sentence and Word-level Quality Prediction; (ii) Explainable QE; and (iii) Critical Error Detection. For all tasks we build on top of the COMET framework, connecting it with the predictor-estimator architecture of OpenKiwi, and equipping it with a word-level sequence tagger and an explanation extractor. Our results suggest that incorporating references during pretraining improves performance across several language pairs on downstream tasks, and that jointly training with sentence and word-level objectives yields a further boost. Furthermore, combining attention and gradient information proved to be the top strategy for extracting good explanations of sentence-level QE models. Overall, our submissions achieved the best results for all three tasks for almost all language pairs by a considerable margin.
△ Less
Submitted 13 September, 2022;
originally announced September 2022.
-
Reconstruction of interactions in the ProtoDUNE-SP detector with Pandora
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
Z. Ahmad,
J. Ahmed,
B. Aimard,
F. Akbar,
B. Ali-Mohammadzadeh,
K. Allison,
S. Alonso Monsalve,
M. AlRashed,
C. Alt,
A. Alton,
R. Alvarez,
P. Amedo
, et al. (1203 additional authors not shown)
Abstract:
The Pandora Software Development Kit and algorithm libraries provide pattern-recognition logic essential to the reconstruction of particle interactions in liquid argon time projection chamber detectors. Pandora is the primary event reconstruction software used at ProtoDUNE-SP, a prototype for the Deep Underground Neutrino Experiment far detector. ProtoDUNE-SP, located at CERN, is exposed to a char…
▽ More
The Pandora Software Development Kit and algorithm libraries provide pattern-recognition logic essential to the reconstruction of particle interactions in liquid argon time projection chamber detectors. Pandora is the primary event reconstruction software used at ProtoDUNE-SP, a prototype for the Deep Underground Neutrino Experiment far detector. ProtoDUNE-SP, located at CERN, is exposed to a charged-particle test beam. This paper gives an overview of the Pandora reconstruction algorithms and how they have been tailored for use at ProtoDUNE-SP. In complex events with numerous cosmic-ray and beam background particles, the simulated reconstruction and identification efficiency for triggered test-beam particles is above 80% for the majority of particle type and beam momentum combinations. Specifically, simulated 1 GeV/$c$ charged pions and protons are correctly reconstructed and identified with efficiencies of 86.1$\pm0.6$% and 84.1$\pm0.6$%, respectively. The efficiencies measured for test-beam data are shown to be within 5% of those predicted by the simulation.
△ Less
Submitted 17 July, 2023; v1 submitted 29 June, 2022;
originally announced June 2022.
-
Double-GEM based thermal neutron detector prototype
Authors:
L. A. Serra Filho,
R. Felix dos Santos,
G. G. A. de Souza,
M. M. M. Paulino,
F. A. Souza,
M. Moralles,
H. Natal da Luz,
M. Bregant,
M. G. Munhoz,
Chung-Chuan Lai,
Carina Höglund,
Per-Olof Svensson,
Linda Robinson,
Richard Hall-Wilton
Abstract:
The Helium-3 shortage and the growing interest in neutron science constitute a driving factor in developing new neutron detection technologies. In this work, we report the development of a double-GEM detector prototype that uses a $^{10}$B$_4$C layer as a neutron converter material. GEANT4 simulations were performed predicting an efficiency of 3.14(10) %, agreeing within 2.7 $σ$ with the experimen…
▽ More
The Helium-3 shortage and the growing interest in neutron science constitute a driving factor in developing new neutron detection technologies. In this work, we report the development of a double-GEM detector prototype that uses a $^{10}$B$_4$C layer as a neutron converter material. GEANT4 simulations were performed predicting an efficiency of 3.14(10) %, agreeing within 2.7 $σ$ with the experimental and analytic detection efficiencies obtained by the detector when tested in a 41.8 meV thermal neutron beam. The detector is position sensitive, equipped with a 256+256 strip readout connected to resistive chains, and achieves a spatial resolution better than 3 mm. The gain stability over time was also measured with a fluctuation of about 0.2 %h$^{-1}$ of the signal amplitude. A simple data acquisition with only 5 electronic channels is sufficient to operate this detector.
△ Less
Submitted 19 July, 2022; v1 submitted 14 May, 2022;
originally announced May 2022.
-
Quality-Aware Decoding for Neural Machine Translation
Authors:
Patrick Fernandes,
António Farinhas,
Ricardo Rei,
José G. C. de Souza,
Perez Ogayo,
Graham Neubig,
André F. T. Martins
Abstract:
Despite the progress in machine translation quality estimation and evaluation in the last years, decoding in neural machine translation (NMT) is mostly oblivious to this and centers around finding the most probable translation according to the model (MAP decoding), approximated with beam search. In this paper, we bring together these two lines of research and propose quality-aware decoding for NMT…
▽ More
Despite the progress in machine translation quality estimation and evaluation in the last years, decoding in neural machine translation (NMT) is mostly oblivious to this and centers around finding the most probable translation according to the model (MAP decoding), approximated with beam search. In this paper, we bring together these two lines of research and propose quality-aware decoding for NMT, by leveraging recent breakthroughs in reference-free and reference-based MT evaluation through various inference methods like $N$-best reranking and minimum Bayes risk decoding. We perform an extensive comparison of various possible candidate generation and ranking methods across four datasets and two model classes and find that quality-aware decoding consistently outperforms MAP-based decoding according both to state-of-the-art automatic metrics (COMET and BLEURT) and to human assessments. Our code is available at https://github.com/deep-spin/qaware-decode.
△ Less
Submitted 2 May, 2022;
originally announced May 2022.
-
Separation of track- and shower-like energy deposits in ProtoDUNE-SP using a convolutional neural network
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
A. Aduszkiewicz,
J. Aguilar,
Z. Ahmad,
J. Ahmed,
B. Aimard,
B. Ali-Mohammadzadeh,
T. Alion,
K. Allison,
S. Alonso Monsalve,
M. AlRashed,
C. Alt,
A. Alton,
R. Alvarez,
P. Amedo,
J. Anderson
, et al. (1204 additional authors not shown)
Abstract:
Liquid argon time projection chamber detector technology provides high spatial and calorimetric resolutions on the charged particles traversing liquid argon. As a result, the technology has been used in a number of recent neutrino experiments, and is the technology of choice for the Deep Underground Neutrino Experiment (DUNE). In order to perform high precision measurements of neutrinos in the det…
▽ More
Liquid argon time projection chamber detector technology provides high spatial and calorimetric resolutions on the charged particles traversing liquid argon. As a result, the technology has been used in a number of recent neutrino experiments, and is the technology of choice for the Deep Underground Neutrino Experiment (DUNE). In order to perform high precision measurements of neutrinos in the detector, final state particles need to be effectively identified, and their energy accurately reconstructed. This article proposes an algorithm based on a convolutional neural network to perform the classification of energy deposits and reconstructed particles as track-like or arising from electromagnetic cascades. Results from testing the algorithm on data from ProtoDUNE-SP, a prototype of the DUNE far detector, are presented. The network identifies track- and shower-like particles, as well as Michel electrons, with high efficiency. The performance of the algorithm is consistent between data and simulation.
△ Less
Submitted 30 June, 2022; v1 submitted 31 March, 2022;
originally announced March 2022.
-
Scintillation light detection in the 6-m drift-length ProtoDUNE Dual Phase liquid argon TPC
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
A. Aduszkiewicz,
J. Aguilar,
Z. Ahmad,
J. Ahmed,
B. Aimard,
B. Ali-Mohammadzadeh,
T. Alion,
K. Allison,
S. Alonso Monsalve,
M. AlRashed,
C. Alt,
A. Alton,
R. Alvarez,
P. Amedo,
J. Anderson
, et al. (1202 additional authors not shown)
Abstract:
DUNE is a dual-site experiment for long-baseline neutrino oscillation studies, neutrino astrophysics and nucleon decay searches. ProtoDUNE Dual Phase (DP) is a 6x6x6m3 liquid argon time-projection-chamber (LArTPC) that recorded cosmic-muon data at the CERN Neutrino Platform in 2019-2020 as a prototype of the DUNE Far Detector. Charged particles propagating through the LArTPC produce ionization and…
▽ More
DUNE is a dual-site experiment for long-baseline neutrino oscillation studies, neutrino astrophysics and nucleon decay searches. ProtoDUNE Dual Phase (DP) is a 6x6x6m3 liquid argon time-projection-chamber (LArTPC) that recorded cosmic-muon data at the CERN Neutrino Platform in 2019-2020 as a prototype of the DUNE Far Detector. Charged particles propagating through the LArTPC produce ionization and scintillation light. The scintillation light signal in these detectors can provide the trigger for non-beam events. In addition, it adds precise timing capabilities and improves the calorimetry measurements. In ProtoDUNE-DP, scintillation and electroluminescence light produced by cosmic muons in the LArTPC is collected by photomultiplier tubes placed up to 7 m away from the ionizing track. In this paper, the ProtoDUNE-DP photon detection system performance is evaluated with a particular focus on the different wavelength shifters, such as PEN and TPB, and the use of Xe-doped LAr, considering its future use in giant LArTPCs. The scintillation light production and propagation processes are analyzed and a comparison of simulation to data is performed, improving understanding of the liquid argon properties
△ Less
Submitted 3 June, 2022; v1 submitted 30 March, 2022;
originally announced March 2022.
-
Application of Stabilized Explicit Runge-Kutta Methods to the Incompressible Navier-Stokes Equations by means of a Projection Method and a Differential Algebraic Approach
Authors:
Giacomo Rosilho de Souza
Abstract:
In this master thesis we have compared different second order stabilized explicit Runge-Kutta methods when applied to the incompressible Navier-Stokes equations by means of a projection method and a differential algebraic approach. We explored the stability and accuracy properties of the RKC, ROCK2 and PIROCK schemes when coupled with the projection and the differential algebraic approach. PIROCK…
▽ More
In this master thesis we have compared different second order stabilized explicit Runge-Kutta methods when applied to the incompressible Navier-Stokes equations by means of a projection method and a differential algebraic approach. We explored the stability and accuracy properties of the RKC, ROCK2 and PIROCK schemes when coupled with the projection and the differential algebraic approach. PIROCK has shown unexpected instabilities, ROCK2 resulted to be the most efficient and versatile Runge-Kutta method taken into account. The differential algebraic approach sounds computationally costly but it exhibits better accuracy and a larger stability region. These properties make it more efficient than the projection method. The theory presented in the first chapters is supported by numerical experiments.
△ Less
Submitted 28 March, 2022;
originally announced March 2022.
-
A Gaseous Argon-Based Near Detector to Enhance the Physics Capabilities of DUNE
Authors:
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
Z. Ahmad,
J. Ahmed,
B. Aimard,
F. Akbar,
B. Ali-Mohammadzadeh,
T. Alion,
K. Allison,
S. Alonso Monsalve,
M. AlRashed,
C. Alt,
A. Alton,
R. Alvarez,
P. Amedo
, et al. (1220 additional authors not shown)
Abstract:
This document presents the concept and physics case for a magnetized gaseous argon-based detector system (ND-GAr) for the Deep Underground Neutrino Experiment (DUNE) Near Detector. This detector system is required in order for DUNE to reach its full physics potential in the measurement of CP violation and in delivering precision measurements of oscillation parameters. In addition to its critical r…
▽ More
This document presents the concept and physics case for a magnetized gaseous argon-based detector system (ND-GAr) for the Deep Underground Neutrino Experiment (DUNE) Near Detector. This detector system is required in order for DUNE to reach its full physics potential in the measurement of CP violation and in delivering precision measurements of oscillation parameters. In addition to its critical role in the long-baseline oscillation program, ND-GAr will extend the overall physics program of DUNE. The LBNF high-intensity proton beam will provide a large flux of neutrinos that is sampled by ND-GAr, enabling DUNE to discover new particles and search for new interactions and symmetries beyond those predicted in the Standard Model.
△ Less
Submitted 11 March, 2022;
originally announced March 2022.
-
Snowmass Neutrino Frontier: DUNE Physics Summary
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
Z. Ahmad,
J. Ahmed,
B. Aimard,
F. Akbar,
B. Ali-Mohammadzadeh,
T. Alion,
K. Allison,
S. Alonso Monsalve,
M. AlRashed,
C. Alt,
A. Alton,
R. Alvarez
, et al. (1221 additional authors not shown)
Abstract:
The Deep Underground Neutrino Experiment (DUNE) is a next-generation long-baseline neutrino oscillation experiment with a primary physics goal of observing neutrino and antineutrino oscillation patterns to precisely measure the parameters governing long-baseline neutrino oscillation in a single experiment, and to test the three-flavor paradigm. DUNE's design has been developed by a large, internat…
▽ More
The Deep Underground Neutrino Experiment (DUNE) is a next-generation long-baseline neutrino oscillation experiment with a primary physics goal of observing neutrino and antineutrino oscillation patterns to precisely measure the parameters governing long-baseline neutrino oscillation in a single experiment, and to test the three-flavor paradigm. DUNE's design has been developed by a large, international collaboration of scientists and engineers to have unique capability to measure neutrino oscillation as a function of energy in a broadband beam, to resolve degeneracy among oscillation parameters, and to control systematic uncertainty using the exquisite imaging capability of massive LArTPC far detector modules and an argon-based near detector. DUNE's neutrino oscillation measurements will unambiguously resolve the neutrino mass ordering and provide the sensitivity to discover CP violation in neutrinos for a wide range of possible values of $δ_{CP}$. DUNE is also uniquely sensitive to electron neutrinos from a galactic supernova burst, and to a broad range of physics beyond the Standard Model (BSM), including nucleon decays. DUNE is anticipated to begin collecting physics data with Phase I, an initial experiment configuration consisting of two far detector modules and a minimal suite of near detector components, with a 1.2 MW proton beam. To realize its extensive, world-leading physics potential requires the full scope of DUNE be completed in Phase II. The three Phase II upgrades are all necessary to achieve DUNE's physics goals: (1) addition of far detector modules three and four for a total FD fiducial mass of at least 40 kt, (2) upgrade of the proton beam power from 1.2 MW to 2.4 MW, and (3) replacement of the near detector's temporary muon spectrometer with a magnetized, high-pressure gaseous argon TPC and calorimeter.
△ Less
Submitted 11 March, 2022;
originally announced March 2022.
-
Using the Energy probability distribution zeros to obtain the critical properties of the two-dimensional anisotropic Heisenberg model
Authors:
Gabriel Bruno Garcia de Souza,
Bismarck Vaz da Costa
Abstract:
In this paper we present a Monte Carlo study of the critical behavior of the easy axis anisotropic Heisenberg spin model in two dimensions. Based on the partial knowledge of the zeros of the energy probability distribution we determine with good precision the phase diagram of the model obtaining the critical temperature and exponents for several values of the anisotropy. Our results indicate that…
▽ More
In this paper we present a Monte Carlo study of the critical behavior of the easy axis anisotropic Heisenberg spin model in two dimensions. Based on the partial knowledge of the zeros of the energy probability distribution we determine with good precision the phase diagram of the model obtaining the critical temperature and exponents for several values of the anisotropy. Our results indicate that the model is in the Ising universality class for any anisotropy.
△ Less
Submitted 7 July, 2022; v1 submitted 1 March, 2022;
originally announced March 2022.
-
Mixed-precision explicit stabilized Runge-Kutta methods for single- and multi-scale differential equations
Authors:
Matteo Croci,
Giacomo Rosilho de Souza
Abstract:
Mixed-precision algorithms combine low- and high-precision computations in order to benefit from the performance gains of reduced-precision without sacrificing accuracy. In this work, we design mixed-precision Runge-Kutta-Chebyshev (RKC) methods, where high precision is used for accuracy, and low precision for stability. Generally speaking, RKC methods are low-order explicit schemes with a stabili…
▽ More
Mixed-precision algorithms combine low- and high-precision computations in order to benefit from the performance gains of reduced-precision without sacrificing accuracy. In this work, we design mixed-precision Runge-Kutta-Chebyshev (RKC) methods, where high precision is used for accuracy, and low precision for stability. Generally speaking, RKC methods are low-order explicit schemes with a stability domain growing quadratically with the number of function evaluations. For this reason, most of the computational effort is spent on stability rather than accuracy purposes. In this paper, we show that a naïve mixed-precision implementation of any Runge-Kutta scheme can harm the convergence order of the method and limit its accuracy, and we introduce a new class of mixed-precision RKC schemes that are instead unaffected by this limiting behaviour. We present three mixed-precision schemes: a first- and a second-order RKC method, and a first-order multirate RKC scheme for multiscale problems. These schemes perform only the few function evaluations needed for accuracy (1 or 2 for first- and second-order methods respectively) in high precision, while the rest are performed in low precision. We prove that while these methods are essentially as cheap as their fully low-precision equivalent, they retain the stability and convergence order of their high-precision counterpart. Indeed, numerical experiments confirm that these schemes are as accurate as the corresponding high-precision method.
△ Less
Submitted 6 April, 2022; v1 submitted 24 September, 2021;
originally announced September 2021.
-
Trends on 3d Transition Metal Coordination on Monolayer MoS$_2$
Authors:
He Liu,
Walner Costa Silva,
Leonardo Santana Gonçalves de Souza,
Amanda Garcez Veiga,
Leandro Seixas,
Kazunori Fujisawa,
Ethan Kahn,
Tianyi Zhang,
Fu Zhang,
Zhuohang Yu,
Katherine Thompson,
Yu Lei,
Christiano J. S. de Matos,
Maria Luiza M. Rocco,
Mauricio Terrones,
Daniel Grasseschi
Abstract:
Two-dimensional materials (2DM) have attracted much interest due to their distinct optical, electronic, and catalytic properties. These properties can be by tuned a range of methods including substitutional doping or, as recently demonstrated, by surface functionalization with single atoms, increasing even further 2DM portfolio. Here we theoretically and experimentally describe the coordination re…
▽ More
Two-dimensional materials (2DM) have attracted much interest due to their distinct optical, electronic, and catalytic properties. These properties can be by tuned a range of methods including substitutional doping or, as recently demonstrated, by surface functionalization with single atoms, increasing even further 2DM portfolio. Here we theoretically and experimentally describe the coordination reaction between MoS$_2$ monolayers with 3d transition metals (TMs), exploring the nature and the trend of MoS$_2$-TMs interaction. Density Functional Theory calculations, X-Ray Photoelectron Spectroscopy (XPS), and Photoluminescence (PL) point to the formation of MoS$_2$-TM coordination complexes, where the adsorption energy trend for 3d TM resembles the crystal-field (CF) stabilization energy for weak-field complexes. Pearson's theory for hard-soft acid-base and Ligand-field theory were applied to discuss the periodic trends on 3d TM coordination on the MoS$_2$ surface. We found that softer acids with higher ligand field stabilization energy, such as Ni$^{2+}$, tend to form bonds with more covalent character with MoS$_2$, which can be considered a soft base. On the other hand, harder acids, such as Cr$^{3+}$, tend to form bonds with more ionic character. Additionally, we studied the trends in charge transfer and doping observed in the XPS and PL results, where metals such as Ni led to an n-type of doping, while Cu functionalization results in p-type doping. Therefore, the formation of coordination complexes on TMD's surface is demonstrated to be a promising and effective way to control and to understand the nature of the single-atom functionalization of TMD.
△ Less
Submitted 17 September, 2021;
originally announced September 2021.
-
Low exposure long-baseline neutrino oscillation sensitivity of the DUNE experiment
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
D. Adams,
M. Adinolfi,
A. Aduszkiewicz,
J. Aguilar,
Z. Ahmad,
J. Ahmed,
B. Aimard,
B. Ali-Mohammadzadeh,
T. Alion,
K. Allison,
S. Alonso Monsalve,
M. AlRashed,
C. Alt,
A. Alton,
P. Amedo,
J. Anderson,
C. Andreopoulos,
M. Andreotti
, et al. (1132 additional authors not shown)
Abstract:
The Deep Underground Neutrino Experiment (DUNE) will produce world-leading neutrino oscillation measurements over the lifetime of the experiment. In this work, we explore DUNE's sensitivity to observe charge-parity violation (CPV) in the neutrino sector, and to resolve the mass ordering, for exposures of up to 100 kiloton-megawatt-years (kt-MW-yr). The analysis includes detailed uncertainties on t…
▽ More
The Deep Underground Neutrino Experiment (DUNE) will produce world-leading neutrino oscillation measurements over the lifetime of the experiment. In this work, we explore DUNE's sensitivity to observe charge-parity violation (CPV) in the neutrino sector, and to resolve the mass ordering, for exposures of up to 100 kiloton-megawatt-years (kt-MW-yr). The analysis includes detailed uncertainties on the flux prediction, the neutrino interaction model, and detector effects. We demonstrate that DUNE will be able to unambiguously resolve the neutrino mass ordering at a 3$σ$ (5$σ$) level, with a 66 (100) kt-MW-yr far detector exposure, and has the ability to make strong statements at significantly shorter exposures depending on the true value of other oscillation parameters. We also show that DUNE has the potential to make a robust measurement of CPV at a 3$σ$ level with a 100 kt-MW-yr exposure for the maximally CP-violating values $δ_{\rm CP}} = \pmπ/2$. Additionally, the dependence of DUNE's sensitivity on the exposure taken in neutrino-enhanced and antineutrino-enhanced running is discussed. An equal fraction of exposure taken in each beam mode is found to be close to optimal when considered over the entire space of interest.
△ Less
Submitted 3 September, 2021;
originally announced September 2021.
-
Design, construction and operation of the ProtoDUNE-SP Liquid Argon TPC
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
D. Adams,
M. Adinolfi,
A. Aduszkiewicz,
J. Aguilar,
Z. Ahmad,
J. Ahmed,
B. Ali-Mohammadzadeh,
T. Alion,
K. Allison,
S. Alonso Monsalve,
M. Alrashed,
C. Alt,
A. Alton,
P. Amedo,
J. Anderson,
C. Andreopoulos,
M. Andreotti,
M. P. Andrews
, et al. (1158 additional authors not shown)
Abstract:
The ProtoDUNE-SP detector is a single-phase liquid argon time projection chamber (LArTPC) that was constructed and operated in the CERN North Area at the end of the H4 beamline. This detector is a prototype for the first far detector module of the Deep Underground Neutrino Experiment (DUNE), which will be constructed at the Sandford Underground Research Facility (SURF) in Lead, South Dakota, USA.…
▽ More
The ProtoDUNE-SP detector is a single-phase liquid argon time projection chamber (LArTPC) that was constructed and operated in the CERN North Area at the end of the H4 beamline. This detector is a prototype for the first far detector module of the Deep Underground Neutrino Experiment (DUNE), which will be constructed at the Sandford Underground Research Facility (SURF) in Lead, South Dakota, USA. The ProtoDUNE-SP detector incorporates full-size components as designed for DUNE and has an active volume of $7\times 6\times 7.2$~m$^3$. The H4 beam delivers incident particles with well-measured momenta and high-purity particle identification. ProtoDUNE-SP's successful operation between 2018 and 2020 demonstrates the effectiveness of the single-phase far detector design. This paper describes the design, construction, assembly and operation of the detector components.
△ Less
Submitted 23 September, 2021; v1 submitted 4 August, 2021;
originally announced August 2021.
-
Top-Down Model of Limescale Formation in Turbulent Pipe Flows
Authors:
L. Moriconi,
T. Nascimento,
B. G. B. de Souza,
J. B. R. Loureiro
Abstract:
We investigate calcium carbonate scale formation at high Reynolds numbers in a large pipe rig facility. The calcium carbonate solution is produced from the injection, at a T-joint inlet, of pH-stabilized sodium carbonate and calcium chloride aqueous solutions. A scanning electron microscopy analysis of the deposited mass along the pipe indicates that after an initial transient regime of ion-by-ion…
▽ More
We investigate calcium carbonate scale formation at high Reynolds numbers in a large pipe rig facility. The calcium carbonate solution is produced from the injection, at a T-joint inlet, of pH-stabilized sodium carbonate and calcium chloride aqueous solutions. A scanning electron microscopy analysis of the deposited mass along the pipe indicates that after an initial transient regime of ion-by-ion crystal growth, calcium carbonate scale is dominated by particulate deposition. While limescale formation in regions that are closer to the pipe's entrance can be described as the heterogeneous surface nucleation of calcium and carbonate ions driven by turbulent diffusion, we rely upon turbophoresis phenomenology to devise a peculiarly simple kinetic model of deposition at farther downstream regions. Letting $Φ$ and $R$ be the flow rate and the pipe's radius, respectively, the mass deposition rates per unit time and unit area are predicted to scale as $Φ^α/ R^β$ (for certain modeled values of the $α$ and $β$ parameters) with suggestive support from our experiments.
△ Less
Submitted 23 July, 2021;
originally announced July 2021.
-
Searching for solar KDAR with DUNE
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
D. Adams,
M. Adinolfi,
A. Aduszkiewicz,
J. Aguilar,
Z. Ahmad,
J. Ahmed,
B. Ali-Mohammadzadeh,
T. Alion,
K. Allison,
S. Alonso Monsalve,
M. Alrashed,
C. Alt,
A. Alton,
P. Amedo,
J. Anderson,
C. Andreopoulos,
M. Andreotti,
M. P. Andrews
, et al. (1157 additional authors not shown)
Abstract:
The observation of 236 MeV muon neutrinos from kaon-decay-at-rest (KDAR) originating in the core of the Sun would provide a unique signature of dark matter annihilation. Since excellent angle and energy reconstruction are necessary to detect this monoenergetic, directional neutrino flux, DUNE with its vast volume and reconstruction capabilities, is a promising candidate for a KDAR neutrino search.…
▽ More
The observation of 236 MeV muon neutrinos from kaon-decay-at-rest (KDAR) originating in the core of the Sun would provide a unique signature of dark matter annihilation. Since excellent angle and energy reconstruction are necessary to detect this monoenergetic, directional neutrino flux, DUNE with its vast volume and reconstruction capabilities, is a promising candidate for a KDAR neutrino search. In this work, we evaluate the proposed KDAR neutrino search strategies by realistically modeling both neutrino-nucleus interactions and the response of DUNE. We find that, although reconstruction of the neutrino energy and direction is difficult with current techniques in the relevant energy range, the superb energy resolution, angular resolution, and particle identification offered by DUNE can still permit great signal/background discrimination. Moreover, there are non-standard scenarios in which searches at DUNE for KDAR in the Sun can probe dark matter interactions.
△ Less
Submitted 26 October, 2021; v1 submitted 19 July, 2021;
originally announced July 2021.
-
Optimal explicit stabilized postprocessed $τ$-leap method for the simulation of chemical kinetics
Authors:
Assyr Abdulle,
Lia Gander,
Giacomo Rosilho de Souza
Abstract:
The simulation of chemical kinetics involving multiple scales constitutes a modeling challenge (from ordinary differential equations to Markov chain) and a computational challenge (multiple scales, large dynamical systems, time step restrictions). In this paper we propose a new discrete stochastic simulation algorithm: the postprocessed second kind stabilized orthogonal $τ$-leap Runge-Kutta method…
▽ More
The simulation of chemical kinetics involving multiple scales constitutes a modeling challenge (from ordinary differential equations to Markov chain) and a computational challenge (multiple scales, large dynamical systems, time step restrictions). In this paper we propose a new discrete stochastic simulation algorithm: the postprocessed second kind stabilized orthogonal $τ$-leap Runge-Kutta method (PSK-$τ$-ROCK). In the context of chemical kinetics this method can be seen as a stabilization of Gillespie's explicit $τ$-leap combined with a postprocessor. The stabilized procedure allows to simulate problems with multiple scales (stiff), while the postprocessing procedure allows to approximate the invariant measure (e.g. mean and variance) of ergodic stochastic dynamical systems. We prove stability and accuracy of the PSK-$τ$-ROCK. Numerical experiments illustrate the high reliability and efficiency of the scheme when compared to other $τ$-leap methods.
△ Less
Submitted 17 June, 2021;
originally announced June 2021.
-
Liquid argon characterization of the X-ARAPUCA with alpha particles, gamma rays and cosmic muons
Authors:
H. V. Souza,
E. Segreto,
A. A. Machado,
R. R. Sarmento,
M. C. Q. Bazetto,
L. Paulucci,
F. Marinho,
V. L. Pimentel,
F. L. Demolin,
G. de Souza,
A. C. Fauth,
M. A. Ayala-Torres
Abstract:
The X-ARAPUCA device is the baseline choice for the photon detection system of the first far detector module of the DUNE experiment. We present the results of the first complete characterization of a small scale X-ARAPUCA prototype, which is a slice of a full DUNE module. Its total detection efficiency in liquid argon was measured with three different ionizing radiations: $α$ particles, $γ$'s and…
▽ More
The X-ARAPUCA device is the baseline choice for the photon detection system of the first far detector module of the DUNE experiment. We present the results of the first complete characterization of a small scale X-ARAPUCA prototype, which is a slice of a full DUNE module. Its total detection efficiency in liquid argon was measured with three different ionizing radiations: $α$ particles, $γ$'s and muons and resulted to be $\sim$2.2% when the active silicon photomultipliers were biased at +5.0 V of over voltage, corresponding to a Photon Detection Efficiency around 50% at room temperature. This value comfortably satisfies the requirements of the first DUNE far detector module (detection efficiency $>$2.0%) and allows to achieve an energy resolution comparable to the one achievable with the Time Projection Chambers for energies below 10 MeV, which is the region relevant for Supernova neutrino detection.
△ Less
Submitted 6 December, 2021; v1 submitted 8 June, 2021;
originally announced June 2021.
-
Deep Underground Neutrino Experiment (DUNE) Near Detector Conceptual Design Report
Authors:
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
G. Adamov,
D. Adams,
M. Adinolfi,
A. Aduszkiewicz,
Z. Ahmad,
J. Ahmed,
T. Alion,
S. Alonso Monsalve,
M. Alrashed,
C. Alt,
A. Alton,
P. Amedo,
J. Anderson,
C. Andreopoulos,
M. P. Andrews,
F. Andrianala,
S. Andringa,
N. Anfimov,
A. Ankowski,
M. Antonova,
S. Antusch
, et al. (1041 additional authors not shown)
Abstract:
This report describes the conceptual design of the DUNE near detector
This report describes the conceptual design of the DUNE near detector
△ Less
Submitted 25 March, 2021;
originally announced March 2021.
-
Analytic characterization of high dimension weighted special atom spaces
Authors:
Eddy Kwessi,
Geraldo de Souza
Abstract:
Special atom spaces have been around for quite awhile since the introduction of atoms by R. Coifman in his seminal paper who led to another proof that the dual of the Hardy space $H^1$ is in fact the space of functions of bounded means oscillations (BMO). Special atom spaces enjoy quite a few attributes of their own, among which the fact that they have an analytic extension to the unit disc. Recen…
▽ More
Special atom spaces have been around for quite awhile since the introduction of atoms by R. Coifman in his seminal paper who led to another proof that the dual of the Hardy space $H^1$ is in fact the space of functions of bounded means oscillations (BMO). Special atom spaces enjoy quite a few attributes of their own, among which the fact that they have an analytic extension to the unit disc. Recently, an extension of special atom spaces to higher dimensions was proposed, making ripe the possible exploration of the above extension in higher dimensions. In this paper we propose an analytic characterization of special atom spaces in higher dimensions.
△ Less
Submitted 4 February, 2021;
originally announced February 2021.
-
Explicit stabilized multirate method for stiff stochastic differential equations
Authors:
Assyr Abdulle,
Giacomo Rosilho de Souza
Abstract:
Stabilized explicit methods are particularly efficient for large systems of stiff stochastic differential equations (SDEs) due to their extended stability domain. However, they loose their efficiency when a severe stiffness is induced by very few "fast" degrees of freedom, as the stiff and nonstiff terms are evaluated concurrently. Therefore, inspired by [A. Abdulle, M. J. Grote, and G. Rosilho de…
▽ More
Stabilized explicit methods are particularly efficient for large systems of stiff stochastic differential equations (SDEs) due to their extended stability domain. However, they loose their efficiency when a severe stiffness is induced by very few "fast" degrees of freedom, as the stiff and nonstiff terms are evaluated concurrently. Therefore, inspired by [A. Abdulle, M. J. Grote, and G. Rosilho de Souza, Preprint (2020), arXiv:2006.00744] we introduce a stochastic modified equation whose stiffness depends solely on the "slow" terms. By integrating this modified equation with a stabilized explicit scheme we devise a multirate method which overcomes the bottleneck caused by a few severely stiff terms and recovers the efficiency of stabilized schemes for large systems of nonlinear SDEs. The scheme is not based on any scale separation assumption of the SDE and therefore it is employable for problems stemming from the spatial discretization of stochastic parabolic partial differential equations on locally refined grids. The multirate scheme has strong order 1/2, weak order 1 and its stability is proved on a model problem. Numerical experiments confirm the efficiency and accuracy of the scheme.
△ Less
Submitted 12 August, 2021; v1 submitted 28 October, 2020;
originally announced October 2020.
-
Regressor: A C program for Combinatorial Regressions
Authors:
Eduardo M. Vasconcelos,
Adriano Gouveia de Souza
Abstract:
In statistics, researchers use Regression models for data analysis and prediction in many productive sectors (industry, business, academy, etc.). Regression models are mathematical functions representing an approximation of dependent variable $Y$ from n independent variables $X_i \in X$. The literature presents many regression methods divided into single and multiple regressions. There are several…
▽ More
In statistics, researchers use Regression models for data analysis and prediction in many productive sectors (industry, business, academy, etc.). Regression models are mathematical functions representing an approximation of dependent variable $Y$ from n independent variables $X_i \in X$. The literature presents many regression methods divided into single and multiple regressions. There are several procedures to generate regression models and sets of commercial and academic tools that implement these procedures. This work presents one open-source program called Regressor that makes models from a specific variation of polynomial regression. These models relate the independent variables to generate an approximation of the original output dependent data. In many tests, Regressor was able to build models five times more accurate than commercial tools.
△ Less
Submitted 25 September, 2020;
originally announced September 2020.
-
Explicit stabilized multirate method for stiff differential equations
Authors:
Assyr Abdulle,
Marcus J. Grote,
Giacomo Rosilho de Souza
Abstract:
Stabilized Runge-Kutta methods are especially efficient for the numerical solution of large systems of stiff nonlinear differential equations because they are fully explicit. For semi-discrete parabolic problems, for instance, stabilized Runge-Kutta methods overcome the stringent stability condition of standard methods without sacrificing explicitness. However, when stiffness is only induced by a…
▽ More
Stabilized Runge-Kutta methods are especially efficient for the numerical solution of large systems of stiff nonlinear differential equations because they are fully explicit. For semi-discrete parabolic problems, for instance, stabilized Runge-Kutta methods overcome the stringent stability condition of standard methods without sacrificing explicitness. However, when stiffness is only induced by a few components, as in the presence of spatially local mesh refinement, their efficiency deteriorates. To remove the crippling effect of a few severely stiff components on the entire system of differential equations, we derive a modified equation, whose stiffness solely depend on the remaining mildly stiff components. By applying stabilized Runge-Kutta methods to this modified equation, we then devise an explicit multirate Runge-Kutta-Chebyshev (mRKC) method whose stability conditions are independent of a few severely stiff components. Stability of the mRKC method is proved for a model problem, whereas its efficiency and usefulness are demonstrated through a series of numerical experiments.
△ Less
Submitted 4 April, 2022; v1 submitted 1 June, 2020;
originally announced June 2020.
-
Paraconsistentization and many-valued logics
Authors:
Edelcio G. de Souza,
Alexandre Costa-Leite,
Diogo H. B. Dias
Abstract:
This paper shows how to transform explosive many-valued systems into paraconsistent logics. We investigate especially the case of three-valued systems showing how paraconsistent three-valued logics can be obtained from them.
This paper shows how to transform explosive many-valued systems into paraconsistent logics. We investigate especially the case of three-valued systems showing how paraconsistent three-valued logics can be obtained from them.
△ Less
Submitted 11 July, 2022; v1 submitted 28 April, 2020;
originally announced April 2020.