-
Synergistic interplay of morphology and metabolic activity rule response to CAR-T cells in B-cell lymphomas
Authors:
Yifan Chen,
Soukaina Sabir,
Christina Kuttler,
Juan Belmonte-Beitia,
Alvaro Mártínez-Rubio,
Lourdes Martín-Martín,
Lucía López-Corral,
Alejandro Martín-Sancho,
J. Cristobal Cañadas Salazar,
Carlos Montes-Fuentes,
M. Pilar Tamayo-Alonso,
Angel Cedillo,
Pascual Balsalobre,
Pere Barba,
Antonio Pérez-Martínez,
Víctor M. Pérez-García
Abstract:
Cellular immunotherapies are one of the mainstream cancer treatments unveiling the power of the patient's immune system to fight tumors. CAR T-cell therapy, based on genetically engineered T cells, has demonstrated significant potential in treating hematological malignancies, including B-cell lymphomas. This treatment has complex longitudinal dynamics due to the interplay of different T-cell pheno…
▽ More
Cellular immunotherapies are one of the mainstream cancer treatments unveiling the power of the patient's immune system to fight tumors. CAR T-cell therapy, based on genetically engineered T cells, has demonstrated significant potential in treating hematological malignancies, including B-cell lymphomas. This treatment has complex longitudinal dynamics due to the interplay of different T-cell phenotypes (e.g. effector and memory), the expansion of the drug and the cytotoxic effect on both normal and cancerous B-cells, the exhaustion of the immune cells, the tumor immunosupressive environments, and more. Thus, the outcome of the therapy is not yet well understood leading to a variety of responses ranging from sustained complete responses, different types of partial responses, or no response at all. We developed a mechanistic model for the interaction between CAR T- and cancerous B-cells, accounting for the role of the tumor morphology and metabolic status. The simulations showed that lesions with irregular shapes and high proliferation could contribute to long term progression by potentially increasing their immunosuppressive capabilities impairing CAR T-cell efficacy. We analyzed 18F-FDG PET/CT imaging data from 63 relapsed/refractory diffuse large B-cell lymphoma receiving CAR T-cells, quantifying radiomic features including tumor sphericity and lesion aggressiveness through standardized uptake values (SUV). Statistical analyses revealed significant correlations between these metrics and progression-free survival (PFS), emphasizing that individual lesions with complex morphology and elevated metabolism play a critical role in shaping long-term treatment outcomes. We demonstrated the potential of using data-driven mathematical models in finding molecular-imaging based biomarkers to identify lymphoma patients treated with CAR T-cell therapy having higher risk of disease progression.
△ Less
Submitted 17 May, 2025;
originally announced May 2025.
-
Multi-Objective Reinforcement Learning for Water Management
Authors:
Zuzanna Osika,
Roxana Radelescu,
Jazmin Zatarain Salazar,
Frans Oliehoek,
Pradeep K. Murukannaiah
Abstract:
Many real-world problems (e.g., resource management, autonomous driving, drug discovery) require optimizing multiple, conflicting objectives. Multi-objective reinforcement learning (MORL) extends classic reinforcement learning to handle multiple objectives simultaneously, yielding a set of policies that capture various trade-offs. However, the MORL field lacks complex, realistic environments and b…
▽ More
Many real-world problems (e.g., resource management, autonomous driving, drug discovery) require optimizing multiple, conflicting objectives. Multi-objective reinforcement learning (MORL) extends classic reinforcement learning to handle multiple objectives simultaneously, yielding a set of policies that capture various trade-offs. However, the MORL field lacks complex, realistic environments and benchmarks. We introduce a water resource (Nile river basin) management case study and model it as a MORL environment. We then benchmark existing MORL algorithms on this task. Our results show that specialized water management methods outperform state-of-the-art MORL approaches, underscoring the scalability challenges MORL algorithms face in real-world scenarios.
△ Less
Submitted 2 May, 2025;
originally announced May 2025.
-
Data-driven Fuzzy Control for Time-Optimal Aggressive Trajectory Following
Authors:
August Phelps,
Juan Augusto Paredes Salazar,
Ankit Goel
Abstract:
Optimal trajectories that minimize a user-defined cost function in dynamic systems require the solution of a two-point boundary value problem. The optimization process yields an optimal control sequence that depends on the initial conditions and system parameters. However, the optimal sequence may result in undesirable behavior if the system's initial conditions and parameters are erroneous. This…
▽ More
Optimal trajectories that minimize a user-defined cost function in dynamic systems require the solution of a two-point boundary value problem. The optimization process yields an optimal control sequence that depends on the initial conditions and system parameters. However, the optimal sequence may result in undesirable behavior if the system's initial conditions and parameters are erroneous. This work presents a data-driven fuzzy controller synthesis framework that is guided by a time-optimal trajectory for multicopter tracking problems. In particular, we consider an aggressive maneuver consisting of a mid-air flip and generate a time-optimal trajectory by numerically solving the two-point boundary value problem. A fuzzy controller consisting of a stabilizing controller near hover conditions and an autoregressive moving average (ARMA) controller, trained to mimic the time-optimal aggressive trajectory, is constructed using the Takagi-Sugeno fuzzy framework.
△ Less
Submitted 8 April, 2025;
originally announced April 2025.
-
Seeing Forests Through Clouds: Comment on "Recent global temperature surge amplified by record-low planetary albedo" (arXiv:2405.19986)
Authors:
Anastassia M. Makarieva,
Andrei V. Nefiodov,
Antonio D. Nobre,
Luz A. Cuartas,
Paulo Nobre,
Germán Poveda,
José A. Marengo,
Anja Rammig,
Susan A. Masino,
Ugo Bardi,
Juan F. Salazar,
William R. Moomaw,
Scott R. Saleska
Abstract:
Goessling et al. (1) link the record-breaking warming anomaly of 2023 to a global albedo decline due to reduced low-level cloud cover. What caused the reduction remains unclear. Goessling et al. considered several geophysical mechanisms, including ocean surface warming and declining aerosol emissions, but did not discuss the biosphere. We propose that disruption of global biospheric functioning co…
▽ More
Goessling et al. (1) link the record-breaking warming anomaly of 2023 to a global albedo decline due to reduced low-level cloud cover. What caused the reduction remains unclear. Goessling et al. considered several geophysical mechanisms, including ocean surface warming and declining aerosol emissions, but did not discuss the biosphere. We propose that disruption of global biospheric functioning could be a cause, as supported by three lines of evidence that have not yet been jointly considered.
△ Less
Submitted 28 January, 2025;
originally announced January 2025.
-
Humanity's Last Exam
Authors:
Long Phan,
Alice Gatti,
Ziwen Han,
Nathaniel Li,
Josephina Hu,
Hugh Zhang,
Chen Bo Calvin Zhang,
Mohamed Shaaban,
John Ling,
Sean Shi,
Michael Choi,
Anish Agrawal,
Arnav Chopra,
Adam Khoja,
Ryan Kim,
Richard Ren,
Jason Hausenloy,
Oliver Zhang,
Mantas Mazeika,
Dmitry Dodonov,
Tung Nguyen,
Jaeho Lee,
Daron Anderson,
Mikhail Doroshenko,
Alun Cennyth Stokes
, et al. (1084 additional authors not shown)
Abstract:
Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are not keeping pace in difficulty: LLMs now achieve over 90\% accuracy on popular benchmarks like MMLU, limiting informed measurement of state-of-the-art LLM capabilities. In response, we introduce Humanity's Last Exam (HLE), a multi-modal benchmark at the frontier of…
▽ More
Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are not keeping pace in difficulty: LLMs now achieve over 90\% accuracy on popular benchmarks like MMLU, limiting informed measurement of state-of-the-art LLM capabilities. In response, we introduce Humanity's Last Exam (HLE), a multi-modal benchmark at the frontier of human knowledge, designed to be the final closed-ended academic benchmark of its kind with broad subject coverage. HLE consists of 2,500 questions across dozens of subjects, including mathematics, humanities, and the natural sciences. HLE is developed globally by subject-matter experts and consists of multiple-choice and short-answer questions suitable for automated grading. Each question has a known solution that is unambiguous and easily verifiable, but cannot be quickly answered via internet retrieval. State-of-the-art LLMs demonstrate low accuracy and calibration on HLE, highlighting a significant gap between current LLM capabilities and the expert human frontier on closed-ended academic questions. To inform research and policymaking upon a clear understanding of model capabilities, we publicly release HLE at https://lastexam.ai.
△ Less
Submitted 19 April, 2025; v1 submitted 24 January, 2025;
originally announced January 2025.
-
Adaptive Numerical Differentiation for Extremum Seeking with Sensor Noise
Authors:
Shashank Verma,
Juan Augusto Paredes Salazar,
Jhon Manuel Portella Delgado,
Ankit Goel,
Dennis S. Bernstein
Abstract:
Extremum-seeking control (ESC) is widely used to optimize performance when the system dynamics are uncertain. However, sensitivity to sensor noise is an important issue in ESC implementation due to the use of high-pass filters or gradient estimators. To reduce the sensitivity of ESC to noise, this paper investigates the use of adaptive input and state estimation (AISE) for numerical differentiatio…
▽ More
Extremum-seeking control (ESC) is widely used to optimize performance when the system dynamics are uncertain. However, sensitivity to sensor noise is an important issue in ESC implementation due to the use of high-pass filters or gradient estimators. To reduce the sensitivity of ESC to noise, this paper investigates the use of adaptive input and state estimation (AISE) for numerical differentiation. In particular, this paper develops extremum-seeking control with adaptive input and state estimation (ESC/AISE), where the high-pass filter of ESC is replaced by AISE to improve performance under sensor noise. The effectiveness of ESC/AISE is illustrated via numerical examples.
△ Less
Submitted 7 January, 2025;
originally announced January 2025.
-
Long-Form Speech Generation with Spoken Language Models
Authors:
Se Jin Park,
Julian Salazar,
Aren Jansen,
Keisuke Kinoshita,
Yong Man Ro,
RJ Skerry-Ryan
Abstract:
We consider the generative modeling of speech over multiple minutes, a requirement for long-form multimedia generation and audio-native voice assistants. However, current spoken language models struggle to generate plausible speech past tens of seconds, from high temporal resolution of speech tokens causing loss of coherence, to architectural issues with long-sequence training or extrapolation, to…
▽ More
We consider the generative modeling of speech over multiple minutes, a requirement for long-form multimedia generation and audio-native voice assistants. However, current spoken language models struggle to generate plausible speech past tens of seconds, from high temporal resolution of speech tokens causing loss of coherence, to architectural issues with long-sequence training or extrapolation, to memory costs at inference time. With these considerations we propose SpeechSSM, the first speech language model to learn from and sample long-form spoken audio (e.g., 16 minutes of read or extemporaneous speech) in a single decoding session without text intermediates, based on recent advances in linear-time sequence modeling. Furthermore, to address growing challenges in spoken language evaluation, especially in this new long-form setting, we propose: new embedding-based and LLM-judged metrics; quality measurements over length and time; and a new benchmark for long-form speech processing and generation, LibriSpeech-Long. Speech samples and the dataset are released at https://google.github.io/tacotron/publications/speechssm/
△ Less
Submitted 24 December, 2024;
originally announced December 2024.
-
Zero-Shot Mono-to-Binaural Speech Synthesis
Authors:
Alon Levkovitch,
Julian Salazar,
Soroosh Mariooryad,
RJ Skerry-Ryan,
Nadav Bar,
Bastiaan Kleijn,
Eliya Nachmani
Abstract:
We present ZeroBAS, a neural method to synthesize binaural audio from monaural audio recordings and positional information without training on any binaural data. To our knowledge, this is the first published zero-shot neural approach to mono-to-binaural audio synthesis. Specifically, we show that a parameter-free geometric time warping and amplitude scaling based on source location suffices to get…
▽ More
We present ZeroBAS, a neural method to synthesize binaural audio from monaural audio recordings and positional information without training on any binaural data. To our knowledge, this is the first published zero-shot neural approach to mono-to-binaural audio synthesis. Specifically, we show that a parameter-free geometric time warping and amplitude scaling based on source location suffices to get an initial binaural synthesis that can be refined by iteratively applying a pretrained denoising vocoder. Furthermore, we find this leads to generalization across room conditions, which we measure by introducing a new dataset, TUT Mono-to-Binaural, to evaluate state-of-the-art monaural-to-binaural synthesis methods on unseen conditions. Our zero-shot method is perceptually on-par with the performance of supervised methods on the standard mono-to-binaural dataset, and even surpasses them on our out-of-distribution TUT Mono-to-Binaural dataset. Our results highlight the potential of pretrained generative audio models and zero-shot learning to unlock robust binaural audio synthesis.
△ Less
Submitted 28 May, 2025; v1 submitted 11 December, 2024;
originally announced December 2024.
-
Relativistic dissipative fluids in the trace-fixed particle frame: Strongly hyperbolic quasi-linear first-order evolution equations
Authors:
J. Félix Salazar,
Ana Laura García-Perciante,
Olivier Sarbach
Abstract:
In this paper we derive a new first-order theory of relativistic dissipative fluids by adopting the trace-fixed particle frame. Whereas in a companion letter we show that this theory is hyperbolic, causal and stable at global equilibrium states, here we prove that the full nonlinear system of equations can be cast into a first-order quasilinear system which is strongly hyperbolic. By rewriting the…
▽ More
In this paper we derive a new first-order theory of relativistic dissipative fluids by adopting the trace-fixed particle frame. Whereas in a companion letter we show that this theory is hyperbolic, causal and stable at global equilibrium states, here we prove that the full nonlinear system of equations can be cast into a first-order quasilinear system which is strongly hyperbolic. By rewriting the system in first-order form, auxiliary constraints are introduced. However, we show that these constraints propagate, and thus our theory leads to a well-posed Cauchy problem.
△ Less
Submitted 28 April, 2025; v1 submitted 4 December, 2024;
originally announced December 2024.
-
Relativistic dissipative fluids in the trace-fixed particle frame: Hyperbolicity, causality, and stability
Authors:
J. Félix Salazar,
Ana Laura García-Perciante,
Olivier Sarbach
Abstract:
We propose a first-order theory of relativistic dissipative fluids in the trace-fixed particle frame, which is similar to Eckart's frame except that the temperature is determined by fixing the trace of the stress-energy tensor. Our theory is hyperbolic and causal provided a single inequality holds. For low wave numbers, the expected damped modes in the shear, acoustic, and heat diffusion channels…
▽ More
We propose a first-order theory of relativistic dissipative fluids in the trace-fixed particle frame, which is similar to Eckart's frame except that the temperature is determined by fixing the trace of the stress-energy tensor. Our theory is hyperbolic and causal provided a single inequality holds. For low wave numbers, the expected damped modes in the shear, acoustic, and heat diffusion channels are recovered. Stability of global equilibria with respect to all wave numbers is also analyzed. The conditions for hyperbolicity, causality and stability are satisfied for a simple gas of hard spheres or disks.
△ Less
Submitted 28 April, 2025; v1 submitted 4 December, 2024;
originally announced December 2024.
-
Prompting with Phonemes: Enhancing LLMs' Multilinguality for Non-Latin Script Languages
Authors:
Hoang H Nguyen,
Khyati Mahajan,
Vikas Yadav,
Julian Salazar,
Philip S. Yu,
Masoud Hashemi,
Rishabh Maheshwary
Abstract:
Although multilingual LLMs have achieved remarkable performance across benchmarks, we find they continue to underperform on non-Latin script languages across contemporary LLM families. This discrepancy arises from the fact that LLMs are pretrained with orthographic scripts, which are dominated by Latin characters that obscure their shared phonology with non-Latin scripts. We propose leveraging pho…
▽ More
Although multilingual LLMs have achieved remarkable performance across benchmarks, we find they continue to underperform on non-Latin script languages across contemporary LLM families. This discrepancy arises from the fact that LLMs are pretrained with orthographic scripts, which are dominated by Latin characters that obscure their shared phonology with non-Latin scripts. We propose leveraging phonemic transcriptions as complementary signals to induce script-invariant representations. Our study demonstrates that integrating phonemic signals improves performance across both non-Latin and Latin script languages, with a particularly significant impact on closing the performance gap between the two. Through detailed experiments, we show that phonemic and orthographic scripts retrieve distinct examples for in-context learning (ICL). This motivates our proposed Mixed-ICL retrieval strategy, where further aggregation from both leads to our significant performance improvements for both Latin script languages (up to 12.6%) and non-Latin script languages (up to 15.1%) compared to randomized ICL retrieval.
△ Less
Submitted 6 March, 2025; v1 submitted 4 November, 2024;
originally announced November 2024.
-
Robust and Unbounded Length Generalization in Autoregressive Transformer-Based Text-to-Speech
Authors:
Eric Battenberg,
RJ Skerry-Ryan,
Daisy Stanton,
Soroosh Mariooryad,
Matt Shannon,
Julian Salazar,
David Kao
Abstract:
Autoregressive (AR) Transformer-based sequence models are known to have difficulty generalizing to sequences longer than those seen during training. When applied to text-to-speech (TTS), these models tend to drop or repeat words or produce erratic output, especially for longer utterances. In this paper, we introduce enhancements aimed at AR Transformer-based encoder-decoder TTS systems that addres…
▽ More
Autoregressive (AR) Transformer-based sequence models are known to have difficulty generalizing to sequences longer than those seen during training. When applied to text-to-speech (TTS), these models tend to drop or repeat words or produce erratic output, especially for longer utterances. In this paper, we introduce enhancements aimed at AR Transformer-based encoder-decoder TTS systems that address these robustness and length generalization issues. Our approach uses an alignment mechanism to provide cross-attention operations with relative location information. The associated alignment position is learned as a latent property of the model via backpropagation and requires no external alignment information during training. While the approach is tailored to the monotonic nature of TTS input-output alignment, it is still able to benefit from the flexible modeling power of interleaved multi-head self- and cross-attention operations. A system incorporating these improvements, which we call Very Attentive Tacotron, matches the naturalness and expressiveness of a baseline T5-based TTS system, while eliminating problems with repeated or dropped words and enabling generalization to any practical utterance length.
△ Less
Submitted 11 March, 2025; v1 submitted 29 October, 2024;
originally announced October 2024.
-
MPC-guided, Data-driven Fuzzy Controller Synthesis
Authors:
Juan Augusto Paredes Salazar,
Ankit Goel
Abstract:
Model predictive control (MPC) is a powerful control technique for online optimization using system model-based predictions over a finite time horizon. However, the computational cost MPC requires can be prohibitive in resource-constrained computer systems. This paper presents a fuzzy controller synthesis framework guided by MPC. In the proposed framework, training data is obtained from MPC closed…
▽ More
Model predictive control (MPC) is a powerful control technique for online optimization using system model-based predictions over a finite time horizon. However, the computational cost MPC requires can be prohibitive in resource-constrained computer systems. This paper presents a fuzzy controller synthesis framework guided by MPC. In the proposed framework, training data is obtained from MPC closed-loop simulations and is used to optimize a low computational complexity controller to emulate the response of MPC. In particular, autoregressive moving average (ARMA) controllers are trained using data obtained from MPC closed-loop simulations, such that each ARMA controller emulates the response of the MPC controller under particular desired conditions. Using a Takagi-Sugeno (T-S) fuzzy system, the responses of all the trained ARMA controllers are then weighted depending on the measured system conditions, resulting in the Fuzzy-Autoregressive Moving Average (F-ARMA) controller. The effectiveness of the trained F-ARMA controllers is illustrated via numerical examples.
△ Less
Submitted 2 December, 2024; v1 submitted 9 October, 2024;
originally announced October 2024.
-
Image-Based Leopard Seal Recognition: Approaches and Challenges in Current Automated Systems
Authors:
Jorge Yero Salazar,
Pablo Rivas,
Renato Borras-Chavez,
Sarah Kienle
Abstract:
This paper examines the challenges and advancements in recognizing seals within their natural habitats using conventional photography, underscored by the emergence of machine learning technologies. We used the leopard seal, \emph{Hydrurga leptonyx}, a key species within Antarctic ecosystems, to review the different available methods found. As apex predators, Leopard seals are characterized by thei…
▽ More
This paper examines the challenges and advancements in recognizing seals within their natural habitats using conventional photography, underscored by the emergence of machine learning technologies. We used the leopard seal, \emph{Hydrurga leptonyx}, a key species within Antarctic ecosystems, to review the different available methods found. As apex predators, Leopard seals are characterized by their significant ecological role and elusive nature so studying them is crucial to understand the health of their ecosystem. Traditional methods of monitoring seal species are often constrained by the labor-intensive and time-consuming processes required for collecting data, compounded by the limited insights these methods provide. The advent of machine learning, particularly through the application of vision transformers, heralds a new era of efficiency and precision in species monitoring. By leveraging state-of-the-art approaches in detection, segmentation, and recognition within digital imaging, this paper presents a synthesis of the current landscape, highlighting both the cutting-edge methodologies and the predominant challenges faced in accurately identifying seals through photographic data.
△ Less
Submitted 13 August, 2024;
originally announced August 2024.
-
Design, Construction, and Test of Compact, Distributed-Charge, X-Band Accelerator Systems that Enable Image-Guided, VHEE FLASH Radiotherapy
Authors:
Christopher P. J. Barty,
J. Martin Algots,
Alexander J. Amador,
James C. R. Barty,
Shawn M. Betts,
Marcelo A. Castañeda,
Matthew M. Chu,
Michael E. Daley,
Ricardo A. De Luna Lopez,
Derek A. Diviak,
Haytham H. Effarah,
Roberto Feliciano,
Adan Garcia,
Keith J. Grabiel,
Alex S. Griffin,
Frederic V. Hartemann,
Leslie Heid,
Yoonwoo Hwang,
Gennady Imeshev,
Michael Jentschel,
Christopher A. Johnson,
Kenneth W. Kinosian,
Agnese Lagzda,
Russell J. Lochrie,
Michael W. May
, et al. (18 additional authors not shown)
Abstract:
The design and optimization of laser-Compton x-ray systems based on compact distributed charge accelerator structures can enable micron-scale imaging of disease and the concomitant production of beams of Very High Energy Electrons (VHEEs) capable of producing FLASH-relevant dose rates. The physics of laser-Compton x-ray scattering ensures that the scattered x-rays follow exactly the trajectory of…
▽ More
The design and optimization of laser-Compton x-ray systems based on compact distributed charge accelerator structures can enable micron-scale imaging of disease and the concomitant production of beams of Very High Energy Electrons (VHEEs) capable of producing FLASH-relevant dose rates. The physics of laser-Compton x-ray scattering ensures that the scattered x-rays follow exactly the trajectory of the incident electrons, thus providing a route to image-guided, VHEE FLASH radiotherapy. The keys to a compact architecture capable of producing both laser-Compton x-rays and VHEEs are the use of X-band RF accelerator structures which have been demonstrated to operate with over 100 MeV/m acceleration gradients. The operation of these structures in a distributed charge mode in which each radiofrequency (RF) cycle of the drive RF pulse is filled with a low-charge, high-brightness electron bunch is enabled by the illumination of a high-brightness photogun with a train of UV laser pulses synchronized to the frequency of the underlying accelerator system. The UV pulse trains are created by a patented pulse synthesis approach which utilizes the RF clock of the accelerator to phase and amplitude modulate a narrow band continuous wave (CW) seed laser. In this way it is possible to produce up to 10 $μ$A of average beam current from the accelerator. Such high current from a compact accelerator enables production of sufficient x-rays via laser-Compton scattering for clinical imaging and does so from a machine of "clinical" footprint. At the same time, the production of 1000 or greater individual micro-bunches per RF pulse enables > 10 nC of charge to be produced in a macrobunch of < 100 ns. The design, construction, and test of the 100-MeV class prototype system in Irvine, CA is also presented.
△ Less
Submitted 2 January, 2025; v1 submitted 7 August, 2024;
originally announced August 2024.
-
What Lies beyond the Pareto Front? A Survey on Decision-Support Methods for Multi-Objective Optimization
Authors:
Zuzanna Osika,
Jazmin Zatarain Salazar,
Diederik M. Roijers,
Frans A. Oliehoek,
Pradeep K. Murukannaiah
Abstract:
We present a review that unifies decision-support methods for exploring the solutions produced by multi-objective optimization (MOO) algorithms. As MOO is applied to solve diverse problems, approaches for analyzing the trade-offs offered by MOO algorithms are scattered across fields. We provide an overview of the advances on this topic, including methods for visualization, mining the solution set,…
▽ More
We present a review that unifies decision-support methods for exploring the solutions produced by multi-objective optimization (MOO) algorithms. As MOO is applied to solve diverse problems, approaches for analyzing the trade-offs offered by MOO algorithms are scattered across fields. We provide an overview of the advances on this topic, including methods for visualization, mining the solution set, and uncertainty exploration as well as emerging research directions, including interactivity, explainability, and ethics. We synthesize these methods drawing from different fields of research to build a unified approach, independent of the application. Our goals are to reduce the entry barrier for researchers and practitioners on using MOO algorithms and to provide novel research directions.
△ Less
Submitted 19 November, 2023;
originally announced November 2023.
-
End-to-End Test Coverage Metrics in Microservice Systems: An Automated Approach
Authors:
Amr Elsayed,
Tomas Cerny,
Jorge Yero Salazar,
Austin Lehman,
Joshua Hunter,
Ashley Bickham,
Davide Taibi
Abstract:
Microservice architecture gains momentum by fueling systems with cloud-native benefits, scalability, and decentralized evolution. However, new challenges emerge for end-to-end (E2E) testing. Testers who see the decentralized system through the user interface might assume their tests are comprehensive, covering all middleware endpoints scattered across microservices. However, they do not have instr…
▽ More
Microservice architecture gains momentum by fueling systems with cloud-native benefits, scalability, and decentralized evolution. However, new challenges emerge for end-to-end (E2E) testing. Testers who see the decentralized system through the user interface might assume their tests are comprehensive, covering all middleware endpoints scattered across microservices. However, they do not have instruments to verify such assumptions. This paper introduces test coverage metrics for evaluating the extent of E2E test suite coverage for microservice endpoints. Next, it presents an automated approach to compute these metrics to provide feedback on the completeness of E2E test suites. Furthermore, a visual perspective is provided to highlight test coverage across the system's microservices to guide on gaps in test suites. We implement a proof-of-concept tool and perform a case study on a well-established system benchmark showing it can generate conclusive feedback on test suite coverage over system endpoints.
△ Less
Submitted 17 August, 2023;
originally announced August 2023.
-
Suppression of Chaotic Motion of Tethered Satellite Systems Using Tether Length Control
Authors:
Francisco J. T. Salazar,
Antonio F. B. A. Prado
Abstract:
This study focuses on attitude and control motion of two bodies (a base-satellite and a sub-satellite) connected by an inextensible and massless tether in a circular orbit under the influence of the Earths gravitational force. The base-satellite is assumed to be far more heavier than the sub-satellite. In such cases, the base-satellite is regarded as the reference spacecraft. Because of the comple…
▽ More
This study focuses on attitude and control motion of two bodies (a base-satellite and a sub-satellite) connected by an inextensible and massless tether in a circular orbit under the influence of the Earths gravitational force. The base-satellite is assumed to be far more heavier than the sub-satellite. In such cases, the base-satellite is regarded as the reference spacecraft. Because of the complexity of the problem, no thrusters on the sub-satellite are considered, and the effect of atmospheric drag, Earths oblateness, and electrodynamic force on the spacecraft are neglected.
△ Less
Submitted 25 July, 2023;
originally announced July 2023.
-
Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM
Authors:
Eliya Nachmani,
Alon Levkovitch,
Roy Hirsch,
Julian Salazar,
Chulayuth Asawaroengchai,
Soroosh Mariooryad,
Ehud Rivlin,
RJ Skerry-Ryan,
Michelle Tadmor Ramanovich
Abstract:
We present Spectron, a novel approach to adapting pre-trained large language models (LLMs) to perform spoken question answering (QA) and speech continuation. By endowing the LLM with a pre-trained speech encoder, our model becomes able to take speech inputs and generate speech outputs. The entire system is trained end-to-end and operates directly on spectrograms, simplifying our architecture. Key…
▽ More
We present Spectron, a novel approach to adapting pre-trained large language models (LLMs) to perform spoken question answering (QA) and speech continuation. By endowing the LLM with a pre-trained speech encoder, our model becomes able to take speech inputs and generate speech outputs. The entire system is trained end-to-end and operates directly on spectrograms, simplifying our architecture. Key to our approach is a training objective that jointly supervises speech recognition, text continuation, and speech synthesis using only paired speech-text pairs, enabling a `cross-modal' chain-of-thought within a single decoding pass. Our method surpasses existing spoken language models in speaker preservation and semantic coherence. Furthermore, the proposed model improves upon direct initialization in retaining the knowledge of the original LLM as demonstrated through spoken QA datasets. We release our audio samples (https://michelleramanovich.github.io/spectron/spectron) and spoken QA dataset (https://github.com/google-research-datasets/LLAMA1-Test-Set).
△ Less
Submitted 30 May, 2024; v1 submitted 24 May, 2023;
originally announced May 2023.
-
Zero-Shot End-to-End Spoken Language Understanding via Cross-Modal Selective Self-Training
Authors:
Jianfeng He,
Julian Salazar,
Kaisheng Yao,
Haoqi Li,
Jinglun Cai
Abstract:
End-to-end (E2E) spoken language understanding (SLU) is constrained by the cost of collecting speech-semantics pairs, especially when label domains change. Hence, we explore \textit{zero-shot} E2E SLU, which learns E2E SLU without speech-semantics pairs, instead using only speech-text and text-semantics pairs. Previous work achieved zero-shot by pseudolabeling all speech-text transcripts with a na…
▽ More
End-to-end (E2E) spoken language understanding (SLU) is constrained by the cost of collecting speech-semantics pairs, especially when label domains change. Hence, we explore \textit{zero-shot} E2E SLU, which learns E2E SLU without speech-semantics pairs, instead using only speech-text and text-semantics pairs. Previous work achieved zero-shot by pseudolabeling all speech-text transcripts with a natural language understanding (NLU) model learned on text-semantics corpora. However, this method requires the domains of speech-text and text-semantics to match, which often mismatch due to separate collections. Furthermore, using the entire collected speech-text corpus from any domains leads to \textit{imbalance} and \textit{noise} issues. To address these, we propose \textit{cross-modal selective self-training} (CMSST). CMSST tackles imbalance by clustering in a joint space of the three modalities (speech, text, and semantics) and handles label noise with a selection network. We also introduce two benchmarks for zero-shot E2E SLU, covering matched and found speech (mismatched) settings. Experiments show that CMSST improves performance in both two settings, with significantly reduced sample sizes and training time. Our code and data are released in https://github.com/amazon-science/zero-shot-E2E-slu.
△ Less
Submitted 2 February, 2024; v1 submitted 22 May, 2023;
originally announced May 2023.
-
Coronal Heating as Determined by the Solar Flare Frequency Distribution Obtained by Aggregating Case Studies
Authors:
James Paul Mason,
Alexandra Werth,
Colin G. West,
Allison A. Youngblood,
Donald L. Woodraska,
Courtney Peck,
Kevin Lacjak,
Florian G. Frick,
Moutamen Gabir,
Reema A. Alsinan,
Thomas Jacobsen,
Mohammad Alrubaie,
Kayla M. Chizmar,
Benjamin P. Lau,
Lizbeth Montoya Dominguez,
David Price,
Dylan R. Butler,
Connor J. Biron,
Nikita Feoktistov,
Kai Dewey,
N. E. Loomis,
Michal Bodzianowski,
Connor Kuybus,
Henry Dietrick,
Aubrey M. Wolfe
, et al. (977 additional authors not shown)
Abstract:
Flare frequency distributions represent a key approach to addressing one of the largest problems in solar and stellar physics: determining the mechanism that counter-intuitively heats coronae to temperatures that are orders of magnitude hotter than the corresponding photospheres. It is widely accepted that the magnetic field is responsible for the heating, but there are two competing mechanisms th…
▽ More
Flare frequency distributions represent a key approach to addressing one of the largest problems in solar and stellar physics: determining the mechanism that counter-intuitively heats coronae to temperatures that are orders of magnitude hotter than the corresponding photospheres. It is widely accepted that the magnetic field is responsible for the heating, but there are two competing mechanisms that could explain it: nanoflares or Alfvén waves. To date, neither can be directly observed. Nanoflares are, by definition, extremely small, but their aggregate energy release could represent a substantial heating mechanism, presuming they are sufficiently abundant. One way to test this presumption is via the flare frequency distribution, which describes how often flares of various energies occur. If the slope of the power law fitting the flare frequency distribution is above a critical threshold, $α=2$ as established in prior literature, then there should be a sufficient abundance of nanoflares to explain coronal heating. We performed $>$600 case studies of solar flares, made possible by an unprecedented number of data analysts via three semesters of an undergraduate physics laboratory course. This allowed us to include two crucial, but nontrivial, analysis methods: pre-flare baseline subtraction and computation of the flare energy, which requires determining flare start and stop times. We aggregated the results of these analyses into a statistical study to determine that $α= 1.63 \pm 0.03$. This is below the critical threshold, suggesting that Alfvén waves are an important driver of coronal heating.
△ Less
Submitted 9 May, 2023;
originally announced May 2023.
-
A minimal axion model for mass matrices with five texture-zeros
Authors:
Yithsbey Giraldo,
R. Martínez,
Eduardo Rojas,
Juan C. Salazar
Abstract:
A model with fermion and scalar fields charged under a Peccei-Queen~(PQ) symmetry is proposed. The PQ charges are chosen in such a way that they can reproduce mass matrices with five texture zeros, {which can generate} the fermion masses, the CKM matrix, and the PMNS matrix of the Standard Model~(SM). To obtain this result, at least 4~Higgs doublets are needed. As we will see in the manuscript thi…
▽ More
A model with fermion and scalar fields charged under a Peccei-Queen~(PQ) symmetry is proposed. The PQ charges are chosen in such a way that they can reproduce mass matrices with five texture zeros, {which can generate} the fermion masses, the CKM matrix, and the PMNS matrix of the Standard Model~(SM). To obtain this result, at least 4~Higgs doublets are needed. As we will see in the manuscript this is a highly non-trivial result since the texture zeros of the mass matrices impose a large number of restrictions. This model shows a route to understand the different scales of the SM by extending it with a multi-Higgs sector and an additional PQ symmetry. Since the PQ charges are not universal, the model presents flavor-changing neutral currents~(FCNC) at the tree level, a feature that constitutes the main source of restrictions on the parameter space. We report the allowed regions by lepton decays and compare them with those coming from the semileptonic decays $K^{\pm}\longrightarrow π\barνν$. We also show the excluded regions and the projected bounds of future experiments for the axion-photon coupling as a function of the axion mass and compare it with the parameter space of our model.
△ Less
Submitted 14 April, 2023;
originally announced April 2023.
-
Sittin'On the Dock of the (WiFi) Bay: On the Frame Aggregation under IEEE 802.11 DCF
Authors:
Ricardo J. Rodríguez,
José Luis Salazar,
Julián Fernández-Navajas
Abstract:
It is well known that frame aggregation in Internet communications improves transmission efficiency. However, it also causes a delay that for some real-time communications is inappropriate, thus creating a trade-off between efficiency and delay. In this paper, we establish the conditions for frame aggregation under the IEEE 802.11 DCF protocol to be beneficial on average delay. To do so, we first…
▽ More
It is well known that frame aggregation in Internet communications improves transmission efficiency. However, it also causes a delay that for some real-time communications is inappropriate, thus creating a trade-off between efficiency and delay. In this paper, we establish the conditions for frame aggregation under the IEEE 802.11 DCF protocol to be beneficial on average delay. To do so, we first describe the transmission time in IEEE 802.11 in a stochastic framework and then we calculate the optimal value of the frames that, when aggregated, saves transmission time in the long term. Our findings, discussed with numerical experimentation, show that frame aggregation reduces transmission congestion and transmission delays.
△ Less
Submitted 7 November, 2022;
originally announced November 2022.
-
Local thermodynamical equilibrium and relativistic dissipation
Authors:
J Félix Salazar,
Thomas Zannias
Abstract:
We introduce a class of relativistic fluid states satisfying the relativistic local thermodynamical equilibrium postulate (abbreviated as relativistic (LTE) postulate). States satisfying this postulate, are states "near equilibrium" (a term defined precisely in the course of the paper) and permit us to attach a fictitious "local thermodynamical equilibrium" state that fits event by event the actua…
▽ More
We introduce a class of relativistic fluid states satisfying the relativistic local thermodynamical equilibrium postulate (abbreviated as relativistic (LTE) postulate). States satisfying this postulate, are states "near equilibrium" (a term defined precisely in the course of the paper) and permit us to attach a fictitious "local thermodynamical equilibrium" state that fits event by event the actual fluid state. They single out an admissible class of rest frames relative to which thermodynamical variables like the energy density, thermodynamical pressure, stresses, particle number density (or densities) measured by observers at rest relative to these frames are becoming frame independent provided second (or higher) order deviations from the fictitious state of "local thermodynamical equilibrium" are ignored. We have verified this property for a large class of theories of relativistic dissipation that include the Hiscock-Lindblom class of first order theories, the Eckart and Landau-Lifshitz theories, the Israel-Stewart transient thermodynamics, the Liu-Müller-Ruggeri theory, fluids of divergence type and the latest developed (BDNK) theory. Moreover, the phenomenological equations describing first order deviations from the fictitious "local thermodynamical equilibrium" state satisfy equations that remain form invariant under change of frame within the class of admissible frames. We proved this property for the Hiscock-Lindblom class of first order theories the Eckart and Landau-Lifshitz theories, the Israel-Stewart transient thermodynamics and the Liu-Müller-Ruggeri theory of relativistic dissipation the (BDNK) theory and we expect that the same property to hold for the class of relativistic fluids of divergence type.
△ Less
Submitted 18 October, 2022;
originally announced October 2022.
-
Meta-Learning the Difference: Preparing Large Language Models for Efficient Adaptation
Authors:
Zejiang Hou,
Julian Salazar,
George Polovets
Abstract:
Large pretrained language models (PLMs) are often domain- or task-adapted via fine-tuning or prompting. Finetuning requires modifying all of the parameters and having enough data to avoid overfitting while prompting requires no training and few examples but limits performance. Instead, we prepare PLMs for data- and parameter-efficient adaptation by learning to learn the difference between general…
▽ More
Large pretrained language models (PLMs) are often domain- or task-adapted via fine-tuning or prompting. Finetuning requires modifying all of the parameters and having enough data to avoid overfitting while prompting requires no training and few examples but limits performance. Instead, we prepare PLMs for data- and parameter-efficient adaptation by learning to learn the difference between general and adapted PLMs. This difference is expressed in terms of model weights and sublayer structure through our proposed dynamic low-rank reparameterization and learned architecture controller. Experiments on few-shot dialogue completion, low-resource abstractive summarization, and multi-domain language modeling show improvements in adaptation time and performance over direct finetuning or preparation via domain-adaptive pretraining. Ablations show our task-adaptive reparameterization (TARP) and model search (TAMS) components individually improve on other parameter-efficient transfer like adapters and structure-learning methods like learned sparsification.
△ Less
Submitted 7 July, 2022;
originally announced July 2022.
-
De Sitter-invariant approach to cosmology
Authors:
A. V. Araujo,
D. F. López,
J. G. Pereira,
J. R. Salazar
Abstract:
The spacetime short-distance structure at the Planck scale is governed by the Planck length, usually interpreted as a three-dimensional Euclidian length. As such, it is not Lorentz invariant and clashes with Einstein's special relativity, which is thus unable to describe the Planck scale kinematics. The solution to this problem is twofold. First, one has to re-interpret the Planck length as a Lore…
▽ More
The spacetime short-distance structure at the Planck scale is governed by the Planck length, usually interpreted as a three-dimensional Euclidian length. As such, it is not Lorentz invariant and clashes with Einstein's special relativity, which is thus unable to describe the Planck scale kinematics. The solution to this problem is twofold. First, one has to re-interpret the Planck length as a Lorentz invariant four-dimensional pseudo-length. Second, to comply with a non-vanishing cosmological term~$Λ$, one has to replace the standard Poincaré-invariant special relativity with the de Sitter-invariant special relativity. Since the Planck pseudo-length does not clash with the de Sitter-invariant special relativity, it provides a consistent description of the Planck scale kinematics in the presence of~$Λ$. Under the above replacement, general relativity changes to the de Sitter-invariant general relativity, in which~$Λ$ is constitutive. In this paper, the ensuing Friedmann equations are derived, and some implications for cosmology are explored and discussed.
△ Less
Submitted 20 March, 2024; v1 submitted 9 March, 2022;
originally announced March 2022.
-
A Novel Assistive Controller Based on Differential Geometry for Users of the Differential-Drive Wheeled Mobile Robots
Authors:
Seyed Amir Tafrishi,
Ankit A. Ravankar,
Jose Salazar,
Yasuhisa Hirata
Abstract:
Certain wheeled mobile robots e.g., electric wheelchairs, can operate through indirect joystick controls from users. Correct steering angle becomes essential when the user should determine the vehicle direction and velocity, in particular for differential wheeled vehicles since the vehicle velocity and direction are controlled with only two actuating wheels. This problem gets more challenging when…
▽ More
Certain wheeled mobile robots e.g., electric wheelchairs, can operate through indirect joystick controls from users. Correct steering angle becomes essential when the user should determine the vehicle direction and velocity, in particular for differential wheeled vehicles since the vehicle velocity and direction are controlled with only two actuating wheels. This problem gets more challenging when complex curves should be realized by the user. A novel assistive controller with safety constraints is needed to address these problems. Also, the classic control methods mostly require the desired states beforehand which completely contradicts human's spontaneous decisions on the desired location to go. In this work, we develop a novel assistive control strategy based on differential geometry relying on only joystick inputs and vehicle states where the controller does not require any desired states. We begin with explaining the vehicle kinematics and our designed Darboux frame kinematics on a contact point of a virtual wheel and plane. Next, the geometric controller using the Darboux frame kinematics is designed for having smooth trajectories under certain safety constraints. We experiment our approach with different participants and evaluate its performance in various routes.
△ Less
Submitted 4 February, 2022;
originally announced February 2022.
-
Quantum field theory in a de Sitter universe transiting to the radiation stage
Authors:
Juan R. Salazar,
Sujoy K. Modak
Abstract:
We study some physical aspects of quantum field theory in a two stage universe starting from the inflationary de Sitter and transiting into the radiation dominated stage. We look into the time evolution of the primordial vacuum states, associated with the (i) comoving and (ii) Bunch-Davies modes. We show how the power spectrum for a comoving observer, obtained from the excitation of the aforementi…
▽ More
We study some physical aspects of quantum field theory in a two stage universe starting from the inflationary de Sitter and transiting into the radiation dominated stage. We look into the time evolution of the primordial vacuum states, associated with the (i) comoving and (ii) Bunch-Davies modes. We show how the power spectrum for a comoving observer, obtained from the excitation of the aforementioned states defined in the de Sitter stage, changes as the universe transits into the radiation stage. In addition, we also develop a methodology to transfer the well known result of particle creation in the static de Sitter frame, originating from the aforementioned vacuum states, while the universe makes a transition to the next (radiation dominated) stage.
△ Less
Submitted 26 October, 2021;
originally announced October 2021.
-
QBoost for regression problems: solving partial differential equations
Authors:
Caio B. D. Góes,
Thiago O. Maciel,
Giovani G. Pollachini,
Rafael Cuenca,
Juan P. L. C. Salazar,
Eduardo I. Duzzioni
Abstract:
A hybrid algorithm based on machine learning and quantum ensemble learning is proposed that is capable of finding a solution to a partial differential equation with good precision and favorable scaling in the required number of qubits. The classical part is composed by training several regressors (weak-learners), capable of solving a partial differential equation using machine learning. The quantu…
▽ More
A hybrid algorithm based on machine learning and quantum ensemble learning is proposed that is capable of finding a solution to a partial differential equation with good precision and favorable scaling in the required number of qubits. The classical part is composed by training several regressors (weak-learners), capable of solving a partial differential equation using machine learning. The quantum part consists of adapting the QBoost algorithm to solve regression problems. We have successfully applied our framework to solve the 1D Burgers' equation with viscosity, showing that the quantum ensemble method really improves the solutions produced by weak-learners. We also implemented the algorithm on the D-Wave Systems, confirming the best performance of the quantum solution compared to the simulated annealing and exact solver methods, given the memory limitations of our classical computer used in the comparison.
△ Less
Submitted 30 August, 2021;
originally announced August 2021.
-
Habitability Models for Astrobiology
Authors:
Abel Méndez,
Edgard E. Rivera-Valentín,
Dirk Schulze-Makuch,
Justin Filiberto,
Ramses M. Ramírez,
Tana Wood,
Alfonso Dávila,
Chris McKay,
Kevin N. Ortiz Ceballos,
Marcos Jusino-Maldonado,
Nicole J. Torres-Santiago,
Guillermo Nery,
René Heller,
Paul K. Byrne,
Michael J. Malaska,
Erica Nathan,
Marta F. Simões,
André Antunes,
Jesús Martínez-Frías,
Ludmila Carone,
Noam R. Izenberg,
Dimitra Atri,
Humberto I. Carvajal Chitty,
Priscilla Nowajewski-Barra,
Frances Rivera-Hernández
, et al. (9 additional authors not shown)
Abstract:
Habitability has been generally defined as the capability of an environment to support life. Ecologists have been using Habitat Suitability Models (HSMs) for more than four decades to study the habitability of Earth from local to global scales. Astrobiologists have been proposing different habitability models for some time, with little integration and consistency among them, being different in fun…
▽ More
Habitability has been generally defined as the capability of an environment to support life. Ecologists have been using Habitat Suitability Models (HSMs) for more than four decades to study the habitability of Earth from local to global scales. Astrobiologists have been proposing different habitability models for some time, with little integration and consistency among them, being different in function to those used by ecologists. Habitability models are not only used to determine if environments are habitable or not, but they also are used to characterize what key factors are responsible for the gradual transition from low to high habitability states. Here we review and compare some of the different models used by ecologists and astrobiologists and suggest how they could be integrated into new habitability standards. Such standards will help to improve the comparison and characterization of potentially habitable environments, prioritize target selections, and study correlations between habitability and biosignatures. Habitability models are the foundation of planetary habitability science and the synergy between ecologists and astrobiologists is necessary to expand our understanding of the habitability of Earth, the Solar System, and extrasolar planets.
△ Less
Submitted 11 August, 2021;
originally announced August 2021.
-
A hybrid classical-quantum approach to solve the heat equation using quantum annealers
Authors:
Giovani G. Pollachini,
Juan P. L. C. Salazar,
Caio B. D. Goes,
Thiago O. Maciel,
Eduardo I. Duzzioni
Abstract:
The numerical solution of partial differential equations by discretization techniques is ubiquitous in computational physics. In this work we benchmark this approach in the quantum realm by solving the heat equation for a square plate subject to fixed temperatures at the edges and random heat sources and sinks within the domain. The hybrid classical-quantum approach consists in the solution on a q…
▽ More
The numerical solution of partial differential equations by discretization techniques is ubiquitous in computational physics. In this work we benchmark this approach in the quantum realm by solving the heat equation for a square plate subject to fixed temperatures at the edges and random heat sources and sinks within the domain. The hybrid classical-quantum approach consists in the solution on a quantum computer of the coupled linear system of equations that result from the discretization step. Owing to the limitations in the number of qubits and their connectivity, we use the Gauss-Seidel method to divide the full system of linear equations into subsystems, which are solved iteratively in block fashion. Each of the linear subsystems were solved using 2000Q and Advantage quantum computers developed by D-Wave Systems Inc. By comparing classical numerical and quantum solutions, we observe that the errors and chain break fraction are, on average, greater on the 2000Q system. Unlike the classical Gauss-Seidel method, the errors of the quantum solutions level off after a few iterations of our algorithm. This is partly a result of the span of the real number line available from the mapping of the chosen size of the set of qubit states. We verified this by using techniques to progressively shrink the range mapped by the set of qubit states at each iteration (increasing floating-point accuracy). As a result, no leveling off is observed. However, an increase in qubits does not translate to an overall lower error. This is believed to be indicative of the increasing length of chains required for the mapping to real numbers and the ensuing limitations of hardware.
△ Less
Submitted 8 June, 2021;
originally announced June 2021.
-
Align-Refine: Non-Autoregressive Speech Recognition via Iterative Realignment
Authors:
Ethan A. Chi,
Julian Salazar,
Katrin Kirchhoff
Abstract:
Non-autoregressive models greatly improve decoding speed over typical sequence-to-sequence models, but suffer from degraded performance. Infilling and iterative refinement models make up some of this gap by editing the outputs of a non-autoregressive model, but are constrained in the edits that they can make. We propose iterative realignment, where refinements occur over latent alignments rather t…
▽ More
Non-autoregressive models greatly improve decoding speed over typical sequence-to-sequence models, but suffer from degraded performance. Infilling and iterative refinement models make up some of this gap by editing the outputs of a non-autoregressive model, but are constrained in the edits that they can make. We propose iterative realignment, where refinements occur over latent alignments rather than output sequence space. We demonstrate this in speech recognition with Align-Refine, an end-to-end Transformer-based model which refines connectionist temporal classification (CTC) alignments to allow length-changing insertions and deletions. Align-Refine outperforms Imputer and Mask-CTC, matching an autoregressive baseline on WSJ at 1/14th the real-time factor and attaining a LibriSpeech test-other WER of 9.0% without an LM. Our model is strong even in one iteration with a shallower decoder.
△ Less
Submitted 24 October, 2020;
originally announced October 2020.
-
Unsupervised Bitext Mining and Translation via Self-trained Contextual Embeddings
Authors:
Phillip Keung,
Julian Salazar,
Yichao Lu,
Noah A. Smith
Abstract:
We describe an unsupervised method to create pseudo-parallel corpora for machine translation (MT) from unaligned text. We use multilingual BERT to create source and target sentence embeddings for nearest-neighbor search and adapt the model via self-training. We validate our technique by extracting parallel sentence pairs on the BUCC 2017 bitext mining task and observe up to a 24.5 point increase (…
▽ More
We describe an unsupervised method to create pseudo-parallel corpora for machine translation (MT) from unaligned text. We use multilingual BERT to create source and target sentence embeddings for nearest-neighbor search and adapt the model via self-training. We validate our technique by extracting parallel sentence pairs on the BUCC 2017 bitext mining task and observe up to a 24.5 point increase (absolute) in F1 scores over previous unsupervised methods. We then improve an XLM-based unsupervised neural MT system pre-trained on Wikipedia by supplementing it with pseudo-parallel text mined from the same corpus, boosting unsupervised translation performance by up to 3.5 BLEU on the WMT'14 French-English and WMT'16 German-English tasks and outperforming the previous state-of-the-art. Finally, we enrich the IWSLT'15 English-Vietnamese corpus with pseudo-parallel Wikipedia sentence pairs, yielding a 1.2 BLEU improvement on the low-resource MT task. We demonstrate that unsupervised bitext mining is an effective way of augmenting MT datasets and complements existing techniques like initializing with pre-trained contextual embeddings.
△ Less
Submitted 15 October, 2020;
originally announced October 2020.
-
Flavored axions and the flavor problem
Authors:
Yithsbey Giraldo,
R. Martinez,
Eduardo Rojas,
Juan C. Salazar
Abstract:
A Peccei-Quinn~(PQ) symmetry is proposed in order to generate in the Standard Model~(SM) quark sector a realistic mass matrix ansatz with five texture-zeros. Limiting our analysis to Hermitian mass matrices, we show that this requires a minimum of 4 Higgs doublets. This model allows assigning values close to 1 for several Yukawa couplings, giving insight into the origin of the mass scales in the S…
▽ More
A Peccei-Quinn~(PQ) symmetry is proposed in order to generate in the Standard Model~(SM) quark sector a realistic mass matrix ansatz with five texture-zeros. Limiting our analysis to Hermitian mass matrices, we show that this requires a minimum of 4 Higgs doublets. This model allows assigning values close to 1 for several Yukawa couplings, giving insight into the origin of the mass scales in the SM. Since the PQ charges are non-universal, the model features Flavor-Changing Neutral Currents~(FCNC) at the tree level. From the analytical expressions for the FCNC we report the allowed region in the parameter space obtained from the measurements of branching ratios of semileptonic meson decays.
△ Less
Submitted 29 November, 2022; v1 submitted 10 July, 2020;
originally announced July 2020.
-
Habitability Models for Planetary Sciences
Authors:
Abel Méndez,
Edgard G. Rivera-Valentín,
Dirk Schulze-Makuch,
Justin Filiberto,
Ramses Ramírez,
Tana E. Wood,
Alfonso Dávila,
Chris McKay,
Kevin Ortiz Ceballos,
Marcos Jusino-Maldonado,
Guillermo Nery,
René Heller,
Paul Byrne,
Michael J. Malaska,
Erica Nathan,
Marta Filipa Simões,
André Antunes,
Jesús Martínez-Frías,
Ludmila Carone,
Noam R. Izenberg,
Dimitra Atri,
Humberto Itic Carvajal Chitty,
Priscilla Nowajewski-Barra,
Frances Rivera-Hernández,
Corine Brown
, et al. (10 additional authors not shown)
Abstract:
Habitability has been generally defined as the capability of an environment to support life. Ecologists have been using Habitat Suitability Models (HSMs) for more than four decades to study the habitability of Earth from local to global scales. Astrobiologists have been proposing different habitability models for some time, with little integration and consistency between them and different in func…
▽ More
Habitability has been generally defined as the capability of an environment to support life. Ecologists have been using Habitat Suitability Models (HSMs) for more than four decades to study the habitability of Earth from local to global scales. Astrobiologists have been proposing different habitability models for some time, with little integration and consistency between them and different in function to those used by ecologists. In this white paper, we suggest a mass-energy habitability model as an example of how to adapt and expand the models used by ecologists to the astrobiology field. We propose to implement these models into a NASA Habitability Standard (NHS) to standardize the habitability objectives of planetary missions. These standards will help to compare and characterize potentially habitable environments, prioritize target selections, and study correlations between habitability and biosignatures. Habitability models are the foundation of planetary habitability science. The synergy between the methods used by ecologists and astrobiologists will help to integrate and expand our understanding of the habitability of Earth, the Solar System, and exoplanets.
△ Less
Submitted 14 July, 2020; v1 submitted 10 July, 2020;
originally announced July 2020.
-
Don't Use English Dev: On the Zero-Shot Cross-Lingual Evaluation of Contextual Embeddings
Authors:
Phillip Keung,
Yichao Lu,
Julian Salazar,
Vikas Bhardwaj
Abstract:
Multilingual contextual embeddings have demonstrated state-of-the-art performance in zero-shot cross-lingual transfer learning, where multilingual BERT is fine-tuned on one source language and evaluated on a different target language. However, published results for mBERT zero-shot accuracy vary as much as 17 points on the MLDoc classification task across four papers. We show that the standard prac…
▽ More
Multilingual contextual embeddings have demonstrated state-of-the-art performance in zero-shot cross-lingual transfer learning, where multilingual BERT is fine-tuned on one source language and evaluated on a different target language. However, published results for mBERT zero-shot accuracy vary as much as 17 points on the MLDoc classification task across four papers. We show that the standard practice of using English dev accuracy for model selection in the zero-shot setting makes it difficult to obtain reproducible results on the MLDoc and XNLI tasks. English dev accuracy is often uncorrelated (or even anti-correlated) with target language accuracy, and zero-shot performance varies greatly at different points in the same fine-tuning run and between different fine-tuning runs. These reproducibility issues are also present for other tasks with different pre-trained embeddings (e.g., MLQA with XLM-R). We recommend providing oracle scores alongside zero-shot results: still fine-tune using English data, but choose a checkpoint with the target dev set. Reporting this upper bound makes results more consistent by avoiding arbitrarily bad checkpoints.
△ Less
Submitted 6 October, 2020; v1 submitted 30 April, 2020;
originally announced April 2020.
-
On the Behavior of Unbounded Collatz Sequences
Authors:
Jorge Salazar
Abstract:
The aim of this paper is to show a peculiar behavior of a (hypothetical) Collatz sequence going to infinity. We study the associated Syracusa sequence (the odd elements of the former) and show that the limit set of a conveniently normalized sequence is the whole unit interval. In particular, for any positive integer there is a subsequence whose elements' expansions in base 3 begin (from the left)…
▽ More
The aim of this paper is to show a peculiar behavior of a (hypothetical) Collatz sequence going to infinity. We study the associated Syracusa sequence (the odd elements of the former) and show that the limit set of a conveniently normalized sequence is the whole unit interval. In particular, for any positive integer there is a subsequence whose elements' expansions in base 3 begin (from the left) with the expansion of the given number.
△ Less
Submitted 8 April, 2022; v1 submitted 10 March, 2020;
originally announced March 2020.
-
Attentional Speech Recognition Models Misbehave on Out-of-domain Utterances
Authors:
Phillip Keung,
Wei Niu,
Yichao Lu,
Julian Salazar,
Vikas Bhardwaj
Abstract:
We discuss the problem of echographic transcription in autoregressive sequence-to-sequence attentional architectures for automatic speech recognition, where a model produces very long sequences of repetitive outputs when presented with out-of-domain utterances. We decode audio from the British National Corpus with an attentional encoder-decoder model trained solely on the LibriSpeech corpus. We ob…
▽ More
We discuss the problem of echographic transcription in autoregressive sequence-to-sequence attentional architectures for automatic speech recognition, where a model produces very long sequences of repetitive outputs when presented with out-of-domain utterances. We decode audio from the British National Corpus with an attentional encoder-decoder model trained solely on the LibriSpeech corpus. We observe that there are many 5-second recordings that produce more than 500 characters of decoding output (i.e. more than 100 characters per second). A frame-synchronous hybrid (DNN-HMM) model trained on the same data does not produce these unusually long transcripts. These decoding issues are reproducible in a speech transformer model from ESPnet, and to a lesser extent in a self-attention CTC model, suggesting that these issues are intrinsic to the use of the attention mechanism. We create a separate length prediction model to predict the correct number of wordpieces in the output, which allows us to identify and truncate problematic decoding results without increasing word error rates on the LibriSpeech task.
△ Less
Submitted 12 February, 2020;
originally announced February 2020.
-
Design and performance of low-energy orbits for the exploration of Enceladus
Authors:
E. Fantino,
F. J. T. Salazar,
E. M. Alessi
Abstract:
The icy moons are in the focus of the exploration plans of the leading space agencies because of the indications of water-based life and geological activity observed in a number of these objects. In particular, the presence of geyser-like jets of water near Enceladus' south pole has turned this moon of Saturn into a priority candidate to search for life and habitability features. This investigatio…
▽ More
The icy moons are in the focus of the exploration plans of the leading space agencies because of the indications of water-based life and geological activity observed in a number of these objects. In particular, the presence of geyser-like jets of water near Enceladus' south pole has turned this moon of Saturn into a priority candidate to search for life and habitability features. This investigation proposes a set of trajectories between Halo orbits about Lagrangian points L1 and L2 in the Saturn-Enceladus Circular Restricted Three-Body Problem as science orbits for a future in situ mission at Enceladus. The design methodology is presented, followed by the analysis of the observational performance of the solutions. The conclusion is that the proposed orbits exhibit suitable features for their use in the scientific exploration of Enceladus, i.e., long transfer times, low altitudes, wide surface visibility windows and long times of overflight.
△ Less
Submitted 1 April, 2021; v1 submitted 5 February, 2020;
originally announced February 2020.
-
Deep Contextualized Acoustic Representations For Semi-Supervised Speech Recognition
Authors:
Shaoshi Ling,
Yuzong Liu,
Julian Salazar,
Katrin Kirchhoff
Abstract:
We propose a novel approach to semi-supervised automatic speech recognition (ASR). We first exploit a large amount of unlabeled audio data via representation learning, where we reconstruct a temporal slice of filterbank features from past and future context frames. The resulting deep contextualized acoustic representations (DeCoAR) are then used to train a CTC-based end-to-end ASR system using a s…
▽ More
We propose a novel approach to semi-supervised automatic speech recognition (ASR). We first exploit a large amount of unlabeled audio data via representation learning, where we reconstruct a temporal slice of filterbank features from past and future context frames. The resulting deep contextualized acoustic representations (DeCoAR) are then used to train a CTC-based end-to-end ASR system using a smaller amount of labeled audio data. In our experiments, we show that systems trained on DeCoAR consistently outperform ones trained on conventional filterbank features, giving 42% and 19% relative improvement over the baseline on WSJ eval92 and LibriSpeech test-clean, respectively. Our approach can drastically reduce the amount of labeled data required; unsupervised training on LibriSpeech then supervision with 100 hours of labeled data achieves performance on par with training on all 960 hours directly. Pre-trained models and code will be released online.
△ Less
Submitted 9 April, 2020; v1 submitted 3 December, 2019;
originally announced December 2019.
-
Masked Language Model Scoring
Authors:
Julian Salazar,
Davis Liang,
Toan Q. Nguyen,
Katrin Kirchhoff
Abstract:
Pretrained masked language models (MLMs) require finetuning for most NLP tasks. Instead, we evaluate MLMs out of the box via their pseudo-log-likelihood scores (PLLs), which are computed by masking tokens one by one. We show that PLLs outperform scores from autoregressive language models like GPT-2 in a variety of tasks. By rescoring ASR and NMT hypotheses, RoBERTa reduces an end-to-end LibriSpeec…
▽ More
Pretrained masked language models (MLMs) require finetuning for most NLP tasks. Instead, we evaluate MLMs out of the box via their pseudo-log-likelihood scores (PLLs), which are computed by masking tokens one by one. We show that PLLs outperform scores from autoregressive language models like GPT-2 in a variety of tasks. By rescoring ASR and NMT hypotheses, RoBERTa reduces an end-to-end LibriSpeech model's WER by 30% relative and adds up to +1.7 BLEU on state-of-the-art baselines for low-resource translation pairs, with further gains from domain adaptation. We attribute this success to PLL's unsupervised expression of linguistic acceptability without a left-to-right bias, greatly improving on scores from GPT-2 (+10 points on island effects, NPI licensing in BLiMP). One can finetune MLMs to give scores without masking, enabling computation in a single inference pass. In all, PLLs and their associated pseudo-perplexities (PPPLs) enable plug-and-play use of the growing number of pretrained MLMs; e.g., we use a single cross-lingual model to rescore translations in multiple languages. We release our library for language model scoring at https://github.com/awslabs/mlm-scoring.
△ Less
Submitted 31 December, 2020; v1 submitted 31 October, 2019;
originally announced October 2019.
-
Transformers without Tears: Improving the Normalization of Self-Attention
Authors:
Toan Q. Nguyen,
Julian Salazar
Abstract:
We evaluate three simple, normalization-centric changes to improve Transformer training. First, we show that pre-norm residual connections (PreNorm) and smaller initializations enable warmup-free, validation-based training with large learning rates. Second, we propose $\ell_2$ normalization with a single scale parameter (ScaleNorm) for faster training and better performance. Finally, we reaffirm t…
▽ More
We evaluate three simple, normalization-centric changes to improve Transformer training. First, we show that pre-norm residual connections (PreNorm) and smaller initializations enable warmup-free, validation-based training with large learning rates. Second, we propose $\ell_2$ normalization with a single scale parameter (ScaleNorm) for faster training and better performance. Finally, we reaffirm the effectiveness of normalizing word embeddings to a fixed length (FixNorm). On five low-resource translation pairs from TED Talks-based corpora, these changes always converge, giving an average +1.1 BLEU over state-of-the-art bilingual baselines and a new 32.8 BLEU on IWSLT'15 English-Vietnamese. We observe sharper performance curves, more consistent gradient norms, and a linear relationship between activation scaling and decoder depth. Surprisingly, in the high-resource setting (WMT'14 English-German), ScaleNorm and FixNorm remain competitive but PreNorm degrades performance.
△ Less
Submitted 29 December, 2019; v1 submitted 13 October, 2019;
originally announced October 2019.
-
BERTphone: Phonetically-Aware Encoder Representations for Utterance-Level Speaker and Language Recognition
Authors:
Shaoshi Ling,
Julian Salazar,
Yuzong Liu,
Katrin Kirchhoff
Abstract:
We introduce BERTphone, a Transformer encoder trained on large speech corpora that outputs phonetically-aware contextual representation vectors that can be used for both speaker and language recognition. This is accomplished by training on two objectives: the first, inspired by adapting BERT to the continuous domain, involves masking spans of input frames and reconstructing the whole sequence for…
▽ More
We introduce BERTphone, a Transformer encoder trained on large speech corpora that outputs phonetically-aware contextual representation vectors that can be used for both speaker and language recognition. This is accomplished by training on two objectives: the first, inspired by adapting BERT to the continuous domain, involves masking spans of input frames and reconstructing the whole sequence for acoustic representation learning; the second, inspired by the success of bottleneck features from ASR, is a sequence-level CTC loss applied to phoneme labels for phonetic representation learning. We pretrain two BERTphone models (one on Fisher and one on TED-LIUM) and use them as feature extractors into x-vector-style DNNs for both tasks. We attain a state-of-the-art $C_{\text{avg}}$ of 6.16 on the challenging LRE07 3sec closed-set language recognition task. On Fisher and VoxCeleb speaker recognition tasks, we see an 18% relative reduction in speaker EER when training on BERTphone vectors instead of MFCCs. In general, BERTphone outperforms previous phonetic pretraining approaches on the same data. We release our code and models at https://github.com/awslabs/speech-representations.
△ Less
Submitted 29 December, 2021; v1 submitted 30 June, 2019;
originally announced July 2019.
-
On Extended Thermodynamics: From Classical to the Relativistic Regime
Authors:
Jose Felix Salazar,
Thomas Zannias
Abstract:
The recent monumental detection of gravitational waves by LIGO, the subsequent detection by the LIGO/VIRGO observatories of a binary neutron star merger seen in the gravitational wave signal $GW170817$,the first photo of the event horizon of the supermassive black hole at the center of the $M87$ galaxy released by the EHT telescope and the ongoing experiments on Relativistic Heavy Ion Collisions a…
▽ More
The recent monumental detection of gravitational waves by LIGO, the subsequent detection by the LIGO/VIRGO observatories of a binary neutron star merger seen in the gravitational wave signal $GW170817$,the first photo of the event horizon of the supermassive black hole at the center of the $M87$ galaxy released by the EHT telescope and the ongoing experiments on Relativistic Heavy Ion Collisions at the BNL and at the CERN, demonstrate that we are witnessing the second golden era of observational relativistic gravity. These new observational breakthroughs, although in the long run would influence our views regarding this Kosmos, in the short run, they suggest that relativistic dissipative fluids (or magnetofluids) and relativistic continuous media play an important role on astrophysical-and also subnuclear-scales. This realization brings into the frontiers of current research theories of irreversible thermodynamics of relativistic continuous media.
Motivated by these considerations, in this paper, we summarize the progress that has been made in the last few decades in the field of non equilibrium thermodynamics of relativistic continuous media. For coherence and completeness purposes, we begin with a brief description of the balance laws for classical (Newtonian) continuous media and introduce the classical irreversible thermodynamics (CIT) and the role of the local-equilibrium postulate within this theory. Tangentially, we touch the program of rational thermodynamics (RT), the Clausius-Duhem inequality, the theory of constitutive relations and the emergence of the entropy principle and its role in the description of continuous media.
△ Less
Submitted 8 October, 2021; v1 submitted 8 April, 2019;
originally announced April 2019.
-
Self-Attention Networks for Connectionist Temporal Classification in Speech Recognition
Authors:
Julian Salazar,
Katrin Kirchhoff,
Zhiheng Huang
Abstract:
The success of self-attention in NLP has led to recent applications in end-to-end encoder-decoder architectures for speech recognition. Separately, connectionist temporal classification (CTC) has matured as an alignment-free, non-autoregressive approach to sequence transduction, either by itself or in various multitask and decoding frameworks. We propose SAN-CTC, a deep, fully self-attentional net…
▽ More
The success of self-attention in NLP has led to recent applications in end-to-end encoder-decoder architectures for speech recognition. Separately, connectionist temporal classification (CTC) has matured as an alignment-free, non-autoregressive approach to sequence transduction, either by itself or in various multitask and decoding frameworks. We propose SAN-CTC, a deep, fully self-attentional network for CTC, and show it is tractable and competitive for end-to-end speech recognition. SAN-CTC trains quickly and outperforms existing CTC models and most encoder-decoder models, with character error rates (CERs) of 4.7% in 1 day on WSJ eval92 and 2.8% in 1 week on LibriSpeech test-clean, with a fixed architecture and one GPU. Similar improvements hold for WERs after LM decoding. We motivate the architecture for speech, evaluate position and downsampling approaches, and explore how label alphabets (character, phoneme, subword) affect attention heads and performance.
△ Less
Submitted 19 February, 2019; v1 submitted 22 January, 2019;
originally announced January 2019.
-
Fubini-Tonelli type theorem for non product measures in a product space
Authors:
Jorge Salazar
Abstract:
I prove a theorem about iterated integrals for non-product measures in a product space. The first task is to show the existence of a family of measures on the second space, indexed by the points on of the first space (outside a negligible set), such that integrating the measures on the index against the first marginal gives back the original measure (see Theorem 2.1). At the end, I give a simple a…
▽ More
I prove a theorem about iterated integrals for non-product measures in a product space. The first task is to show the existence of a family of measures on the second space, indexed by the points on of the first space (outside a negligible set), such that integrating the measures on the index against the first marginal gives back the original measure (see Theorem 2.1). At the end, I give a simple application in Optimal Transport.
△ Less
Submitted 11 June, 2018;
originally announced June 2018.
-
On the Integrability of the Geodesic Flow on a Friedmann-Robertson-Walker Spacetime
Authors:
Francisco Astorga,
J. Felix Salazar,
Thomas Zannias
Abstract:
We study the geodesic flow on the cotangent bundle of a Friedman-Robertson-Walker spacetime (M, g). On this bundle, the HamiltonJacobi equation is completely separable and this separability leads us to construct four linearly independent integrals in involution i.e. Poisson commuting amongst themselves and pointwise linearly independent. These integrals involve the six linearly independent Killing…
▽ More
We study the geodesic flow on the cotangent bundle of a Friedman-Robertson-Walker spacetime (M, g). On this bundle, the HamiltonJacobi equation is completely separable and this separability leads us to construct four linearly independent integrals in involution i.e. Poisson commuting amongst themselves and pointwise linearly independent. These integrals involve the six linearly independent Killing fields of the background metric g. As a consequence, the geodesic flow on an FRW background is completely integrable in the Liouville-Arnold sense. For the case of a spatially closed universe we construct families of invariant by the flow sub manifolds.
△ Less
Submitted 21 December, 2018; v1 submitted 24 December, 2017;
originally announced December 2017.
-
On the behavior of causal geodesics on a Kerr-de Sitter spacetime
Authors:
José Félix Salazar,
Thomas Zannias
Abstract:
We analyze the behavior of causal geodesics on a Kerr-de Sitter spacetime with particular emphasis on their completeness property. We set up an initial value problem (IVP) whose solutions lead to a global understanding of causal geodesics on these spacetime. Causal geodesics that avoid the rotation axis are complete except the ones that hit the ring-like curvature singularity and those that encoun…
▽ More
We analyze the behavior of causal geodesics on a Kerr-de Sitter spacetime with particular emphasis on their completeness property. We set up an initial value problem (IVP) whose solutions lead to a global understanding of causal geodesics on these spacetime. Causal geodesics that avoid the rotation axis are complete except the ones that hit the ring-like curvature singularity and those that encounter the ring singularity are necessary equatorial ones. We also show the existence of geodesics that cross or lie on the rotation axis. The equations governing the latter family show the repulsive nature of the ring singularity. The results of this work show, that as far as properties of causal geodesics are concerned, Kerr-de Sitter spacetimes behave in a similar manner as the family of Kerr spacetimes.
△ Less
Submitted 28 April, 2017;
originally announced May 2017.
-
A $\sim$32-70 K formation temperature range for the ice grains agglomerated by comet 67P/Churyumov-Gerasimenko
Authors:
S. Lectez,
J. M. Simon,
O. Mousis,
S. Picaud,
K. Altwegg,
M. Rubin,
J. M. Salazar
Abstract:
Grand Canonical Monte Carlo simulations are used to reproduce the N$_2$/CO ratio ranging between 1.7 $\times$ 10$^{-3}$ and 1.6 $\times$ 10$^{-2}$ observed {\it in situ} in the Jupiter family comet 67P/Churyumov-Gerasimenko by the ROSINA mass spectrometer aboard the Rosetta spacecraft, assuming that this body has been agglomerated from clathrates in the protosolar nebula. Simulations are done usin…
▽ More
Grand Canonical Monte Carlo simulations are used to reproduce the N$_2$/CO ratio ranging between 1.7 $\times$ 10$^{-3}$ and 1.6 $\times$ 10$^{-2}$ observed {\it in situ} in the Jupiter family comet 67P/Churyumov-Gerasimenko by the ROSINA mass spectrometer aboard the Rosetta spacecraft, assuming that this body has been agglomerated from clathrates in the protosolar nebula. Simulations are done using an elaborated interatomic potentials for investigating the temperature dependence of the trapping within a multiple guest clathrate formed from a gas mixture of CO and N$_2$ in proportions corresponding to those expected for the protosolar nebula. By assuming that 67P/Churyumov-Gerasimenko agglomerated from clathrates, our calculations suggest the cometary grains must have been formed at temperatures ranging between $\sim$31.8 and 69.9 K in the protosolar nebula to match the N$_2$/CO ratio measured by the ROSINA mass spectrometer. The presence of clathrates in Jupiter family comets could then explain the potential N$_2$ depletion (factor up to $\sim$87 compared to the protosolar value) measured in 67P/Churyumov-Gerasimenko.
△ Less
Submitted 7 April, 2015;
originally announced April 2015.
-
The double mass hierarchy pattern: simultaneously understanding quark and lepton mixing
Authors:
Wolfgang Gregor Hollik,
Ulises Jesus Saldana Salazar
Abstract:
The charged fermion masses of the three generations exhibit the two strong hierarchies m_3 >> m_2 >> m_1. We assume that also neutrino masses satisfy m_{nu 3} > m_{nu 2} > m_{nu 1} and derive the consequences of the hierarchical spectra on the fermionic mixing patterns. The quark and lepton mixing matrices are built in a general framework with their matrix elements expressed in terms of the four f…
▽ More
The charged fermion masses of the three generations exhibit the two strong hierarchies m_3 >> m_2 >> m_1. We assume that also neutrino masses satisfy m_{nu 3} > m_{nu 2} > m_{nu 1} and derive the consequences of the hierarchical spectra on the fermionic mixing patterns. The quark and lepton mixing matrices are built in a general framework with their matrix elements expressed in terms of the four fermion mass ratios m_u/m_c, m_c/m_t, m_d/m_s, and m_s/m_b and m_e/m_mu, m_mu/m_tau, m_{nu 1}/m_{nu 2}, and m_{nu 2}/m_{nu 3}, for the quark and lepton sector, respectively. In this framework, we show that the resulting mixing matrices are consistent with data for both quarks and leptons, despite the large leptonic mixing angles. The minimal assumption we take is the one of hierarchical masses and minimal flavour symmetry breaking that strongly follows from phenomenology. No special structure of the mass matrices has to be assumed that cannot be motivated by this minimal assumption. This analysis allows us to predict the neutrino mass spectrum and set the mass of the lightest neutrino well below 0.01 eV. The method also gives the 1 sigma allowed ranges for the leptonic mixing matrix elements. Contrary to the common expectation, leptonic mixing angles are found to be determined solely by the four leptonic mass ratios without any relation to symmetry considerations as commonly used in flavor model building. Still, our formulae can be used to build up a flavor model that predicts the observed hierarchies in the masses---the mixing follows then from the procedure which is developed in this work.
△ Less
Submitted 21 January, 2015; v1 submitted 13 November, 2014;
originally announced November 2014.