Skip to main content

Showing 1–50 of 158 results for author: de Souza, C

.
  1. arXiv:2506.04079  [pdf, ps, other

    cs.CL cs.AI cs.LG

    EuroLLM-9B: Technical Report

    Authors: Pedro Henrique Martins, João Alves, Patrick Fernandes, Nuno M. Guerreiro, Ricardo Rei, Amin Farajian, Mateusz Klimaszewski, Duarte M. Alves, José Pombal, Manuel Faysse, Pierre Colombo, François Yvon, Barry Haddow, José G. C. de Souza, Alexandra Birch, André F. T. Martins

    Abstract: This report presents EuroLLM-9B, a large language model trained from scratch to support the needs of European citizens by covering all 24 official European Union languages and 11 additional languages. EuroLLM addresses the issue of European languages being underrepresented and underserved in existing open large language models. We provide a comprehensive overview of EuroLLM-9B's development, inclu… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: 56 pages

  2. arXiv:2503.16716  [pdf, ps, other

    math.AC math.AG

    On defect in finite extensions of valued fields

    Authors: Caio Henrique Silva de Souza, Mark Spivakovsky

    Abstract: In recent decades, the defect of finite extensions of valued fields has emerged as the main obstacle in several fundamental problems in algebraic geometry such as the local uniformization problem. Hence, it is important to identify defectless fields and study properties related to defect. In this paper we study the relations between the following properties of valued fields: simply defectless, imm… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

  3. arXiv:2503.15657  [pdf, other

    astro-ph.GA

    The S-PLUS Fornax Project (S+FP): Mapping globular clusters systems within 5 virial radii around NGC 1399

    Authors: Luis Lomelí-Núñez, A. Cortesi, A. V. Smith Castelli, M. L. Buzzo, Y. D. Mayya, Vasiliki Fragkou, J. A. Alzate-Trujillo, R. F. Haack, J. P. Calderón, A. R. Lopes, Michael Hilker, M. Grossi, Karín Menéndez-Delmestre, Thiago S. Gonçalves, Ana L. Chies-Santos, L. A. Gutiérrez-Soto, Ciria Lima-Dias, S. V. Werner, Pedro K. Humire, R. C. Thom de Souza, A. Alvarez-Candal, Swayamtrupta Panda, Avinash Chaturvedi, E. Telles, C. Mendes de Oliveira , et al. (3 additional authors not shown)

    Abstract: We present the largest sample ($\sim$13,000 candidates, $\sim$3000 of wich are bona-fide candidates) of globular cluster (GCs) candidates reported in the Fornax Cluster so far. The survey is centered on the NGC 1399 galaxy, extending out to 5 virial radii (\rv) of the cluster. We carried out a photometric study using images observed in the 12-bands system of the Southern Photometric Local Universe… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

    Comments: 30 pages, 18 figures

  4. arXiv:2503.08939  [pdf, other

    cs.CV cs.AI

    KAN-Mixers: a new deep learning architecture for image classification

    Authors: Jorge Luiz dos Santos Canuto, Linnyer Beatrys Ruiz Aylon, Rodrigo Clemente Thom de Souza

    Abstract: Due to their effective performance, Convolutional Neural Network (CNN) and Vision Transformer (ViT) architectures have become the standard for solving computer vision tasks. Such architectures require large data sets and rely on convolution and self-attention operations. In 2021, MLP-Mixer emerged, an architecture that relies only on Multilayer Perceptron (MLP) and achieves extremely competitive r… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

    Comments: 8 pages, 6 figures

  5. arXiv:2502.02925  [pdf, other

    stat.ME cs.LG math.PR math.ST

    Data denoising with self consistency, variance maximization, and the Kantorovich dominance

    Authors: Joshua Zoen-Git Hiew, Tongseok Lim, Brendan Pass, Marcelo Cruz de Souza

    Abstract: We introduce a new framework for data denoising, partially inspired by martingale optimal transport. For a given noisy distribution (the data), our approach involves finding the closest distribution to it among all distributions which 1) have a particular prescribed structure (expressed by requiring they lie in a particular domain), and 2) are self-consistent with the data. We show that this amoun… ▽ More

    Submitted 5 February, 2025; originally announced February 2025.

  6. arXiv:2502.00581  [pdf, other

    cs.RO

    Trajectory Planning and Control for Differentially Flat Fixed-Wing Aerial Systems

    Authors: Luca Morando, Sanket A. Salunkhe, Nishanth Bobbili, Jeffrey Mao, Luca Masci, Hung Nguyen, Cristino de Souza, Giuseppe Loianno

    Abstract: Efficient real-time trajectory planning and control for fixed-wing unmanned aerial vehicles is challenging due to their non-holonomic nature, complex dynamics, and the additional uncertainties introduced by unknown aerodynamic effects. In this paper, we present a fast and efficient real-time trajectory planning and control approach for fixed-wing unmanned aerial vehicles, leveraging the differenti… ▽ More

    Submitted 1 February, 2025; originally announced February 2025.

    Comments: Approved at Icra 25

    Journal ref: Admitted for Publication at 2025 IEEE International Conference on Robotics and Autonomous Systems (ICRA 2025)

  7. arXiv:2501.07344  [pdf, other

    cs.SE

    Affirmative Hackathon for Software Developers with Disabilities: An Industry Initiative

    Authors: Thayssa Rocha, Nicole Davila, Rafaella Vaccari, Nicoly Menezes, Marcelle Mota, Edward Monteiro, Cleidson de Souza, Gustavo Pinto

    Abstract: People with disabilities (PWD) often encounter several barriers to becoming employed. A growing body of evidence in software development highlights the benefits of diversity and inclusion in the field. However, recruiting, hiring, and fostering a supportive environment for PWD remains challenging. These challenges are exacerbated by the lack of skilled professionals with experience in inclusive hi… ▽ More

    Submitted 20 January, 2025; v1 submitted 13 January, 2025; originally announced January 2025.

    Comments: 12 pages, accepted for CHASE 2025

  8. arXiv:2412.19768  [pdf

    physics.ed-ph

    Teaching materials aligned or unaligned with the principles of the Cognitive Theory of Multimedia Learning: the choices made by Physics teachers and students

    Authors: Aline N. Braga, Antonio A. M. Neto, Alessandra N. Braga, Silvio C. F. Pereira Filho, Nelson P. C. de Souza, Danilo T. Alves

    Abstract: In a recent study [Rev. Bras. Ens. Fís. vol. 45, 2023], the absence of the Cognitive Theory of Multimedia Learning (CTML) in the curricula of Physics teacher education programs at Brazilian public universities was highlighted. Considering this gap, the present study investigates whether, even without any formal prior knowledge of CTML principles (Coherence, Signaling, Spatial Contiguity, Segmentat… ▽ More

    Submitted 27 December, 2024; originally announced December 2024.

    Comments: 24 pages, 7 figures

  9. arXiv:2411.18748  [pdf, other

    astro-ph.GA astro-ph.IM astro-ph.SR

    Stellar atmospheric parameters and chemical abundances of about 5 million stars from S-PLUS multi-band photometry

    Authors: C. E. Ferreira Lopes, L. A. Gutiérrez-Soto, V. S. Ferreira Alberice, N. Monsalves, D. Hazarika, M. Catelan, V. M. Placco, G. Limberg, F. Almeida-Fernandes, H. D. Perottoni, A. V. Smith Castelli, S. Akras, J. Alonso-García, V. Cordeiro, M. Jaque Arancibia, S. Daflon, B. Dias, D. R. Gonçalves, E. Machado-Pereira, A. R. Lopes, C. R. Bom, R. C. Thom de Souza, N. G. de Isídio, A. Alvarez-Candal, M. E. De Rossi , et al. (8 additional authors not shown)

    Abstract: Context. Spectroscopic surveys like APOGEE, GALAH, and LAMOST have significantly advanced our understanding of the Milky Way by providing extensive stellar parameters and chemical abundances. Complementing these, photometric surveys with narrow/medium-band filters, such as the Southern Photometric Local Universe Survey (S-PLUS), offer the potential to estimate stellar parameters and abundances for… ▽ More

    Submitted 27 November, 2024; originally announced November 2024.

    Comments: 23 pages, 14 Figures

    Journal ref: A&A 693, A306 (2025)

  10. arXiv:2410.17376  [pdf, other

    hep-th

    Lorentz-violating Yukawa theory at finite temperature

    Authors: D. S. Cabral, L. A. S. Evangelista, J. C. R. de Souza, L. H. A. R. Ferreira, A. F. Santos

    Abstract: This paper addresses Yukawa theory, focusing on the scattering between two identical fermions mediated by an intermediate scalar boson, considering the effects of thermal contributions and Lorentz symmetry breaking. Temperature is introduced into the theory through the TFD formalism, while Lorentz violation arises from a background tensor coupled to the kinetic part of the Klein-Gordon Lagrangian.… ▽ More

    Submitted 22 October, 2024; originally announced October 2024.

    Comments: 20 pages, 4 figures

  11. arXiv:2410.11624  [pdf, other

    cs.CL

    Findings of the WMT 2024 Shared Task on Chat Translation

    Authors: Wafaa Mohammed, Sweta Agrawal, M. Amin Farajian, Vera Cabarrão, Bryan Eikema, Ana C. Farinha, José G. C. de Souza

    Abstract: This paper presents the findings from the third edition of the Chat Translation Shared Task. As with previous editions, the task involved translating bilingual customer support conversations, specifically focusing on the impact of conversation context in translation quality and evaluation. We also include two new language pairs: English-Korean and English-Dutch, in addition to the set of language… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: 12 pages, 5 figures, 13 tables

  12. arXiv:2410.07779  [pdf, other

    cs.CL

    Modeling User Preferences with Automatic Metrics: Creating a High-Quality Preference Dataset for Machine Translation

    Authors: Sweta Agrawal, José G. C. de Souza, Ricardo Rei, António Farinhas, Gonçalo Faria, Patrick Fernandes, Nuno M Guerreiro, Andre Martins

    Abstract: Alignment with human preferences is an important step in developing accurate and safe large language models. This is no exception in machine translation (MT), where better handling of language nuances and context-specific variations leads to improved quality. However, preference data based on human feedback can be very expensive to obtain and curate at a large scale. Automatic metrics, on the othe… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

    Comments: Accepted at EMNLP Main 2024

  13. arXiv:2409.16235  [pdf, other

    cs.CL

    EuroLLM: Multilingual Language Models for Europe

    Authors: Pedro Henrique Martins, Patrick Fernandes, João Alves, Nuno M. Guerreiro, Ricardo Rei, Duarte M. Alves, José Pombal, Amin Farajian, Manuel Faysse, Mateusz Klimaszewski, Pierre Colombo, Barry Haddow, José G. C. de Souza, Alexandra Birch, André F. T. Martins

    Abstract: The quality of open-weight LLMs has seen significant improvement, yet they remain predominantly focused on English. In this paper, we introduce the EuroLLM project, aimed at developing a suite of open-weight multilingual LLMs capable of understanding and generating text in all official European Union languages, as well as several additional relevant languages. We outline the progress made to date,… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

  14. arXiv:2408.11209  [pdf, other

    cs.SE

    Assisting Novice Developers Learning in Flutter Through Cognitive-Driven Development

    Authors: Ronivaldo Ferreira, Victor H. S. Pinto, Cleidson R. B. de Souza, Gustavo Pinto

    Abstract: Cognitive-Driven Development (CDD) is a coding design technique that helps developers focus on designing code within cognitive limits. The imposed limit tends to enhance code readability and maintainability. While early works on CDD focused mostly on Java, its applicability extends beyond specific programming languages. In this study, we explored the use of CDD in two new dimensions: focusing on F… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: 10 pages

    Report number: SBES Education Track 2024

  15. arXiv:2408.00177  [pdf, other

    stat.ME

    Fast variational Bayesian inference for correlated survival data: an application to invasive mechanical ventilation duration analysis

    Authors: Chengqian Xian, Camila P. E. de Souza, Wenqing He, Felipe F. Rodrigues, Renfang Tian

    Abstract: Correlated survival data are prevalent in various clinical settings and have been extensively discussed in literature. One of the most common types of correlated survival data is clustered survival data, where the survival times from individuals in a cluster are associated. Our study is motivated by invasive mechanical ventilation data from different intensive care units (ICUs) in Ontario, Canada,… ▽ More

    Submitted 31 July, 2024; originally announced August 2024.

  16. Time-Machines Construct in $f(\mathcal{R},\mathcal{A},A^{μν}\,A_{μν})$ and $f(\mathcal{R})$ Modified Gravity Theories

    Authors: F. Ahmed, J. C. R. de Souza, A. F. Santos

    Abstract: In this paper, our objective is to explore a time-machine space-time formulated in general relativity, as introduced by Li (Phys. Rev. D {\bf 59}, 084016 (1999)), within the context of modified gravity theories. We consider Ricci-inverse gravity of all Classes of models, {\it i.e.}, (i) Class-{\bf I}: $f(\mathcal{R}, \mathcal{A})=(\mathcal{R}+{κ\,\mathcal{R}^2}+β\,\mathcal{A})$, (ii) Class-{\bf II… ▽ More

    Submitted 4 October, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

    Comments: 23 pages, 4 figures

    Journal ref: JCAP 10, 015 (2024)

  17. arXiv:2407.04596  [pdf, ps, other

    cs.SE

    Teaching and Learning Ethnography for Software Engineering Contexts

    Authors: Yvonne Dittrich, Helen Sharp, Cleidson de Souza

    Abstract: Ethnography has become one of the established methods for empirical research on software engineering. Although there is a wide variety of introductory books available, there has been no material targeting software engineering students particularly, until now. In this chapter we provide an introduction to teaching and learning ethnography for faculty teaching ethnography to software engineering gra… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 38 pages, to be published in: Daniel Mendez, Paris Avgeriou, Marcos Kalinowski, and Nauman bin Ali (eds.) Teaching Empirical Research Methods in Software Engineering, Springer

  18. arXiv:2407.01030  [pdf, other

    math.AC

    Tame fields, Graded Rings and Finite Complete Sequences of Key Polynomials

    Authors: Caio Henrique Silva de Souza

    Abstract: In this paper, we present a criterion for $(K,v)$ to be henselian and defectless in terms of finite complete sequences of key polynomials. For this, we use the theory of Mac Lane-Vaquié chains and abstract key polynomials. We then prove that a valued field $(K,v)$ is tame if and only if $vK$ is $p$-divisible, $Kv$ is perfect and every simple algebraic extension of $K$ admits a finite complete sequ… ▽ More

    Submitted 10 January, 2025; v1 submitted 1 July, 2024; originally announced July 2024.

    MSC Class: 13A18

  19. arXiv:2406.06175  [pdf, ps, other

    nlin.CD

    Ratchet current and scaling properties in a nontwist mapping

    Authors: Matheus Rolim Sales, Daniel Borin, Leonardo Costa de Souza, José Danilo Szezech Jr., Ricardo Luiz Viana, Iberê Luiz Caldas, Edson Denis Leonel

    Abstract: We investigate the transport of particles in the chaotic component of phase space for a two-dimensional, area-preserving nontwist map. The survival probability for particles within the chaotic sea is described by an exponential decay for regions in phase space predominantly chaotic and it is scaling invariant in this case. Alternatively, when considering mixed chaotic and regular regions, there is… ▽ More

    Submitted 13 August, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  20. arXiv:2406.00245  [pdf, other

    stat.ME stat.AP

    Model-based Clustering of Multi-Dimensional Zero-Inflated Counts via the EM Algorithm

    Authors: Zahra AghahosseinaliShirazi, Pedro A. Rangel, Camila P. E. de Souza

    Abstract: Zero-inflated count data arise in various fields, including health, biology, economics, and the social sciences. These data are often modelled using probabilistic distributions such as zero-inflated Poisson (ZIP), zero-inflated negative binomial (ZINB), or zero-inflated binomial (ZIB). To account for heterogeneity in the data, it is often useful to cluster observations into groups that may explain… ▽ More

    Submitted 27 March, 2025; v1 submitted 31 May, 2024; originally announced June 2024.

    Comments: 38

  21. arXiv:2406.00049  [pdf, other

    cs.CL cs.LG

    QUEST: Quality-Aware Metropolis-Hastings Sampling for Machine Translation

    Authors: Gonçalo R. A. Faria, Sweta Agrawal, António Farinhas, Ricardo Rei, José G. C. de Souza, André F. T. Martins

    Abstract: An important challenge in machine translation (MT) is to generate high-quality and diverse translations. Prior work has shown that the estimated likelihood from the MT model correlates poorly with translation quality. In contrast, quality evaluation metrics (such as COMET or BLEURT) exhibit high correlations with human judgments, which has motivated their use as rerankers (such as quality-aware an… ▽ More

    Submitted 15 October, 2024; v1 submitted 28 May, 2024; originally announced June 2024.

    Comments: Accepted at NEURIPS Main 2024

  22. arXiv:2405.20758  [pdf, other

    stat.ME

    Fast Bayesian Basis Selection for Functional Data Representation with Correlated Errors

    Authors: Ana Carolina da Cruz, Camila P. E. de Souza, Pedro H. T. O. Sousa

    Abstract: Functional data analysis finds widespread application across various fields. While functional data are intrinsically infinite-dimensional, in practice, they are observed only at a finite set of points, typically over a dense grid. As a result, smoothing techniques are often used to approximate the observed data as functions. In this work, we propose a novel Bayesian approach for selecting basis fu… ▽ More

    Submitted 8 November, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

    Comments: 41 pages (31 in the main text and 10 in the supplementary material)

  23. arXiv:2402.17733  [pdf, other

    cs.CL

    Tower: An Open Multilingual Large Language Model for Translation-Related Tasks

    Authors: Duarte M. Alves, José Pombal, Nuno M. Guerreiro, Pedro H. Martins, João Alves, Amin Farajian, Ben Peters, Ricardo Rei, Patrick Fernandes, Sweta Agrawal, Pierre Colombo, José G. C. de Souza, André F. T. Martins

    Abstract: While general-purpose large language models (LLMs) demonstrate proficiency on multiple tasks within the domain of translation, approaches based on open LLMs are competitive only when specializing on a single task. In this paper, we propose a recipe for tailoring LLMs to multiple tasks present in translation workflows. We perform continued pretraining on a multilingual mixture of monolingual and pa… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  24. arXiv:2402.17420  [pdf, other

    cs.CV cs.AI

    PANDAS: Prototype-based Novel Class Discovery and Detection

    Authors: Tyler L. Hayes, César R. de Souza, Namil Kim, Jiwon Kim, Riccardo Volpi, Diane Larlus

    Abstract: Object detectors are typically trained once and for all on a fixed set of classes. However, this closed-world assumption is unrealistic in practice, as new classes will inevitably emerge after the detector is deployed in the wild. In this work, we look at ways to extend a detector trained for a set of base classes so it can i) spot the presence of novel classes, and ii) automatically enrich its re… ▽ More

    Submitted 30 April, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted to the Conference on Lifelong Learning Agents (CoLLAs 2024)

  25. Cosmological constant Petrov type-N space-time in Ricci-inverse gravity

    Authors: F. Ahmed, J. C. R. de Souza, A. F. Santos

    Abstract: Our focus is on a specific type-N space-time that exhibits closed time-like curves in general relativity theory within the framework of Ricci-inverse gravity model. The matter-energy content is solely composed of a pure radiation field, and it adheres to the energy conditions while featuring a negative cosmological constant. One of the key findings in this investigation is the non-zero determinant… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: 15 pages, accepted for publication in AOP

  26. arXiv:2312.14281  [pdf, other

    cond-mat.soft cond-mat.stat-mech

    Polar order, shear banding, and clustering in confined active matter

    Authors: Daniel Canavello, Rubens H. Damascena, Leonardo R. E. Cabral, Clécio C. de Souza Silva

    Abstract: We investigate the collective behavior of sterically interacting self-propelled particles confined in a harmonic potential. Our theoretical and numerical study unveils the emergence of distinctive collective polar organizations, revealing how different levels of interparticle torques and noise influence the system. The observed phases include the shear-banded vortex, where the system self organize… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 12 pages, 10 figures

    MSC Class: 82 ACM Class: J.2

    Journal ref: Soft Matter, 2024,20, 2310-2320

  27. arXiv:2312.08472  [pdf, other

    cs.NE cs.LG math.NA

    AutoNumerics-Zero: Automated Discovery of State-of-the-Art Mathematical Functions

    Authors: Esteban Real, Yao Chen, Mirko Rossini, Connal de Souza, Manav Garg, Akhil Verghese, Moritz Firsching, Quoc V. Le, Ekin Dogus Cubuk, David H. Park

    Abstract: Computers calculate transcendental functions by approximating them through the composition of a few limited-precision instructions. For example, an exponential can be calculated with a Taylor series. These approximation methods were developed over the centuries by mathematicians, who emphasized the attainability of arbitrary precision. Computers, however, operate on few limited precision types, su… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    ACM Class: I.2.2; I.2.6; G.1.2

  28. arXiv:2311.18452  [pdf, other

    cs.SE

    Developer Experiences with a Contextualized AI Coding Assistant: Usability, Expectations, and Outcomes

    Authors: Gustavo Pinto, Cleidson de Souza, Thayssa Rocha, Igor Steinmacher, Alberto de Souza, Edward Monteiro

    Abstract: In the rapidly advancing field of artificial intelligence, software development has emerged as a key area of innovation. Despite the plethora of general-purpose AI assistants available, their effectiveness diminishes in complex, domain-specific scenarios. Noting this limitation, both the academic community and industry players are relying on contextualized coding AI assistants. These assistants su… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  29. arXiv:2311.18450  [pdf, other

    cs.SE

    Lessons from Building StackSpot AI: A Contextualized AI Coding Assistant

    Authors: Gustavo Pinto, Cleidson de Souza, João Batista Neto, Alberto de Souza, Tarcísio Gotto, Edward Monteiro

    Abstract: With their exceptional natural language processing capabilities, tools based on Large Language Models (LLMs) like ChatGPT and Co-Pilot have swiftly become indispensable resources in the software developer's toolkit. While recent studies suggest the potential productivity gains these tools can unlock, users still encounter drawbacks, such as generic or incorrect answers. Additionally, the pursuit o… ▽ More

    Submitted 4 January, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

  30. arXiv:2310.13448  [pdf, other

    cs.CL

    Steering Large Language Models for Machine Translation with Finetuning and In-Context Learning

    Authors: Duarte M. Alves, Nuno M. Guerreiro, João Alves, José Pombal, Ricardo Rei, José G. C. de Souza, Pierre Colombo, André F. T. Martins

    Abstract: Large language models (LLMs) are a promising avenue for machine translation (MT). However, current LLM-based MT systems are brittle: their effectiveness highly depends on the choice of few-shot examples and they often require extra post-processing due to overgeneration. Alternatives such as finetuning on translation instructions are computationally expensive and may weaken in-context learning capa… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP 2023 - Findings

  31. arXiv:2310.11430  [pdf, other

    cs.CL

    An Empirical Study of Translation Hypothesis Ensembling with Large Language Models

    Authors: António Farinhas, José G. C. de Souza, André F. T. Martins

    Abstract: Large language models (LLMs) are becoming a one-fits-many solution, but they sometimes hallucinate or produce unreliable output. In this paper, we investigate how hypothesis ensembling can improve the quality of the generated text for the specific problem of LLM-based machine translation. We experiment with several techniques for ensembling hypotheses produced by LLMs such as ChatGPT, LLaMA, and A… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 (main conference)

  32. arXiv:2310.09074  [pdf

    eess.SY

    Effects of Distributed Generation on the Bidirectional Operation of Cascaded Step Voltage Regulators: Case Study of a Real 34.5 kV Distribution Feeder

    Authors: Hugo Rodrigues de Brito, Valéria Monteiro de Souza, João Paulo Abreu Vieira, Maria Emília de Lima Tostes, Ubiratan Holanda Bezerra, Vanderson Carvalho de Souza, Daniel da Conceição Pinheiro, Heitor Alves Barata, Hugo Nazareno de Souza Cardoso, Marcelo Sousa Costa

    Abstract: This work investigates the impact of feeder bidirectional active power flow on the operation of two cascaded step voltage regulators (SVRs) located at a 34.5 kV rural distribution feeder. It shows that, when active power flow reversal is possible both by network reconfiguration and by high penetration levels of distributed generation (DG), typical SVR control mode settings are unable to prevent th… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: 8 pages, 11 figures, submitted to XXV SNPTEE 2019

  33. arXiv:2310.01234  [pdf, other

    physics.optics

    Back-Propagation Optimization and Multi-Valued Artificial Neural Networks for Highly Vivid Structural Color Filter Metasurfaces

    Authors: Arthur Clini de Souza, Stéphane Lanteri, Hugo Enrique Hernandez-Figueroa, Marco Abbarchi, David Grosso, Badre Kerzabi, Mahmoud Elsawy

    Abstract: We introduce a novel technique for designing color filter metasurfaces using a data-driven approach based on deep learning. Our innovative approach employs inverse design principles to identify highly efficient designs that outperform all the configurations in the dataset, which consists of 585 distinct geometries solely. By combining Multi-Valued Artificial Neural Networks and back-propagation op… ▽ More

    Submitted 18 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: To be published. 25 Pages, 17 Figures

  34. arXiv:2309.11925  [pdf, other

    cs.CL

    Scaling up COMETKIWI: Unbabel-IST 2023 Submission for the Quality Estimation Shared Task

    Authors: Ricardo Rei, Nuno M. Guerreiro, José Pombal, Daan van Stigt, Marcos Treviso, Luisa Coheur, José G. C. de Souza, André F. T. Martins

    Abstract: We present the joint contribution of Unbabel and Instituto Superior Técnico to the WMT 2023 Shared Task on Quality Estimation (QE). Our team participated on all tasks: sentence- and word-level quality prediction (task 1) and fine-grained error span detection (task 2). For all tasks, we build on the COMETKIWI-22 model (Rei et al., 2022b). Our multilingual approaches are ranked first for all tasks,… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  35. arXiv:2309.05439  [pdf, other

    gr-qc

    An axially symmetric spacetime with causality violation in Ricci-inverse gravity

    Authors: J. C. R. de Souza, A. F. Santos

    Abstract: In this paper, Ricci-inverse gravity is investigated. It is an alternative theory of gravity that introduces into the Einstein-Hilbert action an anti-curvature scalar that is obtained from the anti-curvature tensor which is the inverse of the Ricci tensor. An axially symmetric spacetime with causality violation is studied. Two classes of the model are discussed. Different sources of matter are con… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

    Comments: 11 pages, accepted for publication in EPJC

  36. arXiv:2309.04947  [pdf, other

    q-fin.MF math.OC math.PR

    Geometry of vectorial martingale optimal transport and robust option pricing

    Authors: Joshua Zoen-Git Hiew, Tongseok Lim, Brendan Pass, Marcelo Cruz de Souza

    Abstract: This paper addresses robust finance, which is concerned with the development of models and approaches that account for market uncertainties. Specifically, we investigate the Vectorial Martingale Optimal Transport (VMOT) problem, the geometry of its solutions, and its application with robust option pricing problems in finance. To this end, we consider two-period market models and show that when the… ▽ More

    Submitted 18 September, 2023; v1 submitted 10 September, 2023; originally announced September 2023.

  37. How life-table right-censoring affected the Brazilian Social Security Factor: an application of the gamma-Gompertz-Makeham model

    Authors: Filipe Costa de Souza, Wilton Bernardino, Silvio Cabral Patricio

    Abstract: Automatic Adjustment Mechanisms (AAM) are legal instruments that help social security systems respond to demographic and economic changes. In Brazil, the Social Security Factor (SSF) was introduced in the late 1990s as an AAM to link retirement benefits to life expectancy at the retirement age, with the hope of promoting contributory justice and discouraging early retirement. Recent research has h… ▽ More

    Submitted 12 September, 2023; v1 submitted 7 September, 2023; originally announced September 2023.

  38. arXiv:2308.13322  [pdf, ps, other

    math.AC

    Parametrizations of subsets of the space of valuations

    Authors: Josnei Antonio Novacoski, Caio Henrique Silva de Souza

    Abstract: In this paper we present different ways to parametrize subsets of the space of valuations on $K[x]$ extending a given valuation on $K$. We discuss the methods using pseudo-Cauchy sequences and approximation types. The method presented here is slightly different than the ones in the literature and we believe that our approach is more accurate.

    Submitted 25 August, 2023; originally announced August 2023.

    MSC Class: 13A18

  39. ProWis: A Visual Approach for Building, Managing, and Analyzing Weather Simulation Ensembles at Runtime

    Authors: Carolina Veiga Ferreira de Souza, Suzanna Maria Bonnet, Daniel de Oliveira, Marcio Cataldi, Fabio Miranda, Marcos Lage

    Abstract: Weather forecasting is essential for decision-making and is usually performed using numerical modeling. Numerical weather models, in turn, are complex tools that require specialized training and laborious setup and are challenging even for weather experts. Moreover, weather simulations are data-intensive computations and may take hours to days to complete. When the simulation is finished, the expe… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: Accepted at IEEE VIS 2023

    Journal ref: Published in: IEEE Transactions on Visualization and Computer Graphics ( Volume: 30, Issue: 1, January 2024)

  40. arXiv:2305.17631  [pdf, other

    stat.ME

    BayesCPclust: A Bayesian Approach for Clustering Constant-Wise Change-Point Data

    Authors: Ana Carolina da Cruz, Camila P. E. de Souza

    Abstract: Change-point models deal with ordered data sequences. Their primary goal is to infer the locations where an aspect of the data sequence changes. In this paper, we propose and implement a nonparametric Bayesian model for clustering observations based on their constant-wise change-point profiles via Gibbs sampler. Our model incorporates a Dirichlet Process on the constant-wise change-point structure… ▽ More

    Submitted 10 February, 2025; v1 submitted 28 May, 2023; originally announced May 2023.

    Comments: 30 pages, 12 figures

  41. arXiv:2305.00955  [pdf, other

    cs.CL cs.AI cs.LG

    Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation

    Authors: Patrick Fernandes, Aman Madaan, Emmy Liu, António Farinhas, Pedro Henrique Martins, Amanda Bertsch, José G. C. de Souza, Shuyan Zhou, Tongshuang Wu, Graham Neubig, André F. T. Martins

    Abstract: Many recent advances in natural language generation have been fueled by training large language models on internet-scale data. However, this paradigm can lead to models that generate toxic, inaccurate, and unhelpful content, and automatic evaluation metrics often fail to identify these behaviors. As models become more capable, human feedback is an invaluable signal for evaluating and improving mod… ▽ More

    Submitted 31 May, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

    Comments: Work in Progress

  42. arXiv:2303.09598  [pdf, other

    stat.ME

    Variational Bayesian analysis of survival data using a log-logistic accelerated failure time model

    Authors: Chengqian Xian, Camila P. E. de Souza, Wenqing He, Felipe F. Rodrigues, Renfang Tian

    Abstract: The log-logistic regression model is one of the most commonly used accelerated failure time (AFT) models in survival analysis, for which statistical inference methods are mainly established under the frequentist framework. Recently, Bayesian inference for log-logistic AFT models using Markov chain Monte Carlo (MCMC) techniques has also been widely developed. In this work, we develop an alternative… ▽ More

    Submitted 10 October, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

  43. arXiv:2303.03521  [pdf, ps, other

    stat.ME stat.CO

    Bayesian Variable Selection for Function-on-Scalar Regression Models: a comparative analysis

    Authors: Pedro Henrique T. O. Sousa, Camila P. E. de Souza, Ronaldo Dias

    Abstract: In this work, we developed a new Bayesian method for variable selection in function-on-scalar regression (FOSR). Our method uses a hierarchical Bayesian structure and latent variables to enable an adaptive covariate selection process for FOSR. Extensive simulation studies show the proposed method's main properties, such as its accuracy in estimating the coefficients and high capacity to select var… ▽ More

    Submitted 24 April, 2024; v1 submitted 6 March, 2023; originally announced March 2023.

  44. arXiv:2302.05488  [pdf

    cs.LG cs.AI cs.CV

    Element-Wise Attention Layers: an option for optimization

    Authors: Giovanni Araujo Bacochina, Rodrigo Clemente Thom de Souza

    Abstract: The use of Attention Layers has become a trend since the popularization of the Transformer-based models, being the key element for many state-of-the-art models that have been developed through recent years. However, one of the biggest obstacles in implementing these architectures - as well as many others in Deep Learning Field - is the enormous amount of optimizing parameters they possess, which m… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

  45. arXiv:2302.05433  [pdf, other

    cs.LG cs.NE

    Unified Functional Hashing in Automatic Machine Learning

    Authors: Ryan Gillard, Stephen Jonany, Yingjie Miao, Michael Munn, Connal de Souza, Jonathan Dungay, Chen Liang, David R. So, Quoc V. Le, Esteban Real

    Abstract: The field of Automatic Machine Learning (AutoML) has recently attained impressive results, including the discovery of state-of-the-art machine learning solutions, such as neural image classifiers. This is often done by applying an evolutionary search method, which samples multiple candidate solutions from a large space and evaluates the quality of each candidate through a long training process. As… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

    ACM Class: I.2.2; I.2.6

  46. arXiv:2212.03158  [pdf, ps, other

    eess.SY

    Robust Switching Control of DC-DC Boost Converter for EV Charging Stations

    Authors: Saif Ahmad, Ryan P. C. de Souza, Pauline Kergus, Zohra Kader, Stephane Caux

    Abstract: In this work, the problem of switching control design for DC-DC boost converter is considered, in the case of operation under uncertain equilibrium condition arising due to perturbations in the input and load parameters. Assuming that these uncertain parameters are generated via a known linear exo-system, a parameter estimator is designed to update the equilibrium point for the switching controlle… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Comments: 8 pages, 4 figures

  47. arXiv:2209.12985  [pdf, other

    cs.CR

    A Bibliometrics Analysis on 28 years of Authentication and Threat Model Area

    Authors: Wesley dos Reis Bezerra, Cristiano Antônio de Souza, Carla Merkle Westphall, Carlos Becker Westphall

    Abstract: The large volume of publications in any research area can make it difficult for researchers to track their research areas' trends, challenges, and characteristics. Bibliometrics solves this problem by bringing statistical tools to help the analysis of selected publications from an online database. Although there are different works in security, our study aims to fill the bibliometric gap in the au… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

  48. arXiv:2209.12984  [pdf, other

    cs.CR cs.SE

    Characteristics and Main Threats about Multi-Factor Authentication: A Survey

    Authors: Wesley dos Reis Bezerra, Cristiano Antônio de Souza, Carla Merkle Westphall, Carlos Becker Westphall

    Abstract: This work reports that the Systematic Literature Review process is responsible for providing theoretical support to research in the Threat Model and Multi-Factor Authentication. However, different from the related works, this study aims to evaluate the main characteristics of authentication solutions and their threat model. Also, it intends to list characteristics, threats, and related content to… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

  49. arXiv:2209.06243  [pdf, other

    cs.CL cs.LG

    CometKiwi: IST-Unbabel 2022 Submission for the Quality Estimation Shared Task

    Authors: Ricardo Rei, Marcos Treviso, Nuno M. Guerreiro, Chrysoula Zerva, Ana C. Farinha, Christine Maroti, José G. C. de Souza, Taisiya Glushkova, Duarte M. Alves, Alon Lavie, Luisa Coheur, André F. T. Martins

    Abstract: We present the joint contribution of IST and Unbabel to the WMT 2022 Shared Task on Quality Estimation (QE). Our team participated on all three subtasks: (i) Sentence and Word-level Quality Prediction; (ii) Explainable QE; and (iii) Critical Error Detection. For all tasks we build on top of the COMET framework, connecting it with the predictor-estimator architecture of OpenKiwi, and equipping it w… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: WMT 2022 Quality Estimation shared task

  50. arXiv:2205.13716  [pdf, other

    stat.ME

    Clustering Functional Data via Variational Inference

    Authors: Chengqian Xian, Camila de Souza, John Jewell, Ronaldo Dias

    Abstract: Functional data analysis deals with data recorded densely over time (or any other continuum) with one or more observed curves per subject. Conceptually, functional data are continuously defined, but in practice, they are usually observed at discrete points. Among different kinds of functional data analyses, clustering analysis aims to determine underlying groups of curves in the dataset when there… ▽ More

    Submitted 18 January, 2023; v1 submitted 26 May, 2022; originally announced May 2022.