Skip to main content

Showing 1–50 of 71 results for author: Cavalcante, M

.
  1. arXiv:2505.22439  [pdf, ps, other

    math.DG

    Rigidity of surfaces with nonpositive Euler characteristic by the second eigenvalue of the Jacobi operator

    Authors: Márcio Batista, Marcos P. Cavalcante, Abraão Mendes, Ivaldo Nunes

    Abstract: In this paper, we investigate the spectral properties of the Jacobi operator for immersed surfaces with nonpositive Euler characteristic, extending previous results in the field. We first prove a sharp upper bound for the second eigenvalue of the Jacobi operator for compact surfaces with nonpositive Euler characteristic that are fully immersed in the Euclidean sphere, and then we classify all such… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: 15 pages. Comments welcome

    MSC Class: Primary 53C24; 58J50; Secondary 49Q05; 53A10

  2. arXiv:2505.20000  [pdf, other

    quant-ph

    Correcting noisy quantum gates with shortcuts to adiabaticity

    Authors: Moallison F. Cavalcante, Bariş Çakmak, Marcus V. S. Bonança, Sebastian Deffner

    Abstract: Unitary quantum gates constitute the building blocks of Quantum Computing in the circuit paradigm. In this work, we engineer a locally driven two-qubit Hamiltonian whose instantaneous ground-state dynamics generates the controlled-NOT (CNOT) quantum gate. In practice, quantum gates have to be implemented in finite-time, hence non-adiabatic and external noise effects debilitate gate fidelities. Her… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  3. arXiv:2504.20145  [pdf, other

    quant-ph

    Roadmap on Quantum Thermodynamics

    Authors: Steve Campbell, Irene D'Amico, Mario A. Ciampini, Janet Anders, Natalia Ares, Simone Artini, Alexia Auffèves, Lindsay Bassman Oftelie, Laetitia P. Bettmann, Marcus V. S. Bonança, Thomas Busch, Michele Campisi, Moallison F. Cavalcante, Luis A. Correa, Eloisa Cuestas, Ceren B. Dag, Salambô Dago, Sebastian Deffner, Adolfo Del Campo, Andreas Deutschmann-Olek, Sandro Donadi, Emery Doucet, Cyril Elouard, Klaus Ensslin, Paul Erker , et al. (44 additional authors not shown)

    Abstract: The last two decades has seen quantum thermodynamics become a well established field of research in its own right. In that time, it has demonstrated a remarkably broad applicability, ranging from providing foundational advances in the understanding of how thermodynamic principles apply at the nano-scale and in the presence of quantum coherence, to providing a guiding framework for the development… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

    Comments: 24 perspectives across 64 pages. Submission to "Focus on Thermodynamics in Quantum Coherent Platforms" in Quantum Science and Technology. Comments related to individual contributions should be directed to the relevant authors

  4. arXiv:2502.18953  [pdf, other

    cs.AR cs.DC

    A Reliable, Time-Predictable Heterogeneous SoC for AI-Enhanced Mixed-Criticality Edge Applications

    Authors: Angelo Garofalo, Alessandro Ottaviano, Matteo Perotti, Thomas Benz, Yvan Tortorella, Robert Balas, Michael Rogenmoser, Chi Zhang, Luca Bertaccini, Nils Wistoff, Maicol Ciani, Cyril Koenig, Mattia Sinigaglia, Luca Valente, Paul Scheffler, Manuel Eggimann, Matheus Cavalcante, Francesco Restuccia, Alessandro Biondi, Francesco Conti, Frank K. Gurkaynak, Davide Rossi, Luca Benini

    Abstract: Next-generation mixed-criticality Systems-on-chip (SoCs) for robotics, automotive, and space must execute mixed-criticality AI-enhanced sensor processing and control workloads, ensuring reliable and time-predictable execution of critical tasks sharing resources with non-critical tasks, while also fitting within a sub-2W power envelope. To tackle these multi-dimensional challenges, in this brief, w… ▽ More

    Submitted 26 February, 2025; originally announced February 2025.

  5. arXiv:2502.07967  [pdf, ps, other

    math.AP

    The Korteweg-de Vries Equation on general star graphs

    Authors: Márcio Cavalcante, José Marques Neto

    Abstract: In this paper, we establish local well-posedness for the Cauchy problem associated with the Korteweg-de Vries (KdV) equation on a general metric star graph. The graph comprises m + k semi-infinite edges: k negative half-lines and m positive half-lines, all joined at a common vertex. The choice of boundary conditions is compatible with the conditions determined by the semigroup theory. The crucial… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

  6. arXiv:2501.13914  [pdf, other

    cond-mat.str-el quant-ph

    Emergence of $X$ states in a quantum impurity model

    Authors: Moallison F. Cavalcante, Marcus V. S. Bonança, Eduardo Miranda, Sebastian Deffner

    Abstract: In the present work, we demonstrate the emergence of $X$ states in the long-time response of a locally perturbed many-body quantum impurity model. The emergence of the double-qubit state is heralded by the lack of decay of the response function as well as the out-of-time order correlator, signifying the trapping of excitations and hence information in edge modes. Surprisingly, after carrying out a… ▽ More

    Submitted 3 May, 2025; v1 submitted 23 January, 2025; originally announced January 2025.

    Report number: Phys. Rev. Research 7, L022027 (2025)

    Journal ref: Phys. Rev. Research 7, L022027 (2025)

  7. Occamy: A 432-Core Dual-Chiplet Dual-HBM2E 768-DP-GFLOP/s RISC-V System for 8-to-64-bit Dense and Sparse Computing in 12nm FinFET

    Authors: Paul Scheffler, Thomas Benz, Viviane Potocnik, Tim Fischer, Luca Colagrande, Nils Wistoff, Yichao Zhang, Luca Bertaccini, Gianmarco Ottavi, Manuel Eggimann, Matheus Cavalcante, Gianna Paulin, Frank K. Gürkaynak, Davide Rossi, Luca Benini

    Abstract: ML and HPC applications increasingly combine dense and sparse memory access computations to maximize storage efficiency. However, existing CPUs and GPUs struggle to flexibly handle these heterogeneous workloads with consistently high compute efficiency. We present Occamy, a 432-Core, 768-DP-GFLOP/s, dual-HBM2E, dual-chiplet RISC-V system with a latency-tolerant hierarchical interconnect and in-cor… ▽ More

    Submitted 13 January, 2025; originally announced January 2025.

    Comments: 16 pages, 13 figures, 1 table. Accepted for publication in IEEE JSSC

  8. arXiv:2412.18575  [pdf

    physics.data-an cond-mat.mes-hall eess.IV

    New method of image processing via statistical analysis for application in intelligent systems

    Authors: Monalisa Cavalcante, José Araújo, José Holanda

    Abstract: Image processing has always been a topic of significant importance to society. Recently, this field has gained considerable prominence due to the development of intelligent systems. In this work, we present a new method of image processing that utilizes statistical analysis, specifically designed for applications in intelligent systems. We tested our method on a large collection of images to asses… ▽ More

    Submitted 24 December, 2024; originally announced December 2024.

  9. arXiv:2407.05447  [pdf, other

    cs.AR

    Spatzformer: An Efficient Reconfigurable Dual-Core RISC-V V Cluster for Mixed Scalar-Vector Workloads

    Authors: Matteo Perotti, Michele Raeber, Mattia Sinigaglia, Matheus Cavalcante, Davide Rossi, Luca Benini

    Abstract: Multi-core vector processor architectures excel in handling computationally intensive vectorizable tasks but struggle to achieve optimal resource utilization when facing sequential and control tasks that cannot be vectorized. This work presents Spatzformer, the first reconfigurable RISC-V V (RVV) architecture developed from a baseline open-source dual-core cluster based on Snitch scalar cores augm… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: To be published in the 2024 IEEE 35th International Conference on Application Specific Systems (ASAP), Architectures and Processors

  10. arXiv:2406.15068  [pdf, other

    cs.AR

    Occamy: A 432-Core 28.1 DP-GFLOP/s/W 83% FPU Utilization Dual-Chiplet, Dual-HBM2E RISC-V-based Accelerator for Stencil and Sparse Linear Algebra Computations with 8-to-64-bit Floating-Point Support in 12nm FinFET

    Authors: Gianna Paulin, Paul Scheffler, Thomas Benz, Matheus Cavalcante, Tim Fischer, Manuel Eggimann, Yichao Zhang, Nils Wistoff, Luca Bertaccini, Luca Colagrande, Gianmarco Ottavi, Frank K. Gürkaynak, Davide Rossi, Luca Benini

    Abstract: We present Occamy, a 432-core RISC-V dual-chiplet 2.5D system for efficient sparse linear algebra and stencil computations on FP64 and narrow (32-, 16-, 8-bit) SIMD FP data. Occamy features 48 clusters of RISC-V cores with custom extensions, two 64-bit host cores, and a latency-tolerant multi-chiplet interconnect and memory system with 32 GiB of HBM2E. It achieves leading-edge utilization on stenc… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 2 pages, 7 figures. Accepted at the 2024 IEEE Symposium on VLSI Technology & Circuits

  11. arXiv:2406.13628  [pdf, ps, other

    math.DG math.AP

    Stability of extremal domains for the first eigenvalue of the Laplacian operator

    Authors: Marcos P. Cavalcante, Ivaldo Nunes

    Abstract: In this paper, we compute the second variation of the first Dirichlet eigenvalue on extremal domains in general Riemannian manifolds and establish a criterion for stability. We classify the stable extremal domains in the 2-sphere and higher-dimensional spheres when the boundary is minimal. Additionally, we establish topological bounds for stable domains in a general compact Riemannian surface, ass… ▽ More

    Submitted 28 July, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

    Comments: This new version includes a new theorem (Theorem 1.3) and additional references. 23 pages. Comments and feedback are welcome!

  12. arXiv:2406.09927  [pdf, ps, other

    math.DG

    Index estimates for harmonic Gauss maps

    Authors: Alcides de Carvalho, Marcos P. Cavalcante, Wagner Costa-Filho, Darlan de Oliveira

    Abstract: Let $Σ$ denote a closed surface with constant mean curvature in $\mathbb{G}^3$, a 3-dimensional Lie group equipped with a bi-invariant metric. For such surfaces, there is a harmonic Gauss map which maps values to the unit sphere within the Lie algebra of $\mathbb{G}$. We prove that the energy index of the Gauss map of $Σ$ is bounded below by its topological genus. We also obtain index estimates in… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 11 pages

  13. arXiv:2405.18233  [pdf, ps, other

    math.DG math.AP math.SP

    First Eigenvalue of Jacobi operator and Rigidity Results for Constant Mean Curvature Hypersurfaces

    Authors: Marcio Batista, Marcos P. Cavalcante, Luiz R. Melo

    Abstract: In this paper, we obtain geometric upper bounds for the first eigenvalue $λ_1^J$ of the Jacobi operator for both closed and compact with boundary hypersurfaces having constant mean curvature (CMC). As an application, we derive new rigidity results for the area of CMC hypersurfaces under suitable conditions on $λ_1^J$ and the curvature of the ambient space. We also address the Jacobi-Steklov proble… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 17 pages. Comments welcome!

  14. TeraPool-SDR: An 1.89TOPS 1024 RV-Cores 4MiB Shared-L1 Cluster for Next-Generation Open-Source Software-Defined Radios

    Authors: Yichao Zhang, Marco Bertuletti, Samuel Riedel, Matheus Cavalcante, Alessandro Vanelli-Coralli, Luca Benini

    Abstract: Radio Access Networks (RAN) workloads are rapidly scaling up in data processing intensity and throughput as the 5G (and beyond) standards grow in number of antennas and sub-carriers. Offering flexible Processing Elements (PEs), efficient memory access, and a productive parallel programming model, many-core clusters are a well-matched architecture for next-generation software-defined RANs, but stag… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 6 pages, 6 figures and 3 tables

  15. arXiv:2404.10074  [pdf, other

    cond-mat.str-el cond-mat.stat-mech quant-ph

    Nano-welding of quantum spin-$1/2$ chains at minimal dissipation

    Authors: Moallison F. Cavalcante, Marcus V. S. Bonança, Eduardo Miranda, Sebastian Deffner

    Abstract: We consider the optimal control of switching on a coupling term between two quantum many-body systems. Specifically, we (i) quantify the energetic cost of establishing a weak junction between two quantum spin-$1/2$ chains in finite time $τ$ and (ii) identify the energetically optimal protocol to realize it. For linear driving protocols, we find that for long times the excess (irreversible) work sc… ▽ More

    Submitted 23 January, 2025; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: Article published as Editors' Suggestion in PRB

    Report number: Phys. Rev. B 110, 064304 (2024)

    Journal ref: Phys. Rev. B 110, 064304 (2024)

  16. arXiv:2402.01561  [pdf, other

    math.AP

    Dynamics of the Korteweg-de Vries equation on a balanced metric graph

    Authors: Jaime Angulo Pava, Márcio Cavalcante

    Abstract: In this work, we establish local well-posedness for the Korteweg-de Vries model on a balanced star graph with a structure represented by semi-infinite edges, by considering a boundary condition of $δ$-type at the {unique} graph-vertex. Also, we extend the linear instability result of Angulo and Cavalcante (2021) to one of nonlinear instability. For the proof of local well posedness theory the prin… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: 31 pages, 4 figures

    MSC Class: Primary 5Q51; 35Q53; 35J61; Secondary 47E05

  17. arXiv:2401.04012  [pdf, other

    cs.AR

    MX: Enhancing RISC-V's Vector ISA for Ultra-Low Overhead, Energy-Efficient Matrix Multiplication

    Authors: Matteo Perotti, Yichao Zhang, Matheus Cavalcante, Enis Mustafa, Luca Benini

    Abstract: Dense Matrix Multiplication (MatMul) is arguably one of the most ubiquitous compute-intensive kernels, spanning linear algebra, DSP, graphics, and machine learning applications. Thus, MatMul optimization is crucial not only in high-performance processors but also in embedded low-power platforms. Several Instruction Set Architectures (ISAs) have recently included matrix extensions to improve MatMul… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  18. Ara2: Exploring Single- and Multi-Core Vector Processing with an Efficient RVV 1.0 Compliant Open-Source Processor

    Authors: Matteo Perotti, Matheus Cavalcante, Renzo Andri, Lukas Cavigelli, Luca Benini

    Abstract: Vector processing is highly effective in boosting processor performance and efficiency for data-parallel workloads. In this paper, we present Ara2, the first fully open-source vector processor to support the RISC-V V 1.0 frozen ISA. We evaluate Ara2's performance on a diverse set of data-parallel kernels for various problem sizes and vector-unit configurations, achieving an average functional-unit… ▽ More

    Submitted 17 June, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: To be published in: IEEE Transactions on Computers

  19. arXiv:2311.06878  [pdf, ps, other

    math.AP math.DG

    Geometric properties of extremal domains for the $p$-Laplacian operator

    Authors: Francisco G. Carvalho, Marcos P. Cavalcante

    Abstract: In this paper, we explore the geometric properties of unbounded extremal domains for the $p$-Laplacian operator in both Euclidean and hyperbolic spaces. Assuming that the nonlinearity grows at least as the nonlinearity of the eigenvalue problem, we prove that these domains exhibit remarkable geometric properties and cannot be arbitrarily wide. In two dimensions, we prove that such domains with con… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

    MSC Class: 35Nxx; 35Pxx; 49Kxx. 49Qxx

  20. arXiv:2309.10137  [pdf, other

    cs.AR

    Spatz: Clustering Compact RISC-V-Based Vector Units to Maximize Computing Efficiency

    Authors: Matteo Perotti, Samuel Riedel, Matheus Cavalcante, Luca Benini

    Abstract: The ever-increasing computational and storage requirements of modern applications and the slowdown of technology scaling pose major challenges to designing and implementing efficient computer architectures. To mitigate the bottlenecks of typical processor-based architectures on both the instruction and data sides of the memory, we present Spatz, a compact 64-bit floating-point-capable vector proce… ▽ More

    Submitted 9 January, 2025; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: To be published in "IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems"

  21. arXiv:2308.00154  [pdf, other

    cs.AR

    PATRONoC: Parallel AXI Transport Reducing Overhead for Networks-on-Chip targeting Multi-Accelerator DNN Platforms at the Edge

    Authors: Vikram Jain, Matheus Cavalcante, Nazareno Bruschi, Michael Rogenmoser, Thomas Benz, Andreas Kurth, Davide Rossi, Luca Benini, Marian Verhelst

    Abstract: Emerging deep neural network (DNN) applications require high-performance multi-core hardware acceleration with large data bursts. Classical network-on-chips (NoCs) use serial packet-based protocols suffering from significant protocol translation overheads towards the endpoints. This paper proposes PATRONoC, an open-source fully AXI-compliant NoC fabric to better address the specific needs of multi… ▽ More

    Submitted 31 July, 2023; originally announced August 2023.

    Comments: Accepted and presented at 60th DAC

  22. arXiv:2305.11688  [pdf, other

    cond-mat.str-el cond-mat.supr-con

    Raman Response of the Charge Density Wave in Cuprate Superconductors

    Authors: Moallison F. Cavalcante, S. Bag, I. Paul, A. Sacuto, M. C. O. Aguiar, M. Civelli

    Abstract: We study the Raman response, for $B_{1g}$ and $B_{2g}$ light-polarization symmetries, of the charge density wave phase appearing in the underdoped region of cuprate superconductors. We show that the $B_{2g}$ response provides a distinctive signature of the charge order, independently of the details of the electronic structure and from the concomitant presence of a pseudogap, in sharp contrast with… ▽ More

    Submitted 10 October, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Journal ref: Phys. Rev. B 108, 165111 (2023)

  23. FlooNoC: A Multi-Tbps Wide NoC for Heterogeneous AXI4 Traffic

    Authors: Tim Fischer, Michael Rogenmoser, Matheus Cavalcante, Frank K. Gürkaynak, Luca Benini

    Abstract: Meeting the staggering bandwidth requirements of today's applications challenges the traditional narrow and serialized NoCs, which hit hard bounds on the maximum operating frequency. This paper proposes FlooNoC, an open-source, low-latency, fully AXI4-compatible NoC with wide physical channels for latency-tolerant high-bandwidth non-blocking transactions and decoupled latency-critical short messag… ▽ More

    Submitted 6 August, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

  24. MemPool: A Scalable Manycore Architecture with a Low-Latency Shared L1 Memory

    Authors: Samuel Riedel, Matheus Cavalcante, Renzo Andri, Luca Benini

    Abstract: Shared L1 memory clusters are a common architectural pattern (e.g., in GPGPUs) for building efficient and flexible multi-processing-element (PE) engines. However, it is a common belief that these tightly-coupled clusters would not scale beyond a few tens of PEs. In this work, we tackle scaling shared L1 clusters to hundreds of PEs while supporting a flexible and productive programming model and ma… ▽ More

    Submitted 28 November, 2023; v1 submitted 30 March, 2023; originally announced March 2023.

    Comments: 14 pages, 17 figures, 2 tables, Published in IEEE Transactions on Computers

    Journal ref: IEEE Transactions on Computers, vol. 72, no. 12, pp. 3561-3575, Dec. 2023

  25. arXiv:2302.05996  [pdf, other

    cs.AR

    Quark: An Integer RISC-V Vector Processor for Sub-Byte Quantized DNN Inference

    Authors: MohammadHossein AskariHemmat, Theo Dupuis, Yoan Fournier, Nizar El Zarif, Matheus Cavalcante, Matteo Perotti, Frank Gurkaynak, Luca Benini, Francois Leduc-Primeau, Yvon Savaria, Jean-Pierre David

    Abstract: In this paper, we present Quark, an integer RISC-V vector processor specifically tailored for sub-byte DNN inference. Quark is implemented in GlobalFoundries' 22FDX FD-SOI technology. It is designed on top of Ara, an open-source 64-bit RISC-V vector processor. To accommodate sub-byte DNN inference, Quark extends Ara by adding specialized vector instructions to perform sub-byte quantized operations… ▽ More

    Submitted 12 February, 2023; originally announced February 2023.

    Comments: 5 pages. Accepted for publication in the 56th International Symposium on Circuits and Systems (ISCAS 2023)

    ACM Class: C.1.3; C.3

  26. On the fundamental tone of the $p$-Laplacian on Riemannian manifolds and applications

    Authors: Francisco G. de S. Carvalho, Marcos Petrucio Cavalcante

    Abstract: We present a general lower bound for the fundamental tone for the $p$-Laplacian on Riemannian manifolds carrying a special kind of function. We then apply our result to the cases of negatively curved simply connected manifolds, a class of warped product manifolds and for a class of Riemannian submersions.

    Submitted 28 January, 2023; originally announced January 2023.

    Journal ref: J. Math. Anal. Appl. 506 (2022) 125703

  27. Index bounds for closed minimal surfaces in 3-manifolds with the Killing property

    Authors: Marcos P. Cavalcante, Darlan F. de Oliveira, Robson dos S. Silva

    Abstract: Let $Σ$ be a closed minimal surface immersed in a Riemannian 3-manifold carrying an orthonormal Killing frame. This class of ambient spaces includes Lie groups with a bi-invariant metric. In this paper, we prove that the sum of the Morse index and the nullity of $Σ$ is bounded from below by a constant times its genus.

    Submitted 28 January, 2023; originally announced January 2023.

    Comments: Dedicated to Professor Renato Tribuzy on the occasion of his 75th birthday

    Journal ref: Mat. Contemp. 50 (2022) 38-53

  28. arXiv:2211.13989  [pdf, other

    cs.AR cs.DC cs.NI

    HexaMesh: Scaling to Hundreds of Chiplets with an Optimized Chiplet Arrangement

    Authors: Patrick Iff, Maciej Besta, Matheus Cavalcante, Tim Fischer, Luca Benini, Torsten Hoefler

    Abstract: 2.5D integration is an important technique to tackle the growing cost of manufacturing chips in advanced technology nodes. This poses the challenge of providing high-performance inter-chiplet interconnects (ICIs). As the number of chiplets grows to tens or hundreds, it becomes infeasible to hand-optimize their arrangement in a way that maximizes the ICI performance. In this paper, we propose HexaM… ▽ More

    Submitted 8 October, 2023; v1 submitted 25 November, 2022; originally announced November 2022.

  29. arXiv:2211.13980  [pdf, other

    cs.AR cs.DC cs.NI

    Sparse Hamming Graph: A Customizable Network-on-Chip Topology

    Authors: Patrick Iff, Maciej Besta, Matheus Cavalcante, Tim Fischer, Luca Benini, Torsten Hoefler

    Abstract: Chips with hundreds to thousands of cores require scalable networks-on-chip (NoCs). Customization of the NoC topology is necessary to reach the diverse design goals of different chips. We introduce sparse Hamming graph, a novel NoC topology with an adjustable costperformance trade-off that is based on four NoC topology design principles we identified. To efficiently customize this topology, we dev… ▽ More

    Submitted 28 June, 2023; v1 submitted 25 November, 2022; originally announced November 2022.

  30. Quench dynamics of the Kondo effect: transport across an impurity coupled to interacting wires

    Authors: Moallison F. Cavalcante, Rodrigo G. Pereira, Maria C. O. Aguiar

    Abstract: We study the real-time dynamics of the Kondo effect after a quantum quench in which a magnetic impurity is coupled to two metallic Hubbard chains. Using an effective field theory approach, we find that for noninteracting electrons the charge current across the impurity is given by a scaling function that involves the Kondo time. In the interacting case, we show that the Kondo time decreases with t… ▽ More

    Submitted 7 February, 2023; v1 submitted 4 November, 2022; originally announced November 2022.

    Report number: Phys. Rev. B 107, 075110 (2023)

    Journal ref: Phys. Rev. B 107, 075110 (2023)

  31. A "New Ara" for Vector Computing: An Open Source Highly Efficient RISC-V V 1.0 Vector Processor Design

    Authors: Matteo Perotti, Matheus Cavalcante, Nils Wistoff, Renzo Andri, Lukas Cavigelli, Luca Benini

    Abstract: Vector architectures are gaining traction for highly efficient processing of data-parallel workloads, driven by all major ISAs (RISC-V, Arm, Intel), and boosted by landmark chips, like the Arm SVE-based Fujitsu A64FX, powering the TOP500 leader Fugaku. The RISC-V V extension has recently reached 1.0-Frozen status. Here, we present its first open-source implementation, discuss the new specification… ▽ More

    Submitted 9 January, 2025; v1 submitted 17 October, 2022; originally announced October 2022.

    Comments: Accepted version of the article published in "2022 IEEE 33rd International Conference on Application-specific Systems, Architectures and Processors (ASAP)"

  32. arXiv:2209.00889  [pdf, other

    cs.AR

    Soft Tiles: Capturing Physical Implementation Flexibility for Tightly-Coupled Parallel Processing Clusters

    Authors: Gianna Paulin, Matheus Cavalcante, Paul Scheffler, Luca Bertaccini, Yichao Zhang, Frank Gürkaynak, Luca Benini

    Abstract: Modern high-performance computing architectures (Multicore, GPU, Manycore) are based on tightly-coupled clusters of processing elements, physically implemented as rectangular tiles. Their size and aspect ratio strongly impact the achievable operating frequency and energy efficiency, but they should be as flexible as possible to achieve a high utilization for the top-level die floorplan. In this pa… ▽ More

    Submitted 2 September, 2022; originally announced September 2022.

    Comments: 6 pages. Accepted for publication in the IEEE Computer Society Annual Symposium on VLSI (ISVLSI) 2022

  33. Spatz: A Compact Vector Processing Unit for High-Performance and Energy-Efficient Shared-L1 Clusters

    Authors: Matheus Cavalcante, Domenic Wüthrich, Matteo Perotti, Samuel Riedel, Luca Benini

    Abstract: While parallel architectures based on clusters of Processing Elements (PEs) sharing L1 memory are widespread, there is no consensus on how lean their PE should be. Architecting PEs as vector processors holds the promise to greatly reduce their instruction fetch bandwidth, mitigating the Von Neumann Bottleneck (VNB). However, due to their historical association with supercomputers, classical vector… ▽ More

    Submitted 16 July, 2022; originally announced July 2022.

    Comments: 9 pages. Accepted for publication in the 2022 International Conference on Computer-Aided Design (ICCAD 2022)

    ACM Class: C.1.3; C.1.2

  34. arXiv:2206.02898  [pdf, other

    math.AP math-ph

    Stability of mKdV breathers on the half-line

    Authors: Miguel A. Alejo, Márcio Cavalcante, Adán J. Corcho

    Abstract: In this paper we study the stability problem for mKdV breathers on the left half-line. We are able to show that leftwards moving breathers, initially located far away from the origin, are strongly stable for the problem posed on the left half-line, when assuming homogeneous boundary conditions. The proof involves a Lyapunov functional which is almost conserved by the mKdV flow once we control some… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

    Comments: 17 pages, 1 figure

    MSC Class: 35Q55

  35. MemPool-3D: Boosting Performance and Efficiency of Shared-L1 Memory Many-Core Clusters with 3D Integration

    Authors: Matheus Cavalcante, Anthony Agnesina, Samuel Riedel, Moritz Brunion, Alberto Garcia-Ortiz, Dragomir Milojevic, Francky Catthoor, Sung Kyu Lim, Luca Benini

    Abstract: Three-dimensional integrated circuits promise power, performance, and footprint gains compared to their 2D counterparts, thanks to drastic reductions in the interconnects' length through their smaller form factor. We can leverage the potential of 3D integration by enhancing MemPool, an open-source many-core design with 256 cores and a shared pool of L1 scratchpad memory connected with a low-latenc… ▽ More

    Submitted 2 December, 2021; originally announced December 2021.

    Comments: Accepted for publication in DATE 2022 -- Design, Automation and Test in Europe Conference

  36. Controllability for Schrödinger type system with mixed dispersion on compact star graphs

    Authors: Roberto de A. Capistrano-Filho, Márcio Cavalcante, Fernando Gallego

    Abstract: In this work we are concerned with solutions to the linear Schrödinger type system with mixed dispersion, the so-called biharmonic Schrödinger equation. Precisely, we are able to prove an exact control property for these solutions with the control in the energy space posed on an oriented star graph structure $\mathcal{G}$ for $T>T_{min}$, with… ▽ More

    Submitted 28 September, 2021; originally announced September 2021.

    Comments: 15 pages, 1 figure, comments are welcome

    MSC Class: 35R02; 35Q55; 35G30; 93B05; 93B07

    Journal ref: Evolution Equations & Control Theory (EECT) 2023

  37. arXiv:2104.05137  [pdf, ps, other

    math.AP math-ph

    The nonlinear Quadratic Interactions of the Schrödinger type on the half-line

    Authors: Isnaldo Isaac Barbosa, Márcio Cavalcante

    Abstract: In this work we study the initial boundary value problem associated with the coupled Schrödinger equations {with quadratic nonlinearities, that appears in nonlinear optics}, on the half-line. We obtain local well-posedness for data {in Sobolev spaces} with low regularity, by using a forcing problem on the full line with a presence of a forcing term in order to apply the Fourier restriction method… ▽ More

    Submitted 11 April, 2021; originally announced April 2021.

    Comments: 31 pages

    MSC Class: 35Q55; 35J61; 35J57

  38. Quench dynamics and relaxation of a spin coupled to interacting leads

    Authors: Helena Bragança, Moallison F. Cavalcante, R. G. Pereira, Maria C. O. Aguiar

    Abstract: We study a quantum quench in which a magnetic impurity is suddenly coupled to Hubbard chains, whose low-energy physics is described by Tomonaga-Luttinger liquid theory. Using the time-dependent density-matrix renormalization-group (tDMRG) technique, we analyze the propagation of charge, spin and entanglement in the chains after the quench and relate the light-cone velocities to the dispersion of h… ▽ More

    Submitted 25 March, 2021; v1 submitted 22 January, 2021; originally announced January 2021.

    Comments: 11 pages, 5 figures

    Journal ref: Phys. Rev. B 103, 125152 (2021)

  39. MemPool: A Shared-L1 Memory Many-Core Cluster with a Low-Latency Interconnect

    Authors: Matheus Cavalcante, Samuel Riedel, Antonio Pullini, Luca Benini

    Abstract: A key challenge in scaling shared-L1 multi-core clusters towards many-core (more than 16 cores) configurations is to ensure low-latency and efficient access to the L1 memory. In this work we demonstrate that it is possible to scale up the shared-L1 architecture: We present MemPool, a 32 bit many-core system with 256 fast RV32IMA "Snitch" cores featuring application-tunable execution units, running… ▽ More

    Submitted 5 December, 2020; originally announced December 2020.

    Comments: Accepted for publication in the Design, Automation and Test in Europe (DATE) Conference 2021

  40. An Open-Source Platform for High-Performance Non-Coherent On-Chip Communication

    Authors: Andreas Kurth, Wolfgang Rönninger, Thomas Benz, Matheus Cavalcante, Fabian Schuiki, Florian Zaruba, Luca Benini

    Abstract: On-chip communication infrastructure is a central component of modern systems-on-chip (SoCs), and it continues to gain importance as the number of cores, the heterogeneity of components, and the on-chip and off-chip bandwidth continue to grow. Decades of research on on-chip networks enabled cache-coherent shared-memory multiprocessors. However, communication fabrics that meet the needs of heteroge… ▽ More

    Submitted 11 November, 2021; v1 submitted 11 September, 2020; originally announced September 2020.

    Comments: 14 pages, 24 figures, 4 tables

    ACM Class: B.4.3; C.1.2; C.5.4

  41. Linear instability of stationary solutions for the Korteweg-de Vries equation on a star graph

    Authors: Jaime Angulo Pava, Márcio Cavalcante

    Abstract: The aim of this work is to establish a linear instability criterium of stationary solutions for the Korteweg-de Vries model on a star graph with a structure represented by a finite collections of semi-infinite edges. By considering a boundary condition of $δ$-type interaction at the graph-vertex, we show that the continuous tail and bump profiles are linearly unstable in a balanced star graph. The… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

    Comments: 34 pages, 5 figures

  42. arXiv:1911.08307  [pdf, ps, other

    math.AP

    The cubic nonlinear fractional Schrödinger equation on the half-line

    Authors: Márcio Cavalcante, Gerardo Huaroto

    Abstract: We study the cubic nonlinear fractional Schrödinger equation with Lévy indices $\frac{4}{3}<α< 2$ posed on the half-line. More precisely, we define the notion of a solution for this model and we obtain a result of local-well-posedness almost sharp with respect for known results on the full real line $\mathbb R$. Also, we prove for the same model that the solution of the nonlinear part is smoother… ▽ More

    Submitted 19 November, 2019; originally announced November 2019.

    Comments: 22 pages

  43. Forcing operators on star graphs applied for the cubic fourth order Schrödinger equation

    Authors: Roberto de A. Capistrano Filho, Márcio Cavalcante, Fernando A. Gallego

    Abstract: In a recent article \textit{"Lower regularity solutions of the biharmonic Schrödinger equation in a quarter plane", to appear on Pacific Journal of Mathematics [15]}, the authors gave a starting point of the study on a series of problems concerning the initial boundary value problem and control theory of Biharmonic NLS in some non-standard domains. In this direction, this article deals to present… ▽ More

    Submitted 10 August, 2020; v1 submitted 15 September, 2019; originally announced September 2019.

    Comments: 28 pages, 1 figure

    MSC Class: 35R02; 35Q55; 35C15; 81Q35; 35G30

    Journal ref: Discrete & Continuous Dynamical Systems - B (2022)

  44. arXiv:1908.09952  [pdf, other

    math.DG

    Gap phenomena for constant mean curvature surfaces

    Authors: Ezequiel Barbosa, Marcos P. Cavalcante, Edno Pereira

    Abstract: In this paper, we prove gap results for constant mean curvature (CMC) surfaces. Firstly, we find a natural inequality for CMC surfaces which imply convexity for distance function. We then show that if $Σ$ is a complete, properly embedded CMC surface in the Euclidean space satisfying this inequality, then $Σ$ is either a sphere or a right circular cylinder. Next, we show that if $Σ$ is a free bound… ▽ More

    Submitted 28 January, 2023; v1 submitted 26 August, 2019; originally announced August 2019.

    Comments: This paper was reformulated in order to include new results for the case of complete noncompact surfaces. It was also revised according to the referee's comments

  45. Ara: A 1 GHz+ Scalable and Energy-Efficient RISC-V Vector Processor with Multi-Precision Floating Point Support in 22 nm FD-SOI

    Authors: Matheus Cavalcante, Fabian Schuiki, Florian Zaruba, Michael Schaffner, Luca Benini

    Abstract: In this paper, we present Ara, a 64-bit vector processor based on the version 0.5 draft of RISC-V's vector extension, implemented in GlobalFoundries 22FDX FD-SOI technology. Ara's microarchitecture is scalable, as it is composed of a set of identical lanes, each containing part of the processor's vector register file and functional units. It achieves up to 97% FPU utilization when running a 256 x… ▽ More

    Submitted 27 October, 2019; v1 submitted 2 June, 2019; originally announced June 2019.

    Comments: 13 pages. Accepted for publication in IEEE Transactions on Very Large Scale Integration Systems

  46. Lower regularity solutions of the biharmonic Schrödinger equation in a quarter plane

    Authors: Roberto A. Capistrano-Filho, Márcio Cavalcante, Fernando A. Gallego

    Abstract: This paper deals with the initial-boundary value problem of the biharmonic cubic nonlinear Schrödinger equation in a quarter plane with inhomogeneous Dirichlet-Neumann boundary data. We prove local well-posedness in the low regularity Sobolev spaces introducing Duhamel boundary forcing operator associated to the linear equation to construct solutions on the whole line. With this in hands, the ener… ▽ More

    Submitted 10 August, 2020; v1 submitted 23 December, 2018; originally announced December 2018.

    Comments: 26 pages, 2 figures

    MSC Class: 35Q55; 35G15; 35C15; 35A07; 35G30

    Journal ref: Pacific J. Math. 309 (2020) 35-70

  47. The halfspace theorem for minimal hypersurfaces in regions bounded by minimal cones

    Authors: Marcos Petrúcio Cavalcante, Wagner Oliveira Costa-Filho

    Abstract: We prove that there are no minimal hypersurfaces properly immersed in any region of the Euclidean space bounded by unstable minimal cones. We also prove the analogous result for $r$-minimal hypersurfaces.

    Submitted 8 October, 2018; originally announced October 2018.

    Comments: 7 pages

    MSC Class: Primary 53C42; Secondary 58J05; 35B50

  48. arXiv:1810.01905  [pdf, ps, other

    math.AP

    Well-posedness and long time behavior for the Schrödinger-Korteweg-de Vries interactions on the half-Line

    Authors: Márcio Cavalcante, Adán Corcho

    Abstract: The initial-boundary value problem for the Schrödinger-Korteweg-de Vries system is considered on the left and right half-line for a wide class of initial-boundary data, including the energy regularity $H^1(\R^{\pm})\times H^1(\R^{\pm})$ for initial data. Assuming homogeneous boundary conditions it is shown for positive coupling interactions that local solutions can be extended globally in time for… ▽ More

    Submitted 3 October, 2018; originally announced October 2018.

    Comments: 26 pages

  49. arXiv:1808.06494  [pdf, ps, other

    math.AP

    Local well-posedness of the fifth-order KdV-type equations on the half-line

    Authors: Márcio Cavalcante, Chulkwang Kwak

    Abstract: This paper is a continuation of authors' previous work \cite{CK2018-1}. We extend the argument \cite{CK2018-1} to fifth-order KdV-type equations with different nonlinearities, in specific, where the scaling argument does not hold. We establish the $X^{s,b}$ nonlinear estimates for $b < \frac12$, which is almost optimal compared to the standard $X^{s,b}$ nonlinear estimates for $b > \frac12$ \cite{… ▽ More

    Submitted 2 January, 2019; v1 submitted 16 August, 2018; originally announced August 2018.

    Comments: 60 pages. More expositions are complemented in the introduction section, and minor typos are corrected. arXiv admin note: text overlap with arXiv:1805.05229

    MSC Class: 35Q53; 35G31

  50. arXiv:1807.06849  [pdf, ps, other

    math.DG

    Vanishing theorems for the cohomology groups of free boundary hypersurfaces

    Authors: Marcos P. Cavalcante, Abraão Mendes, Feliciano Vitório

    Abstract: In this paper, we prove that there exists a universal constant $C$, depending only on positive integers $n\geq 3$ and $p\leq n-1$, such that if $M^n$ is a compact free boundary submanifold of dimension $n$ immersed in the Euclidean unit ball $\mathbb{B}^{n+k}$ whose size of the traceless second fundamental form is less than $C$, then the $p$th cohomology group of $M^n$ vanishes. Also, employing a… ▽ More

    Submitted 22 November, 2018; v1 submitted 18 July, 2018; originally announced July 2018.

    Comments: Version with improved constants. The authors thank Ezequiel Barbosa for valuable comments on this paper