Skip to main content

Showing 1–23 of 23 results for author: Pasini, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.08112  [pdf, other

    cs.LG cond-mat.mtrl-sci

    Scaling Laws of Graph Neural Networks for Atomistic Materials Modeling

    Authors: Chaojian Li, Zhifan Ye, Massimiliano Lupo Pasini, Jong Youl Choi, Cheng Wan, Yingyan Celine Lin, Prasanna Balaprakash

    Abstract: Atomistic materials modeling is a critical task with wide-ranging applications, from drug discovery to materials science, where accurate predictions of the target material property can lead to significant advancements in scientific discovery. Graph Neural Networks (GNNs) represent the state-of-the-art approach for modeling atomistic material data thanks to their capacity to capture complex relatio… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

    Comments: Accepted by DAC'25

  2. arXiv:2501.17578  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    Music2Latent2: Audio Compression with Summary Embeddings and Autoregressive Decoding

    Authors: Marco Pasini, Stefan Lattner, George Fazekas

    Abstract: Efficiently compressing high-dimensional audio signals into a compact and informative latent space is crucial for various tasks, including generative modeling and music information retrieval (MIR). Existing audio autoencoders, however, often struggle to achieve high compression ratios while preserving audio fidelity and facilitating efficient downstream applications. We introduce Music2Latent2, a… ▽ More

    Submitted 29 January, 2025; originally announced January 2025.

    Comments: Accepted to ICASSP 2025

  3. arXiv:2411.18447  [pdf, other

    cs.LG cs.AI cs.SD eess.AS

    Continuous Autoregressive Models with Noise Augmentation Avoid Error Accumulation

    Authors: Marco Pasini, Javier Nistal, Stefan Lattner, George Fazekas

    Abstract: Autoregressive models are typically applied to sequences of discrete tokens, but recent research indicates that generating sequences of continuous embeddings in an autoregressive manner is also feasible. However, such Continuous Autoregressive Models (CAMs) can suffer from a decline in generation quality over extended sequences due to error accumulation during inference. We introduce a novel metho… ▽ More

    Submitted 27 November, 2024; originally announced November 2024.

    Comments: Accepted to NeurIPS 2024 - Audio Imagination Workshop

  4. arXiv:2410.23005  [pdf, ps, other

    cs.SD eess.AS

    Improving Musical Accompaniment Co-creation via Diffusion Transformers

    Authors: Javier Nistal, Marco Pasini, Stefan Lattner

    Abstract: Building upon Diff-A-Riff, a latent diffusion model for musical instrument accompaniment generation, we present a series of improvements targeting quality, diversity, inference speed, and text-driven control. First, we upgrade the underlying autoencoder to a stereo-capable model with superior fidelity and replace the latent U-Net with a Diffusion Transformer. Additionally, we refine text prompting… ▽ More

    Submitted 30 October, 2024; originally announced October 2024.

    Comments: 5 pages; 1 table

  5. arXiv:2408.06500  [pdf, other

    cs.SD cs.LG eess.AS

    Music2Latent: Consistency Autoencoders for Latent Audio Compression

    Authors: Marco Pasini, Stefan Lattner, George Fazekas

    Abstract: Efficient audio representations in a compressed continuous latent space are critical for generative audio modeling and Music Information Retrieval (MIR) tasks. However, some existing audio autoencoders have limitations, such as multi-stage training procedures, slow iterative sampling, or low reconstruction quality. We introduce Music2Latent, an audio autoencoder that overcomes these limitations by… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

    Comments: Accepted to ISMIR 2024

  6. arXiv:2406.12909  [pdf, other

    cs.LG physics.comp-ph

    Scalable Training of Trustworthy and Energy-Efficient Predictive Graph Foundation Models for Atomistic Materials Modeling: A Case Study with HydraGNN

    Authors: Massimiliano Lupo Pasini, Jong Youl Choi, Kshitij Mehta, Pei Zhang, David Rogers, Jonghyun Bae, Khaled Z. Ibrahim, Ashwin M. Aji, Karl W. Schulz, Jorda Polo, Prasanna Balaprakash

    Abstract: We present our work on developing and training scalable, trustworthy, and energy-efficient predictive graph foundation models (GFMs) using HydraGNN, a multi-headed graph convolutional neural network architecture. HydraGNN expands the boundaries of graph neural network (GNN) computations in both training scale and data diversity. It abstracts over message passing algorithms, allowing both reproduct… ▽ More

    Submitted 1 November, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: 51 pages, 32 figures

    MSC Class: 68T07; 68T09 ACM Class: C.2.4; I.2.11

  7. arXiv:2406.08384  [pdf, other

    cs.SD cs.AI eess.AS

    Diff-A-Riff: Musical Accompaniment Co-creation via Latent Diffusion Models

    Authors: Javier Nistal, Marco Pasini, Cyran Aouameur, Maarten Grachten, Stefan Lattner

    Abstract: Recent advancements in deep generative models present new opportunities for music production but also pose challenges, such as high computational demands and limited audio quality. Moreover, current systems frequently rely solely on text input and typically focus on producing complete musical pieces, which is incompatible with existing workflows in music production. To address these issues, we int… ▽ More

    Submitted 30 October, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: 8 pages, 2 figures, 3 tables

    Journal ref: Proc. of the 25th International Society for Music Information Retrieval, 2024

  8. arXiv:2402.01412  [pdf, other

    cs.SD cs.LG eess.AS

    Bass Accompaniment Generation via Latent Diffusion

    Authors: Marco Pasini, Maarten Grachten, Stefan Lattner

    Abstract: The ability to automatically generate music that appropriately matches an arbitrary input track is a challenging task. We present a novel controllable system for generating single stems to accompany musical mixes of arbitrary length. At the core of our method are audio autoencoders that efficiently compress audio waveform samples into invertible latent representations, and a conditional latent dif… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: ICASSP 2024

  9. arXiv:2311.13058  [pdf, other

    cs.SD eess.AS

    Self-Supervised Music Source Separation Using Vector-Quantized Source Category Estimates

    Authors: Marco Pasini, Stefan Lattner, George Fazekas

    Abstract: Music source separation is focused on extracting distinct sonic elements from composite tracks. Historically, many methods have been grounded in supervised learning, necessitating labeled data, which is occasionally constrained in its diversity. More recent methods have delved into N-shot techniques that utilize one or more audio samples to aid in the separation. However, a challenge with some of… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: 4 pages, 2 figures, 1 table; Accepted at the 37th Conference on Neural Information Processing Systems (2023), Machine Learning for Audio Workshop

  10. arXiv:2310.04610  [pdf, other

    cs.AI cs.LG

    DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies

    Authors: Shuaiwen Leon Song, Bonnie Kruft, Minjia Zhang, Conglong Li, Shiyang Chen, Chengming Zhang, Masahiro Tanaka, Xiaoxia Wu, Jeff Rasley, Ammar Ahmad Awan, Connor Holmes, Martin Cai, Adam Ghanem, Zhongzhu Zhou, Yuxiong He, Pete Luferenko, Divya Kumar, Jonathan Weyn, Ruixiong Zhang, Sylwester Klocek, Volodymyr Vragov, Mohammed AlQuraishi, Gustaf Ahdritz, Christina Floristean, Cristina Negri , et al. (67 additional authors not shown)

    Abstract: In the upcoming decade, deep learning may revolutionize the natural sciences, enhancing our capacity to model and predict natural occurrences. This could herald a new era of scientific exploration, bringing significant advancements across sectors from drug development to renewable energy. To answer this call, we present DeepSpeed4Science initiative (deepspeed4science.ai) which aims to build unique… ▽ More

    Submitted 11 October, 2023; v1 submitted 6 October, 2023; originally announced October 2023.

  11. arXiv:2301.13162  [pdf, other

    math.NA cs.AI cs.LG

    A deep learning approach for adaptive zoning

    Authors: Massimiliano Lupo Pasini, Luka Malenica, Kwitae Chong, Stuart Slattery

    Abstract: We propose a supervised deep learning (DL) approach to perform adaptive zoning on time dependent partial differential equations that model the propagation of 1D shock waves in a compressible medium. We train a neural network on a dataset composed of different static shock profiles associated with the corresponding adapted meshes computed with standard adaptive zoning techniques. We show that the t… ▽ More

    Submitted 9 December, 2022; originally announced January 2023.

    Comments: 30 pages, 26 figures

    MSC Class: 68T10; 68U20; 00A72; 00A79 ACM Class: G.1.1; G.1.8; I.2

  12. arXiv:2210.03746  [pdf, other

    cs.LG cs.AI math.OC

    A deep learning approach to solve forward differential problems on graphs

    Authors: Yuanyuan Zhao, Massimiliano Lupo Pasini

    Abstract: We propose a novel deep learning (DL) approach to solve one-dimensional non-linear elliptic, parabolic, and hyperbolic problems on graphs. A system of physics-informed neural network (PINN) models is used to solve the differential equations, by assigning each PINN model to a specific edge of the graph. Kirkhoff-Neumann (KN) nodal conditions are imposed in a weak form by adding a penalization term… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

    Comments: 40 pages, 27 figures

    MSC Class: 05C45; 05C85; 05C90; 68T20 ACM Class: G.1.0; G.1.6; G.1.7; G.1.8; G.2.0; G.2.3; G.4; I.2.8; I.2.11

  13. arXiv:2210.03558  [pdf, other

    cs.CV cs.LG stat.ML

    A deep learning approach for detection and localization of leaf anomalies

    Authors: Davide Calabrò, Massimiliano Lupo Pasini, Nicola Ferro, Simona Perotto

    Abstract: The detection and localization of possible diseases in crops are usually automated by resorting to supervised deep learning approaches. In this work, we tackle these goals with unsupervised models, by applying three different types of autoencoders to a specific open-source dataset of healthy and unhealthy pepper and cherry leaf images. CAE, CVAE and VQ-VAE autoencoders are deployed to screen unlab… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

    Comments: 23 pages, 8 figures

    MSC Class: 68T10; 68T45; 68U10 ACM Class: I.2.5; I.2.6; I.2.10; I.3.6; I.3.8; I.4.2; I.4.5; I.4.9; I.5.0; I.5.4; J.2; J.3

  14. arXiv:2208.08706  [pdf, other

    cs.SD cs.LG eess.AS

    Musika! Fast Infinite Waveform Music Generation

    Authors: Marco Pasini, Jan Schlüter

    Abstract: Fast and user-controllable music generation could enable novel ways of composing or performing music. However, state-of-the-art music generation systems require large amounts of data and computational resources for training, and are slow at inference. This makes them impractical for real-time interactive use. In this work, we introduce Musika, a music generation system that can be trained on hundr… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

    Comments: Accepted at ISMIR 2022

  15. arXiv:2207.12315  [pdf, other

    cs.AI cs.CV cs.DC cs.LG cs.MA

    Stable Parallel Training of Wasserstein Conditional Generative Adversarial Neural Networks

    Authors: Massimiliano Lupo Pasini, Junqi Yin

    Abstract: We propose a stable, parallel approach to train Wasserstein Conditional Generative Adversarial Neural Networks (W-CGANs) under the constraint of a fixed computational budget. Differently from previous distributed GANs training techniques, our approach avoids inter-process communications, reduces the risk of mode collapse and enhances scalability by using multiple generators, each one of them concu… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: 22 pages; 9 figures

    MSC Class: 68T01; 68T10; 68M14; 65Y05; 65Y10 ACM Class: I.2.0; I.2.11; C.1.4; C.2.4

  16. arXiv:2207.11333  [pdf, other

    cs.LG cs.DC physics.chem-ph physics.comp-ph

    Scalable training of graph convolutional neural networks for fast and accurate predictions of HOMO-LUMO gap in molecules

    Authors: Jong Youl Choi, Pei Zhang, Kshitij Mehta, Andrew Blanchard, Massimiliano Lupo Pasini

    Abstract: Graph Convolutional Neural Network (GCNN) is a popular class of deep learning (DL) models in material science to predict material properties from the graph representation of molecular structures. Training an accurate and comprehensive GCNN surrogate for molecular design requires large-scale graph datasets and is usually a time-consuming process. Recent advances in GPUs and distributed computing op… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

    Comments: 19 pages, 9 figures

    MSC Class: 68Q85; 68M14; 68W15; 68W15 ACM Class: I.2.11

  17. arXiv:2204.00538  [pdf, ps, other

    math.NA cs.LG

    Hierarchical model reduction driven by machine learning for parametric advection-diffusion-reaction problems in the presence of noisy data

    Authors: Massimiliano Lupo Pasini, Simona Perotto

    Abstract: We propose a new approach to generate a reliable reduced model for a parametric elliptic problem, in the presence of noisy data. The reference model reduction procedure is the directional HiPOD method, which combines Hierarchical Model reduction with a standard Proper Orthogonal Decomposition, according to an offline/online paradigm. In this paper we show that directional HiPOD looses in terms of… ▽ More

    Submitted 1 April, 2022; originally announced April 2022.

    Comments: 19 pages; 4 figures

    MSC Class: 68T01; 65M22; 65M60; 65M70; ACM Class: G.1.8; I.2.0

  18. arXiv:2202.01954  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.comp-ph

    Multi-task graph neural networks for simultaneous prediction of global and atomic properties in ferromagnetic systems

    Authors: Massimiliano Lupo Pasini, Pei Zhang, Samuel Temple Reeve, Jong Youl Choi

    Abstract: We introduce a multi-tasking graph convolutional neural network, HydraGNN, to simultaneously predict both global and atomic physical properties and demonstrate with ferromagnetic materials. We train HydraGNN on an open-source ab initio density functional theory (DFT) dataset for iron-platinum (FePt) with a fixed body centered tetragonal (BCT) lattice structure and fixed volume to simultaneously pr… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

    Comments: 13 pages, 6 figures

    Journal ref: Mach. Learn.: Sci. Technol. 3 025007 (2022)

  19. arXiv:2110.14813  [pdf, other

    cs.LG math.NA

    Stable Anderson Acceleration for Deep Learning

    Authors: Massimiliano Lupo Pasini, Junqi Yin, Viktor Reshniak, Miroslav Stoyanov

    Abstract: Anderson acceleration (AA) is an extrapolation technique designed to speed-up fixed-point iterations like those arising from the iterative training of DL models. Training DL models requires large datasets processed in randomly sampled batches that tend to introduce in the fixed-point iteration stochastic oscillations of amplitude roughly inversely proportional to the size of the batch. These oscil… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

    MSC Class: 68T07; 68W15; 68W10; 68W25; 65B05; 65F20; 65F22; 65F55; ACM Class: G.1.10; G.1.3; I.2.11

  20. Scalable Balanced Training of Conditional Generative Adversarial Neural Networks on Image Data

    Authors: Massimiliano Lupo Pasini, Vittorio Gabbi, Junqi Yin, Simona Perotto, Nouamane Laanait

    Abstract: We propose a distributed approach to train deep convolutional generative adversarial neural network (DC-CGANs) models. Our method reduces the imbalance between generator and discriminator by partitioning the training data according to data labels, and enhances scalability by performing a parallel training where multiple generators are concurrently trained, each one of them focusing on a single dat… ▽ More

    Submitted 20 February, 2021; originally announced February 2021.

    ACM Class: I.2.10; I.2.11; I.5.1; I.6.5

    Journal ref: Journal of Supercomputing, 2021

  21. arXiv:1910.13088  [pdf, ps, other

    cs.DM cs.AI cs.LO quant-ph

    Estimating the Density of States of Boolean Satisfiability Problems on Classical and Quantum Computing Platforms

    Authors: Tuhin Sahai, Anurag Mishra, Jose Miguel Pasini, Susmit Jha

    Abstract: Given a Boolean formula $φ(x)$ in conjunctive normal form (CNF), the density of states counts the number of variable assignments that violate exactly $e$ clauses, for all values of $e$. Thus, the density of states is a histogram of the number of unsatisfied clauses over all possible assignments. This computation generalizes both maximum-satisfiability (MAX-SAT) and model counting problems and not… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

  22. arXiv:1910.03713  [pdf, other

    eess.AS cs.LG cs.SD

    MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms

    Authors: Marco Pasini

    Abstract: Traditional voice conversion methods rely on parallel recordings of multiple speakers pronouncing the same sentences. For real-world applications however, parallel data is rarely available. We propose MelGAN-VC, a voice conversion method that relies on non-parallel speech data and is able to convert audio signals of arbitrary length from a source voice to a target voice. We firstly compute spectro… ▽ More

    Submitted 5 December, 2019; v1 submitted 8 October, 2019; originally announced October 2019.

  23. arXiv:1909.03306  [pdf, other

    cs.LG cs.NE stat.ML

    A scalable constructive algorithm for the optimization of neural network architectures

    Authors: Massimiliano Lupo Pasini, Junqi Yin, Ying Wai Li, Markus Eisenbach

    Abstract: We propose a new scalable method to optimize the architecture of an artificial neural network. The proposed algorithm, called Greedy Search for Neural Network Architecture, aims to determine a neural network with minimal number of layers that is at least as performant as neural networks of the same structure identified by other hyperparameter search algorithms in terms of accuracy and computationa… ▽ More

    Submitted 21 April, 2021; v1 submitted 7 September, 2019; originally announced September 2019.

    Comments: 12 pages, 15 figures, 3 table

    MSC Class: 68T01; 68Q32; 68T05; 68T10; 68W20

    Journal ref: Parallel Computing, 2021