Skip to main content

Showing 1–10 of 10 results for author: Franco, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.17513  [pdf, ps, other

    cs.LG cs.AI cs.AR

    Improving Quantization with Post-Training Model Expansion

    Authors: Giuseppe Franco, Pablo Monteagudo-Lago, Ian Colbert, Nicholas Fraser, Michaela Blott

    Abstract: The size of a model has been a strong predictor of its quality, as well as its cost. As such, the trade-off between model cost and quality has been well-studied. Post-training optimizations like quantization and pruning have typically focused on reducing the overall volume of pre-trained models to reduce inference costs while maintaining model quality. However, recent advancements have introduced… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

  2. arXiv:2410.00340  [pdf, other

    cs.LG cs.AI cs.CL

    Sparse Attention Decomposition Applied to Circuit Tracing

    Authors: Gabriel Franco, Mark Crovella

    Abstract: Many papers have shown that attention heads work in conjunction with each other to perform complex tasks. It's frequently assumed that communication between attention heads is via the addition of specific features to token residuals. In this work we seek to isolate and identify the features used to effect communication and coordination among attention heads in GPT-2 small. Our key leverage on the… ▽ More

    Submitted 28 October, 2024; v1 submitted 30 September, 2024; originally announced October 2024.

  3. arXiv:2409.17092  [pdf, other

    cs.LG cs.AI cs.DM

    Accumulator-Aware Post-Training Quantization

    Authors: Ian Colbert, Fabian Grob, Giuseppe Franco, Jinjie Zhang, Rayan Saab

    Abstract: Several recent studies have investigated low-precision accumulation, reporting improvements in throughput, power, and area across various platforms. However, the accompanying proposals have only considered the quantization-aware training (QAT) paradigm, in which models are fine-tuned or trained from scratch with quantization in the loop. As models continue to grow in size, QAT techniques become in… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

  4. arXiv:2407.14538  [pdf, other

    cs.DC

    Alea-BFT: Practical Asynchronous Byzantine Fault Tolerance

    Authors: Diogo S. Antunes, Afonso N. Oliveira, André Breda, Matheus Guilherme Franco, Henrique Moniz, Rodrigo Rodrigues

    Abstract: Traditional Byzantine Fault Tolerance (BFT) state machine replication protocols assume a partial synchrony model, leading to a design where a leader replica drives the protocol and is replaced after a timeout. Recently, we witnessed a surge of asynchronous BFT protocols, which use randomization to remove the need for bounds on message delivery times, making them more resilient to adverse network c… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2202.02071

    ACM Class: C.2.4; D.4.5

    Journal ref: In21st USENIX Symposium on Networked Systems Design and Implementation (NSDI 24) 2024 (pp. 313-328)

  5. arXiv:2311.12359  [pdf, other

    cs.CV cs.AI cs.AR cs.LG cs.PF

    Shedding the Bits: Pushing the Boundaries of Quantization with Minifloats on FPGAs

    Authors: Shivam Aggarwal, Hans Jakob Damsgaard, Alessandro Pappalardo, Giuseppe Franco, Thomas B. Preußer, Michaela Blott, Tulika Mitra

    Abstract: Post-training quantization (PTQ) is a powerful technique for model compression, reducing the numerical precision in neural networks without additional training overhead. Recent works have investigated adopting 8-bit floating-point formats(FP8) in the context of PTQ for model inference. However, floating-point formats smaller than 8 bits and their relative comparison in terms of accuracy-hardware c… ▽ More

    Submitted 5 July, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

    Comments: Accepted in FPL (International Conference on Field-Programmable Logic and Applications) 2024 conference. Revised with updated results

  6. arXiv:2310.19065  [pdf, other

    cs.LG

    Evaluating LLP Methods: Challenges and Approaches

    Authors: Gabriel Franco, Giovanni Comarela, Mark Crovella

    Abstract: Learning from Label Proportions (LLP) is an established machine learning problem with numerous real-world applications. In this setting, data items are grouped into bags, and the goal is to learn individual item labels, knowing only the features of the data and the proportions of labels in each bag. Although LLP is a well-established problem, it has several unusual aspects that create challenges f… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

  7. arXiv:2304.01405  [pdf

    cs.HC cs.CY

    The Work Avatar Face-Off: Knowledge Worker Preferences for Realism in Meetings

    Authors: Vrushank Phadnis, Kristin Moore, Mar Gonzalez Franco

    Abstract: While avatars have grown in popularity in social settings, their use in the workplace is still debatable. We conducted a large-scale survey to evaluate knowledge worker sentiment towards avatars, particularly the effects of realism on their acceptability for work meetings. Our survey of 2509 knowledge workers from multiple countries rated five avatar styles for use by managers, known colleagues an… ▽ More

    Submitted 8 October, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

    Comments: 10 pages, accepted at ISMAR 2023 conference

  8. arXiv:2106.15351  [pdf, ps, other

    cs.CE q-bio.GN

    Spectral concepts in genome informational analysis

    Authors: Vincenzo Bonnici, Giuditta Franco, Vincenzo Manca

    Abstract: The concept of k-spectrum for genomes is here investigated as a basic tool to analyze genomes. Related spectral notions based on k-mers are introduced with some related mathematical properties which are relevant for informational analysis of genomes. Procedures to generate spectral segmentations of genomes are provided and are tested (under several values of length k for k-mers) on cases of real g… ▽ More

    Submitted 25 June, 2021; originally announced June 2021.

  9. arXiv:2009.10449  [pdf, other

    q-bio.GN cs.IT

    A word recurrence based algorithm to extract genomic dictionaries

    Authors: Vincenzo Bonnici, Giuditta Franco, Vincenzo Manca

    Abstract: Genomes may be analyzed from an information viewpoint as very long strings, containing functional elements of variable length, which have been assembled by evolution. In this work an innovative information theory based algorithm is proposed, to extract significant (relatively small) dictionaries of genomic words. Namely, conceptual analyses are here combined with empirical studies, to open up a me… ▽ More

    Submitted 22 September, 2020; originally announced September 2020.

  10. arXiv:1908.06399  [pdf

    eess.IV cs.CV

    Evaluation of an AI System for the Detection of Diabetic Retinopathy from Images Captured with a Handheld Portable Fundus Camera: the MAILOR AI study

    Authors: T W Rogers, J Gonzalez-Bueno, R Garcia Franco, E Lopez Star, D Méndez Marín, J Vassallo, V C Lansingh, S Trikha, N Jaccard

    Abstract: Objectives: To evaluate the performance of an Artificial Intelligence (AI) system (Pegasus, Visulytix Ltd., UK), at the detection of Diabetic Retinopathy (DR) from images captured by a handheld portable fundus camera. Methods: A cohort of 6,404 patients (~80% with diabetes mellitus) was screened for retinal diseases using a handheld portable fundus camera (Pictor Plus, Volk Optical Inc., USA) at… ▽ More

    Submitted 18 August, 2019; originally announced August 2019.