Skip to main content

Showing 1–37 of 37 results for author: Pena, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.10564  [pdf, ps, other

    cs.CV cs.AI

    Balancing Tails when Comparing Distributions: Comprehensive Equity Index (CEI) with Application to Bias Evaluation in Operational Face Biometrics

    Authors: Imanol Solano, Julian Fierrez, Aythami Morales, Alejandro Peña, Ruben Tolosana, Francisco Zamora-Martinez, Javier San Agustin

    Abstract: Demographic bias in high-performance face recognition (FR) systems often eludes detection by existing metrics, especially with respect to subtle disparities in the tails of the score distribution. We introduce the Comprehensive Equity Index (CEI), a novel metric designed to address this limitation. CEI uniquely analyzes genuine and impostor score distributions separately, enabling a configurable f… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

  2. arXiv:2409.01928  [pdf, other

    cs.CV cs.AI

    Comprehensive Equity Index (CEI): Definition and Application to Bias Evaluation in Biometrics

    Authors: Imanol Solano, Alejandro Peña, Aythami Morales, Julian Fierrez, Ruben Tolosana, Francisco Zamora-Martinez, Javier San Agustin

    Abstract: We present a novel metric designed, among other applications, to quantify biased behaviors of machine learning models. As its core, the metric consists of a new similarity metric between score distributions that balances both their general shapes and tails' probabilities. In that sense, our proposed metric may be useful in many application areas. Here we focus on and apply it to the operational ev… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

    Comments: Accepted paper for the 27th International Conference on Pattern Recognition (ICPR) 2024

  3. arXiv:2311.11974  [pdf, other

    cs.CV cs.AI cs.LG

    Evaluating Supervision Levels Trade-Offs for Infrared-Based People Counting

    Authors: David Latortue, Moetez Kdayem, Fidel A Guerrero Peña, Eric Granger, Marco Pedersoli

    Abstract: Object detection models are commonly used for people counting (and localization) in many applications but require a dataset with costly bounding box annotations for training. Given the importance of privacy in people counting, these models rely more and more on infrared images, making the task even harder. In this paper, we explore how weaker levels of supervision can affect the performance of dee… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: Accepted in IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024

  4. arXiv:2310.06670  [pdf, other

    cs.LG cs.CV

    Domain Generalization by Rejecting Extreme Augmentations

    Authors: Masih Aminbeidokhti, Fidel A. Guerrero Peña, Heitor Rapela Medeiros, Thomas Dubail, Eric Granger, Marco Pedersoli

    Abstract: Data augmentation is one of the most effective techniques for regularizing deep learning models and improving their recognition performance in a variety of tasks and domains. However, this holds for standard in-domain settings, in which the training and test data follow the same distribution. For the out-of-domain case, where the test data follow a different and unknown distribution, the best reci… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  5. arXiv:2310.04662  [pdf, other

    cs.CV cs.AI

    HalluciDet: Hallucinating RGB Modality for Person Detection Through Privileged Information

    Authors: Heitor Rapela Medeiros, Fidel A. Guerrero Pena, Masih Aminbeidokhti, Thomas Dubail, Eric Granger, Marco Pedersoli

    Abstract: A powerful way to adapt a visual recognition model to a new domain is through image translation. However, common image translation approaches only focus on generating data from the same distribution as the target domain. Given a cross-modal application, such as pedestrian detection from aerial images, with a considerable shift in data distribution between infrared (IR) to visible (RGB) images, a t… ▽ More

    Submitted 22 March, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024

    Journal ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision 2024

  6. arXiv:2306.13002  [pdf, other

    cs.DC

    ACC Saturator: Automatic Kernel Optimization for Directive-Based GPU Code

    Authors: Kazuaki Matsumura, Simon Garcia De Gonzalo, Antonio J. Peña

    Abstract: Automatic code optimization is a complex process that typically involves the application of multiple discrete algorithms that modify the program structure irreversibly. However, the design of these algorithms is often monolithic, and they require repetitive implementation to perform similar analyses due to the lack of cooperation. To address this issue, modern optimization techniques, such as equa… ▽ More

    Submitted 17 September, 2024; v1 submitted 22 June, 2023; originally announced June 2023.

    Comments: To appear in: Proceedings of Eleventh Workshop on Accelerator Programming and Directives (WACCPD 2024)

  7. Document Layout Annotation: Database and Benchmark in the Domain of Public Affairs

    Authors: Alejandro Peña, Aythami Morales, Julian Fierrez, Javier Ortega-Garcia, Marcos Grande, Iñigo Puente, Jorge Cordova, Gonzalo Cordova

    Abstract: Every day, thousands of digital documents are generated with useful information for companies, public organizations, and citizens. Given the impossibility of processing them manually, the automatic processing of these documents is becoming increasingly necessary in certain sectors. However, this task remains challenging, since in most cases a text-only based parsing is not enough to fully understa… ▽ More

    Submitted 8 August, 2023; v1 submitted 12 June, 2023; originally announced June 2023.

    Comments: Accepted in ICDAR 2023 Workshop on Machine Vision and NLP for Document Analysis

    Journal ref: Document Analysis and Recognition - ICDAR 2023 Workshops. ICDAR 2023. Lecture Notes in Computer Science, vol 14194

  8. Leveraging Large Language Models for Topic Classification in the Domain of Public Affairs

    Authors: Alejandro Peña, Aythami Morales, Julian Fierrez, Ignacio Serna, Javier Ortega-Garcia, Iñigo Puente, Jorge Cordova, Gonzalo Cordova

    Abstract: The analysis of public affairs documents is crucial for citizens as it promotes transparency, accountability, and informed decision-making. It allows citizens to understand government policies, participate in public discourse, and hold representatives accountable. This is crucial, and sometimes a matter of life or death, for companies whose operation depend on certain regulations. Large Language M… ▽ More

    Submitted 8 August, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted in ICDAR 2023 Workshop on Automatic Domain-Adapted and Personalized Document Analysis

    Journal ref: Document Analysis and Recognition - ICDAR 2023 Workshops. ICDAR 2023. Lecture Notes in Computer Science, vol 14194

  9. Human-Centric Multimodal Machine Learning: Recent Advances and Testbed on AI-based Recruitment

    Authors: Alejandro Peña, Ignacio Serna, Aythami Morales, Julian Fierrez, Alfonso Ortega, Ainhoa Herrarte, Manuel Alcantara, Javier Ortega-Garcia

    Abstract: The presence of decision-making algorithms in society is rapidly increasing nowadays, while concerns about their transparency and the possibility of these algorithms becoming new sources of discrimination are arising. There is a certain consensus about the need to develop AI applications with a Human-Centric approach. Human-Centric Machine Learning needs to be developed based on four main requirem… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

    Comments: arXiv admin note: text overlap with arXiv:2004.07173

    Journal ref: SN COMPUT. SCI. 4, 434 (2023)

  10. A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code

    Authors: Kazuaki Matsumura, Simon Garcia De Gonzalo, Antonio J. Peña

    Abstract: Various kinds of applications take advantage of GPUs through automation tools that attempt to automatically exploit the available performance of the GPU's parallel architecture. Directive-based programming models, such as OpenACC, are one such method that easily enables parallel computing by just adhering code annotations to code loops. Such abstract models, however, often prevent programmers from… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

    Comments: To appear in: Proceedings of the 32nd ACM SIGPLAN International Conference on Compiler Construction (CC '23)

  11. arXiv:2212.12042  [pdf, other

    cs.CV

    Re-basin via implicit Sinkhorn differentiation

    Authors: Fidel A. Guerrero Peña, Heitor Rapela Medeiros, Thomas Dubail, Masih Aminbeidokhti, Eric Granger, Marco Pedersoli

    Abstract: The recent emergence of new algorithms for permuting models into functionally equivalent regions of the solution space has shed some light on the complexity of error surfaces, and some promising properties like mode connectivity. However, finding the right permutation is challenging, and current optimization techniques are not differentiable, which makes it difficult to integrate into a gradient-b… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

  12. arXiv:2209.11335  [pdf, other

    cs.CV

    Privacy-Preserving Person Detection Using Low-Resolution Infrared Cameras

    Authors: Thomas Dubail, Fidel Alejandro Guerrero Peña, Heitor Rapela Medeiros, Masih Aminbeidokhti, Eric Granger, Marco Pedersoli

    Abstract: In intelligent building management, knowing the number of people and their location in a room are important for better control of its illumination, ventilation, and heating with reduced costs and improved comfort. This is typically achieved by detecting people using compact embedded devices that are installed on the room's ceiling, and that integrate low-resolution infrared camera, which conceals… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

  13. JACC: An OpenACC Runtime Framework with Kernel-Level and Multi-GPU Parallelization

    Authors: Kazuaki Matsumura, Simon Garcia De Gonzalo, Antonio J. Peña

    Abstract: The rapid development in computing technology has paved the way for directive-based programming models towards a principal role in maintaining software portability of performance-critical applications. Efforts on such models involve a least engineering cost for enabling computational acceleration on multiple architectures while programmers are only required to add meta information upon sequential… ▽ More

    Submitted 27 April, 2022; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: Extended version of a paper to appear in: Proceedings of the 28th IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), December 17-18, 2021

  14. arXiv:2106.12485  [pdf, other

    cs.DC physics.comp-ph physics.plasm-ph

    Particle-In-Cell Simulation using Asynchronous Tasking

    Authors: Nicolas Guidotti, Pedro Ceyrat, João Barreto, José Monteiro, Rodrigo Rodrigues, Ricardo Fonseca, Xavier Martorell, Antonio J. Peña

    Abstract: Recently, task-based programming models have emerged as a prominent alternative among shared-memory parallel programming paradigms. Inherently asynchronous, these models provide native support for dynamic load balancing and incorporate data flow concepts to selectively synchronize the tasks. However, tasking models are yet to be widely adopted by the HPC community and their effective advantages wh… ▽ More

    Submitted 29 August, 2021; v1 submitted 23 June, 2021; originally announced June 2021.

    Comments: Published on the 27th European Conference on Parallel and Distributed Computing (Euro-Par 2021)

    Journal ref: Euro-Par 2021: Parallel Processing. Lecture Notes in Computer Science, vol 12820, pp. 482-498

  15. cuConv: A CUDA Implementation of Convolution for CNN Inference

    Authors: Marc Jordà, Pedro Valero-Lara, Antonio J. Peña

    Abstract: Convolutions are the core operation of deep learning applications based on Convolutional Neural Networks (CNNs). Current GPU architectures are highly efficient for training and deploying deep CNNs, and hence, these are largely used in production for this purpose. State-of-the-art implementations, however, present a lack of efficiency for some commonly used network configurations. In this paper w… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

    Comments: This work has been submitted to the Springer for possible publication

    Journal ref: Cluster Comput (2022)

  16. arXiv:2103.16139  [pdf, other

    cs.CR cs.LG cs.PF

    Enabling Homomorphically Encrypted Inference for Large DNN Models

    Authors: Guillermo Lloret-Talavera, Marc Jorda, Harald Servat, Fabian Boemer, Chetan Chauhan, Shigeki Tomishima, Nilesh N. Shah, Antonio J. Peña

    Abstract: The proliferation of machine learning services in the last few years has raised data privacy concerns. Homomorphic encryption (HE) enables inference using encrypted data but it incurs 100x-10,000x memory and runtime overheads. Secure deep neural network (DNN) inference using HE is currently limited by computing and memory resources, with frameworks requiring hundreds of gigabytes of DRAM to evalua… ▽ More

    Submitted 29 April, 2021; v1 submitted 30 March, 2021; originally announced March 2021.

    Comments: Manuscript accepted for publication in IEEE Transactions on Computers

  17. Virtual laser scanning with HELIOS++: A novel take on ray tracing-based simulation of topographic 3D laser scanning

    Authors: Lukas Winiwarter, Alberto Manuel Esmorís Pena, Hannah Weiser, Katharina Anders, Jorge Martínez Sanchez, Mark Searle, Bernhard Höfle

    Abstract: Topographic laser scanning is a remote sensing method to create detailed 3D point cloud representations of the Earth's surface. Since data acquisition is expensive, simulations can complement real data given certain premises are available: i) a model of 3D scene and scanner, ii) a model of the beam-scene interaction, simplified to a computationally feasible while physically realistic level, and ii… ▽ More

    Submitted 21 January, 2021; originally announced January 2021.

  18. arXiv:2011.08809  [pdf, other

    cs.CV

    Facial Expressions as a Vulnerability in Face Recognition

    Authors: Alejandro Peña, Ignacio Serna, Aythami Morales, Julian Fierrez, Agata Lapedriza

    Abstract: This work explores facial expression bias as a security vulnerability of face recognition systems. Despite the great performance achieved by state-of-the-art face recognition systems, the algorithms are still sensitive to a large range of covariates. We present a comprehensive analysis of how facial expression bias impacts the performance of face recognition technologies. Our study analyzes: i) fa… ▽ More

    Submitted 18 June, 2021; v1 submitted 17 November, 2020; originally announced November 2020.

    Comments: Proc. of IEEE Int. Conf. on Image Processing (ICIP)

  19. arXiv:2009.08704  [pdf, other

    cs.CV

    Learning Emotional-Blinded Face Representations

    Authors: Alejandro Peña, Julian Fierrez, Agata Lapedriza, Aythami Morales

    Abstract: We propose two face representations that are blind to facial expressions associated to emotional responses. This work is in part motivated by new international regulations for personal data protection, which enforce data controllers to protect any kind of sensitive information involved in automatic processes. The advances in Affective Computing have contributed to improve human-machine interfaces… ▽ More

    Submitted 18 September, 2020; originally announced September 2020.

    Comments: IAPR Intl. Conf. on Pattern Recognition, 2020

  20. arXiv:2009.07025  [pdf, other

    cs.CV

    FairCVtest Demo: Understanding Bias in Multimodal Learning with a Testbed in Fair Automatic Recruitment

    Authors: Alejandro Peña, Ignacio Serna, Aythami Morales, Julian Fierrez

    Abstract: With the aim of studying how current multimodal AI algorithms based on heterogeneous sources of information are affected by sensitive elements and inner biases in the data, this demonstrator experiments over an automated recruitment testbed based on Curriculum Vitae: FairCVtest. The presence of decision-making algorithms in society is rapidly increasing nowadays, while concerns about their transpa… ▽ More

    Submitted 12 September, 2020; originally announced September 2020.

    Comments: ACM Intl. Conf. on Multimodal Interaction (ICMI). arXiv admin note: substantial text overlap with arXiv:2004.07173

  21. MPI+OpenMP Tasking Scalability for Multi-Morphology Simulations of the Human Brain

    Authors: Pedro Valero-Lara, Raül Sirvent, Antonio J. Peña, Jesús Labarta

    Abstract: The simulation of the behavior of the human brain is one of the most ambitious challenges today with a non-end of important applications. We can find many different initiatives in the USA, Europe and Japan which attempt to achieve such a challenging target. In this work, we focus on the most important European initiative (the Human Brain Project) and on one of the models developed in this project.… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

    Journal ref: P. Valero-Lara, R. Sirvent, A. J. Peña, and J. Labarta. "MPI+OpenMP tasking scalability for multi-morphology simulations of the human brain", Parallel Computing, Elsevier, vol. 84, pp. 50-61, May 2019

  22. DMR API: Improving cluster productivity by turning applications into malleable

    Authors: Sergio Iserte, Rafael Mayo, Enrique S. Quintana-Orti, Vicenc Beltran, Antonio J. Peña

    Abstract: Adaptive workloads can change on--the--fly the configuration of their jobs, in terms of number of processes. In order to carry out these job reconfigurations, we have designed a methodology which enables a job to communicate with the resource manager and, through the runtime, to change its number of MPI ranks. The collaboration between both the workload manager---aware of the queue of jobs and the… ▽ More

    Submitted 28 May, 2020; v1 submitted 12 May, 2020; originally announced May 2020.

    Journal ref: S. Iserte, R. Mayo, E. S. Quintana-Orti, V. Beltran, and A. J. Peña, "DMR API: Improving cluster productivity by turning applications into malleable", Parallel Computing, Elsevier, vol. 78, pp. 54-66, Oct. 2018

  23. Understanding Memory Access Patterns Using the BSC Performance Tools

    Authors: Harald Servat, Jesús Labarta, Hans-Christian Hoppe, Judit Giménez, Antonio J. Peña

    Abstract: The growing gap between processor and memory speeds results in complex memory hierarchies as processors evolve to mitigate such divergence by taking advantage of the locality of reference. In this direction, the BSC performance analysis tools have been recently extended to provide insight relative to the application memory accesses depicting their temporal and spatial characteristics, correlating… ▽ More

    Submitted 28 May, 2020; v1 submitted 12 May, 2020; originally announced May 2020.

    Journal ref: H. Servat, J. Labarta, H. C. Hoppe, J. Giménez, and A. J. Peña, "Understanding memory access patterns using the BSC performance tools", Parallel Computing, Elsevier, vol. 78, pp. 1-14, Oct. 2018

  24. arXiv:2004.07173  [pdf, other

    cs.CV

    Bias in Multimodal AI: Testbed for Fair Automatic Recruitment

    Authors: Alejandro Peña, Ignacio Serna, Aythami Morales, Julian Fierrez

    Abstract: The presence of decision-making algorithms in society is rapidly increasing nowadays, while concerns about their transparency and the possibility of these algorithms becoming new sources of discrimination are arising. In fact, many relevant automated systems have been shown to make decisions based on sensitive information or discriminate certain social groups (e.g. certain biometric systems for pe… ▽ More

    Submitted 15 April, 2020; originally announced April 2020.

    Journal ref: IEEE CVPR Workshop on Fair, Data Efficient and Trusted Computer Vision, Washington, Seattle, USA, 2020

  25. arXiv:2004.06592  [pdf, other

    cs.CV

    InsideBias: Measuring Bias in Deep Networks and Application to Face Gender Biometrics

    Authors: Ignacio Serna, Alejandro Peña, Aythami Morales, Julian Fierrez

    Abstract: This work explores the biases in learning processes based on deep neural network architectures. We analyze how bias affects deep learning processes through a toy example using the MNIST database and a case study in gender detection from face images. We employ two gender detection models based on popular deep neural networks. We present a comprehensive analysis of bias effects when using an unbalan… ▽ More

    Submitted 22 July, 2020; v1 submitted 14 April, 2020; originally announced April 2020.

  26. arXiv:2001.06612  [pdf, other

    cs.CV

    Deep Metric Structured Learning For Facial Expression Recognition

    Authors: Pedro D. Marrero Fernandez, Tsang Ing Ren, Tsang Ing Jyh, Fidel A. Guerrero Peña, Alexandre Cunha

    Abstract: We propose a deep metric learning model to create embedded sub-spaces with a well defined structure. A new loss function that imposes Gaussian structures on the output space is introduced to create these sub-spaces thus shaping the distribution of the data. Having a mixture of Gaussians solution space is advantageous given its simplified and well established structure. It allows fast discovering o… ▽ More

    Submitted 5 January, 2022; v1 submitted 18 January, 2020; originally announced January 2020.

  27. arXiv:1910.09783  [pdf, other

    cs.CV

    J Regularization Improves Imbalanced Multiclass Segmentation

    Authors: Fidel A. Guerrero Peña, Pedro D. Marrero Fernandez, Paul T. Tarr, Tsang Ing Ren, Elliot M. Meyerowitz, Alexandre Cunha

    Abstract: We propose a new loss formulation to further advance the multiclass segmentation of cluttered cells under weakly supervised conditions. We improve the separation of touching and immediate cells, obtaining sharp segmentation boundaries with high adequacy, when we add Youden's $J$ statistic regularization term to the cross entropy loss. This regularization intrinsically supports class imbalance th… ▽ More

    Submitted 22 October, 2019; originally announced October 2019.

    Comments: Submitted to ISBI 2020

  28. arXiv:1908.10945  [pdf, other

    cs.CV

    A Multiple Source Hourglass Deep Network for Multi-Focus Image Fusion

    Authors: Fidel Alejandro Guerrero Peña, Pedro Diamel Marrero Fernández, Tsang Ing Ren, Germano Crispim Vasconcelos, Alexandre Cunha

    Abstract: Multi-Focus Image Fusion seeks to improve the quality of an acquired burst of images with different focus planes. For solving the task, an activity level measurement and a fusion rule are typically established to select and fuse the most relevant information from the sources. However, the design of this kind of method by hand is really hard and sometimes restricted to solution spaces where the opt… ▽ More

    Submitted 28 August, 2019; originally announced August 2019.

  29. arXiv:1905.05861  [pdf

    eess.IV cs.IR q-bio.QM

    From Brain Imaging to Graph Analysis: a study on ADNI's patient cohort

    Authors: Rui Zhang, Luca Giancardo, Danilo A. Pena, Yejin Kim, Hanghang Tong, Xiaoqian Jiang

    Abstract: In this paper, we studied the association between the change of structural brain volumes to the potential development of Alzheimer's disease (AD). Using a simple abstraction technique, we converted regional cortical and subcortical volume differences over two time points for each study subject into a graph. We then obtained substructures of interest using a graph decomposition algorithm in order t… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

  30. arXiv:1902.03284  [pdf, other

    cs.CV

    FERAtt: Facial Expression Recognition with Attention Net

    Authors: Pedro D. Marrero Fernandez, Fidel A. Guerrero Peña, Tsang Ing Ren, Alexandre Cunha

    Abstract: We present a new end-to-end network architecture for facial expression recognition with an attention model. It focuses attention in the human face and uses a Gaussian space representation for expression recognition. We devise this architecture based on two fundamental complementary components: (1) facial image correction and attention and (2) facial expression representation and classification. Th… ▽ More

    Submitted 8 February, 2019; originally announced February 2019.

  31. Integrating Blocking and Non-Blocking MPI Primitives with Task-Based Programming Models

    Authors: Kevin Sala, Xavier Teruel, Josep M. Perez, Antonio J. Peña, Vicenç Beltran, Jesus Labarta

    Abstract: In this paper we present the Task-Aware MPI library (TAMPI) that integrates both blocking and non-blocking MPI primitives with task-based programming models. The TAMPI library leverages two new runtime APIs to improve both programmability and performance of hybrid applications. The first API allows to pause and resume the execution of a task depending on external events. This API is used to improv… ▽ More

    Submitted 29 May, 2020; v1 submitted 10 January, 2019; originally announced January 2019.

    Comments: European Commission's projects: INTERTWinE (EC-H2020-671602), Marie Skłodowska-Curie (EC-H2020-749516). Postprint submitted to the Parallel Computing Journal (Elsevier). Figures from section 7.2 updated, typos corrected

    Journal ref: Parallel Computing, 85, 153-166 (2019)

  32. arXiv:1810.12121  [pdf, other

    cs.CV

    Burst ranking for blind multi-image deblurring

    Authors: Fidel A. Guerrero Peña, Pedro D. Marrero Fernández, Tsang Ing Ren, Jorge J. G. Leandro, Ricardo Nishihara

    Abstract: We propose a new incremental aggregation algorithm for multi-image deblurring with automatic image selection. The primary motivation is that current bursts deblurring methods do not handle well situations in which misalignment or out-of-context frames are present in the burst. These real-life situations result in poor reconstructions or manual selection of the images that will be used to deblur. A… ▽ More

    Submitted 30 October, 2018; v1 submitted 29 October, 2018; originally announced October 2018.

    Comments: Submitted to IEEE Transactions on Image Processing. 11 pages, 9 figures

  33. arXiv:1810.04150  [pdf, other

    cs.DC

    Exploring the Vision Processing Unit as Co-processor for Inference

    Authors: Sergio Rivas-Gomez, Antonio J. Peña, David Moloney, Erwin Laure, Stefano Markidis

    Abstract: The success of the exascale supercomputer is largely debated to remain dependent on novel breakthroughs in technology that effectively reduce the power consumption and thermal dissipation requirements. In this work, we consider the integration of co-processors in high-performance computing (HPC) to enable low-power, seamless computation offloading of certain operations. In particular, we explore t… ▽ More

    Submitted 9 October, 2018; originally announced October 2018.

  34. arXiv:1807.09232  [pdf

    cs.CV cs.AI

    Deep Learning on Retina Images as Screening Tool for Diagnostic Decision Support

    Authors: Maria Camila Alvarez Trivino, Jeremie Despraz, Jesus Alfonso Lopez Sotelo, Carlos Andres Pena

    Abstract: In this project, we developed a deep learning system applied to human retina images for medical diagnostic decision support. The retina images were provided by EyePACS. These images were used in the framework of a Kaggle contest, whose purpose to identify diabetic retinopathy signs through an automatic detection system. Using as inspiration one of the solutions proposed in the contest, we implemen… ▽ More

    Submitted 24 July, 2018; originally announced July 2018.

  35. arXiv:1801.05469  [pdf, other

    cs.HC

    ProvThreads: Analytic Provenance Visualization and Segmentation

    Authors: Sina Mohseni, Alyssa Pena, Eric D. Ragan

    Abstract: Our work aims to generate visualizations to enable meta-analysis of analytic provenance and aid better understanding of analysts' strategies during exploratory text analysis. We introduce ProvThreads, a visual analytics approach that incorporates interactive topic modeling outcomes to illustrate relationships between user interactions and the data topics under investigation. ProvThreads uses a ser… ▽ More

    Submitted 16 January, 2018; originally announced January 2018.

    Comments: Presented at IEEE VIS 2017 Poster Session

  36. arXiv:1801.05076  [pdf, other

    cs.HC

    Analytic Provenance Datasets: A Data Repository of Human Analysis Activity and Interaction Logs

    Authors: Sina Mohseni, Andrew Pachuilo, Ehsanul Haque Nirjhar, Rhema Linder, Alyssa Pena, Eric D. Ragan

    Abstract: We present an analytic provenance data repository that can be used to study human analysis activity, thought processes, and software interaction with visual analysis tools during exploratory data analysis. We conducted a series of user studies involving exploratory data analysis scenario with textual and cyber security data. Interactions logs, think-alouds, videos and all coded data in this study… ▽ More

    Submitted 15 January, 2018; originally announced January 2018.

    Comments: Datasets are available online at https://research.arch.tamu.edu/analytic-provenance/datasets/ for research purposes

  37. arXiv:1612.00881  [pdf, other

    cs.CV

    Procedural Generation of Videos to Train Deep Action Recognition Networks

    Authors: César Roberto de Souza, Adrien Gaidon, Yohann Cabon, Antonio Manuel López Peña

    Abstract: Deep learning for human action recognition in videos is making significant progress, but is slowed down by its dependency on expensive manual labeling of large video collections. In this work, we investigate the generation of synthetic training data for action recognition, as it has recently shown promising results for a variety of other computer vision tasks. We propose an interpretable parametri… ▽ More

    Submitted 19 July, 2017; v1 submitted 2 December, 2016; originally announced December 2016.

    Comments: Accepted for publication at CVPR 2017. http://adas.cvc.uab.es/phav/