Skip to main content

Showing 1–46 of 46 results for author: Cruz, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.03359  [pdf, other

    cs.CV cs.AI cs.LG

    GHOST: Gaussian Hypothesis Open-Set Technique

    Authors: Ryan Rabinowitz, Steve Cruz, Manuel Günther, Terrance E. Boult

    Abstract: Evaluations of large-scale recognition methods typically focus on overall performance. While this approach is common, it often fails to provide insights into performance across individual classes, which can lead to fairness issues and misrepresentation. Addressing these gaps is crucial for accurately assessing how well methods handle novel or unseen classes and ensuring a fair evaluation. To addre… ▽ More

    Submitted 10 February, 2025; v1 submitted 5 February, 2025; originally announced February 2025.

    Comments: Accepted at AAAI Conference on Artificial Intelligence 2025

  2. arXiv:2501.13104  [pdf, other

    cs.CV cs.GR

    Neural Radiance Fields for the Real World: A Survey

    Authors: Wenhui Xiao, Remi Chierchia, Rodrigo Santa Cruz, Xuesong Li, David Ahmedt-Aristizabal, Olivier Salvado, Clinton Fookes, Leo Lebrat

    Abstract: Neural Radiance Fields (NeRFs) have remodeled 3D scene representation since release. NeRFs can effectively reconstruct complex 3D scenes from 2D images, advancing different fields and applications such as scene understanding, 3D content generation, and robotics. Despite significant research progress, a thorough review of recent innovations, applications, and challenges is lacking. This survey comp… ▽ More

    Submitted 22 January, 2025; originally announced January 2025.

  3. arXiv:2501.08962  [pdf, other

    cs.CV cs.AI

    An analysis of data variation and bias in image-based dermatological datasets for machine learning classification

    Authors: Francisco Filho, Emanoel Santos, Rodrigo Mota, Kelvin Cunha, Fabio Papais, Amanda Arruda, Mateus Baltazar, Camila Vieira, José Gabriel Tavares, Rafael Barros, Othon Souza, Thales Bezerra, Natalia Lopes, Érico Moutinho, Jéssica Guido, Shirley Cruz, Paulo Borba, Tsang Ing Ren

    Abstract: AI algorithms have become valuable in aiding professionals in healthcare. The increasing confidence obtained by these models is helpful in critical decision demands. In clinical dermatology, classification models can detect malignant lesions on patients' skin using only RGB images as input. However, most learning-based methods employ data acquired from dermoscopic datasets on training, which are l… ▽ More

    Submitted 11 February, 2025; v1 submitted 15 January, 2025; originally announced January 2025.

    Comments: 10 pages, 1 figure

    ACM Class: I.5.4; J.3

  4. arXiv:2410.16331  [pdf, other

    quant-ph cs.ET cs.LG

    Exploring Quantum Neural Networks for Demand Forecasting

    Authors: Gleydson Fernandes de Jesus, Maria Heloísa Fraga da Silva, Otto Menegasso Pires, Lucas Cruz da Silva, Clebson dos Santos Cruz, Valéria Loureiro da Silva

    Abstract: Forecasting demand for assets and services can be addressed in various markets, providing a competitive advantage when the predictive models used demonstrate high accuracy. However, the training of machine learning models incurs high computational costs, which may limit the training of prediction models based on available computational capacity. In this context, this paper presents an approach for… ▽ More

    Submitted 19 October, 2024; originally announced October 2024.

    Comments: 22 pages, 13 figures, 10 tables

  5. arXiv:2410.12728  [pdf, other

    cs.LG cs.AI

    Transformer based super-resolution downscaling for regional reanalysis: Full domain vs tiling approaches

    Authors: Antonio Pérez, Mario Santa Cruz, Daniel San Martín, José Manuel Gutiérrez

    Abstract: Super-resolution (SR) is a promising cost-effective downscaling methodology for producing high-resolution climate information from coarser counterparts. A particular application is downscaling regional reanalysis outputs (predictand) from the driving global counterparts (predictor). This study conducts an intercomparison of various SR downscaling methods focusing on temperature and using the CERRA… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  6. arXiv:2407.19652  [pdf, ps, other

    cs.CV

    SALVE: A 3D Reconstruction Benchmark of Wounds from Consumer-grade Videos

    Authors: Remi Chierchia, Leo Lebrat, David Ahmedt-Aristizabal, Olivier Salvado, Clinton Fookes, Rodrigo Santa Cruz

    Abstract: Managing chronic wounds is a global challenge that can be alleviated by the adoption of automatic systems for clinical wound assessment from consumer-grade videos. While 2D image analysis approaches are insufficient for handling the 3D features of wounds, existing approaches utilizing 3D reconstruction methods have not been thoroughly evaluated. To address this gap, this paper presents a comprehen… ▽ More

    Submitted 6 June, 2025; v1 submitted 28 July, 2024; originally announced July 2024.

  7. arXiv:2407.00273  [pdf

    cs.SE

    Please do not go: understanding turnover of software engineers from different perspectives

    Authors: Michelle Larissa Luciano Carvalho, Paulo da Silva Cruz, Eduardo Santana de Almeida, Paulo Anselmo da Mota Silveira Neto, Rafael Prikladnicki

    Abstract: Turnover consists of moving into and out of professional employees in the company in a given period. Such a phenomenon significantly impacts the software industry since it generates knowledge loss, delays in the schedule, and increased costs in the final project. Despite the efforts made by researchers and professionals to minimize the turnover, more studies are needed to understand the motivation… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  8. arXiv:2406.08839  [pdf, other

    cs.CV

    NeRF Director: Revisiting View Selection in Neural Volume Rendering

    Authors: Wenhui Xiao, Rodrigo Santa Cruz, David Ahmedt-Aristizabal, Olivier Salvado, Clinton Fookes, Leo Lebrat

    Abstract: Neural Rendering representations have significantly contributed to the field of 3D computer vision. Given their potential, considerable efforts have been invested to improve their performance. Nonetheless, the essential question of selecting training views is yet to be thoroughly investigated. This key aspect plays a vital role in achieving high-quality results and aligns with the well-known tenet… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: CVPR2024

  9. arXiv:2401.16144  [pdf, other

    cs.CV cs.AI

    Divide and Conquer: Rethinking the Training Paradigm of Neural Radiance Fields

    Authors: Rongkai Ma, Leo Lebrat, Rodrigo Santa Cruz, Gil Avraham, Yan Zuo, Clinton Fookes, Olivier Salvado

    Abstract: Neural radiance fields (NeRFs) have exhibited potential in synthesizing high-fidelity views of 3D scenes but the standard training paradigm of NeRF presupposes an equal importance for each image in the training set. This assumption poses a significant challenge for rendering specific views presenting intricate geometries, thereby resulting in suboptimal performance. In this paper, we take a closer… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  10. arXiv:2311.15836  [pdf, other

    cs.CV

    Syn3DWound: A Synthetic Dataset for 3D Wound Bed Analysis

    Authors: Léo Lebrat, Rodrigo Santa Cruz, Remi Chierchia, Yulia Arzhaeva, Mohammad Ali Armin, Joshua Goldsmith, Jeremy Oorloff, Prithvi Reddy, Chuong Nguyen, Lars Petersson, Michelle Barakat-Johnson, Georgina Luscombe, Clinton Fookes, Olivier Salvado, David Ahmedt-Aristizabal

    Abstract: Wound management poses a significant challenge, particularly for bedridden patients and the elderly. Accurate diagnostic and healing monitoring can significantly benefit from modern image analysis, providing accurate and precise measurements of wounds. Despite several existing techniques, the shortage of expansive and diverse training datasets remains a significant obstacle to constructing machine… ▽ More

    Submitted 3 March, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: In the IEEE International Symposium on Biomedical Imaging (ISBI) 2024

  11. Level of Awareness of PSU Bayambang Campus Students towards E learning Technologies

    Authors: Matthew John F. Sino Cruz, Kim Eric B. Nanlabi, Michael Ryan C. Peoro

    Abstract: The study assesses the awareness of PSU Bayambang Campus students regarding e-learning technologies. A Quantitative Research Approach was used, gathering data through a demographic questionnaire and ICT Resources assessment. The survey measured students' familiarity and knowledge of existing e-learning technologies. Around 52.50% of respondents were familiar with e learning concepts, but their exp… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

    Comments: published in International Journal of Computing Sciences Research

    Journal ref: Journal of Computing Sciences Research. 3(2)(2019) 199-220

  12. arXiv:2307.10018  [pdf, other

    cs.RO cs.AI

    RobôCIn Small Size League Extended Team Description Paper for RoboCup 2023

    Authors: Aline Lima de Oliveira, Cauê Addae da Silva Gomes, Cecília Virginia Santos da Silva, Charles Matheus de Sousa Alves, Danilo Andrade Martins de Souza, Driele Pires Ferreira Araújo Xavier, Edgleyson Pereira da Silva, Felipe Bezerra Martins, Lucas Henrique Cavalcanti Santos, Lucas Dias Maciel, Matheus Paixão Gumercindo dos Santos, Matheus Lafayette Vasconcelos, Matheus Vinícius Teotonio do Nascimento Andrade, João Guilherme Oliveira Carvalho de Melo, João Pedro Souza Pereira de Moura, José Ronald da Silva, José Victor Silva Cruz, Pedro Henrique Santana de Morais, Pedro Paulo Salman de Oliveira, Riei Joaquim Matos Rodrigues, Roberto Costa Fernandes, Ryan Vinicius Santos Morais, Tamara Mayara Ramos Teobaldo, Washington Igor dos Santos Silva, Edna Natividade Silva Barros

    Abstract: RobôCIn has participated in RoboCup Small Size League since 2019, won its first world title in 2022 (Division B), and is currently a three-times Latin-American champion. This paper presents our improvements to defend the Small Size League (SSL) division B title in RoboCup 2023 in Bordeaux, France. This paper aims to share some of the academic research that our team developed over the past year. Ou… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

  13. Augmented Reality's Potential for Identifying and Mitigating Home Privacy Leaks

    Authors: Stefany Cruz, Logan Danek, Shinan Liu, Christopher Kraemer, Zixin Wang, Nick Feamster, Danny Yuxing Huang, Yaxing Yao, Josiah Hester

    Abstract: Users face various privacy risks in smart homes, yet there are limited ways for them to learn about the details of such risks, such as the data practices of smart home devices and their data flow. In this paper, we present Privacy Plumber, a system that enables a user to inspect and explore the privacy "leaks" in their home using an augmented reality tool. Privacy Plumber allows the user to learn… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

    Journal ref: Workshop on Usable Security and Privacy (USEC) 2023

  14. arXiv:2212.03802  [pdf, other

    cs.DC

    Distributed Load Orchestration for Vision Computing in Multi-Access Edge Computing

    Authors: Ricardo N. Boing, Hugo Vaz Sampaio, Fernando Koch, Rene N. S. Cruz, Carlos B. Westphall

    Abstract: Multi-access Edge Computing (MEC) is a type of network architecture that provides cloud computing capabilities at the edge of the network. We consider the use case of video surveillance for an university campus running on a 5G-MEC environment. A key issue is the eventual overloading of computing resources on the MEC nodes during peak demand. We propose a new strategy for distributed orchestration… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

    Comments: This work has been submitted to the IEEE for possible publication

  15. arXiv:2206.06598  [pdf, other

    eess.IV cs.CV cs.LG

    CorticalFlow$^{++}$: Boosting Cortical Surface Reconstruction Accuracy, Regularity, and Interoperability

    Authors: Rodrigo Santa Cruz, Léo Lebrat, Darren Fu, Pierrick Bourgeat, Jurgen Fripp, Clinton Fookes, Olivier Salvado

    Abstract: The problem of Cortical Surface Reconstruction from magnetic resonance imaging has been traditionally addressed using lengthy pipelines of image processing techniques like FreeSurfer, CAT, or CIVET. These frameworks require very long runtimes deemed unfeasible for real-time applications and unpractical for large-scale studies. Recently, supervised deep learning approaches have been introduced to s… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

  16. arXiv:2206.02374  [pdf, other

    cs.CV cs.AI

    CorticalFlow: A Diffeomorphic Mesh Deformation Module for Cortical Surface Reconstruction

    Authors: Léo Lebrat, Rodrigo Santa Cruz, Frédéric de Gournay, Darren Fu, Pierrick Bourgeat, Jurgen Fripp, Clinton Fookes, Olivier Salvado

    Abstract: In this paper we introduce CorticalFlow, a new geometric deep-learning model that, given a 3-dimensional image, learns to deform a reference template towards a targeted object. To conserve the template mesh's topological properties, we train our model over a set of diffeomorphic transformations. This new implementation of a flow Ordinary Differential Equation (ODE) framework benefits from a small… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

  17. arXiv:2205.00452  [pdf, other

    cs.CL

    The use of Data Augmentation as a technique for improving neural network accuracy in detecting fake news about COVID-19

    Authors: Wilton O. Júnior, Mauricio S. da Cruz, Andre Brasil Vieira Wyzykowski, Arnaldo Bispo de Jesus

    Abstract: This paper aims to present how the application of Natural Language Processing (NLP) and data augmentation techniques can improve the performance of a neural network for better detection of fake news in the Portuguese language. Fake news is one of the main controversies during the growth of the internet in the last decade. Verifying what is fact and what is false has proven to be a difficult task,… ▽ More

    Submitted 1 May, 2022; originally announced May 2022.

  18. arXiv:2204.00386  [pdf, other

    cs.CV cs.LG

    Autoencoder for Synthetic to Real Generalization: From Simple to More Complex Scenes

    Authors: Steve Dias Da Cruz, Bertram Taetz, Thomas Stifter, Didier Stricker

    Abstract: Learning on synthetic data and transferring the resulting properties to their real counterparts is an important challenge for reducing costs and increasing safety in machine learning. In this work, we focus on autoencoder architectures and aim at learning latent space representations that are invariant to inductive biases caused by the domain shift between simulated and real images showing the sam… ▽ More

    Submitted 1 April, 2022; originally announced April 2022.

    Comments: This paper is accepted at IEEE International Conference on Pattern Recognition (ICPR), 2022. Supplementary material is available under https://sviro.kl.dfki.de/downloads/papers/icpr_syn2real_appendix.pdf

  19. arXiv:2204.00382  [pdf, other

    cs.LG cs.CV

    Autoencoder Attractors for Uncertainty Estimation

    Authors: Steve Dias Da Cruz, Bertram Taetz, Thomas Stifter, Didier Stricker

    Abstract: The reliability assessment of a machine learning model's prediction is an important quantity for the deployment in safety critical applications. Not only can it be used to detect novel sceneries, either as out-of-distribution or anomaly sample, but it also helps to determine deficiencies in the training data distribution. A lot of promising research directions have either proposed traditional meth… ▽ More

    Submitted 11 May, 2022; v1 submitted 1 April, 2022; originally announced April 2022.

    Comments: This paper is accepted at IEEE International Conference on Pattern Recognition (ICPR), 2022

  20. arXiv:2203.15841  [pdf, other

    cs.LG cs.CV eess.SY math.OC

    NNLander-VeriF: A Neural Network Formal Verification Framework for Vision-Based Autonomous Aircraft Landing

    Authors: Ulices Santa Cruz, Yasser Shoukry

    Abstract: In this paper, we consider the problem of formally verifying a Neural Network (NN) based autonomous landing system. In such a system, a NN controller processes images from a camera to guide the aircraft while approaching the runway. A central challenge for the safety and liveness verification of vision-based closed-loop systems is the lack of mathematical models that captures the relation between… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: 18 pages

  21. arXiv:2201.07894  [pdf, other

    cs.CV cs.AI

    Enhanced Performance of Pre-Trained Networks by Matched Augmentation Distributions

    Authors: Touqeer Ahmad, Mohsen Jafarzadeh, Akshay Raj Dhamija, Ryan Rabinowitz, Steve Cruz, Chunchun Li, Terrance E. Boult

    Abstract: There exists a distribution discrepancy between training and testing, in the way images are fed to modern CNNs. Recent work tried to bridge this gap either by fine-tuning or re-training the network at different resolutions. However re-training a network is rarely cheap and not always viable. To this end, we propose a simple solution to address the train-test distributional shift and enhance the pe… ▽ More

    Submitted 19 January, 2022; originally announced January 2022.

    MSC Class: 68T45 ACM Class: I.4.8

  22. arXiv:2109.01255  [pdf, other

    cs.LG cs.RO eess.SY

    Provably Safe Model-Based Meta Reinforcement Learning: An Abstraction-Based Approach

    Authors: Xiaowu Sun, Wael Fatnassi, Ulices Santa Cruz, Yasser Shoukry

    Abstract: While conventional reinforcement learning focuses on designing agents that can perform one task, meta-learning aims, instead, to solve the problem of designing agents that can generalize to different tasks (e.g., environments, obstacles, and goals) that were not considered during the design or the training of these agents. In this spirit, in this paper, we consider the problem of training a provab… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

  23. arXiv:2105.03164  [pdf, other

    cs.CV

    Autoencoder Based Inter-Vehicle Generalization for In-Cabin Occupant Classification

    Authors: Steve Dias Da Cruz, Bertram Taetz, Oliver Wasenmüller, Thomas Stifter, Didier Stricker

    Abstract: Common domain shift problem formulations consider the integration of multiple source domains, or the target domain during training. Regarding the generalization of machine learning models between different car interiors, we formulate the criterion of training in a single vehicle: without access to the target distribution of the vehicle the model would be deployed to, neither with access to multipl… ▽ More

    Submitted 7 May, 2021; originally announced May 2021.

    Comments: This paper has been accepted at IEEE Intelligent Vehicles Symposium (IV), 2021

  24. arXiv:2105.03009  [pdf, other

    cs.DC

    Autonomic Management of Power Consumption with IoT and Fog Computing

    Authors: Hugo Vaz Sampaio, Fernando Koch, Carlos Becker Westphall, Ricardo do Nascimento Boing, Rene Nolio Santa Cruz

    Abstract: We introduce a system for Autonomic Management of Power Consumption in setups that involve Internet of Things (IoT) and Fog Computing. The Central IoT (CIoT) is a Fog Computing based solution to provide advanced orchestration mechanisms to manage dynamic duty cycles for extra energy savings. The solution works by adjusting Home (H) and Away (A) cycles based on contextual information, like environm… ▽ More

    Submitted 12 May, 2021; v1 submitted 6 May, 2021; originally announced May 2021.

    Comments: 14 pages, 7 figures, 11 tables, preprint submitted to Elsevier

  25. arXiv:2104.14554  [pdf, other

    cs.CV cs.LG

    MongeNet: Efficient Sampler for Geometric Deep Learning

    Authors: Léo Lebrat, Rodrigo Santa Cruz, Clinton Fookes, Olivier Salvado

    Abstract: Recent advances in geometric deep-learning introduce complex computational challenges for evaluating the distance between meshes. From a mesh model, point clouds are necessary along with a robust distance metric to assess surface quality or as part of the loss function for training models. Current methods often rely on a uniform random mesh discretization, which yields irregular sampling and noisy… ▽ More

    Submitted 29 April, 2021; originally announced April 2021.

  26. arXiv:2104.02788  [pdf, other

    cs.LG eess.SY math.OC

    Safe-by-Repair: A Convex Optimization Approach for Repairing Unsafe Two-Level Lattice Neural Network Controllers

    Authors: Ulices Santa Cruz, James Ferlez, Yasser Shoukry

    Abstract: In this paper, we consider the problem of repairing a data-trained Rectified Linear Unit (ReLU) Neural Network (NN) controller for a discrete-time, input-affine system. That is we assume that such a NN controller is available, and we seek to repair unsafe closed-loop behavior at one known "counterexample" state while simultaneously preserving a notion of safe closed-loop behavior on a separate, ve… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

  27. arXiv:2101.12331  [pdf, other

    cs.DC

    A Pub-Sub Architecture to Promote Blockchain Interoperability

    Authors: Sara Ghaemi, Sara Rouhani, Rafael Belchior, Rui S. Cruz, Hamzeh Khazaei, Petr Musilek

    Abstract: The maturing of blockchain technology leads to heterogeneity, where multiple solutions specialize in a particular use case. While the development of different blockchain networks shows great potential for blockchains, the isolated networks have led to data and asset silos, limiting the applications of this technology. Blockchain interoperability solutions are essential to enable distributed ledger… ▽ More

    Submitted 28 January, 2021; originally announced January 2021.

  28. arXiv:2012.04226  [pdf, other

    cs.AI cs.CV cs.LG

    A Unifying Framework for Formal Theories of Novelty:Framework, Examples and Discussion

    Authors: T. E. Boult, P. A. Grabowicz, D. S. Prijatelj, R. Stern, L. Holder, J. Alspector, M. Jafarzadeh, T. Ahmad, A. R. Dhamija, C. Li, S. Cruz, A. Shrivastava, C. Vondrick, W. J. Scheirer

    Abstract: Managing inputs that are novel, unknown, or out-of-distribution is critical as an agent moves from the lab to the open world. Novelty-related problems include being tolerant to novel perturbations of the normal input, detecting when the input includes novel items, and adapting to novel inputs. While significant research has been undertaken in these areas, a noticeable gap exists in the lack of a f… ▽ More

    Submitted 8 December, 2020; originally announced December 2020.

    Comments: Extended version/preprint of a AAAI 2021 paper

  29. arXiv:2011.12906  [pdf, other

    cs.CV cs.AI cs.LG eess.IV eess.SP

    A Review of Open-World Learning and Steps Toward Open-World Learning Without Labels

    Authors: Mohsen Jafarzadeh, Akshay Raj Dhamija, Steve Cruz, Chunchun Li, Touqeer Ahmad, Terrance E. Boult

    Abstract: In open-world learning, an agent starts with a set of known classes, detects, and manages things that it does not know, and learns them over time from a non-stationary stream of data. Open-world learning is related to but also distinct from a multitude of other learning problems and this paper briefly analyzes the key differences between a wide range of problems including incremental learning, gen… ▽ More

    Submitted 3 January, 2022; v1 submitted 25 November, 2020; originally announced November 2020.

    MSC Class: 68T45 ACM Class: I.4.8

  30. arXiv:2011.05506  [pdf, other

    cs.CV cs.LG eess.IV eess.SP

    Automatic Open-World Reliability Assessment

    Authors: Mohsen Jafarzadeh, Touqeer Ahmad, Akshay Raj Dhamija, Chunchun Li, Steve Cruz, Terrance E. Boult

    Abstract: Image classification in the open-world must handle out-of-distribution (OOD) images. Systems should ideally reject OOD images, or they will map atop of known classes and reduce reliability. Using open-set classifiers that can reject OOD inputs can help. However, optimal accuracy of open-set classifiers depend on the frequency of OOD data. Thus, for either standard or open-set classifiers, it is im… ▽ More

    Submitted 13 December, 2020; v1 submitted 10 November, 2020; originally announced November 2020.

    Comments: 2021 IEEE Winter Conference on Applications of Computer Vision (WACV)

    MSC Class: 68T45 ACM Class: I.4.8

    Journal ref: 2021 IEEE Winter Conference on Applications of Computer Vision (WACV)

  31. arXiv:2011.04923  [pdf, other

    cs.LG cs.AI stat.ML

    Topological properties of basins of attraction and expressiveness of width bounded neural networks

    Authors: Hans-Peter Beise, Steve Dias Da Cruz

    Abstract: In Radhakrishnan et al. [2020], the authors empirically show that autoencoders trained with usual SGD methods shape out basins of attraction around their training data. We consider network functions of width not exceeding the input dimension and prove that in this situation basins of attraction are bounded and their complement cannot have bounded components. Our conditions in these results are met… ▽ More

    Submitted 1 December, 2023; v1 submitted 10 November, 2020; originally announced November 2020.

  32. arXiv:2011.03428  [pdf, other

    cs.CV

    Illumination Normalization by Partially Impossible Encoder-Decoder Cost Function

    Authors: Steve Dias Da Cruz, Bertram Taetz, Thomas Stifter, Didier Stricker

    Abstract: Images recorded during the lifetime of computer vision based systems undergo a wide range of illumination and environmental conditions affecting the reliability of previously trained machine learning models. Image normalization is hence a valuable preprocessing component to enhance the models' robustness. To this end, we introduce a new strategy for the cost function formulation of encoder-decoder… ▽ More

    Submitted 9 November, 2020; v1 submitted 6 November, 2020; originally announced November 2020.

    Comments: This paper is accepted at IEEE Winter Conference on Applications of Computer Vision (WACV), 2021. Supplementary material is available under https://sviro.kl.dfki.de/downloads/papers/wacv2021_supplementary.pdf

  33. arXiv:2010.11423  [pdf, other

    eess.IV cs.CV

    DeepCSR: A 3D Deep Learning Approach for Cortical Surface Reconstruction

    Authors: Rodrigo Santa Cruz, Leo Lebrat, Pierrick Bourgeat, Clinton Fookes, Jurgen Fripp, Olivier Salvado

    Abstract: The study of neurodegenerative diseases relies on the reconstruction and analysis of the brain cortex from magnetic resonance imaging (MRI). Traditional frameworks for this task like FreeSurfer demand lengthy runtimes, while its accelerated variant FastSurfer still relies on a voxel-wise segmentation which is limited by its resolution to capture narrow continuous objects as cortical surfaces. Havi… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

    Comments: Accepted in 2021 IEEE Winter Conference on Applications of Computer Vision (WACV)

  34. arXiv:2009.03303  [pdf, other

    eess.IV cs.CV cs.LG

    Going deeper with brain morphometry using neural networks

    Authors: Rodrigo Santa Cruz, Léo Lebrat, Pierrick Bourgeat, Vincent Doré, Jason Dowling, Jurgen Fripp, Clinton Fookes, Olivier Salvado

    Abstract: Brain morphometry from magnetic resonance imaging (MRI) is a consolidated biomarker for many neurodegenerative diseases. Recent advances in this domain indicate that deep convolutional neural networks can infer morphometric measurements within a few seconds. Nevertheless, the accuracy of the devised model for insightful bio-markers (mean curvature and thickness) remains unsatisfactory. In this pap… ▽ More

    Submitted 7 September, 2020; originally announced September 2020.

  35. arXiv:2008.11572  [pdf, other

    eess.IV cs.CV cs.LG

    On the Composition and Limitations of Publicly Available COVID-19 X-Ray Imaging Datasets

    Authors: Beatriz Garcia Santa Cruz, Jan Sölter, Matias Nicolas Bossa, Andreas Dominik Husch

    Abstract: Machine learning based methods for diagnosis and progression prediction of COVID-19 from imaging data have gained significant attention in the last months, in particular by the use of deep learning models. In this context hundreds of models where proposed with the majority of them trained on public datasets. Data scarcity, mismatch between training and target population, group imbalance, and lack… ▽ More

    Submitted 26 August, 2020; originally announced August 2020.

    Comments: 12 pages, 3 figures

  36. arXiv:2006.04384  [pdf, other

    cs.CR

    Distributed Attribute-Based Access Control System Using a Permissioned Blockchain

    Authors: Sara Rouhani, Rafael Belchior, Rui S. Cruz, Ralph Deters

    Abstract: Auditing provides an essential security control in computer systems, by keeping track of all access attempts, including both legitimate and illegal access attempts. This phase can be useful to the context of audits, where eventual misbehaving parties can be held accountable. Blockchain technology can provide trusted auditability required for access control systems. In this paper, we propose a dist… ▽ More

    Submitted 8 June, 2020; originally announced June 2020.

  37. arXiv:2004.13217  [pdf, other

    cs.CV

    Inferring Temporal Compositions of Actions Using Probabilistic Automata

    Authors: Rodrigo Santa Cruz, Anoop Cherian, Basura Fernando, Dylan Campbell, Stephen Gould

    Abstract: This paper presents a framework to recognize temporal compositions of atomic actions in videos. Specifically, we propose to express temporal compositions of actions as semantic regular expressions and derive an inference framework using probabilistic automata to recognize complex actions as satisfying these expressions on the input video features. Our approach is different from existing works that… ▽ More

    Submitted 27 April, 2020; originally announced April 2020.

    Comments: Accepted in Workshop on Compositionality in Computer Vision at CVPR, 2020

  38. arXiv:2001.03483  [pdf, other

    cs.CV

    SVIRO: Synthetic Vehicle Interior Rear Seat Occupancy Dataset and Benchmark

    Authors: Steve Dias Da Cruz, Oliver Wasenmüller, Hans-Peter Beise, Thomas Stifter, Didier Stricker

    Abstract: We release SVIRO, a synthetic dataset for sceneries in the passenger compartment of ten different vehicles, in order to analyze machine learning-based approaches for their generalization capacities and reliability when trained on a limited number of variations (e.g. identical backgrounds and textures, few instances per class). This is in contrast to the intrinsically high variability of common ben… ▽ More

    Submitted 10 January, 2020; originally announced January 2020.

    Comments: This paper is accepted at IEEE Winter Conference on Applications of Computer Vision (WACV), 2020. Supplementary material is available under https://sviro.kl.dfki.de/downloads/papers/wacv_supplementary.pdf

  39. arXiv:1911.07929  [pdf

    cs.CV cs.CY cs.LG eess.IV stat.ML

    A Smartphone-Based Skin Disease Classification Using MobileNet CNN

    Authors: Jessica Velasco, Cherry Pascion, Jean Wilmar Alberio, Jonathan Apuang, John Stephen Cruz, Mark Angelo Gomez, Benjamin Jr. Molina, Lyndon Tuala, August Thio-ac, Romeo Jr. Jorda

    Abstract: The MobileNet model was used by applying transfer learning on the 7 skin diseases to create a skin disease classification system on Android application. The proponents gathered a total of 3,406 images and it is considered as imbalanced dataset because of the unequal number of images on its classes. Using different sampling method and preprocessing of input data was explored to further improved the… ▽ More

    Submitted 13 November, 2019; originally announced November 2019.

    Journal ref: International Journal of Advanced Trends in Computer Science and Engineering (2019) 2632-2637

  40. To Beta or Not To Beta: Information Bottleneck for DigitaL Image Forensics

    Authors: Aurobrata Ghosh, Zheng Zhong, Steve Cruz, Subbu Veeravasarapu, Terrance E Boult, Maneesh Singh

    Abstract: We consider an information theoretic approach to address the problem of identifying fake digital images. We propose an innovative method to formulate the issue of localizing manipulated regions in an image as a deep representation learning problem using the Information Bottleneck (IB), which has recently gained popularity as a framework for interpreting deep neural networks. Tampered images pose a… ▽ More

    Submitted 11 August, 2019; originally announced August 2019.

    Comments: 10 pages

    Journal ref: 2020 IEEE International Conference on Image Processing (ICIP)

  41. arXiv:1807.01194  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    On decision regions of narrow deep neural networks

    Authors: Hans-Peter Beise, Steve Dias Da Cruz, Udo Schröder

    Abstract: We show that for neural network functions that have width less or equal to the input dimension all connected components of decision regions are unbounded. The result holds for continuous and strictly monotonic activation functions as well as for the ReLU activation function. This complements recent results on approximation capabilities by [Hanin 2017 Approximating] and connectivity of decision reg… ▽ More

    Submitted 3 March, 2021; v1 submitted 3 July, 2018; originally announced July 2018.

    Comments: This paper is accepted for publication in Neural Networks (Elsevier Journal)

  42. arXiv:1801.08676  [pdf, other

    cs.CV cs.LG

    Neural Algebra of Classifiers

    Authors: Rodrigo Santa Cruz, Basura Fernando, Anoop Cherian, Stephen Gould

    Abstract: The world is fundamentally compositional, so it is natural to think of visual recognition as the recognition of basic visually primitives that are composed according to well-defined rules. This strategy allows us to recognize unseen complex concepts from simple visual primitives. However, the current trend in visual recognition follows a data greedy approach where huge amounts of data are required… ▽ More

    Submitted 26 January, 2018; originally announced January 2018.

    Comments: 2018 IEEE Winter Conference on Applications of Computer Vision (WACV)

  43. arXiv:1705.01567  [pdf, other

    cs.CV

    Toward Open-Set Face Recognition

    Authors: Manuel Günther, Steve Cruz, Ethan M. Rudd, Terrance E. Boult

    Abstract: Much research has been conducted on both face identification and face verification, with greater focus on the latter. Research on face identification has mostly focused on using closed-set protocols, which assume that all probe images used in evaluation contain identities of subjects that are enrolled in the gallery. Real systems, however, where only a fraction of probe sample identities are enrol… ▽ More

    Submitted 18 May, 2017; v1 submitted 3 May, 2017; originally announced May 2017.

    Comments: Accepted for Publication in CVPR 2017 Biometrics Workshop

  44. arXiv:1704.02729  [pdf, other

    cs.CV

    DeepPermNet: Visual Permutation Learning

    Authors: Rodrigo Santa Cruz, Basura Fernando, Anoop Cherian, Stephen Gould

    Abstract: We present a principled approach to uncover the structure of visual data by solving a novel deep learning task coined visual permutation learning. The goal of this task is to find the permutation that recovers the structure of data from shuffled versions of it. In the case of natural images, this task boils down to recovering the original image from patches shuffled by an unknown permutation matri… ▽ More

    Submitted 10 April, 2017; originally announced April 2017.

    Comments: Accepted in IEEE International Conference on Computer Vision and Pattern Recognition CVPR 2017

  45. arXiv:1703.02244  [pdf, other

    cs.CR

    Open Set Intrusion Recognition for Fine-Grained Attack Categorization

    Authors: Steve Cruz, Cora Coleman, Ethan M. Rudd, Terrance E. Boult

    Abstract: Confidently distinguishing a malicious intrusion over a network is an important challenge. Most intrusion detection system evaluations have been performed in a closed set protocol in which only classes seen during training are considered during classification. Thus far, there has been no realistic application in which novel types of behaviors unseen at training -- unknown classes as it were -- mus… ▽ More

    Submitted 7 March, 2017; originally announced March 2017.

    Comments: Pre-print of camera-ready version to appear at the IEEE Homeland Security Technologies (HST) 2017 Conference

  46. arXiv:1607.05447  [pdf, other

    cs.CV math.OC

    On Differentiating Parameterized Argmin and Argmax Problems with Application to Bi-level Optimization

    Authors: Stephen Gould, Basura Fernando, Anoop Cherian, Peter Anderson, Rodrigo Santa Cruz, Edison Guo

    Abstract: Some recent works in machine learning and computer vision involve the solution of a bi-level optimization problem. Here the solution of a parameterized lower-level problem binds variables that appear in the objective of an upper-level problem. The lower-level problem typically appears as an argmin or argmax optimization problem. Many techniques have been proposed to solve bi-level optimization pro… ▽ More

    Submitted 20 July, 2016; v1 submitted 19 July, 2016; originally announced July 2016.

    Comments: 16 pages, 6 figures