Skip to main content

Showing 1–14 of 14 results for author: Jaja, J

.
  1. arXiv:2505.18137  [pdf, ps, other

    cs.CV cs.LG

    Boosting Open Set Recognition Performance through Modulated Representation Learning

    Authors: Amit Kumar Kundu, Vaishnavi Patil, Joseph Jaja

    Abstract: The open set recognition (OSR) problem aims to identify test samples from novel semantic classes that are not part of the training classes, a task that is crucial in many practical scenarios. However, existing OSR methods use a constant scaling factor (the temperature) to the logits before applying a loss function, which hinders the model from exploring both ends of the spectrum in representation… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  2. arXiv:2502.10562  [pdf, other

    cs.CV cs.LG

    Detecting and Monitoring Bias for Subgroups in Breast Cancer Detection AI

    Authors: Amit Kumar Kundu, Florence X. Doo, Vaishnavi Patil, Amitabh Varshney, Joseph Jaja

    Abstract: Automated mammography screening plays an important role in early breast cancer detection. However, current machine learning models, developed on some training datasets, may exhibit performance degradation and bias when deployed in real-world settings. In this paper, we analyze the performance of high-performing AI models on two mammography datasets-the Emory Breast Imaging Dataset (EMBED) and the… ▽ More

    Submitted 14 February, 2025; originally announced February 2025.

  3. arXiv:2305.09092  [pdf, other

    cs.LG cs.CV

    ProtoVAE: Prototypical Networks for Unsupervised Disentanglement

    Authors: Vaishnavi Patil, Matthew Evanusa, Joseph JaJa

    Abstract: Generative modeling and self-supervised learning have in recent years made great strides towards learning from data in a completely unsupervised way. There is still however an open area of investigation into guiding a neural network to encode the data into representations that are interpretable or explainable. The problem of unsupervised disentanglement is of particular importance as it proposes t… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  4. arXiv:2210.10920  [pdf, other

    cs.LG cs.NE

    DOT-VAE: Disentangling One Factor at a Time

    Authors: Vaishnavi Patil, Matthew Evanusa, Joseph JaJa

    Abstract: As we enter the era of machine learning characterized by an overabundance of data, discovery, organization, and interpretation of the data in an unsupervised manner becomes a critical need. One promising approach to this endeavour is the problem of Disentanglement, which aims at learning the underlying generative latent factors, called the factors of variation, of the data and encoding them in dis… ▽ More

    Submitted 20 October, 2022; v1 submitted 19 October, 2022; originally announced October 2022.

  5. arXiv:2208.01813  [pdf, other

    cs.CV

    TAG: Boosting Text-VQA via Text-aware Visual Question-answer Generation

    Authors: Jun Wang, Mingfei Gao, Yuqian Hu, Ramprasaath R. Selvaraju, Chetan Ramaiah, Ran Xu, Joseph F. JaJa, Larry S. Davis

    Abstract: Text-VQA aims at answering questions that require understanding the textual cues in an image. Despite the great progress of existing Text-VQA methods, their performance suffers from insufficient human-labeled question-answer (QA) pairs. However, we observe that, in general, the scene text is not fully exploited in the existing datasets -- only a small portion of the text in each image participates… ▽ More

    Submitted 7 October, 2022; v1 submitted 2 August, 2022; originally announced August 2022.

    Comments: BMVC 2022

  6. arXiv:2207.09568  [pdf, other

    cs.LG cs.CR cs.DC

    FedNet2Net: Saving Communication and Computations in Federated Learning with Model Growing

    Authors: Amit Kumar Kundu, Joseph Jaja

    Abstract: Federated learning (FL) is a recently developed area of machine learning, in which the private data of a large number of distributed clients is used to develop a global model under the coordination of a central server without explicitly exposing the data. The standard FL strategy has a number of significant bottlenecks including large communication requirements and high impact on the clients' reso… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: This version of the contribution has been accepted for publication in the proceedings of 31st International Conference on Artificial Neural Networks

  7. arXiv:2006.14028  [pdf, other

    cs.LG stat.ML

    Class-Similarity Based Label Smoothing for Confidence Calibration

    Authors: Chihuang Liu, Joseph JaJa

    Abstract: Generating confidence calibrated outputs is of utmost importance for the applications of deep neural networks in safety-critical decision-making systems. The output of a neural network is a probability distribution where the scores are estimated confidences of the input belonging to the corresponding classes, and hence they represent a complete estimate of the output likelihood relative to all cla… ▽ More

    Submitted 15 September, 2021; v1 submitted 24 June, 2020; originally announced June 2020.

    Comments: ICANN 2021

  8. arXiv:1810.02424  [pdf, other

    cs.LG stat.ML

    Feature Prioritization and Regularization Improve Standard Accuracy and Adversarial Robustness

    Authors: Chihuang Liu, Joseph JaJa

    Abstract: Adversarial training has been successfully applied to build robust models at a certain cost. While the robustness of a model increases, the standard classification accuracy declines. This phenomenon is suggested to be an inherent trade-off. We propose a model that employs feature prioritization by a nonlinear attention module and $L_2$ feature regularization to improve the adversarial robustness a… ▽ More

    Submitted 12 August, 2019; v1 submitted 4 October, 2018; originally announced October 2018.

    Comments: IJCAI 2019

  9. arXiv:1805.07683  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Graph-Level Representations with Recurrent Neural Networks

    Authors: Yu Jin, Joseph F. JaJa

    Abstract: Recently a variety of methods have been developed to encode graphs into low-dimensional vectors that can be easily exploited by machine learning algorithms. The majority of these methods start by embedding the graph nodes into a low-dimensional vector space, followed by using some scheme to aggregate the node embeddings. In this work, we develop a new approach to learn graph-level representations,… ▽ More

    Submitted 11 September, 2018; v1 submitted 19 May, 2018; originally announced May 2018.

    Comments: Submit to AAAI 2019

  10. A High Performance Implementation of Spectral Clustering on CPU-GPU Platforms

    Authors: Yu Jin, Joseph F. JaJa

    Abstract: Spectral clustering is one of the most popular graph clustering algorithms, which achieves the best performance for many scientific and engineering applications. However, existing implementations in commonly used software platforms such as Matlab and Python do not scale well for many of the emerging Big Data applications. In this paper, we present a fast implementation of the spectral clustering a… ▽ More

    Submitted 12 February, 2018; originally announced February 2018.

    Comments: 2016 IEEE International Parallel and Distributed Processing Symposium Workshops (Parallel Computing and Optimization (PCO) workshop). Codes are available on https://github.com/yuj-umd/fastsc

  11. arXiv:1802.04447  [pdf, other

    cs.SI math.NA stat.AP

    Graph Coarsening with Preserved Spectral Properties

    Authors: Yu Jin, Andreas Loukas, Joseph F. JaJa

    Abstract: Large-scale graphs are widely used to represent object relationships in many real world applications. The occurrence of large-scale graphs presents significant computational challenges to process, analyze, and extract information. Graph coarsening techniques are commonly used to reduce the computational load while attempting to maintain the basic structural properties of the original graph. As the… ▽ More

    Submitted 10 October, 2019; v1 submitted 12 February, 2018; originally announced February 2018.

    Comments: Submitted to AISTATS 2020

  12. arXiv:1802.04353  [pdf, other

    cs.CE q-bio.NC stat.AP

    A Data-Driven Approach to Extract Connectivity Structures from Diffusion Tensor Imaging Data

    Authors: Yu Jin, Joseph F. JaJa, Rong Chen, Edward H. Herskovits

    Abstract: Diffusion Tensor Imaging (DTI) is an effective tool for the analysis of structural brain connectivity in normal development and in a broad range of brain disorders. However efforts to derive inherent characteristics of structural brain networks have been hampered by the very high dimensionality of the data, relatively small sample sizes, and the lack of widely acceptable connectivity-based regions… ▽ More

    Submitted 12 February, 2018; originally announced February 2018.

    Comments: Proceedings of 2015 IEEE International Conference on Big Data

  13. arXiv:1609.03893  [pdf, other

    cs.CE q-bio.NC stat.AP

    Scalable Algorithms for Generating and Analyzing Structural Brain Networks with a Varying Number of Nodes

    Authors: Yu Jin, Joseph F. JaJa, Rong Chen, Edward H. Herskovits

    Abstract: Diffusion Magnetic Resonance Imaging (MRI) exploits the anisotropic diffusion of water molecules in the brain to enable the estimation of the brain's anatomical fiber tracts at a relatively high resolution. In particular, tractographic methods can be used to generate whole-brain anatomical connectivity matrix where each element provides an estimate of the connectivity strength between the correspo… ▽ More

    Submitted 13 September, 2016; originally announced September 2016.

  14. arXiv:1312.1909  [pdf

    cs.NE cs.CV cs.LG stat.ML

    From Maxout to Channel-Out: Encoding Information on Sparse Pathways

    Authors: Qi Wang, Joseph JaJa

    Abstract: Motivated by an important insight from neural science, we propose a new framework for understanding the success of the recently proposed "maxout" networks. The framework is based on encoding information on sparse pathways and recognizing the correct pathway at inference time. Elaborating further on this insight, we propose a novel deep network architecture, called "channel-out" network, which take… ▽ More

    Submitted 18 November, 2013; originally announced December 2013.

    Comments: 10 pages including the appendix, 9 figures