Skip to main content

Showing 1–50 of 62 results for author: Turaga, P

.
  1. arXiv:2509.23077  [pdf, ps, other

    cs.LG

    CLAD-Net: Continual Activity Recognition in Multi-Sensor Wearable Systems

    Authors: Reza Rahimi Azghan, Gautham Krishna Gudur, Mohit Malu, Edison Thomaz, Giulia Pedrielli, Pavan Turaga, Hassan Ghasemzadeh

    Abstract: The rise of deep learning has greatly advanced human behavior monitoring using wearable sensors, particularly human activity recognition (HAR). While deep models have been widely studied, most assume stationary data distributions - an assumption often violated in real-world scenarios. For example, sensor data from one subject may differ significantly from another, leading to distribution shifts. I… ▽ More

    Submitted 26 September, 2025; originally announced September 2025.

  2. arXiv:2509.18457  [pdf, ps, other

    cs.LG

    GluMind: Multimodal Parallel Attention and Knowledge Retention for Robust Cross-Population Blood Glucose Forecasting

    Authors: Ebrahim Farahmand, Reza Rahimi Azghan, Nooshin Taheri Chatrudi, Velarie Yaa Ansu-Baidoo, Eric Kim, Gautham Krishna Gudur, Mohit Malu, Owen Krueger, Edison Thomaz, Giulia Pedrielli, Pavan Turaga, Hassan Ghasemzadeh

    Abstract: This paper proposes GluMind, a transformer-based multimodal framework designed for continual and long-term blood glucose forecasting. GluMind devises two attention mechanisms, including cross-attention and multi-scale attention, which operate in parallel and deliver accurate predictive performance. Cross-attention effectively integrates blood glucose data with other physiological and behavioral si… ▽ More

    Submitted 22 September, 2025; originally announced September 2025.

  3. arXiv:2506.10265  [pdf, ps, other

    eess.SP cs.CV cs.HC

    Ground Reaction Force Estimation via Time-aware Knowledge Distillation

    Authors: Eun Som Jeon, Sinjini Mitra, Jisoo Lee, Omik M. Save, Ankita Shukla, Hyunglae Lee, Pavan Turaga

    Abstract: Human gait analysis with wearable sensors has been widely used in various applications, such as daily life healthcare, rehabilitation, physical therapy, and clinical diagnostics and monitoring. In particular, ground reaction force (GRF) provides critical information about how the body interacts with the ground during locomotion. Although instrumented treadmills have been widely used as the gold st… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

    Journal ref: IEEE Internet of Things Journal, 2025

  4. Intra-class Patch Swap for Self-Distillation

    Authors: Hongjun Choi, Eun Som Jeon, Ankita Shukla, Pavan Turaga

    Abstract: Knowledge distillation (KD) is a valuable technique for compressing large deep learning models into smaller, edge-suitable networks. However, conventional KD frameworks rely on pre-trained high-capacity teacher networks, which introduce significant challenges such as increased memory/storage requirements, additional training costs, and ambiguity in selecting an appropriate teacher for a given stud… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: Accepted for publication in Neurocomputing

  5. arXiv:2505.12486  [pdf, ps, other

    cs.CV

    Guiding Diffusion with Deep Geometric Moments: Balancing Fidelity and Variation

    Authors: Sangmin Jung, Utkarsh Nath, Yezhou Yang, Giulia Pedrielli, Joydeep Biswas, Amy Zhang, Hassan Ghasemzadeh, Pavan Turaga

    Abstract: Text-to-image generation models have achieved remarkable capabilities in synthesizing images, but often struggle to provide fine-grained control over the output. Existing guidance approaches, such as segmentation maps and depth maps, introduce spatial rigidity that restricts the inherent diversity of diffusion models. In this work, we introduce Deep Geometric Moments (DGM) as a novel form of guida… ▽ More

    Submitted 18 May, 2025; originally announced May 2025.

    Comments: Accepted in CVPR Workshop GMCV 2025

  6. arXiv:2504.03119  [pdf

    cs.SI cs.AI stat.ML

    Graph Network Modeling Techniques for Visualizing Human Mobility Patterns

    Authors: Sinjini Mitra, Anuj Srivastava, Avipsa Roy, Pavan Turaga

    Abstract: Human mobility analysis at urban-scale requires models to represent the complex nature of human movements, which in turn are affected by accessibility to nearby points of interest, underlying socioeconomic factors of a place, and local transport choices for people living in a geographic region. In this work, we represent human mobility and the associated flow of movements as a grapyh. Graph-based… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

  7. arXiv:2503.11981  [pdf, ps, other

    cs.CV

    DecompDreamer: A Composition-Aware Curriculum for Structured 3D Asset Generation

    Authors: Utkarsh Nath, Rajeev Goel, Rahul Khurana, Kyle Min, Mark Ollila, Pavan Turaga, Varun Jampani, Tejaswi Gowda

    Abstract: Current text-to-3D methods excel at generating single objects but falter on compositional prompts. We argue this failure is fundamental to their optimization schedules, as simultaneous or iterative heuristics predictably collapse under a combinatorial explosion of conflicting gradients, leading to entangled geometry or catastrophic divergence. In this paper, we reframe the core challenge of compos… ▽ More

    Submitted 5 October, 2025; v1 submitted 14 March, 2025; originally announced March 2025.

  8. arXiv:2502.19766  [pdf, other

    cs.CV

    Automatic Temporal Segmentation for Post-Stroke Rehabilitation: A Keypoint Detection and Temporal Segmentation Approach for Small Datasets

    Authors: Jisoo Lee, Tamim Ahmed, Thanassis Rikakis, Pavan Turaga

    Abstract: Rehabilitation is essential and critical for post-stroke patients, addressing both physical and cognitive aspects. Stroke predominantly affects older adults, with 75% of cases occurring in individuals aged 65 and older, underscoring the urgent need for tailored rehabilitation strategies in aging populations. Despite the critical role therapists play in evaluating rehabilitation progress and ensuri… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

  9. arXiv:2502.09919  [pdf, other

    cs.LG cs.AI

    AttenGluco: Multimodal Transformer-Based Blood Glucose Forecasting on AI-READI Dataset

    Authors: Ebrahim Farahmand, Reza Rahimi Azghan, Nooshin Taheri Chatrudi, Eric Kim, Gautham Krishna Gudur, Edison Thomaz, Giulia Pedrielli, Pavan Turaga, Hassan Ghasemzadeh

    Abstract: Diabetes is a chronic metabolic disorder characterized by persistently high blood glucose levels (BGLs), leading to severe complications such as cardiovascular disease, neuropathy, and retinopathy. Predicting BGLs enables patients to maintain glucose levels within a safe range and allows caregivers to take proactive measures through lifestyle modifications. Continuous Glucose Monitoring (CGM) syst… ▽ More

    Submitted 14 February, 2025; originally announced February 2025.

  10. arXiv:2502.00779  [pdf, other

    cs.LG cs.AI eess.SP

    Role of Mixup in Topological Persistence Based Knowledge Distillation for Wearable Sensor Data

    Authors: Eun Som Jeon, Hongjun Choi, Matthew P. Buman, Pavan Turaga

    Abstract: The analysis of wearable sensor data has enabled many successes in several applications. To represent the high-sampling rate time-series with sufficient detail, the use of topological data analysis (TDA) has been considered, and it is found that TDA can complement other time-series features. Nonetheless, due to the large time consumption and high computational resource requirements of extracting t… ▽ More

    Submitted 2 February, 2025; originally announced February 2025.

    Comments: IEEE Sensors Journal (2024)

  11. arXiv:2408.05938  [pdf, other

    cs.CV

    Deep Geometric Moments Promote Shape Consistency in Text-to-3D Generation

    Authors: Utkarsh Nath, Rajeev Goel, Eun Som Jeon, Changhoon Kim, Kyle Min, Yezhou Yang, Yingzhen Yang, Pavan Turaga

    Abstract: To address the data scarcity associated with 3D assets, 2D-lifting techniques such as Score Distillation Sampling (SDS) have become a widely adopted practice in text-to-3D generation pipelines. However, the diffusion models used in these techniques are prone to viewpoint bias and thus lead to geometric inconsistencies such as the Janus problem. To counter this, we introduce MT3D, a text-to-3D gene… ▽ More

    Submitted 21 January, 2025; v1 submitted 12 August, 2024; originally announced August 2024.

    Comments: This paper has been accepted to WACV 2025

  12. arXiv:2407.05316  [pdf, other

    cs.CV

    Leveraging Topological Guidance for Improved Knowledge Distillation

    Authors: Eun Som Jeon, Rahul Khurana, Aishani Pathak, Pavan Turaga

    Abstract: Deep learning has shown its efficacy in extracting useful features to solve various computer vision tasks. However, when the structure of the data is complex and noisy, capturing effective information to improve performance is very difficult. To this end, topological data analysis (TDA) has been utilized to derive useful representations that can contribute to improving performance and robustness a… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: ICML 2024 Workshop on Geometry-grounded Representation Learning and Generative Modeling

  13. arXiv:2407.05315  [pdf, other

    eess.SP cs.LG math.AT

    Topological Persistence Guided Knowledge Distillation for Wearable Sensor Data

    Authors: Eun Som Jeon, Hongjun Choi, Ankita Shukla, Yuan Wang, Hyunglae Lee, Matthew P. Buman, Pavan Turaga

    Abstract: Deep learning methods have achieved a lot of success in various applications involving converting wearable sensor data to actionable health insights. A common application areas is activity recognition, where deep-learning methods still suffer from limitations such as sensitivity to signal quality, sensor characteristic variations, and variability between subjects. To mitigate these issues, robust… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: Engineering Applications of Artificial Intelligence 130, 107719

    Journal ref: Engineering Applications of Artificial Intelligence, 130, 107719 (2024)

  14. arXiv:2403.14140  [pdf, other

    cs.CV cs.LG

    Learning Decomposable and Debiased Representations via Attribute-Centric Information Bottlenecks

    Authors: Jinyung Hong, Eun Som Jeon, Changhoon Kim, Keun Hee Park, Utkarsh Nath, Yezhou Yang, Pavan Turaga, Theodore P. Pavlic

    Abstract: Biased attributes, spuriously correlated with target labels in a dataset, can problematically lead to neural networks that learn improper shortcuts for classifications and limit their capabilities for out-of-distribution (OOD) generalization. Although many debiasing approaches have been proposed to ensure correct predictions from biased datasets, few studies have considered learning latent embeddi… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 24 pages, 16 figures, 3 tables

  15. arXiv:2305.13284  [pdf, other

    cs.CV cs.AI

    Target-Aware Generative Augmentations for Single-Shot Adaptation

    Authors: Kowshik Thopalli, Rakshith Subramanyam, Pavan Turaga, Jayaraman J. Thiagarajan

    Abstract: In this paper, we address the problem of adapting models from a source domain to a target domain, a task that has become increasingly important due to the brittle generalization of deep neural networks. While several test-time adaptation techniques have emerged, they typically rely on synthetic toolbox data augmentations in cases of limited target data availability. We consider the challenging set… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted at International Conference Machine Learning (ICML) 2023

  16. arXiv:2305.10513  [pdf, other

    cs.CV stat.ML

    Learning Pose Image Manifolds Using Geometry-Preserving GANs and Elasticae

    Authors: Shenyuan Liang, Pavan Turaga, Anuj Srivastava

    Abstract: This paper investigates the challenge of learning image manifolds, specifically pose manifolds, of 3D objects using limited training data. It proposes a DNN approach to manifold learning and for predicting images of objects for novel, continuous 3D rotations. The approach uses two distinct concepts: (1) Geometric Style-GAN (Geom-SGAN), which maps images to low-dimensional latent representations an… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  17. arXiv:2303.11424  [pdf, other

    cs.CV

    Polynomial Implicit Neural Representations For Large Diverse Datasets

    Authors: Rajhans Singh, Ankita Shukla, Pavan Turaga

    Abstract: Implicit neural representations (INR) have gained significant popularity for signal and image representation for many end-tasks, such as superresolution, 3D modeling, and more. Most INR architectures rely on sinusoidal positional encoding, which accounts for high-frequency information in data. However, the finite encoding size restricts the model's representational power. Higher representational p… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: Accepted at CVPR 2023

  18. Leveraging Angular Distributions for Improved Knowledge Distillation

    Authors: Eun Som Jeon, Hongjun Choi, Ankita Shukla, Pavan Turaga

    Abstract: Knowledge distillation as a broad class of methods has led to the development of lightweight and memory efficient models, using a pre-trained model with a large capacity (teacher network) to train a smaller model (student network). Recently, additional variations for knowledge distillation, utilizing activation maps of intermediate layers as the source of knowledge, have been studied. Generally, i… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: Neurocomputing, Volume 518, 21 January 2023, Pages 466-481

    Journal ref: Neurocomputing, Volume 518, 2023, Pages 466-481

  19. arXiv:2211.03946  [pdf, other

    cs.CV

    Understanding the Role of Mixup in Knowledge Distillation: An Empirical Study

    Authors: Hongjun Choi, Eun Som Jeon, Ankita Shukla, Pavan Turaga

    Abstract: Mixup is a popular data augmentation technique based on creating new samples by linear interpolation between two given data samples, to improve both the generalization and robustness of the trained model. Knowledge distillation (KD), on the other hand, is widely used for model compression and transfer learning, which involves using a larger network's implicit knowledge to guide the learning of a s… ▽ More

    Submitted 8 November, 2022; v1 submitted 7 November, 2022; originally announced November 2022.

    Comments: To be presented at WACV 2023

  20. arXiv:2210.16692  [pdf, other

    cs.CV cs.LG stat.ML

    Single-Shot Domain Adaptation via Target-Aware Generative Augmentation

    Authors: Rakshith Subramanyam, Kowshik Thopalli, Spring Berman, Pavan Turaga, Jayaraman J. Thiagarajan

    Abstract: The problem of adapting models from a source domain using data from any target domain of interest has gained prominence, thanks to the brittle generalization in deep neural networks. While several test-time adaptation techniques have emerged, they typically rely on synthetic data augmentations in cases of limited target data availability. In this paper, we consider the challenging setting of singl… ▽ More

    Submitted 29 October, 2022; originally announced October 2022.

  21. arXiv:2207.04185  [pdf, other

    cs.CV cs.LG

    Domain Alignment Meets Fully Test-Time Adaptation

    Authors: Kowshik Thopalli, Pavan Turaga, Jayaraman J. Thiagarajan

    Abstract: A foundational requirement of a deployed ML model is to generalize to data drawn from a testing distribution that is different from training. A popular solution to this problem is to adapt a pre-trained model to novel domains using only unlabeled data. In this paper, we focus on a challenging variant of this problem, where access to the original source data is restricted. While fully test-time ada… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

    Comments: 16 Pages including references, 5 figures

  22. arXiv:2205.11722  [pdf, other

    cs.CV

    Improving Shape Awareness and Interpretability in Deep Networks Using Geometric Moments

    Authors: Rajhans Singh, Ankita Shukla, Pavan Turaga

    Abstract: Deep networks for image classification often rely more on texture information than object shape. While efforts have been made to make deep-models shape-aware, it is often difficult to make such models simple, interpretable, or rooted in known mathematical definitions of shape. This paper presents a deep-learning model inspired by geometric moments, a classically well understood approach to measure… ▽ More

    Submitted 22 May, 2023; v1 submitted 23 May, 2022; originally announced May 2022.

    Comments: Accepted at CVPR 2023 Workshop: Deep Learning for Geometric Computing

  23. arXiv:2201.01806  [pdf, other

    cs.LG cs.CV

    Revisiting Deep Subspace Alignment for Unsupervised Domain Adaptation

    Authors: Kowshik Thopalli, Jayaraman J Thiagarajan, Rushil Anirudh, Pavan K Turaga

    Abstract: Unsupervised domain adaptation (UDA) aims to transfer and adapt knowledge from a labeled source domain to an unlabeled target domain. Traditionally, subspace-based methods form an important class of solutions to this problem. Despite their mathematical elegance and tractability, these methods are often found to be ineffective at producing domain-invariant features with complex, real-world datasets… ▽ More

    Submitted 5 January, 2022; originally announced January 2022.

    Comments: arXiv admin note: text overlap with arXiv:1906.04338

  24. arXiv:2201.00111  [pdf, other

    cs.LG cs.HC eess.SP

    Role of Data Augmentation Strategies in Knowledge Distillation for Wearable Sensor Data

    Authors: Eun Som Jeon, Anirudh Som, Ankita Shukla, Kristina Hasanaj, Matthew P. Buman, Pavan Turaga

    Abstract: Deep neural networks are parametrized by several thousands or millions of parameters, and have shown tremendous success in many classification problems. However, the large number of parameters makes it difficult to integrate these models into edge devices such as smartphones and wearable devices. To address this problem, knowledge distillation (KD) has been widely employed, that uses a pre-trained… ▽ More

    Submitted 31 December, 2021; originally announced January 2022.

  25. arXiv:2112.09802  [pdf, other

    cs.LG cs.CV

    Automated Domain Discovery from Multiple Sources to Improve Zero-Shot Generalization

    Authors: Kowshik Thopalli, Sameeksha Katoch, Pavan Turaga, Jayaraman J. Thiagarajan

    Abstract: Domain generalization (DG) methods aim to develop models that generalize to settings where the test distribution is different from the training data. In this paper, we focus on the challenging problem of multi-source zero shot DG (MDG), where labeled training data from multiple source domains is available but with no access to data from the target domain. A wide range of solutions have been propos… ▽ More

    Submitted 3 November, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

  26. arXiv:2111.14053  [pdf, other

    q-bio.BM cs.AI cs.LG physics.bio-ph

    Towards Conditional Generation of Minimal Action Potential Pathways for Molecular Dynamics

    Authors: John Kevin Cava, John Vant, Nicholas Ho, Ankita Shukla, Pavan Turaga, Ross Maciejewski, Abhishek Singharoy

    Abstract: In this paper, we utilized generative models, and reformulate it for problems in molecular dynamics (MD) simulation, by introducing an MD potential energy component to our generative model. By incorporating potential energy as calculated from TorchMD into a conditional generative framework, we attempt to construct a low-potential energy route of transformation between the helix~$\rightarrow$~coil… ▽ More

    Submitted 5 January, 2022; v1 submitted 28 November, 2021; originally announced November 2021.

    Comments: Accepted to ELLIS ML4Molecules Workshop 2021

  27. arXiv:2111.12798  [pdf, other

    cs.LG cs.CV

    Geometric Priors for Scientific Generative Models in Inertial Confinement Fusion

    Authors: Ankita Shukla, Rushil Anirudh, Eugene Kur, Jayaraman J. Thiagarajan, Peer-Timo Bremer, Brian K. Spears, Tammy Ma, Pavan Turaga

    Abstract: In this paper, we develop a Wasserstein autoencoder (WAE) with a hyperspherical prior for multimodal data in the application of inertial confinement fusion. Unlike a typical hyperspherical generative model that requires computationally inefficient sampling from distributions like the von Mis Fisher, we sample from a normal distribution followed by a projection layer before the generator. Finally,… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

    Comments: 5 pages, 4 figures, Fourth Workshop on Machine Learning and the Physical Sciences, NeurIPS 2021

  28. arXiv:2102.08360  [pdf, other

    cs.LG cs.CV

    Interpretable COVID-19 Chest X-Ray Classification via Orthogonality Constraint

    Authors: Ella Y. Wang, Anirudh Som, Ankita Shukla, Hongjun Choi, Pavan Turaga

    Abstract: Deep neural networks have increasingly been used as an auxiliary tool in healthcare applications, due to their ability to improve performance of several diagnosis tasks. However, these methods are not widely adopted in clinical settings due to the practical limitations in the reliability, generalizability, and interpretability of deep learning based systems. As a result, methods have been develope… ▽ More

    Submitted 21 December, 2021; v1 submitted 2 February, 2021; originally announced February 2021.

    Comments: Accepted in the 2021 ACM CHIL Workshop track. An extended version of this work is under consideration at Pattern Recognition Letters

  29. arXiv:2012.02043  [pdf, other

    cs.CV cs.LG

    Recovering Trajectories of Unmarked Joints in 3D Human Actions Using Latent Space Optimization

    Authors: Suhas Lohit, Rushil Anirudh, Pavan Turaga

    Abstract: Motion capture (mocap) and time-of-flight based sensing of human actions are becoming increasingly popular modalities to perform robust activity analysis. Applications range from action recognition to quantifying movement quality for health applications. While marker-less motion capture has made great progress, in critical applications such as healthcare, marker-based systems, especially active ma… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

    Comments: Accepted at WACV 2021

  30. arXiv:2009.10762  [pdf, other

    cs.CV cs.LG

    Role of Orthogonality Constraints in Improving Properties of Deep Networks for Image Classification

    Authors: Hongjun Choi, Anirudh Som, Pavan Turaga

    Abstract: Standard deep learning models that employ the categorical cross-entropy loss are known to perform well at image classification tasks. However, many standard models thus obtained often exhibit issues like feature redundancy, low interpretability, and poor calibration. A body of recent work has emerged that has tried addressing some of these challenges by proposing the use of new regularization func… ▽ More

    Submitted 22 September, 2020; originally announced September 2020.

    Comments: 8 figures, 4 tables, 1 pseudo-code

  31. arXiv:2006.10873  [pdf, other

    cs.CV cs.LG

    Generative Patch Priors for Practical Compressive Image Recovery

    Authors: Rushil Anirudh, Suhas Lohit, Pavan Turaga

    Abstract: In this paper, we propose the generative patch prior (GPP) that defines a generative prior for compressive image recovery, based on patch-manifold models. Unlike learned, image-level priors that are restricted to the range space of a pre-trained generator, GPP can recover a wide variety of natural images using a pre-trained patch generator. Additionally, GPP retains the benefits of generative prio… ▽ More

    Submitted 5 October, 2020; v1 submitted 18 June, 2020; originally announced June 2020.

  32. arXiv:2005.02936  [pdf, other

    cs.CV cs.CR cs.LG

    GraCIAS: Grassmannian of Corrupted Images for Adversarial Security

    Authors: Ankita Shukla, Pavan Turaga, Saket Anand

    Abstract: Input transformation based defense strategies fall short in defending against strong adversarial attacks. Some successful defenses adopt approaches that either increase the randomness within the applied transformations, or make the defense computationally intensive, making it substantially more challenging for the attacker. However, it limits the applicability of such defenses as a pre-processing… ▽ More

    Submitted 7 May, 2020; v1 submitted 6 May, 2020; originally announced May 2020.

    Comments: 16 pages

  33. arXiv:2005.02589  [pdf, ps, other

    cs.LG cs.CV stat.ML

    Unsupervised Pre-trained Models from Healthy ADLs Improve Parkinson's Disease Classification of Gait Patterns

    Authors: Anirudh Som, Narayanan Krishnamurthi, Matthew Buman, Pavan Turaga

    Abstract: Application and use of deep learning algorithms for different healthcare applications is gaining interest at a steady pace. However, use of such algorithms can prove to be challenging as they require large amounts of training data that capture different possible variations. This makes it difficult to use them in a clinical setting since in most health applications researchers often have to work wi… ▽ More

    Submitted 6 May, 2020; v1 submitted 6 May, 2020; originally announced May 2020.

    Comments: Accepted in the 42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society (EMBC 2020)

  34. arXiv:2004.09805  [pdf, other

    cs.CV

    AMC-Loss: Angular Margin Contrastive Loss for Improved Explainability in Image Classification

    Authors: Hongjun Choi, Anirudh Som, Pavan Turaga

    Abstract: Deep-learning architectures for classification problems involve the cross-entropy loss sometimes assisted with auxiliary loss functions like center loss, contrastive loss and triplet loss. These auxiliary loss functions facilitate better discrimination between the different classes of interest. However, recent studies hint at the fact that these loss functions do not take into account the intrinsi… ▽ More

    Submitted 21 April, 2020; originally announced April 2020.

  35. arXiv:2004.08614  [pdf, other

    cs.CV cs.LG

    Halluci-Net: Scene Completion by Exploiting Object Co-occurrence Relationships

    Authors: Kuldeep Kulkarni, Tejas Gokhale, Rajhans Singh, Pavan Turaga, Aswin Sankaranarayanan

    Abstract: Recently, there has been substantial progress in image synthesis from semantic labelmaps. However, methods used for this task assume the availability of complete and unambiguous labelmaps, with instance boundaries of objects, and class labels for each pixel. This reliance on heavily annotated inputs restricts the application of image synthesis techniques to real-world applications, especially unde… ▽ More

    Submitted 20 May, 2021; v1 submitted 18 April, 2020; originally announced April 2020.

    Comments: Accepted to AI for Content Creation Workshop @CVPR 2021

  36. arXiv:2004.07384  [pdf, other

    cs.LG eess.IV q-bio.QM stat.ML

    Topological Descriptors for Parkinson's Disease Classification and Regression Analysis

    Authors: Afra Nawar, Farhan Rahman, Narayanan Krishnamurthi, Anirudh Som, Pavan Turaga

    Abstract: At present, the vast majority of human subjects with neurological disease are still diagnosed through in-person assessments and qualitative analysis of patient data. In this paper, we propose to use Topological Data Analysis (TDA) together with machine learning tools to automate the process of Parkinson's disease classification and severity assessment. An automated, stable, and accurate method to… ▽ More

    Submitted 6 May, 2020; v1 submitted 15 April, 2020; originally announced April 2020.

    Comments: Accepted in the 42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society (EMBC 2020)

  37. arXiv:1911.10600  [pdf, other

    cs.CV

    Invenio: Discovering Hidden Relationships Between Tasks/Domains Using Structured Meta Learning

    Authors: Sameeksha Katoch, Kowshik Thopalli, Jayaraman J. Thiagarajan, Pavan Turaga, Andreas Spanias

    Abstract: Exploiting known semantic relationships between fine-grained tasks is critical to the success of recent model agnostic approaches. These approaches often rely on meta-optimization to make a model robust to systematic task or domain shifts. However, in practice, the performance of these methods can suffer, when there are no coherent semantic relationships between the tasks (or domains). We present… ▽ More

    Submitted 1 February, 2020; v1 submitted 24 November, 2019; originally announced November 2019.

    Comments: Semantic structure development for tasks/domains essential for efficient knowledge transfer

  38. arXiv:1907.09554  [pdf, other

    cs.CV cs.LG

    Product of Orthogonal Spheres Parameterization for Disentangled Representation Learning

    Authors: Ankita Shukla, Sarthak Bhagat, Shagun Uppal, Saket Anand, Pavan Turaga

    Abstract: Learning representations that can disentangle explanatory attributes underlying the data improves interpretabilty as well as provides control on data generation. Various learning frameworks such as VAEs, GANs and auto-encoders have been used in the literature to learn such representations. Most often, the latent space is constrained to a partitioned representation or structured by a prior to impos… ▽ More

    Submitted 22 July, 2019; originally announced July 2019.

    Comments: Accepted at British Machine Vision Conference (BMVC) 2019

  39. arXiv:1906.05947  [pdf, other

    cs.CV

    Temporal Transformer Networks: Joint Learning of Invariant and Discriminative Time Warping

    Authors: Suhas Lohit, Qiao Wang, Pavan Turaga

    Abstract: Many time-series classification problems involve developing metrics that are invariant to temporal misalignment. In human activity analysis, temporal misalignment arises due to various reasons including differing initial phase, sensor sampling rates, and elastic time-warps due to subject-specific biomechanics. Past work in this area has only looked at reducing intra-class variability by elastic te… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

    Comments: Published in CVPR 2019, Codes available at https://github.com/suhaslohit/TTN

  40. arXiv:1906.04338  [pdf, other

    stat.ML cs.CV cs.LG

    SALT: Subspace Alignment as an Auxiliary Learning Task for Domain Adaptation

    Authors: Kowshik Thopalli, Jayaraman J. Thiagarajan, Rushil Anirudh, Pavan Turaga

    Abstract: Unsupervised domain adaptation aims to transfer and adapt knowledge learned from a labeled source domain to an unlabeled target domain. Key components of unsupervised domain adaptation include: (a) maximizing performance on the target, and (b) aligning the source and target domains. Traditionally, these tasks have either been considered as separate, or assumed to be implicitly addressed together w… ▽ More

    Submitted 18 December, 2019; v1 submitted 10 June, 2019; originally announced June 2019.

  41. arXiv:1906.01769  [pdf, other

    cs.CV cs.LG math.AT

    PI-Net: A Deep Learning Approach to Extract Topological Persistence Images

    Authors: Anirudh Som, Hongjun Choi, Karthikeyan Natesan Ramamurthy, Matthew Buman, Pavan Turaga

    Abstract: Topological features such as persistence diagrams and their functional approximations like persistence images (PIs) have been showing substantial promise for machine learning and computer vision applications. This is greatly attributed to the robustness topological representations provide against different types of physical nuisance variables seen in real-world data, such as view-point, illuminati… ▽ More

    Submitted 23 May, 2020; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: 10 pages, 8 figures, 4 tables

  42. arXiv:1905.07061  [pdf, other

    cs.CV cs.LG

    Non-Parametric Priors For Generative Adversarial Networks

    Authors: Rajhans Singh, Pavan Turaga, Suren Jayasuriya, Ravi Garg, Martin W. Braun

    Abstract: The advent of generative adversarial networks (GAN) has enabled new capabilities in synthesis, interpolation, and data augmentation heretofore considered very challenging. However, one of the common assumptions in most GAN architectures is the assumption of simple parametric latent-space distributions. While easy to implement, a simple latent-space distribution can be problematic for uses such as… ▽ More

    Submitted 16 May, 2019; originally announced May 2019.

    Journal ref: International Conference on Machine Learning (2019)

  43. arXiv:1902.06964  [pdf, other

    cs.CV

    Geometry of Deep Generative Models for Disentangled Representations

    Authors: Ankita Shukla, Shagun Uppal, Sarthak Bhagat, Saket Anand, Pavan Turaga

    Abstract: Deep generative models like variational autoencoders approximate the intrinsic geometry of high dimensional data manifolds by learning low-dimensional latent-space variables and an embedding function. The geometric properties of these latent spaces has been studied under the lens of Riemannian geometry; via analysis of the non-linearity of the generator function. In new developments, deep generati… ▽ More

    Submitted 19 February, 2019; originally announced February 2019.

    Comments: Accepted at ICVGIP, 2018

  44. arXiv:1812.08882  [pdf, other

    cs.CV

    An Optical Flow-Based Approach for Minimally-Divergent Velocimetry Data Interpolation

    Authors: Berkay Kanberoglu, Dhritiman Das, Priya Nair, Pavan Turaga, David Frakes

    Abstract: Three-dimensional (3D) biomedical image sets are often acquired with in-plane pixel spacings that are far less than the out-of-plane spacings between images. The resultant anisotropy, which can be detrimental in many applications, can be decreased using image interpolation. Optical flow and/or other registration-based interpolators have proven useful in such interpolation roles in the past. When a… ▽ More

    Submitted 20 December, 2018; originally announced December 2018.

    Comments: 24 pages, 10 figures, International Journal of Biomedical Imaging, accepted manuscript

  45. arXiv:1811.04491  [pdf, other

    cs.CV

    Multiple Subspace Alignment Improves Domain Adaptation

    Authors: Kowshik Thopalli, Rushil Anirudh, Jayaraman J. Thiagarajan, Pavan Turaga

    Abstract: We present a novel unsupervised domain adaptation (DA) method for cross-domain visual recognition. Though subspace methods have found success in DA, their performance is often limited due to the assumption of approximating an entire dataset using a single low-dimensional subspace. Instead, we develop a method to effectively represent the source and target datasets via a collection of low-dimension… ▽ More

    Submitted 11 November, 2018; originally announced November 2018.

    Comments: under review in ICASSP 2019

  46. arXiv:1809.02850  [pdf, other

    cs.CV

    Rate-Adaptive Neural Networks for Spatial Multiplexers

    Authors: Suhas Lohit, Rajhans Singh, Kuldeep Kulkarni, Pavan Turaga

    Abstract: In resource-constrained environments, one can employ spatial multiplexing cameras to acquire a small number of measurements of a scene, and perform effective reconstruction or high-level inference using purely data-driven neural networks. However, once trained, the measurement matrix and the network are valid only for a single measurement rate (MR) chosen at training time. To overcome this drawbac… ▽ More

    Submitted 8 September, 2018; originally announced September 2018.

  47. arXiv:1807.10400  [pdf, other

    cs.CV

    Perturbation Robust Representations of Topological Persistence Diagrams

    Authors: Anirudh Som, Kowshik Thopalli, Karthikeyan Natesan Ramamurthy, Vinay Venkataraman, Ankita Shukla, Pavan Turaga

    Abstract: Topological methods for data analysis present opportunities for enforcing certain invariances of broad interest in computer vision, including view-point in activity analysis, articulation in shape analysis, and measurement invariance in non-linear dynamical modeling. The increasing success of these methods is attributed to the complementary information that topology provides, as well as availabili… ▽ More

    Submitted 26 July, 2018; originally announced July 2018.

    Comments: 19 pages, 4 figures, 6 tables

  48. arXiv:1806.03379  [pdf, other

    cs.CV cs.AI

    CS-VQA: Visual Question Answering with Compressively Sensed Images

    Authors: Li-Chi Huang, Kuldeep Kulkarni, Anik Jha, Suhas Lohit, Suren Jayasuriya, Pavan Turaga

    Abstract: Visual Question Answering (VQA) is a complex semantic task requiring both natural language processing and visual recognition. In this paper, we explore whether VQA is solvable when images are captured in a sub-Nyquist compressive paradigm. We develop a series of deep-network architectures that exploit available compressive data to increasing degrees of accuracy, and show that VQA is indeed solvabl… ▽ More

    Submitted 8 June, 2018; originally announced June 2018.

    Comments: 5 pages, 2 figures, accepted to ICIP 2018

    MSC Class: 68

  49. arXiv:1802.01722  [pdf, other

    cs.CV

    Compressive Light Field Reconstructions using Deep Learning

    Authors: Mayank Gupta, Arjun Jauhari, Kuldeep Kulkarni, Suren Jayasuriya, Alyosha Molnar, Pavan Turaga

    Abstract: Light field imaging is limited in its computational processing demands of high sampling for both spatial and angular dimensions. Single-shot light field cameras sacrifice spatial resolution to sample angular viewpoints, typically by multiplexing incoming rays onto a 2D sensor array. While this resolution can be recovered using compressive sensing, these iterative solutions are slow in processing a… ▽ More

    Submitted 5 February, 2018; originally announced February 2018.

    Comments: Published at CCD 2017 workshop held in conjunction with CVPR 2017

  50. arXiv:1708.09485  [pdf, other

    cs.CV

    Learning Invariant Riemannian Geometric Representations Using Deep Nets

    Authors: Suhas Lohit, Pavan Turaga

    Abstract: Non-Euclidean constraints are inherent in many kinds of data in computer vision and machine learning, typically as a result of specific invariance requirements that need to be respected during high-level inference. Often, these geometric constraints can be expressed in the language of Riemannian geometry, where conventional vector space machine learning does not apply directly. The central questio… ▽ More

    Submitted 22 September, 2017; v1 submitted 30 August, 2017; originally announced August 2017.

    Comments: Accepted at International Conference on Computer Vision Workshop (ICCVW), 2017 on Manifold Learning: from Euclid to Riemann