Skip to main content

Showing 1–50 of 102 results for author: Narasimhan, S

.
  1. arXiv:2505.13851  [pdf, ps, other

    cs.AI

    A Challenge to Build Neuro-Symbolic Video Agents

    Authors: Sahil Shah, Harsh Goel, Sai Shankar Narasimhan, Minkyu Choi, S P Sharan, Oguzhan Akcin, Sandeep Chinchali

    Abstract: Modern video understanding systems excel at tasks such as scene classification, object detection, and short video retrieval. However, as video analysis becomes increasingly central to real-world applications, there is a growing need for proactive video agents for the systems that not only interpret video streams but also reason about events and take informed actions. A key obstacle in this directi… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  2. arXiv:2504.16086  [pdf, other

    cs.GR cs.HC

    Digital Kitchen Remodeling: Editing and Relighting Intricate Indoor Scenes from a Single Panorama

    Authors: Guanzhou Ji, Azadeh O. Sawyer, Srinivasa G. Narasimhan

    Abstract: We present a novel virtual staging application for kitchen remodeling from a single panorama. To ensure the realism of the virtual rendered scene, we capture real-world High Dynamic Range (HDR) panoramas and recover the absolute scene radiance for high-quality scene relighting. Our application pipeline consists of three key components: (1) HDR photography for capturing paired indoor and outdoor pa… ▽ More

    Submitted 4 February, 2025; originally announced April 2025.

    Comments: Submitted to IES25 - The Lighting Conference, Anaheim, California, August 21 - 23, 2025

  3. arXiv:2504.13157  [pdf, other

    cs.CV

    AerialMegaDepth: Learning Aerial-Ground Reconstruction and View Synthesis

    Authors: Khiem Vuong, Anurag Ghosh, Deva Ramanan, Srinivasa Narasimhan, Shubham Tulsiani

    Abstract: We explore the task of geometric reconstruction of images captured from a mixture of ground and aerial views. Current state-of-the-art learning-based approaches fail to handle the extreme viewpoint variation between aerial-ground image pairs. Our hypothesis is that the lack of high-quality, co-registered aerial-ground datasets for training is a key reason for this failure. Such data is difficult t… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

    Comments: Appearing in CVPR 2025. Project page: https://aerial-megadepth.github.io

  4. arXiv:2503.21521  [pdf

    econ.GN

    Securing the supply of graphite for batteries

    Authors: Karan Bhuwalka, Hari Ramachandran, Swati Narasimhan, Adrian Yao, Julia Frohmann, Leopold Peiseler, William Chueh, Adam Boies, Steven J. Davis, Sally Benson

    Abstract: The increasing demand for graphite in batteries has led to concerns around supply chain security. Currently, over 92% of global anode material is produced in China, posing a geopolitical risk for other countries reliant on graphite supply for domestic industries. This paper assesses the costs of producing battery-grade graphite (natural and synthetic) in the US and China using process-based cost m… ▽ More

    Submitted 29 April, 2025; v1 submitted 27 March, 2025; originally announced March 2025.

  5. arXiv:2503.18711  [pdf

    cs.CV cs.LG

    Accenture-NVS1: A Novel View Synthesis Dataset

    Authors: Thomas Sugg, Kyle O'Brien, Lekh Poudel, Alex Dumouchelle, Michelle Jou, Marc Bosch, Deva Ramanan, Srinivasa Narasimhan, Shubham Tulsiani

    Abstract: This paper introduces ACC-NVS1, a specialized dataset designed for research on Novel View Synthesis specifically for airborne and ground imagery. Data for ACC-NVS1 was collected in Austin, TX and Pittsburgh, PA in 2023 and 2024. The collection encompasses six diverse real-world scenes captured from both airborne and ground cameras, resulting in a total of 148,000 images. ACC-NVS1 addresses challen… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

    Comments: 6 pages, 7 figures

  6. arXiv:2503.03192  [pdf, other

    cs.RO

    Distributed Certifiably Correct Range-Aided SLAM

    Authors: Alexander Thoms, Alan Papalia, Jared Velasquez, David M. Rosen, Sriram Narasimhan

    Abstract: Reliable simultaneous localization and mapping (SLAM) algorithms are necessary for safety-critical autonomous navigation. In the communication-constrained multi-agent setting, navigation systems increasingly use point-to-point range sensors as they afford measurements with low bandwidth requirements and known data association. The state estimation problem for these systems takes the form of range-… ▽ More

    Submitted 13 May, 2025; v1 submitted 5 March, 2025; originally announced March 2025.

    Comments: 8 pages, 3 figures, accepted to the 2025 International Conference on Robotics and Automation (ICRA). This version includes minor clerical edits to the published version in the conference proceedings

  7. arXiv:2502.06973  [pdf, other

    cs.CV

    Indoor Light and Heat Estimation from a Single Panorama

    Authors: Guanzhou Ji, Sriram Narayanan, Azadeh Sawyer, Srinivasa Narasimhan

    Abstract: This paper presents a novel application for directly estimating indoor light and heat maps from captured indoor-outdoor High Dynamic Range (HDR) panoramas. In our image-based rendering method, the indoor panorama is used to estimate the 3D room layout, while the corresponding outdoor panorama serves as an environment map to infer spatially-varying light and material properties. We establish a conn… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

  8. Supporting Contraceptive Decision-Making in the Intermediated Pharmacy Setting in Kenya

    Authors: Lisa Orii, Elizabeth K Harrington, Serah Gitome, Nelson Kiprotich Cheruiyot, Elizabeth Anne Bukusi, Sandy Cheng, Ariel Fu, Khushi Khandelwal, Shrimayee Narasimhan, Richard Anderson

    Abstract: Adolescent girls and young women (AGYW) in sub-Saharan Africa face unique barriers to contraceptive access and lack AGYW-centered contraceptive decision-support resources. To empower AGYW to make informed choices and improve reproductive health outcomes, we developed a tablet-based application to provide contraceptive education and decision-making support in the pharmacy setting - a key source of… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

  9. arXiv:2501.14486  [pdf, other

    cs.RO

    Visual-Lidar Map Alignment for Infrastructure Inspections

    Authors: Jake McLaughlin, Nicholas Charron, Sriram Narasimhan

    Abstract: Routine and repetitive infrastructure inspections present safety, efficiency, and consistency challenges as they are performed manually, often in challenging or hazardous environments. They can also introduce subjectivity and errors into the process, resulting in undesirable outcomes. Simultaneous localization and mapping (SLAM) presents an opportunity to generate high-quality 3D maps that can be… ▽ More

    Submitted 27 January, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

    Comments: 8 pages, 8 figures, for associated code see https://github.com/jakemclaughlin6/vlma

    MSC Class: 68T40 (primary); 62P30 (secondary) ACM Class: J.2; I.4

  10. arXiv:2501.01424  [pdf, other

    cs.CV cs.AI cs.GR

    Object-level Visual Prompts for Compositional Image Generation

    Authors: Gaurav Parmar, Or Patashnik, Kuan-Chieh Wang, Daniil Ostashev, Srinivasa Narasimhan, Jun-Yan Zhu, Daniel Cohen-Or, Kfir Aberman

    Abstract: We introduce a method for composing object-level visual prompts within a text-to-image diffusion model. Our approach addresses the task of generating semantically coherent compositions across diverse scenes and styles, similar to the versatility and expressiveness offered by text prompts. A key challenge in this task is to preserve the identity of the objects depicted in the input visual prompts,… ▽ More

    Submitted 2 January, 2025; originally announced January 2025.

    Comments: Project: https://snap-research.github.io/visual-composer/

  11. arXiv:2411.16776  [pdf, other

    cs.CV

    SynDiff-AD: Improving Semantic Segmentation and End-to-End Autonomous Driving with Synthetic Data from Latent Diffusion Models

    Authors: Harsh Goel, Sai Shankar Narasimhan, Oguzhan Akcin, Sandeep Chinchali

    Abstract: In recent years, significant progress has been made in collecting large-scale datasets to improve segmentation and autonomous driving models. These large-scale datasets are often dominated by common environmental conditions such as "Clear and Day" weather, leading to decreased performance in under-represented conditions like "Rainy and Night". To address this issue, we introduce SynDiff-AD, a nove… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

    Comments: 15 pages, 10 figures

  12. arXiv:2410.12672  [pdf, other

    cs.LG cs.AI

    Context Matters: Leveraging Contextual Features for Time Series Forecasting

    Authors: Sameep Chattopadhyay, Pulkit Paliwal, Sai Shankar Narasimhan, Shubhankar Agarwal, Sandeep P. Chinchali

    Abstract: Time series forecasts are often influenced by exogenous contextual features in addition to their corresponding history. For example, in financial settings, it is hard to accurately predict a stock price without considering public sentiments and policy decisions in the form of news articles, tweets, etc. Though this is common knowledge, the current state-of-the-art (SOTA) forecasting models fail to… ▽ More

    Submitted 13 January, 2025; v1 submitted 16 October, 2024; originally announced October 2024.

  13. arXiv:2410.12652  [pdf, other

    cs.LG cs.AI eess.SP

    Constrained Posterior Sampling: Time Series Generation with Hard Constraints

    Authors: Sai Shankar Narasimhan, Shubhankar Agarwal, Litu Rout, Sanjay Shakkottai, Sandeep P. Chinchali

    Abstract: Generating realistic time series samples is crucial for stress-testing models and protecting user privacy by using synthetic data. In engineering and safety-critical applications, these samples must meet certain hard constraints that are domain-specific or naturally imposed by physics or nature. Consider, for example, generating electricity demand patterns with constraints on peak demand times. Th… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  14. arXiv:2409.13675  [pdf

    cs.RO

    OLiVia-Nav: An Online Lifelong Vision Language Approach for Mobile Robot Social Navigation

    Authors: Siddarth Narasimhan, Aaron Hao Tan, Daniel Choi, Goldie Nejat

    Abstract: Service robots in human-centered environments such as hospitals, office buildings, and long-term care homes need to navigate while adhering to social norms to ensure the safety and comfortability of the people they are sharing the space with. Furthermore, they need to adapt to new social scenarios that can arise during robot navigation. In this paper, we present a novel Online Lifelong Vision Lang… ▽ More

    Submitted 8 March, 2025; v1 submitted 20 September, 2024; originally announced September 2024.

  15. arXiv:2409.03061  [pdf, other

    cs.CV cs.GR cs.RO

    Incorporating dense metric depth into neural 3D representations for view synthesis and relighting

    Authors: Arkadeep Narayan Chaudhury, Igor Vasiljevic, Sergey Zakharov, Vitor Guizilini, Rares Ambrus, Srinivasa Narasimhan, Christopher G. Atkeson

    Abstract: Synthesizing accurate geometry and photo-realistic appearance of small scenes is an active area of research with compelling use cases in gaming, virtual reality, robotic-manipulation, autonomous driving, convenient product capture, and consumer-level photography. When applying scene geometry and appearance estimation techniques to robotics, we found that the narrow cone of possible viewpoints due… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

    Comments: Project webpage: https://stereomfc.github.io

  16. arXiv:2406.07661  [pdf, other

    cs.CV cs.RO

    ROADWork Dataset: Learning to Recognize, Observe, Analyze and Drive Through Work Zones

    Authors: Anurag Ghosh, Robert Tamburo, Shen Zheng, Juan R. Alvarez-Padilla, Hailiang Zhu, Michael Cardei, Nicholas Dunn, Christoph Mertz, Srinivasa G. Narasimhan

    Abstract: Perceiving and navigating through work zones is challenging and under-explored, even with major strides in self-driving research. An important reason is the lack of open datasets for developing new algorithms to address this long-tailed scenario. We propose the ROADWork dataset to learn how to recognize, observe and analyze and drive through work zones. We find that state-of-the-art foundation mod… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  17. arXiv:2404.16944  [pdf, other

    cs.CV

    Constellation Dataset: Benchmarking High-Altitude Object Detection for an Urban Intersection

    Authors: Mehmet Kerem Turkcan, Sanjeev Narasimhan, Chengbo Zang, Gyung Hyun Je, Bo Yu, Mahshid Ghasemi, Javad Ghaderi, Gil Zussman, Zoran Kostic

    Abstract: We introduce Constellation, a dataset of 13K images suitable for research on detection of objects in dense urban streetscapes observed from high-elevation cameras, collected for a variety of temporal conditions. The dataset addresses the need for curated data to explore problems in small object detection exemplified by the limited pixel footprint of pedestrians observed tens of meters from above.… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  18. arXiv:2404.03556  [pdf, other

    cs.RO

    Robot Safety Monitoring using Programmable Light Curtains

    Authors: Karnik Ram, Shobhit Aggarwal, Robert Tamburo, Siddharth Ancha, Srinivasa Narasimhan

    Abstract: As factories continue to evolve into collaborative spaces with multiple robots working together with human supervisors in the loop, ensuring safety for all actors involved becomes critical. Currently, laser-based light curtain sensors are widely used in factories for safety monitoring. While these conventional safety sensors meet high accuracy standards, they are difficult to reconfigure and can o… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: Under review for IROS '24. Webpage http://cmu-mfi.github.io/plc-safety

  19. arXiv:2403.19022  [pdf, other

    cs.CV

    WALT3D: Generating Realistic Training Data from Time-Lapse Imagery for Reconstructing Dynamic Objects under Occlusion

    Authors: Khiem Vuong, N. Dinesh Reddy, Robert Tamburo, Srinivasa G. Narasimhan

    Abstract: Current methods for 2D and 3D object understanding struggle with severe occlusions in busy urban environments, partly due to the lack of large-scale labeled ground-truth annotations for learning occlusion. In this work, we introduce a novel framework for automatically generating a large, realistic dataset of dynamic objects under occlusions using freely available time-lapse imagery. By leveraging… ▽ More

    Submitted 1 April, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: To appear in CVPR 2024. Homepage: https://www.cs.cmu.edu/~walt3d

  20. arXiv:2403.12712  [pdf, other

    cs.CV cs.LG

    Instance-Warp: Saliency Guided Image Warping for Unsupervised Domain Adaptation

    Authors: Shen Zheng, Anurag Ghosh, Srinivasa G. Narasimhan

    Abstract: Driving is challenging in conditions like night, rain, and snow. Lack of good labeled datasets has hampered progress in scene understanding under such conditions. Unsupervised Domain Adaptation (UDA) using large labeled clear-day datasets is a promising research direction in such cases. However, many UDA methods are trained with dominant scene backgrounds (e.g., roads, sky, sidewalks) that appear… ▽ More

    Submitted 4 December, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: WACV 2025 Accepted Paper

  21. arXiv:2403.12036  [pdf, other

    cs.CV cs.GR cs.LG

    One-Step Image Translation with Text-to-Image Models

    Authors: Gaurav Parmar, Taesung Park, Srinivasa Narasimhan, Jun-Yan Zhu

    Abstract: In this work, we address two limitations of existing conditional diffusion models: their slow inference speed due to the iterative denoising process and their reliance on paired data for model fine-tuning. To tackle these issues, we introduce a general method for adapting a single-step diffusion model to new tasks and domains through adversarial learning objectives. Specifically, we consolidate va… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Github: https://github.com/GaParmar/img2img-turbo

  22. Gaze-based Human-Robot Interaction System for Infrastructure Inspections

    Authors: Sunwoong Choi, Zaid Abbas Al-Sabbag, Sriram Narasimhan, Chul Min Yeum

    Abstract: Routine inspections for critical infrastructures such as bridges are required in most jurisdictions worldwide. Such routine inspections are largely visual in nature, which are qualitative, subjective, and not repeatable. Although robotic infrastructure inspections address such limitations, they cannot replace the superior ability of experts to make decisions in complex situations, thus making huma… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 7 pages, 8 figures, 1 supplementary video; Accepted to the 2024 IEEE International Conference on Robotics and Automation (ICRA)

  23. arXiv:2403.02682  [pdf, other

    cs.LG eess.SP

    Time Weaver: A Conditional Time Series Generation Model

    Authors: Sai Shankar Narasimhan, Shubhankar Agarwal, Oguzhan Akcin, Sujay Sanghavi, Sandeep Chinchali

    Abstract: Imagine generating a city's electricity demand pattern based on weather, the presence of an electric vehicle, and location, which could be used for capacity planning during a winter freeze. Such real-world time series are often enriched with paired heterogeneous contextual metadata (weather, location, etc.). Current approaches to time series generation often ignore this paired metadata, and its he… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  24. arXiv:2402.17904  [pdf

    cs.RO

    4CNet: A Diffusion Approach to Map Prediction for Decentralized Multi-Robot Exploration

    Authors: Aaron Hao Tan, Siddarth Narasimhan, Goldie Nejat

    Abstract: Mobile robots in unknown cluttered environments with irregularly shaped obstacles often face energy and communication challenges which directly affect their ability to explore these environments. In this paper, we introduce a novel deep learning architecture, Confidence-Aware Contrastive Conditional Consistency Model (4CNet), for robot map prediction during decentralized, resource-limited multi-ro… ▽ More

    Submitted 8 April, 2025; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: 17 pages, 13 figures

  25. Virtual Home Staging: Inverse Rendering and Editing an Indoor Panorama under Natural Illumination

    Authors: Guanzhou Ji, Azadeh O. Sawyer, Srinivasa G. Narasimhan

    Abstract: We propose a novel inverse rendering method that enables the transformation of existing indoor panoramas with new indoor furniture layouts under natural illumination. To achieve this, we captured indoor HDR panoramas along with real-time outdoor hemispherical HDR photographs. Indoor and outdoor HDR images were linearly calibrated with measured absolute luminance values for accurate scene relightin… ▽ More

    Submitted 28 January, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

    Journal ref: International Symposium on Visual Computing 2023

  26. arXiv:2311.04243  [pdf, other

    cs.CV

    Toward Planet-Wide Traffic Camera Calibration

    Authors: Khiem Vuong, Robert Tamburo, Srinivasa G. Narasimhan

    Abstract: Despite the widespread deployment of outdoor cameras, their potential for automated analysis remains largely untapped due, in part, to calibration challenges. The absence of precise camera calibration data, including intrinsic and extrinsic parameters, hinders accurate real-world distance measurements from captured videos. To address this, we present a scalable framework that utilizes street-level… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: To appear in WACV 2024. Project webpage: https://www.khiemvuong.com/OpenTrafficCam3D

  27. arXiv:2311.00660  [pdf, other

    cs.CV

    TPSeNCE: Towards Artifact-Free Realistic Rain Generation for Deraining and Object Detection in Rain

    Authors: Shen Zheng, Changjie Lu, Srinivasa G. Narasimhan

    Abstract: Rain generation algorithms have the potential to improve the generalization of deraining methods and scene understanding in rainy conditions. However, in practice, they produce artifacts and distortions and struggle to control the amount of rain generated due to a lack of proper constraints. In this paper, we propose an unpaired image-to-image translation framework for generating realistic rainy i… ▽ More

    Submitted 7 November, 2023; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: WACV 2024

  28. arXiv:2310.10980  [pdf, other

    eess.SY

    Analysis of potential flow networks: Variations in transport time with $discrete$, $continuous$, and $selfish$ operation

    Authors: Varghese Kurian, Sridharakumar Narasimhan

    Abstract: In potential flow networks, the equilibrium flow rates are usually not proportional to the demands and flow control elements are required to regulate the flow. The control elements can broadly be classified into two types - discrete and continuous. Discrete control elements can have only two operational states: fully open or fully closed. On the other hand, continuous control elements may be opera… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    MSC Class: 76B75; 90B10; 91A43

  29. arXiv:2303.17485  [pdf, other

    cs.SI cs.AI

    Edge Ranking of Graphs in Transportation Networks using a Graph Neural Network (GNN)

    Authors: Debasish Jana, Sven Malama, Sriram Narasimhan, Ertugrul Taciroglu

    Abstract: Many networks, such as transportation, power, and water distribution, can be represented as graphs. Crucial challenge in graph representations is identifying the importance of graph edges and their influence on overall network efficiency and information flow performance. For example, important edges in a transportation network are those roads that, when affected, will significantly alter the netwo… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

  30. arXiv:2303.14311  [pdf, other

    cs.CV

    Learned Two-Plane Perspective Prior based Image Resampling for Efficient Object Detection

    Authors: Anurag Ghosh, N. Dinesh Reddy, Christoph Mertz, Srinivasa G. Narasimhan

    Abstract: Real-time efficient perception is critical for autonomous navigation and city scale sensing. Orthogonal to architectural improvements, streaming perception approaches have exploited adaptive sampling improving real-time detection performance. In this work, we propose a learnable geometry-guided prior that incorporates rough geometry of the 3D scene (a ground plane and a plane above) to resample im… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

    Comments: CVPR 2023 Accepted Paper, 21 pages, 16 Figures

  31. arXiv:2303.10729  [pdf, other

    cs.RO

    A Target-Based Extrinsic Calibration Framework for Non-Overlapping Camera-Lidar Systems Using a Motion Capture System

    Authors: Nicholas Charron, Huaiyuan Weng, Steven L. Waslander, Sriram Narasimhan

    Abstract: We present a novel target-based lidar-camera extrinsic calibration methodology that can be used for non-overlapping field of view (FOV) sensors. Contrary to previous work, our methodology overcomes the non-overlapping FOV challenge using a motion capture system (MCS) instead of traditional simultaneous localization and mapping approaches. Due to the high relative precision of MCSs, our methodology… ▽ More

    Submitted 2 March, 2025; v1 submitted 19 March, 2023; originally announced March 2023.

    Comments: 9 pages, 9 figures

  32. arXiv:2302.12597  [pdf, other

    cs.RO cs.AI

    Active Velocity Estimation using Light Curtains via Self-Supervised Multi-Armed Bandits

    Authors: Siddharth Ancha, Gaurav Pathak, Ji Zhang, Srinivasa Narasimhan, David Held

    Abstract: To navigate in an environment safely and autonomously, robots must accurately estimate where obstacles are and how they move. Instead of using expensive traditional 3D sensors, we explore the use of a much cheaper, faster, and higher resolution alternative: programmable light curtains. Light curtains are a controllable depth sensor that sense only along a surface that the user selects. We adapt a… ▽ More

    Submitted 29 May, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

    Comments: 9 pages (main paper), 3 pages (references), 9 pages (appendix)

  33. arXiv:2302.09182  [pdf, other

    cs.RO cs.FL eess.SY

    Safe Networked Robotics with Probabilistic Verification

    Authors: Sai Shankar Narasimhan, Sharachchandra Bhat, Sandeep P. Chinchali

    Abstract: Autonomous robots must utilize rich sensory data to make safe control decisions. To process this data, compute-constrained robots often require assistance from remote computation, or the cloud, that runs compute-intensive deep neural network perception or control models. However, this assistance comes at the cost of a time delay due to network latency, resulting in past observations being used in… ▽ More

    Submitted 3 December, 2024; v1 submitted 17 February, 2023; originally announced February 2023.

  34. arXiv:2210.06394  [pdf, other

    cs.CL

    On Text Style Transfer via Style Masked Language Models

    Authors: Sharan Narasimhan, Pooja Shekar, Suvodip Dey, Maunendra Sankar Desarkar

    Abstract: Text Style Transfer (TST) is performable through approaches such as latent space disentanglement, cycle-consistency losses, prototype editing etc. The prototype editing approach, which is known to be quite successful in TST, involves two key phases a) Masking of source style-associated tokens and b) Reconstruction of this source-style masked sentence conditioned with the target style. We follow a… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

  35. arXiv:2208.12278  [pdf, other

    cs.CV cs.AI cs.GR

    Learning Continuous Implicit Representation for Near-Periodic Patterns

    Authors: Bowei Chen, Tiancheng Zhi, Martial Hebert, Srinivasa G. Narasimhan

    Abstract: Near-Periodic Patterns (NPP) are ubiquitous in man-made scenes and are composed of tiled motifs with appearance differences caused by lighting, defects, or design elements. A good NPP representation is useful for many applications including image completion, segmentation, and geometric remapping. But representing NPP is challenging because it needs to maintain global consistency (tiled motifs layo… ▽ More

    Submitted 25 August, 2022; originally announced August 2022.

    Comments: ECCV 2022. Project page: https://armastuschen.github.io/projects/NPP_Net/

  36. Doppler: Automated SKU Recommendation in Migrating SQL Workloads to the Cloud

    Authors: Joyce Cahoon, Wenjing Wang, Yiwen Zhu, Katherine Lin, Sean Liu, Raymond Truong, Neetu Singh, Chengcheng Wan, Alexandra M Ciortea, Sreraman Narasimhan, Subru Krishnan

    Abstract: Selecting the optimal cloud target to migrate SQL estates from on-premises to the cloud remains a challenge. Current solutions are not only time-consuming and error-prone, requiring significant user input, but also fail to provide appropriate recommendations. We present Doppler, a scalable recommendation engine that provides right-sized Azure SQL Platform-as-a-Service (PaaS) recommendations withou… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

    Journal ref: Proceedings of the VLDB Endowment 15 (12), 3509-3521, 2022

  37. arXiv:2205.13150  [pdf, other

    cs.GR

    Semantically Supervised Appearance Decomposition for Virtual Staging from a Single Panorama

    Authors: Tiancheng Zhi, Bowei Chen, Ivaylo Boyadzhiev, Sing Bing Kang, Martial Hebert, Srinivasa G. Narasimhan

    Abstract: We describe a novel approach to decompose a single panorama of an empty indoor environment into four appearance components: specular, direct sunlight, diffuse and diffuse ambient without direct sunlight. Our system is weakly supervised by automatically generated semantic maps (with floor, wall, ceiling, lamp, window and door labels) that have shown success on perspective views and are trained for… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

    Comments: To appear in SIGGRAPH 2022

  38. arXiv:2205.02309  [pdf, other

    cs.CL

    Towards Robust and Semantically Organised Latent Representations for Unsupervised Text Style Transfer

    Authors: Sharan Narasimhan, Suvodip Dey, Maunendra Sankar Desarkar

    Abstract: Recent studies show that auto-encoder based approaches successfully perform language generation, smooth sentence interpolation, and style transfer over unseen attributes using unlabelled datasets in a zero-shot manner. The latent space geometry of such models is organised well enough to perform on datasets where the style is "coarse-grained" i.e. a small fraction of words alone in a sentence are e… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

    Comments: NAACL 2022 Main Conference paper

  39. arXiv:2107.04000  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    Active Safety Envelopes using Light Curtains with Probabilistic Guarantees

    Authors: Siddharth Ancha, Gaurav Pathak, Srinivasa G. Narasimhan, David Held

    Abstract: To safely navigate unknown environments, robots must accurately perceive dynamic obstacles. Instead of directly measuring the scene depth with a LiDAR sensor, we explore the use of a much cheaper and higher resolution sensor: programmable light curtains. Light curtains are controllable depth sensors that sense only along a surface that a user selects. We use light curtains to estimate the safety e… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

    Comments: 18 pages, Published at Robotics: Science and Systems (RSS) 2021

  40. arXiv:2106.04872  [pdf, ps, other

    math.AG

    Symmetric products and moduli spaces of vector bundles of curves

    Authors: Kyoung-Seog Lee, M. S. Narasimhan

    Abstract: Let $X$ be a smooth projective curve of genus $g \geq 2$ and $M$ be the moduli space of rank 2 stable vector bundles on $X$ whose determinants are isomorphic to a fixed odd degree line bundle $L$. There has been a lot of works studying the moduli and recently the bounded derived category of coherent sheaves on $M$ draws lots of attentions. It was proved that the derived category of $X$ can be embe… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

  41. Higgs bundles twisted by a vector bundle

    Authors: Guillermo Gallego, Oscar Garcia-Prada, M. S. Narasimhan

    Abstract: In this paper, we consider a generalization of the theory of Higgs bundles over a smooth complex projective curve in which the twisting of the Higgs field by the canonical bundle of the curve is replaced by a rank 2 vector bundle. We define a Hitchin map and give a spectral correspondence. We also state a Hitchin-Kobayashi correspondence for a generalization of the Hitchin equations to this situat… ▽ More

    Submitted 18 November, 2023; v1 submitted 12 May, 2021; originally announced May 2021.

    Comments: 23 pages. To appear in the International Journal of Mathematics, we have included some referee comments. We have added and expanded some remarks and comments. We have also added an extra case in Section 3.4.3

    MSC Class: Primary 14H60; Secondary 14D23; 53C07

  42. arXiv:2012.07938  [pdf, other

    physics.flu-dyn cs.LG

    NVIDIA SimNet^{TM}: an AI-accelerated multi-physics simulation framework

    Authors: Oliver Hennigh, Susheela Narasimhan, Mohammad Amin Nabian, Akshay Subramaniam, Kaustubh Tangsali, Max Rietmann, Jose del Aguila Ferrandis, Wonmin Byeon, Zhiwei Fang, Sanjay Choudhry

    Abstract: We present SimNet, an AI-driven multi-physics simulation framework, to accelerate simulations across a wide range of disciplines in science and engineering. Compared to traditional numerical solvers, SimNet addresses a wide range of use cases - coupled forward simulations without any training data, inverse and data assimilation problems. SimNet offers fast turnaround time by enabling parameterized… ▽ More

    Submitted 14 December, 2020; originally announced December 2020.

  43. arXiv:2011.14645  [pdf, other

    eess.SY cs.LG

    Identification of Errors-in-Variables ARX Models Using Modified Dynamic Iterative PCA

    Authors: Deepak Maurya, Arun K. Tangirala, Shankar Narasimhan

    Abstract: Identification of autoregressive models with exogenous input (ARX) is a classical problem in system identification. This article considers the errors-in-variables (EIV) ARX model identification problem, where input measurements are also corrupted with noise. The recently proposed DIPCA technique solves the EIV identification problem but is only applicable to white measurement errors. We propose a… ▽ More

    Submitted 30 November, 2020; originally announced November 2020.

    Comments: 10 pages

  44. arXiv:2009.01810  [pdf, other

    cs.AI

    SEDRo: A Simulated Environment for Developmental Robotics

    Authors: Aishwarya Pothula, Md Ashaduzzaman Rubel Mondol, Sanath Narasimhan, Sm Mazharul Islam, Deokgun Park

    Abstract: Even with impressive advances in application-specific models, we still lack knowledge about how to build a model that can learn in a human-like way and do multiple tasks. To learn in a human-like way, we need to provide a diverse experience that is comparable to humans. In this paper, we introduce our ongoing effort to build a simulated environment for developmental robotics (SEDRo). SEDRo provide… ▽ More

    Submitted 3 September, 2020; originally announced September 2020.

  45. arXiv:2008.05150  [pdf, other

    eess.SY

    Identification of MISO systems in Minimal Realization Form

    Authors: Chaithanya K. Donda, Deepak Maurya, Arun K. Tangirala, Shankar Narasimhan

    Abstract: The paper is concerned with identifying transfer functions of individual input channels in minimal realization form of a Multi-Input Single Output (MISO) from the input-output data corrupted by the error in all the variables. Such a framework is commonly referred to as error-in-variables (EIV). A common approach in the existing methods for identification of MISO systems is to estimate a non-minima… ▽ More

    Submitted 12 August, 2020; originally announced August 2020.

    Comments: Accepted in ACODS2020. Other related works can be found on https://d-maurya.github.io/web/

  46. arXiv:2008.04779  [pdf, other

    eess.SY

    ARX Model Identification using Generalized Spectral Decomposition

    Authors: Deepak Maurya, Arun K. Tangirala, Shankar Narasimhan

    Abstract: This article is concerned with the identification of autoregressive with exogenous inputs (ARX) models. Most of the existing approaches like prediction error minimization and state-space framework are widely accepted and utilized for the estimation of ARX models but are known to deliver unbiased and consistent parameter estimates for a correctly supplied guess of input-output orders and delay. I… ▽ More

    Submitted 11 August, 2020; originally announced August 2020.

    Comments: 8 pages, accepted at MTNS 2020

  47. Active Perception using Light Curtains for Autonomous Driving

    Authors: Siddharth Ancha, Yaadhav Raaj, Peiyun Hu, Srinivasa G. Narasimhan, David Held

    Abstract: Most real-world 3D sensors such as LiDARs perform fixed scans of the entire environment, while being decoupled from the recognition system that processes the sensor data. In this work, we propose a method for 3D object recognition using light curtains, a resource-efficient controllable sensor that measures depth at user-specified locations in the environment. Crucially, we propose using prediction… ▽ More

    Submitted 5 August, 2020; originally announced August 2020.

    Comments: Published at the European Conference on Computer Vision (ECCV), 2020

  48. arXiv:2008.00158  [pdf, ps, other

    cs.CV

    TexMesh: Reconstructing Detailed Human Texture and Geometry from RGB-D Video

    Authors: Tiancheng Zhi, Christoph Lassner, Tony Tung, Carsten Stoll, Srinivasa G. Narasimhan, Minh Vo

    Abstract: We present TexMesh, a novel approach to reconstruct detailed human meshes with high-resolution full-body texture from RGB-D video. TexMesh enables high quality free-viewpoint rendering of humans. Given the RGB frames, the captured environment map, and the coarse per-frame human mesh from RGB-D tracking, our method reconstructs spatiotemporally consistent and detailed per-frame meshes along with a… ▽ More

    Submitted 20 September, 2020; v1 submitted 31 July, 2020; originally announced August 2020.

    Comments: ECCV 2020

  49. arXiv:2007.12806  [pdf, other

    cs.CV

    Spatiotemporal Bundle Adjustment for Dynamic 3D Human Reconstruction in the Wild

    Authors: Minh Vo, Yaser Sheikh, Srinivasa G. Narasimhan

    Abstract: Bundle adjustment jointly optimizes camera intrinsics and extrinsics and 3D point triangulation to reconstruct a static scene. The triangulation constraint, however, is invalid for moving points captured in multiple unsynchronized videos and bundle adjustment is not designed to estimate the temporal alignment between cameras. We present a spatiotemporal bundle adjustment framework that jointly opt… ▽ More

    Submitted 24 July, 2020; originally announced July 2020.

    Comments: Accepted to IEEE TPAMI

  50. arXiv:2005.13532  [pdf, other

    cs.CV cs.GR

    4D Visualization of Dynamic Events from Unconstrained Multi-View Videos

    Authors: Aayush Bansal, Minh Vo, Yaser Sheikh, Deva Ramanan, Srinivasa Narasimhan

    Abstract: We present a data-driven approach for 4D space-time visualization of dynamic events from videos captured by hand-held multiple cameras. Key to our approach is the use of self-supervised neural networks specific to the scene to compose static and dynamic aspects of an event. Though captured from discrete viewpoints, this model enables us to move around the space-time of the event continuously. This… ▽ More

    Submitted 27 May, 2020; originally announced May 2020.

    Comments: Project Page - http://www.cs.cmu.edu/~aayushb/Open4D/