Skip to main content

Showing 1–40 of 40 results for author: Gupta, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2507.04966  [pdf, ps, other

    cs.SD cs.AI eess.AS

    LAPS-Diff: A Diffusion-Based Framework for Singing Voice Synthesis With Language Aware Prosody-Style Guided Learning

    Authors: Sandipan Dhar, Mayank Gupta, Preeti Rao

    Abstract: The field of Singing Voice Synthesis (SVS) has seen significant advancements in recent years due to the rapid progress of diffusion-based approaches. However, capturing vocal style, genre-specific pitch inflections, and language-dependent characteristics remains challenging, particularly in low-resource scenarios. To address this, we propose LAPS-Diff, a diffusion model integrated with language-aw… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

    Comments: 10 pages, 5 figures, 3 Tables

  2. arXiv:2505.20027  [pdf, ps, other

    q-bio.NC cs.AI cs.CL cs.LG eess.AS eess.IV

    Multi-modal brain encoding models for multi-modal stimuli

    Authors: Subba Reddy Oota, Khushbu Pahwa, Mounika Marreddy, Maneesh Singh, Manish Gupta, Bapi S. Raju

    Abstract: Despite participants engaging in unimodal stimuli, such as watching images or silent videos, recent work has demonstrated that multi-modal Transformer models can predict visual brain activity impressively well, even with incongruent modality representations. This raises the question of how accurately these multi-modal models can predict brain activity when participants are engaged in multi-modal s… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: 26 pages, 15 figures, The Thirteenth International Conference on Learning Representations, ICLR-2025, Singapore. https://openreview.net/pdf?id=0dELcFHig2

    Journal ref: ICLR-2025, Sinapore

  3. CurviTrack: Curvilinear Trajectory Tracking for High-speed Chase of a USV

    Authors: Parakh M. Gupta, Ondřej Procházka, Tiago Nascimento, Martin Saska

    Abstract: Heterogeneous robot teams used in marine environments incur time-and-energy penalties when the marine vehicle has to halt the mission to allow the autonomous aerial vehicle to land for recharging. In this paper, we present a solution for this problem using a novel drag-aware model formulation which is coupled with MPC, and therefore, enables tracking and landing during high-speed curvilinear traje… ▽ More

    Submitted 28 February, 2025; originally announced February 2025.

    Journal ref: in IEEE Robotics and Automation Letters, Feb. 2025

  4. Model predictive control-based trajectory generation for agile landing of unmanned aerial vehicle on a moving boat

    Authors: Ondřej Procházka, Filip Novák, Tomáš Báča, Parakh M. Gupta, Robert Pěnička, Martin Saska

    Abstract: This paper proposes a novel trajectory generation method based on Model Predictive Control (MPC) for agile landing of an Unmanned Aerial Vehicle (UAV) onto an Unmanned Surface Vehicle (USV)'s deck in harsh conditions. The trajectory generation exploits the state predictions of the USV to create periodically updated trajectories for a multirotor UAV to precisely land on the deck of a moving USV eve… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

    Comments: 18 pages, 17 figures, Ocean Engineering

    Journal ref: Ocean Engineering 313:119164, 2024

  5. arXiv:2411.14453  [pdf, other

    cs.CL cs.SD eess.AS

    Direct Speech-to-Speech Neural Machine Translation: A Survey

    Authors: Mahendra Gupta, Maitreyee Dutta, Chandresh Kumar Maurya

    Abstract: Speech-to-Speech Translation (S2ST) models transform speech from one language to another target language with the same linguistic information. S2ST is important for bridging the communication gap among communities and has diverse applications. In recent years, researchers have introduced direct S2ST models, which have the potential to translate speech without relying on intermediate text generatio… ▽ More

    Submitted 13 November, 2024; originally announced November 2024.

  6. arXiv:2410.21276  [pdf, other

    cs.CL cs.AI cs.CV cs.CY cs.LG cs.SD eess.AS

    GPT-4o System Card

    Authors: OpenAI, :, Aaron Hurst, Adam Lerer, Adam P. Goucher, Adam Perelman, Aditya Ramesh, Aidan Clark, AJ Ostrow, Akila Welihinda, Alan Hayes, Alec Radford, Aleksander Mądry, Alex Baker-Whitcomb, Alex Beutel, Alex Borzunov, Alex Carney, Alex Chow, Alex Kirillov, Alex Nichol, Alex Paino, Alex Renzin, Alex Tachard Passos, Alexander Kirillov, Alexi Christakis , et al. (395 additional authors not shown)

    Abstract: GPT-4o is an autoregressive omni model that accepts as input any combination of text, audio, image, and video, and generates any combination of text, audio, and image outputs. It's trained end-to-end across text, vision, and audio, meaning all inputs and outputs are processed by the same neural network. GPT-4o can respond to audio inputs in as little as 232 milliseconds, with an average of 320 mil… ▽ More

    Submitted 25 October, 2024; originally announced October 2024.

  7. arXiv:2409.19273  [pdf

    eess.SP

    Towards ubiquitous radio access using nanodiamond based quantum receivers

    Authors: Qunsong Zeng, Jiahua Zhang, Madhav Gupta, Zhiqin Chu, Kaibin Huang

    Abstract: The development of sixth-generation (6G) wireless communication systems demands innovative solutions to address challenges in the deployment of a large number of base stations and the detection of multi-band signals. Quantum technology, specifically nitrogen vacancy (NV) centers in diamonds, offers promising potential for the development of compact, robust receivers capable of supporting multiple… ▽ More

    Submitted 28 September, 2024; originally announced September 2024.

  8. arXiv:2409.18337  [pdf, other

    eess.IV cs.CV physics.ins-det

    Photon Inhibition for Energy-Efficient Single-Photon Imaging

    Authors: Lucas J. Koerner, Shantanu Gupta, Atul Ingle, Mohit Gupta

    Abstract: Single-photon cameras (SPCs) are emerging as sensors of choice for various challenging imaging applications. One class of SPCs based on the single-photon avalanche diode (SPAD) detects individual photons using an avalanche process; the raw photon data can then be processed to extract scene information under extremely low light, high dynamic range, and rapid motion. Yet, single-photon sensitivity i… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

    Comments: Accepted for ECCV 2024. Supplementary material and code available at https://wisionlab.com/project/inhibition

  9. arXiv:2407.09386  [pdf, other

    cs.CV eess.IV

    Radiance Fields from Photons

    Authors: Sacha Jungerman, Aryan Garg, Mohit Gupta

    Abstract: Neural radiance fields, or NeRFs, have become the de facto approach for high-quality view synthesis from a collection of images captured from multiple viewpoints. However, many issues remain when capturing images in-the-wild under challenging conditions, such as low light, high dynamic range, or rapid motion leading to smeared reconstructions with noticeable artifacts. In this work, we introduce q… ▽ More

    Submitted 3 December, 2024; v1 submitted 12 July, 2024; originally announced July 2024.

  10. arXiv:2407.02683  [pdf, other

    cs.CV eess.IV

    Generalized Event Cameras

    Authors: Varun Sundar, Matthew Dutson, Andrei Ardelean, Claudio Bruschini, Edoardo Charbon, Mohit Gupta

    Abstract: Event cameras capture the world at high time resolution and with minimal bandwidth requirements. However, event streams, which only encode changes in brightness, do not contain sufficient scene information to support a wide variety of downstream tasks. In this work, we design generalized event cameras that inherently preserve scene intensity in a bandwidth-efficient manner. We generalize event cam… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: CVPR 2024

  11. arXiv:2406.00859  [pdf, other

    eess.IV cs.CV

    Streaming quanta sensors for online, high-performance imaging and vision

    Authors: Tianyi Zhang, Matthew Dutson, Vivek Boominathan, Mohit Gupta, Ashok Veeraraghavan

    Abstract: Recently quanta image sensors (QIS) -- ultra-fast, zero-read-noise binary image sensors -- have demonstrated remarkable imaging capabilities in many challenging scenarios. Despite their potential, the adoption of these sensors is severely hampered by (a) high data rates and (b) the need for new computational pipelines to handle the unconventional raw data. We introduce a simple, low-bandwidth comp… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  12. arXiv:2404.07959  [pdf

    eess.SP eess.SY

    Damage identification of offshore jacket platforms in a digital twin framework considering optimal sensor placement

    Authors: Mengmeng Wang, Atilla Incecik, Shizhe Feng, M. K. Gupta, Grzegorz Krlolczyk, Z Li

    Abstract: A new digital twin (DT) framework with optimal sensor placement (OSP) is proposed to accurately calculate the modal responses and identify the damage ratios of the offshore jacket platforms. The proposed damage identification framework consists of two models (namely one OSP model and one damage identification model). The OSP model adopts the multi-objective Lichtenberg algorithm (MOLA) to perform… ▽ More

    Submitted 26 March, 2024; originally announced April 2024.

  13. arXiv:2403.17801  [pdf, other

    cs.CV eess.IV

    Towards 3D Vision with Low-Cost Single-Photon Cameras

    Authors: Fangzhou Mu, Carter Sifferman, Sacha Jungerman, Yiquan Li, Mark Han, Michael Gleicher, Mohit Gupta, Yin Li

    Abstract: We present a method for reconstructing 3D shape of arbitrary Lambertian objects based on measurements by miniature, energy-efficient, low-cost single-photon cameras. These cameras, operating as time resolved image sensors, illuminate the scene with a very fast pulse of diffuse light and record the shape of that pulse as it returns back from the scene at a high temporal resolution. We propose to mo… ▽ More

    Submitted 29 March, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

  14. arXiv:2403.08848  [pdf, other

    eess.IV cs.CV

    FocusMAE: Gallbladder Cancer Detection from Ultrasound Videos with Focused Masked Autoencoders

    Authors: Soumen Basu, Mayuna Gupta, Chetan Madan, Pankaj Gupta, Chetan Arora

    Abstract: In recent years, automated Gallbladder Cancer (GBC) detection has gained the attention of researchers. Current state-of-the-art (SOTA) methodologies relying on ultrasound sonography (US) images exhibit limited generalization, emphasizing the need for transformative approaches. We observe that individual US frames may lack sufficient information to capture disease manifestation. This study advocate… ▽ More

    Submitted 29 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: To Appear at CVPR 2024

  15. arXiv:2309.00066  [pdf, other

    cs.CV eess.IV

    SoDaCam: Software-defined Cameras via Single-Photon Imaging

    Authors: Varun Sundar, Andrei Ardelean, Tristan Swedish, Claudio Bruschini, Edoardo Charbon, Mohit Gupta

    Abstract: Reinterpretable cameras are defined by their post-processing capabilities that exceed traditional imaging. We present "SoDaCam" that provides reinterpretable cameras at the granularity of photons, from photon-cubes acquired by single-photon devices. Photon-cubes represent the spatio-temporal detections of photons as a sequence of binary frames, at frame-rates as high as 100 kHz. We show that simpl… ▽ More

    Submitted 8 September, 2023; v1 submitted 31 August, 2023; originally announced September 2023.

    Comments: Accepted at ICCV 2023 (oral). Project webpage can be found at https://wisionlab.com/project/sodacam/

  16. arXiv:2307.01690  [pdf

    eess.SP

    Design and Characterization of Crossbar architecture Velostat-based Flexible Writing Pad

    Authors: Mohee Datta Gupta

    Abstract: Pressure sensors are popular in a large variety of industries. For some applications, it is critical for these sensors to come in a flexible form factor. With the development of new synthetic polymers and novel fabrication techniques, flexible pressure sensing arrays are more easily accessible and can serve a variety of applications. As part of this dissertation, we demonstrate one such applicatio… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

  17. arXiv:2305.19490  [pdf

    cs.CR eess.SY

    Adoption of Blockchain Platform for Security Enhancement in Energy Transaction

    Authors: Madhuresh Gupta, Soumyakanti Giri, Prabhakar Karthikeyan Shanmugam, Mahajan Sagar Bhaskar, Jens Bo Holm-Nielsen, Sanjeevikumar Padmanaban

    Abstract: Renewable energy has become a reality in the present and is being preferred by countries to become a considerable part of the central grid. With the increasing adoption of renewables it will soon become crucial to have a platform which would facilitate secure transaction of energy for consumers as well as producers. This paper discusses and implements a Blockchain based platform which enhances and… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: 11 Pages, 6 Figures

  18. arXiv:2301.09294  [pdf, other

    eess.SY

    Forecaster-aided User Association and Load Balancing in Multi-band Mobile Networks

    Authors: Manan Gupta, Sandeep Chinchali, Paul Varkey, Jeffrey G. Andrews

    Abstract: Cellular networks are becoming increasingly heterogeneous with higher base station (BS) densities and ever more frequency bands, making BS selection and band assignment key decisions in terms of rate and coverage. In this paper, we decompose the mobility-aware user association task into (i) forecasting of user rate and then (ii) convex utility maximization for user association accounting for the e… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

  19. arXiv:2211.05094  [pdf, other

    cs.CV eess.IV

    3D Scene Inference from Transient Histograms

    Authors: Sacha Jungerman, Atul Ingle, Yin Li, Mohit Gupta

    Abstract: Time-resolved image sensors that capture light at pico-to-nanosecond timescales were once limited to niche applications but are now rapidly becoming mainstream in consumer devices. We propose low-cost and low-power imaging modalities that capture scene information from minimal time-resolved image sensors with as few as one pixel. The key idea is to flood illuminate large scene patches (or the enti… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

  20. arXiv:2207.13148  [pdf, other

    eess.IV cs.CV

    Unsupervised Contrastive Learning of Image Representations from Ultrasound Videos with Hard Negative Mining

    Authors: Soumen Basu, Somanshu Singla, Mayank Gupta, Pratyaksha Rana, Pankaj Gupta, Chetan Arora

    Abstract: Rich temporal information and variations in viewpoints make video data an attractive choice for learning image representations using unsupervised contrastive learning (UCL) techniques. State-of-the-art (SOTA) contrastive learning techniques consider frames within a video as positives in the embedding space, whereas the frames from other videos are considered negatives. We observe that unlike multi… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

    Comments: ACCEPTED for publication at MICCAI 2022

  21. arXiv:2205.13774  [pdf

    eess.IV cs.CV

    Classification of COVID-19 Patients with their Severity Level from Chest CT Scans using Transfer Learning

    Authors: Mansi Gupta, Aman Swaraj, Karan Verma

    Abstract: Background and Objective: During pandemics, the use of artificial intelligence (AI) approaches combined with biomedical science play a significant role in reducing the burden on the healthcare systems and physicians. The rapid increment in cases of COVID-19 has led to an increase in demand for hospital beds and other medical equipment. However, since medical facilities are limited, it is recommend… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

  22. arXiv:2205.12445  [pdf, other

    eess.SP cs.IT cs.LG

    Over-the-Air Design of GAN Training for mmWave MIMO Channel Estimation

    Authors: Akash Doshi, Manan Gupta, Jeffrey G. Andrews

    Abstract: Future wireless systems are trending towards higher carrier frequencies that offer larger communication bandwidth but necessitate the use of large antenna arrays. Existing signal processing techniques for channel estimation do not scale well to this "high-dimensional" regime in terms of performance and pilot overhead. Meanwhile, training deep learning based approaches for channel estimation requir… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

    Comments: 34 pages, 12 figures, 5 tables. Under review for publication in IEEE Journal of Sel. Areas in Information Theory

  23. BlueSky: Activity Control: A Vision for "Active" Security Models for Smart Collaborative Systems

    Authors: Tanjila Mawla, Maanak Gupta, Ravi Sandhu

    Abstract: Cyber physical ecosystem connects different intelligent devices over heterogeneous networks. Various operations are performed on smart objects to ensure efficiency and to support automation in smart environments. An Activity (defined by Gupta and Sandhu) reflects the current state of an object, which changes in response to requested operations. Due to multiple running activities on different objec… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

  24. arXiv:2204.09564  [pdf, other

    q-bio.NC cs.AI cs.CL cs.CV cs.LG eess.IV

    Cross-view Brain Decoding

    Authors: Subba Reddy Oota, Jashn Arora, Manish Gupta, Raju S. Bapi

    Abstract: How the brain captures the meaning of linguistic stimuli across multiple views is still a critical open question in neuroscience. Consider three different views of the concept apartment: (1) picture (WP) presented with the target word label, (2) sentence (S) using the target word, and (3) word cloud (WC) containing the target word along with other semantically related words. Unlike previous effort… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

    Comments: 11 pages, 10 figures

  25. Similarity Learning based Few Shot Learning for ECG Time Series Classification

    Authors: Priyanka Gupta, Sathvik Bhaskarpandit, Manik Gupta

    Abstract: Using deep learning models to classify time series data generated from the Internet of Things (IoT) devices requires a large amount of labeled data. However, due to constrained resources available in IoT devices, it is often difficult to accommodate training using large data sets. This paper proposes and demonstrates a Similarity Learning-based Few Shot Learning for ECG arrhythmia classification u… ▽ More

    Submitted 31 January, 2022; originally announced February 2022.

    Comments: 7 pages, 4 figures. Published as part of the DICTA 2021 conference proceedings

  26. arXiv:2201.02236  [pdf, other

    cs.CR eess.SP

    Detecting Anomalies using Overlapping Electrical Measurements in Smart Power Grids

    Authors: Sina Sontowski, Nigel Lawrence, Deepjyoti Deka, Maanak Gupta

    Abstract: As cyber-attacks against critical infrastructure become more frequent, it is increasingly important to be able to rapidly identify and respond to these threats. This work investigates two independent systems with overlapping electrical measurements with the goal to more rapidly identify anomalies. The independent systems include HIST, a SCADA historian, and ION, an automatic meter reading system (… ▽ More

    Submitted 6 January, 2022; originally announced January 2022.

  27. arXiv:2112.05263  [pdf, other

    eess.SP cs.IT

    System-Level Analysis of Full-Duplex Self-Backhauled Millimeter Wave Networks

    Authors: Manan Gupta, Ian P. Roberts, Jeffrey G. Andrews

    Abstract: Integrated access and backhaul (IAB) facilitates cost-effective deployment of millimeter wave(mmWave) cellular networks through multihop self-backhauling. Full-duplex (FD) technology, particularly for mmWave systems, is a potential means to overcome latency and throughput challenges faced by IAB networks. We derive practical and tractable throughput and latency constraints using queueing theory an… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

  28. arXiv:2107.11001  [pdf, other

    eess.IV cs.CV

    Photon-Starved Scene Inference using Single Photon Cameras

    Authors: Bhavya Goyal, Mohit Gupta

    Abstract: Scene understanding under low-light conditions is a challenging problem. This is due to the small number of photons captured by the camera and the resulting low signal-to-noise ratio (SNR). Single-photon cameras (SPCs) are an emerging sensing modality that are capable of capturing images with high sensitivity. Despite having minimal read-noise, images captured by SPCs in photon-starved conditions… ▽ More

    Submitted 16 August, 2021; v1 submitted 22 July, 2021; originally announced July 2021.

    Comments: International Conference on Computer Vision (ICCV), 2021 - Camera Ready

  29. arXiv:2105.12693  [pdf, other

    eess.SP

    Reconfigurable Architecture for Spatial Sensing in Wideband Radio Front-End

    Authors: M. Gupta, S. Sharma, H. Joshi, S. J. Darak

    Abstract: The deployment of cellular spectrum in licensed, shared and unlicensed spectrum demands wideband sensing over non-contiguous sub-6 GHz spectrum. To improve the spectrum and energy efficiency, beamforming and massive multi-antenna systems are being explored which demand spatial sensing i.e. blind identification of vacant frequency bands and direction-of-arrival (DoA) of the occupied bands. We propo… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

    Comments: 16 pages, 13 figures

  30. arXiv:2105.09046  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    Music Generation using Three-layered LSTM

    Authors: Vaishali Ingale, Anush Mohan, Divit Adlakha, Krishan Kumar, Mohit Gupta

    Abstract: This paper explores the idea of utilising Long Short-Term Memory neural networks (LSTMNN) for the generation of musical sequences in ABC notation. The proposed approach takes ABC notations from the Nottingham dataset and encodes it to be fed as input for the neural networks. The primary objective is to input the neural networks with an arbitrary note, let the network process and augment a sequence… ▽ More

    Submitted 9 June, 2021; v1 submitted 19 May, 2021; originally announced May 2021.

  31. arXiv:2104.00059  [pdf, other

    cs.CV eess.IV

    Passive Inter-Photon Imaging

    Authors: Atul Ingle, Trevor Seets, Mauro Buttafava, Shantanu Gupta, Alberto Tosi, Mohit Gupta, Andreas Velten

    Abstract: Digital camera pixels measure image intensities by converting incident light energy into an analog electrical current, and then digitizing it into a fixed-width binary representation. This direct measurement method, while conceptually simple, suffers from limited dynamic range and poor performance under extreme illumination -- electronic noise dominates under low illumination, and pixel full-well… ▽ More

    Submitted 10 April, 2021; v1 submitted 31 March, 2021; originally announced April 2021.

    Comments: Accepted to CVPR 2021 as an oral presentation

  32. iToF2dToF: A Robust and Flexible Representation for Data-Driven Time-of-Flight Imaging

    Authors: Felipe Gutierrez-Barragan, Huaijin Chen, Mohit Gupta, Andreas Velten, Jinwei Gu

    Abstract: Indirect Time-of-Flight (iToF) cameras are a promising depth sensing technology. However, they are prone to errors caused by multi-path interference (MPI) and low signal-to-noise ratio (SNR). Traditional methods, after denoising, mitigate MPI by estimating a transient image that encodes depths. Recently, data-driven methods that jointly denoise and mitigate MPI have become state-of-the-art without… ▽ More

    Submitted 21 December, 2021; v1 submitted 11 March, 2021; originally announced March 2021.

    Comments: 35 pages

  33. arXiv:2009.13175  [pdf

    eess.SY math.OC

    A Comparative Study Between a Classical and Optimal Controller for a Quadrotor

    Authors: Prathamesh Saraf, Manan Gupta, Alivelu Manga Parimi

    Abstract: This paper presents a simulation-based comparison between the two controllers, Proportional Integral Derivative (PID), a classical controller and Linear Quadratic Regulator (LQR), an optimal controller, for a linearized quadrotor model. To simplify an otherwise complicated dynamic model of a quadrotor, we derive a linear mathematical model using Newtonian and Euler's laws and applying basic princi… ▽ More

    Submitted 28 September, 2020; originally announced September 2020.

  34. arXiv:2006.11840  [pdf, other

    cs.CV cs.GR eess.IV

    Quanta Burst Photography

    Authors: Sizhuo Ma, Shantanu Gupta, Arin C. Ulku, Claudio Bruschini, Edoardo Charbon, Mohit Gupta

    Abstract: Single-photon avalanche diodes (SPADs) are an emerging sensor technology capable of detecting individual incident photons, and capturing their time-of-arrival with high timing precision. While these sensors were limited to single-pixel or low-resolution devices in the past, recently, large (up to 1 MPixel) SPAD arrays have been developed. These single-photon cameras (SPCs) are capable of capturing… ▽ More

    Submitted 21 June, 2020; originally announced June 2020.

    Comments: A version with better-quality images can be found on the project webpage: http://wisionlab.cs.wisc.edu/project/quanta-burst-photography/

  35. arXiv:1912.05198  [pdf

    cs.LG cs.NE eess.SP stat.ML

    Recurrent Transform Learning

    Authors: Megha Gupta, Angshul Majumdar

    Abstract: The objective of this work is to improve the accuracy of building demand forecasting. This is a more challenging task than grid level forecasting. For the said purpose, we develop a new technique called recurrent transform learning (RTL). Two versions are proposed. The first one (RTL) is unsupervised; this is used as a feature extraction tool that is further fed into a regression model. The second… ▽ More

    Submitted 11 December, 2019; originally announced December 2019.

    Comments: A slightly different version has been accepted at Neural Networks

  36. arXiv:1908.06372  [pdf, other

    eess.IV cs.CV

    Asynchronous Single-Photon 3D Imaging

    Authors: Anant Gupta, Atul Ingle, Mohit Gupta

    Abstract: Single-photon avalanche diodes (SPADs) are becoming popular in time-of-flight depth-ranging due to their unique ability to capture individual photons with picosecond timing resolution. However, ambient light (e.g., sunlight) incident on a SPAD-based 3D camera leads to severe non-linear distortions (pileup) in the measured waveform, resulting in large depth errors. We propose asynchronous single-ph… ▽ More

    Submitted 18 August, 2019; originally announced August 2019.

  37. Differential Scene Flow from Light Field Gradients

    Authors: Sizhuo Ma, Brandon M. Smith, Mohit Gupta

    Abstract: This paper presents novel techniques for recovering 3D dense scene flow, based on differential analysis of 4D light fields. The key enabling result is a per-ray linear equation, called the ray flow equation, that relates 3D scene flow to 4D light field gradients. The ray flow equation is invariant to 3D scene structure and applicable to a general class of scenes, but is under-constrained (3 unknow… ▽ More

    Submitted 29 July, 2019; v1 submitted 26 July, 2019; originally announced July 2019.

    ACM Class: I.4.8

  38. arXiv:1903.08347  [pdf, other

    cs.CV eess.IV

    Photon-Flooded Single-Photon 3D Cameras

    Authors: Anant Gupta, Atul Ingle, Andreas Velten, Mohit Gupta

    Abstract: Single photon avalanche diodes (SPADs) are starting to play a pivotal role in the development of photon-efficient, long-range LiDAR systems. However, due to non-linearities in their image formation model, a high photon flux (e.g., due to strong sunlight) leads to distortion of the incident temporal waveform, and potentially, large depth errors. Operating SPADs in low flux regimes can mitigate thes… ▽ More

    Submitted 29 April, 2019; v1 submitted 20 March, 2019; originally announced March 2019.

  39. arXiv:1902.10190  [pdf, other

    eess.IV cs.CV physics.ins-det

    High Flux Passive Imaging with Single-Photon Sensors

    Authors: Atul Ingle, Andreas Velten, Mohit Gupta

    Abstract: Single-photon avalanche diodes (SPADs) are an emerging technology with a unique capability of capturing individual photons with high timing precision. SPADs are being used in several active imaging systems (e.g., fluorescence lifetime microscopy and LiDAR), albeit mostly limited to low photon flux settings. We propose passive free-running SPAD (PF-SPAD) imaging, an imaging modality that uses SPADs… ▽ More

    Submitted 23 April, 2019; v1 submitted 26 February, 2019; originally announced February 2019.

    Comments: 28 pages, 15 figures, addressed reviewers's comments, fixed some errors and typos, official peer reviewed version to appear in IEEE CVPR 2019

    ACM Class: I.3.3; I.4.1

  40. arXiv:1509.00392  [pdf, other

    eess.SY

    Cascade Markov Decision Processes: Theory and Applications

    Authors: Manish Gupta

    Abstract: This paper considers the optimal control of time varying continuous time Markov chains whose transition rates are themselves Markov processes. In one set of problems the solution of an ordinary differential equation is shown to determine the optimal performance and feedback controls, while some other cases are shown to lead to singular optimal control problems which are more difficult to solve. So… ▽ More

    Submitted 1 September, 2015; originally announced September 2015.

    MSC Class: 60J20; 90C40