Skip to main content

Showing 1–50 of 50 results for author: Pillai, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.06261  [pdf, ps, other

    cs.CL cs.AI

    Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

    Authors: Gheorghe Comanici, Eric Bieber, Mike Schaekermann, Ice Pasupat, Noveen Sachdeva, Inderjit Dhillon, Marcel Blistein, Ori Ram, Dan Zhang, Evan Rosen, Luke Marris, Sam Petulla, Colin Gaffney, Asaf Aharoni, Nathan Lintz, Tiago Cardal Pais, Henrik Jacobsson, Idan Szpektor, Nan-Jiang Jiang, Krishna Haridasan, Ahmed Omran, Nikunj Saunshi, Dara Bahri, Gaurav Mishra, Eric Chu , et al. (3264 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 2.X model family: Gemini 2.5 Pro and Gemini 2.5 Flash, as well as our earlier Gemini 2.0 Flash and Flash-Lite models. Gemini 2.5 Pro is our most capable model yet, achieving SoTA performance on frontier coding and reasoning benchmarks. In addition to its incredible coding and reasoning skills, Gemini 2.5 Pro is a thinking model that excels at multimodal unde… ▽ More

    Submitted 11 July, 2025; v1 submitted 7 July, 2025; originally announced July 2025.

    Comments: 72 pages, 17 figures

  2. arXiv:2506.01125  [pdf, ps, other

    cs.RO

    iRonCub 3: The Jet-Powered Flying Humanoid Robot

    Authors: Davide Gorbani, Hosameldin Awadalla Omer Mohamed, Giuseppe L'Erario, Gabriele Nava, Punith Reddy Vanteddu, Shabarish Purushothaman Pillai, Antonello Paolino, Fabio Bergonti, Saverio Taliani, Alessandro Croci, Nicholas James Tremaroli, Silvio Traversaro, Bruno Vittorio Trombetta, Daniele Pucci

    Abstract: This article presents iRonCub 3, a jet-powered humanoid robot, and its first flight experiments. Unlike traditional aerial vehicles, iRonCub 3 aims to achieve flight using a full-body humanoid form, which poses unique challenges in control, estimation, and system integration. We highlight the robot's current mechanical and software architecture, including its propulsion system, control framework,… ▽ More

    Submitted 1 June, 2025; originally announced June 2025.

  3. arXiv:2504.17780  [pdf, other

    cs.LG

    Replay to Remember: Retaining Domain Knowledge in Streaming Language Models

    Authors: Sneh Pillai

    Abstract: Continual learning in large language models (LLMs) typically encounters the critical challenge of catastrophic forgetting, where previously acquired knowledge deteriorates upon exposure to new data. While techniques like replay buffers and parameter-efficient tuning (e.g., Low-Rank Adaptation or LoRA) have been proposed, few studies investigate real-time domain adaptation under strict computationa… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

    Comments: 8 pages 3 figures, 3 tables

  4. arXiv:2503.03729  [pdf, other

    cs.LG

    Graph-Augmented LSTM for Forecasting Sparse Anomalies in Graph-Structured Time Series

    Authors: Sneh Pillai

    Abstract: Detecting anomalies in time series data is a critical task across many domains. The challenge intensifies when anomalies are sparse and the data are multivariate with relational dependencies across sensors or nodes. Traditional univariate anomaly detectors struggle to capture such cross-node dependencies, particularly in sparse anomaly settings. To address this, we propose a graph-augmented time s… ▽ More

    Submitted 5 March, 2025; originally announced March 2025.

    Comments: 12 pages

  5. arXiv:2503.03202  [pdf, other

    cs.CV

    Variance-Aware Loss Scheduling for Multimodal Alignment in Low-Data Settings

    Authors: Sneh Pillai

    Abstract: Training vision-language models for image-text alignment typically requires large datasets to achieve robust performance. In low-data scenarios, standard contrastive learning can struggle to align modalities effectively due to overfitting and unstable training dynamics. In this paper, we propose a variance-aware loss scheduling approach that dynamically adjusts the weighting of the contrastive los… ▽ More

    Submitted 5 March, 2025; originally announced March 2025.

    Comments: 8 pages, 4 figures

  6. arXiv:2409.04652  [pdf, other

    cs.LG cs.CR

    Privacy-Preserving Race/Ethnicity Estimation for Algorithmic Bias Measurement in the U.S

    Authors: Saikrishna Badrinarayanan, Osonde Osoba, Miao Cheng, Ryan Rogers, Sakshi Jain, Rahul Tandra, Natesh S. Pillai

    Abstract: AI fairness measurements, including tests for equal treatment, often take the form of disaggregated evaluations of AI systems. Such measurements are an important part of Responsible AI operations. These measurements compare system performance across demographic groups or sub-populations and typically require member-level demographic signals such as gender, race, ethnicity, and location. However, s… ▽ More

    Submitted 16 September, 2024; v1 submitted 6 September, 2024; originally announced September 2024.

    Comments: Saikrishna Badrinarayanan and Osonde Osoba contributed equally to this work. Updating text to indicate limitations of sample analyses

  7. arXiv:2409.01574  [pdf, other

    stat.CO cs.LG stat.ML

    Policy Gradients for Optimal Parallel Tempering MCMC

    Authors: Daniel Zhao, Natesh S. Pillai

    Abstract: Parallel tempering is a meta-algorithm for Markov Chain Monte Carlo that uses multiple chains to sample from tempered versions of the target distribution, enhancing mixing in multi-modal distributions that are challenging for traditional methods. The effectiveness of parallel tempering is heavily influenced by the selection of chain temperatures. Here, we present an adaptive temperature selection… ▽ More

    Submitted 26 December, 2024; v1 submitted 2 September, 2024; originally announced September 2024.

    Comments: 12 pages, 5 figures, accepted to ICML 2024 Workshop on Structured Probabilistic Inference & Generative Modeling

  8. arXiv:2408.02840  [pdf, other

    cs.CV

    GAReT: Cross-view Video Geolocalization with Adapters and Auto-Regressive Transformers

    Authors: Manu S Pillai, Mamshad Nayeem Rizve, Mubarak Shah

    Abstract: Cross-view video geo-localization (CVGL) aims to derive GPS trajectories from street-view videos by aligning them with aerial-view images. Despite their promising performance, current CVGL methods face significant challenges. These methods use camera and odometry data, typically absent in real-world scenarios. They utilize multiple adjacent frames and various encoders for feature extraction, resul… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

    Comments: Accepted at ECCV 2024

  9. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1112 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 16 December, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  10. Mobile Health Text Misinformation Identification Using Mobile Data Mining

    Authors: Wen-Chen Hu, Sanjaikanth E Vadakkethil Somanathan Pillai, Abdelrahman Ahmed ElSaid

    Abstract: More than six million people died of the COVID-19 by April 2022. The heavy casualties have put people on great and urgent alert and people try to find all kinds of information to keep them from being inflected by the coronavirus. This research tries to find out whether the mobile health text information sent to peoples devices is correct as smartphones becoming the major information source for peo… ▽ More

    Submitted 5 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

  11. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1326 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 9 May, 2025; v1 submitted 18 December, 2023; originally announced December 2023.

  12. arXiv:2305.14076  [pdf, other

    math.ST cs.LG math.PR stat.CO stat.ML

    Towards Understanding the Dynamics of Gaussian-Stein Variational Gradient Descent

    Authors: Tianle Liu, Promit Ghosal, Krishnakumar Balasubramanian, Natesh S. Pillai

    Abstract: Stein Variational Gradient Descent (SVGD) is a nonparametric particle-based deterministic sampling algorithm. Despite its wide usage, understanding the theoretical properties of SVGD has remained a challenging problem. For sampling from a Gaussian target, the SVGD dynamics with a bilinear kernel will remain Gaussian as long as the initializer is Gaussian. Inspired by this fact, we undertake a deta… ▽ More

    Submitted 27 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023; 60 pages, 8 figures

  13. Detecting Fake Job Postings Using Bidirectional LSTM

    Authors: Aravind Sasidharan Pillai

    Abstract: Fake job postings have become prevalent in the online job market, posing significant challenges to job seekers and employers. Despite the growing need to address this problem, there is limited research that leverages deep learning techniques for the detection of fraudulent job advertisements. This study aims to fill the gap by employing a Bidirectional Long Short-Term Memory (Bi-LSTM) model to ide… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Journal ref: International Research Journal of Modernization in Engineering Technology and Science, Volume:05/Issue:03/March-2023

  14. Multi-Label Chest X-Ray Classification via Deep Learning

    Authors: Aravind Sasidharan Pillai

    Abstract: In this era of pandemic, the future of healthcare industry has never been more exciting. Artificial intelligence and machine learning (AI & ML) present opportunities to develop solutions that cater for very specific needs within the industry. Deep learning in healthcare had become incredibly powerful for supporting clinics and in transforming patient care in general. Deep learning is increasingly… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

    Journal ref: Journal of Intelligent Learning Systems and Applications Vol.14 No.4, November 1, 2022

  15. arXiv:2204.02311  [pdf, other

    cs.CL

    PaLM: Scaling Language Modeling with Pathways

    Authors: Aakanksha Chowdhery, Sharan Narang, Jacob Devlin, Maarten Bosma, Gaurav Mishra, Adam Roberts, Paul Barham, Hyung Won Chung, Charles Sutton, Sebastian Gehrmann, Parker Schuh, Kensen Shi, Sasha Tsvyashchenko, Joshua Maynez, Abhishek Rao, Parker Barnes, Yi Tay, Noam Shazeer, Vinodkumar Prabhakaran, Emily Reif, Nan Du, Ben Hutchinson, Reiner Pope, James Bradbury, Jacob Austin , et al. (42 additional authors not shown)

    Abstract: Large language models have been shown to achieve remarkable performance across a variety of natural language tasks using few-shot learning, which drastically reduces the number of task-specific training examples needed to adapt the model to a particular application. To further our understanding of the impact of scale on few-shot learning, we trained a 540-billion parameter, densely activated, Tran… ▽ More

    Submitted 5 October, 2022; v1 submitted 5 April, 2022; originally announced April 2022.

  16. arXiv:2201.05797  [pdf, other

    cs.DB

    Finding Label and Model Errors in Perception Data With Learned Observation Assertions

    Authors: Daniel Kang, Nikos Arechiga, Sudeep Pillai, Peter Bailis, Matei Zaharia

    Abstract: ML is being deployed in complex, real-world scenarios where errors have impactful consequences. In these systems, thorough testing of the ML pipelines is critical. A key component in ML deployment pipelines is the curation of labeled training data. Common practice in the ML literature assumes that labels are the ground truth. However, in our experience in a large autonomous vehicle development cen… ▽ More

    Submitted 15 January, 2022; originally announced January 2022.

    Journal ref: SIGMOD 2022

  17. arXiv:2109.05752  [pdf, other

    cs.IT

    On the Age of Information of a Queuing System with Heterogeneous Servers

    Authors: Anhad Bhati, Sibi Raj B. Pillai, Rahul Vaze

    Abstract: An optimal control problem with heterogeneous servers to minimize the average age of information (AoI) is considered. Each server maintains a separate queue, and each packet arriving to the system is randomly routed to one of the servers. Assuming Poisson arrivals and exponentially distributed service times, we first derive an exact expression of the average AoI for two heterogeneous servers. Next… ▽ More

    Submitted 14 September, 2021; v1 submitted 13 September, 2021; originally announced September 2021.

    Comments: 6 pages, 4 figures. Appeared in NCC 2021, IIT Kanpur

    MSC Class: 94A15

  18. Multiple Access Channel Simulation

    Authors: Gowtham R. Kurri, Viswanathan Ramachandran, Sibi Raj B. Pillai, Vinod M. Prabhakaran

    Abstract: We study the problem of simulating a two-user multiple-access channel (MAC) over a multiple access network of noiseless links. Two encoders observe independent and identically distributed (i.i.d.) copies of a source random variable each, while a decoder observes i.i.d. copies of a side-information random variable. There are rate-limited noiseless communication links between each encoder and the de… ▽ More

    Submitted 16 June, 2022; v1 submitted 23 February, 2021; originally announced February 2021.

    Comments: 30 pages, 3 figures

  19. arXiv:2009.08765  [pdf, ps, other

    cs.IT

    On the Capacity Enlargement of Gaussian Broadcast Channels with Passive Noisy Feedback

    Authors: Aditya Narayan Ravi, Sibi Raj B. Pillai, Vinod Prabhakaran, Michèle Wigger

    Abstract: It is well known that the capacity region of an average transmit power constrained Gaussian Broadcast Channel (GBC) with independent noise realizations at the receivers is enlarged by the presence of causal noiseless feedback. Capacity region enlargement is also known to be possible by using only passive noisy feedback, when the GBC has identical noise variances at the receivers. The last fact rem… ▽ More

    Submitted 18 September, 2020; originally announced September 2020.

    Comments: 23 single column pages, 4 Figures

  20. arXiv:2008.06630  [pdf, other

    cs.CV cs.LG cs.RO

    Neural Ray Surfaces for Self-Supervised Learning of Depth and Ego-motion

    Authors: Igor Vasiljevic, Vitor Guizilini, Rares Ambrus, Sudeep Pillai, Wolfram Burgard, Greg Shakhnarovich, Adrien Gaidon

    Abstract: Self-supervised learning has emerged as a powerful tool for depth and ego-motion estimation, leading to state-of-the-art results on benchmark datasets. However, one significant limitation shared by current methods is the assumption of a known parametric camera model -- usually the standard pinhole geometry -- leading to failure when applied to imaging systems that deviate significantly from this a… ▽ More

    Submitted 14 August, 2020; originally announced August 2020.

  21. arXiv:2008.01179  [pdf, other

    cs.CV cs.LG cs.RO

    PillarFlow: End-to-end Birds-eye-view Flow Estimation for Autonomous Driving

    Authors: Kuan-Hui Lee, Matthew Kliemann, Adrien Gaidon, Jie Li, Chao Fang, Sudeep Pillai, Wolfram Burgard

    Abstract: In autonomous driving, accurately estimating the state of surrounding obstacles is critical for safe and robust path planning. However, this perception task is difficult, particularly for generic obstacles/objects, due to appearance and occlusion changes. To tackle this problem, we propose an end-to-end deep learning framework for LIDAR-based flow estimation in bird's eye view (BeV). Our method ta… ▽ More

    Submitted 29 August, 2020; v1 submitted 3 August, 2020; originally announced August 2020.

    Comments: Accepted by IROS 2020

  22. arXiv:2003.10069  [pdf, ps, other

    cs.DS math.PR

    Fast and memory-optimal dimension reduction using Kac's walk

    Authors: Vishesh Jain, Natesh S. Pillai, Ashwin Sah, Mehtaab Sawhney, Aaron Smith

    Abstract: In this work, we analyze dimension reduction algorithms based on the Kac walk and discrete variants. (1) For $n$ points in $\mathbb{R}^{d}$, we design an optimal Johnson-Lindenstrauss (JL) transform based on the Kac walk which can be applied to any vector in time $O(d\log{d})$ for essentially the same restriction on $n$ as in the best-known transforms due to Ailon and Liberty [SODA, 2008], and B… ▽ More

    Submitted 14 July, 2020; v1 submitted 22 March, 2020; originally announced March 2020.

    Comments: 27 pages, comments welcome! This version: significant new results; added two co-authors

  23. arXiv:1912.10615  [pdf, other

    cs.CV cs.RO

    Neural Outlier Rejection for Self-Supervised Keypoint Learning

    Authors: Jiexiong Tang, Hanme Kim, Vitor Guizilini, Sudeep Pillai, Rares Ambrus

    Abstract: Identifying salient points in images is a crucial component for visual odometry, Structure-from-Motion or SLAM algorithms. Recently, several learned keypoint methods have demonstrated compelling performance on challenging benchmarks. However, generating consistent and accurate training data for interest-point detection in natural images still remains challenging, especially for human annotators. W… ▽ More

    Submitted 22 December, 2019; originally announced December 2019.

  24. arXiv:1912.03426  [pdf, other

    cs.CV cs.LG cs.RO

    Self-Supervised 3D Keypoint Learning for Ego-motion Estimation

    Authors: Jiexiong Tang, Rares Ambrus, Vitor Guizilini, Sudeep Pillai, Hanme Kim, Patric Jensfelt, Adrien Gaidon

    Abstract: Detecting and matching robust viewpoint-invariant keypoints is critical for visual SLAM and Structure-from-Motion. State-of-the-art learning-based methods generate training samples via homography adaptation to create 2D synthetic views with known keypoint matches from a single image. This approach, however, does not generalize to non-planar 3D scenes with illumination variations commonly seen in r… ▽ More

    Submitted 17 November, 2020; v1 submitted 6 December, 2019; originally announced December 2019.

  25. arXiv:1910.01765  [pdf, other

    cs.CV

    Robust Semi-Supervised Monocular Depth Estimation with Reprojected Distances

    Authors: Vitor Guizilini, Jie Li, Rares Ambrus, Sudeep Pillai, Adrien Gaidon

    Abstract: Dense depth estimation from a single image is a key problem in computer vision, with exciting applications in a multitude of robotic tasks. Initially viewed as a direct regression problem, requiring annotated labels as supervision at training time, in the past few years a substantial amount of work has been done in self-supervised depth training based on strong geometric cues, both from stereo cam… ▽ More

    Submitted 19 November, 2019; v1 submitted 3 October, 2019; originally announced October 2019.

    Comments: Conference on Robot Learning (CoRL 2019)

  26. arXiv:1910.01764  [pdf, other

    cs.CV eess.IV

    Two Stream Networks for Self-Supervised Ego-Motion Estimation

    Authors: Rares Ambrus, Vitor Guizilini, Jie Li, Sudeep Pillai, Adrien Gaidon

    Abstract: Learning depth and camera ego-motion from raw unlabeled RGB video streams is seeing exciting progress through self-supervision from strong geometric cues. To leverage not only appearance but also scene geometry, we propose a novel self-supervised two-stream network using RGB and inferred depth information for accurate visual odometry. In addition, we introduce a sparsity-inducing data augmentation… ▽ More

    Submitted 19 November, 2019; v1 submitted 3 October, 2019; originally announced October 2019.

    Comments: Conference on Robot Learning (CoRL 2019)

  27. arXiv:1908.10003  [pdf, ps, other

    cs.IT

    Online Energy Harvesting Problem Over An Arbitrary Directed Acyclic Graph Network

    Authors: Rahul Vaze, Sibi Raj B Pillai

    Abstract: A communication network modelled by a directed acyclic graph (DAG) is considered, over which a source wishes to send a specified number of bits to a destination node. Each node of the DAG is powered by a separate renewable energy source, and the harvested energy is used to facilitate the source destination data flow. The challenge here is to find the optimal rate and power allocations across time… ▽ More

    Submitted 26 August, 2019; originally announced August 2019.

    Comments: Sub-result: An optimal algorithm to find the max flow in a DAG with non-polymatroidal rate constraints on subsets of edges

  28. arXiv:1905.08990  [pdf, other

    eess.SP cs.IT cs.LG

    MIST: A Novel Training Strategy for Low-latency Scalable Neural Net Decoders

    Authors: Kumar Yashashwi, Deepak Anand, Sibi Raj B Pillai, Prasanna Chaporkar, K Ganesh

    Abstract: In this paper, we propose a low latency, robust and scalable neural net based decoder for convolutional and low-density parity-check (LPDC) coding schemes. The proposed decoders are demonstrated to have bit error rate (BER) and block error rate (BLER) performances at par with the state-of-the-art neural net based decoders while achieving more than 8 times higher decoding speed. The enhanced decodi… ▽ More

    Submitted 22 May, 2019; originally announced May 2019.

  29. arXiv:1905.04453  [pdf, other

    cs.CV cs.LG cs.RO

    Self-Supervised Visual Place Recognition Learning in Mobile Robots

    Authors: Sudeep Pillai, John Leonard

    Abstract: Place recognition is a critical component in robot navigation that enables it to re-establish previously visited locations, and simultaneously use this information to correct the drift incurred in its dead-reckoned estimate. In this work, we develop a self-supervised approach to place recognition in robots. The task of visual loop-closure identification is cast as a metric learning problem, where… ▽ More

    Submitted 11 May, 2019; originally announced May 2019.

    Comments: Presented at Learning for Localization and Mapping Workshop at IROS 2017

  30. arXiv:1905.02693  [pdf, other

    cs.CV cs.LG cs.RO

    3D Packing for Self-Supervised Monocular Depth Estimation

    Authors: Vitor Guizilini, Rares Ambrus, Sudeep Pillai, Allan Raventos, Adrien Gaidon

    Abstract: Although cameras are ubiquitous, robotic platforms typically rely on active sensors like LiDAR for direct 3D perception. In this work, we propose a novel self-supervised monocular depth estimation method combining geometry with a new deep network, PackNet, learned only from unlabeled monocular videos. Our architecture leverages novel symmetrical packing and unpacking blocks to jointly learn to com… ▽ More

    Submitted 28 March, 2020; v1 submitted 6 May, 2019; originally announced May 2019.

  31. arXiv:1812.00509  [pdf, other

    cs.LG q-bio.QM stat.ML

    Knowledge-driven generative subspaces for modeling multi-view dependencies in medical data

    Authors: Parvathy Sudhir Pillai, Tze-Yun Leong

    Abstract: Early detection of Alzheimer's disease (AD) and identification of potential risk/beneficial factors are important for planning and administering timely interventions or preventive measures. In this paper, we learn a disease model for AD that combines genotypic and phenotypic profiles, and cognitive health metrics of patients. We propose a probabilistic generative subspace that describes the correl… ▽ More

    Submitted 2 December, 2018; originally announced December 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

    Report number: ML4H/2018/84

  32. arXiv:1811.09973  [pdf, other

    cs.IT

    Joint State Estimation and Communication over a State-Dependent Gaussian Multiple Access Channel

    Authors: Viswanathan Ramachandran, Sibi Raj B Pillai, Vinod M Prabhakaran

    Abstract: A hybrid communication network with a common analog signal and an independent digital data stream as input to each node in a multiple access network is considered. The receiver/base-station has to estimate the analog signal with a given fidelity, and decode the digital streams with a low error probability. Treating the analog signal as a common state process, we set up a joint state estimation and… ▽ More

    Submitted 25 November, 2018; originally announced November 2018.

    Comments: 12 pages, Journal submission

  33. arXiv:1810.01849  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    SuperDepth: Self-Supervised, Super-Resolved Monocular Depth Estimation

    Authors: Sudeep Pillai, Rares Ambrus, Adrien Gaidon

    Abstract: Recent techniques in self-supervised monocular depth estimation are approaching the performance of supervised methods, but operate in low resolution only. We show that high resolution is key towards high-fidelity self-supervised monocular depth prediction. Inspired by recent deep learning methods for Single-Image Super-Resolution, we propose a sub-pixel convolutional layer extension for depth supe… ▽ More

    Submitted 3 October, 2018; originally announced October 2018.

    Comments: 6 pages, 5 figures, 2 tables, ICRA 2019 Submission

  34. arXiv:1808.03230  [pdf, other

    math.PR cs.LG stat.CO stat.ME stat.ML

    Does Hamiltonian Monte Carlo mix faster than a random walk on multimodal densities?

    Authors: Oren Mangoubi, Natesh S. Pillai, Aaron Smith

    Abstract: Hamiltonian Monte Carlo (HMC) is a very popular and generic collection of Markov chain Monte Carlo (MCMC) algorithms. One explanation for the popularity of HMC algorithms is their excellent performance as the dimension $d$ of the target becomes large: under conditions that are satisfied for many common statistical models, optimally-tuned HMC algorithms have a running time that scales like… ▽ More

    Submitted 4 September, 2018; v1 submitted 9 August, 2018; originally announced August 2018.

  35. arXiv:1705.10279  [pdf, other

    cs.RO cs.AI cs.CV

    Towards Visual Ego-motion Learning in Robots

    Authors: Sudeep Pillai, John J. Leonard

    Abstract: Many model-based Visual Odometry (VO) algorithms have been proposed in the past decade, often restricted to the type of camera optics, or the underlying motion manifold observed. We envision robots to be able to learn and perform these tasks, in a minimally supervised setting, as they gain more experience. To this end, we propose a fully trainable solution to visual ego-motion estimation for varie… ▽ More

    Submitted 29 May, 2017; originally announced May 2017.

    Comments: Conference paper; Submitted to IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2017, Vancouver CA; 8 pages, 8 figures, 2 tables

  36. arXiv:1704.05479  [pdf, other

    cs.IT

    Feedback-Capacity of Degraded Gaussian Vector BC using Directed Information and Concave Envelopes

    Authors: Viswanathan Ramachandran, S. R. B. Pillai

    Abstract: It is known that the capacity region of a two user physically degraded discrete memoryless (DM) broadcast channel (BC) is not enlarged by feedback. An identical result holds true for a physically degraded Gaussian BC, established later using a variant of the Entropy Power Inequality (EPI). In this paper, we extend the latter result to a physically degraded Gaussian Vector BC (PD-GVBC). However, th… ▽ More

    Submitted 18 April, 2017; originally announced April 2017.

  37. Robust Spatial Filtering with Graph Convolutional Neural Networks

    Authors: Felipe Petroski Such, Shagan Sah, Miguel Dominguez, Suhas Pillai, Chao Zhang, Andrew Michael, Nathan Cahill, Raymond Ptucha

    Abstract: Convolutional Neural Networks (CNNs) have recently led to incredible breakthroughs on a variety of pattern recognition problems. Banks of finite impulse response filters are learned on a hierarchy of layers, each contributing more abstract information than the previous layer. The simplicity and elegance of the convolutional filtering process makes them perfect for structured problems such as image… ▽ More

    Submitted 14 July, 2017; v1 submitted 2 March, 2017; originally announced March 2017.

  38. arXiv:1602.00883  [pdf, other

    cs.IT

    Distributed Scheduling in Multiple Access with Bursty Arrivals and Delay Constraints

    Authors: Sakshi Kapoor, Sreejith Sreekumar, Sibi Raj B Pillai

    Abstract: A multiple access system with bursty data arrivals to the terminals is considered. The users are frame-synchronized, with variable sized packets independently arriving in each slot at every transmitter. Each packet needs to be delivered to a common receiver within a certain number of slots specified by a maximum delay constraint. The key assumption is that the terminals know only their own packet… ▽ More

    Submitted 28 November, 2016; v1 submitted 2 February, 2016; originally announced February 2016.

    Comments: 39 pages, 16 figures, presented in part at ISIT 2014

  39. arXiv:1511.00758  [pdf, other

    cs.RO cs.CV

    High-Performance and Tunable Stereo Reconstruction

    Authors: Sudeep Pillai, Srikumar Ramalingam, John J. Leonard

    Abstract: Traditional stereo algorithms have focused their efforts on reconstruction quality and have largely avoided prioritizing for run time performance. Robots, on the other hand, require quick maneuverability and effective computation to observe its immediate environment and perform tasks within it. In this work, we propose a high-performance and tunable stereo disparity estimation method, with a peak… ▽ More

    Submitted 17 February, 2016; v1 submitted 2 November, 2015; originally announced November 2015.

    Comments: Accepted to International Conference on Robotics and Automation (ICRA) 2016; 8 pages, 5 figures

  40. arXiv:1506.01732  [pdf, other

    cs.RO cs.CV

    Monocular SLAM Supported Object Recognition

    Authors: Sudeep Pillai, John Leonard

    Abstract: In this work, we develop a monocular SLAM-aware object recognition system that is able to achieve considerably stronger recognition performance, as compared to classical object recognition systems that function on a frame-by-frame basis. By incorporating several key ideas including multi-view object proposals and efficient feature encoding methods, our proposed system is able to detect and robustl… ▽ More

    Submitted 4 June, 2015; originally announced June 2015.

    Comments: Accepted to appear at Robotics: Science and Systems 2015, Rome, Italy

  41. arXiv:1502.04806  [pdf, ps, other

    cs.IT

    On the Noisy Feedback Capacity of Gaussian Broadcast Channels

    Authors: Sibi Raj B. Pillai, Vinod M. Prabhakaran

    Abstract: It is well known that, in general, feedback may enlarge the capacity region of Gaussian broadcast channels. This has been demonstrated even when the feedback is noisy (or partial-but-perfect) and only from one of the receivers. The only case known where feedback has been shown not to enlarge the capacity region is when the channel is physically degraded (El Gamal 1978, 1981). In this paper, we sho… ▽ More

    Submitted 17 February, 2015; originally announced February 2015.

    Comments: 5 pages, 3 figures, to appear in IEEE Information Theory Workshop 2015, Jerusalem

  42. arXiv:1502.01659  [pdf, other

    cs.RO cs.CV

    Learning Articulated Motions From Visual Demonstration

    Authors: Sudeep Pillai, Matthew R. Walter, Seth Teller

    Abstract: Many functional elements of human homes and workplaces consist of rigid components which are connected through one or more sliding or rotating linkages. Examples include doors and drawers of cabinets and appliances; laptops; and swivel office chairs. A robotic mobile manipulator would benefit from the ability to acquire kinematic models of such objects from observation. This paper describes a meth… ▽ More

    Submitted 5 February, 2015; originally announced February 2015.

    Comments: Published in Robotics: Science and Systems X, Berkeley, CA. ISBN: 978-0-9923747-0-9

  43. arXiv:1502.01657  [pdf, other

    cs.CR cs.IR cs.SI

    Bitcoin Transaction Graph Analysis

    Authors: Michael Fleder, Michael S. Kester, Sudeep Pillai

    Abstract: Bitcoins have recently become an increasingly popular cryptocurrency through which users trade electronically and more anonymously than via traditional electronic transfers. Bitcoin's design keeps all transactions in a public ledger. The sender and receiver for each transaction are identified only by cryptographic public-key ids. This leads to a common misconception that it inherently provides ano… ▽ More

    Submitted 5 February, 2015; originally announced February 2015.

  44. arXiv:1501.03271  [pdf

    cs.CV physics.med-ph

    Higher dimensional homodyne filtering for suppression of incidental phase artifacts in multichannel MRI

    Authors: Joseph Suresh Paul, Uma Krishna Swamy Pillai

    Abstract: The aim of this paper is to introduce procedural steps for extension of the 1D homodyne phase correction for k-space truncation in all gradient encoding directions. Compared to the existing method applied to 2D partial k-space, signal losses introduced by the phase correction filter is observed to be minimal for the extended approach. In addition, the modified form of phase correction mitigates In… ▽ More

    Submitted 14 January, 2015; originally announced January 2015.

  45. arXiv:1410.7528  [pdf, other

    cs.IT

    Optimal WiFi Sensing via Dynamic Programming

    Authors: Abhinav Kumar, Rahul Vaze, Sibi Raj B Pillai, Aditya Gopalan

    Abstract: The problem of finding an optimal sensing schedule for a mobile device that encounters an intermittent WiFi access opportunity is considered. At any given time, the WiFi is in any of the two modes, ON or OFF, and the mobile's incentive is to connect to the WiFi in the ON mode as soon as possible, while spending as little sensing energy. We introduce a dynamic programming framework which enables th… ▽ More

    Submitted 28 October, 2014; originally announced October 2014.

  46. arXiv:1409.4489  [pdf, other

    cs.IT

    Distributed Rate Adaptation and Power Control in Fading Multiple Access Channels

    Authors: Sreejith Sreekumar, Bikash K. Dey, Sibi Raj B. Pillai

    Abstract: Traditionally, the capacity region of a coherent fading multiple access channel (MAC) is analyzed in two popular contexts. In the first, a centralized system with full channel state information at the transmitters (CSIT) is assumed, and the communication parameters like transmit power and data-rate are jointly chosen for every fading vector realization. On the other hand, in fast-fading links with… ▽ More

    Submitted 15 September, 2014; originally announced September 2014.

    Comments: 26 pages, 11 pictures, presented in parts at ISIT2013 and ITW 2014

  47. Performance Comparison of Linear Prediction based Vocoders in Linux Platform

    Authors: Lani Rachel Mathew, Ancy S. Anselam, Sakuntala S. Pillai

    Abstract: Linear predictive coders form an important class of speech coders. This paper describes the software level implementation of linear prediction based vocoders, viz. Code Excited Linear Prediction (CELP), Low-Delay CELP (LD-CELP) and Mixed Excitation Linear Prediction (MELP) at bit rates of 4.8 kb/s, 16 kb/s and 2.4 kb/s respectively. The C programs of the vocoders have been compiled and executed in… ▽ More

    Submitted 25 June, 2014; originally announced June 2014.

    Comments: 5 pages, 5 figures, Published with International Journal of Engineering Trends and Technology (IJETT)

    Journal ref: International Journal of Engineering Trends and Technology (IJETT),V10(11),554-558 April 2014

  48. arXiv:1303.2437  [pdf

    cs.CV

    Least-Squares FIR Models of Low-Resolution MR data for Efficient Phase-Error Compensation with Simultaneous Artefact Removal

    Authors: Joseph Suresh Paul, Uma Krishna Swamy Pillai, Nyjin Thomas

    Abstract: Signal space models in both phase-encode, and frequency-encode directions are presented for extrapolation of 2D partial kspace. Using the boxcar representation of low-resolution spatial data, and a geometrical representation of signal space vectors in both positive and negative phase-encode directions, a robust predictor is constructed using a series of signal space projections. Compared to some o… ▽ More

    Submitted 11 March, 2013; originally announced March 2013.

  49. arXiv:1208.4777  [pdf, other

    cs.IT

    Power Controlled Adaptive Sum-Capacity of Fading MACs with Distributed CSI

    Authors: Sibi Raj B. Pillai, Bikash K. Dey, Yash Deshpande, Krishnamoorthy Iyer

    Abstract: We consider the problem of finding optimal, fair and distributed power-rate strategies to achieve the sum capacity of the Gaussian multiple-access block-fading channel. In here, the transmitters have access to only their own fading coefficients, while the receiver has global access to all the fading coefficients. Outage is not permitted in any communication block. The resulting average sum-through… ▽ More

    Submitted 23 August, 2012; originally announced August 2012.

    Comments: 15 pages, 5 figures, combined and extended version of ITW 2011 and ISITA 2012 papers

  50. arXiv:0904.4525  [pdf, ps, other

    cs.IT

    Number of Measurements in Sparse Signal Recovery

    Authors: Paul Tune, Sibiraj Bhaskaran Pillai, Stephen Hanly

    Abstract: We analyze the asymptotic performance of sparse signal recovery from noisy measurements. In particular, we generalize some of the existing results for the Gaussian case to subgaussian and other ensembles. An achievable result is presented for the linear sparsity regime. A converse on the number of required measurements in the sub-linear regime is also presented, which cover many of the widely us… ▽ More

    Submitted 28 April, 2009; originally announced April 2009.

    Comments: 6 pages, 1 figure. Extended from conference version with proofs included