Skip to main content

Showing 1–7 of 7 results for author: Ravishankar, R

.
  1. arXiv:2501.05453  [pdf, other

    cs.CV cs.AI

    An Empirical Study of Autoregressive Pre-training from Videos

    Authors: Jathushan Rajasegaran, Ilija Radosavovic, Rahul Ravishankar, Yossi Gandelsman, Christoph Feichtenhofer, Jitendra Malik

    Abstract: We empirically study autoregressive pre-training from videos. To perform our study, we construct a series of autoregressive video models, called Toto. We treat videos as sequences of visual tokens and train transformer models to autoregressively predict future tokens. Our models are pre-trained on a diverse dataset of videos and images comprising over 1 trillion visual tokens. We explore different… ▽ More

    Submitted 9 January, 2025; originally announced January 2025.

  2. arXiv:2411.08034  [pdf, other

    cs.CV cs.AI

    Scaling Properties of Diffusion Models for Perceptual Tasks

    Authors: Rahul Ravishankar, Zeeshan Patel, Jathushan Rajasegaran, Jitendra Malik

    Abstract: In this paper, we argue that iterative computation with diffusion models offers a powerful paradigm for not only generation but also visual perception tasks. We unify tasks such as depth estimation, optical flow, and amodal segmentation under the framework of image-to-image translation, and show how diffusion models benefit from scaling training and test-time compute for these perceptual tasks. Th… ▽ More

    Submitted 16 November, 2024; v1 submitted 12 November, 2024; originally announced November 2024.

  3. arXiv:2405.10716  [pdf

    physics.app-ph physics.ins-det

    Scanning Acoustic Microscopy for Quantifying Two-phase Transfer in Operando Alkaline Water Electrolyzer

    Authors: Zehua Dou, Hannes Rox, Zyzi Ramos, Robert Baumann, Rachappa Ravishankar, Peter Czurratis, Xuegeng Yang, Andrés Fabian Lasagni, Kerstin Eckert, Juergen Czarske, David Weik

    Abstract: Improved understandings of two-phase transport in electrochemical gas-evolving systems are increasingly demanded, while high-performance imaging techniques using simplified instrumentations are not readily available. This work presents volumetric scanning acoustic microscopy (SAM) imaging for quantifying the dynamics of gas bubbles and electrolyte in porous Nickel electrodes with different wettabi… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: Research artical on an emerging field. 33 pages, 6 figures, 61 references, 10 supplementary figures available. Journal submission in progress

  4. arXiv:2404.10151  [pdf, other

    cs.DC

    Distributing Context-Aware Shared Memory Data Structures: A Case Study on Singly-Linked Lists

    Authors: Raaghav Ravishankar, Sandeep Kulkarni, Sathya Peri, Gokarna Sharma

    Abstract: In this paper, we study the partitioning of a context-aware shared memory data structure so that it can be implemented as a distributed data structure running on multiple machines. By context-aware data structures, we mean that the result of an operation not only depends upon the value of the shared data but also upon the previous operations performed by the same client. While there is substantial… ▽ More

    Submitted 24 May, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

  5. arXiv:2404.09138  [pdf, other

    cs.CL cs.AI cs.LG

    From Bytes to Borsch: Fine-Tuning Gemma and Mistral for the Ukrainian Language Representation

    Authors: Artur Kiulian, Anton Polishko, Mykola Khandoga, Oryna Chubych, Jack Connor, Raghav Ravishankar, Adarsh Shirawalmath

    Abstract: In the rapidly advancing field of AI and NLP, generative large language models (LLMs) stand at the forefront of innovation, showcasing unparalleled abilities in text understanding and generation. However, the limited representation of low-resource languages like Ukrainian poses a notable challenge, restricting the reach and relevance of this technology. Our paper addresses this by fine-tuning the… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  6. arXiv:2402.07912  [pdf, other

    cs.HC cs.AI

    Spatial Computing: Concept, Applications, Challenges and Future Directions

    Authors: Gokul Yenduri, Ramalingam M, Praveen Kumar Reddy Maddikunta, Thippa Reddy Gadekallu, Rutvij H Jhaveri, Ajay Bandi, Junxin Chen, Wei Wang, Adarsh Arunkumar Shirawalmath, Raghav Ravishankar, Weizheng Wang

    Abstract: Spatial computing is a technological advancement that facilitates the seamless integration of devices into the physical environment, resulting in a more natural and intuitive digital world user experience. Spatial computing has the potential to become a significant advancement in the field of computing. From GPS and location-based services to healthcare, spatial computing technologies have influen… ▽ More

    Submitted 30 January, 2024; originally announced February 2024.

    Comments: Submitted to peer reviewe

  7. arXiv:2211.13856  [pdf

    cs.CV cs.AI

    WSSL: Weighted Self-supervised Learning Framework For Image-inpainting

    Authors: Shubham Gupta, Rahul Kunigal Ravishankar, Madhoolika Gangaraju, Poojasree Dwarkanath, Natarajan Subramanyam

    Abstract: Image inpainting is the process of regenerating lost parts of the image. Supervised algorithm-based methods have shown excellent results but have two significant drawbacks. They do not perform well when tested with unseen data. They fail to capture the global context of the image, resulting in a visually unappealing result. We propose a novel self-supervised learning framework for image-inpainting… ▽ More

    Submitted 24 August, 2023; v1 submitted 24 November, 2022; originally announced November 2022.

    Comments: 9 Pages, document submitted for publication at CGVCVIP 2022 - ISBN 978-989-8704-42-9