Skip to main content

Showing 1–9 of 9 results for author: Doshi, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.10695  [pdf, other

    cs.CV cs.AI cs.GR

    Playground v3: Improving Text-to-Image Alignment with Deep-Fusion Large Language Models

    Authors: Bingchen Liu, Ehsan Akhgari, Alexander Visheratin, Aleks Kamko, Linmiao Xu, Shivam Shrirao, Chase Lambert, Joao Souza, Suhail Doshi, Daiqing Li

    Abstract: We introduce Playground v3 (PGv3), our latest text-to-image model that achieves state-of-the-art (SoTA) performance across multiple testing benchmarks, excels in graphic design abilities and introduces new capabilities. Unlike traditional text-to-image generative models that rely on pre-trained language models like T5 or CLIP text encoders, our approach fully integrates Large Language Models (LLMs… ▽ More

    Submitted 21 October, 2024; v1 submitted 16 September, 2024; originally announced September 2024.

    Comments: Project page: https://playground.com/pg-v3

  2. arXiv:2404.04877  [pdf, other

    cs.IT cs.CY cs.ET

    A Bird-Eye view on DNA Storage Simulators

    Authors: Sanket Doshi, Mihir Gohel, Manish K. Gupta

    Abstract: In the current world due to the huge demand for storage, DNA-based storage solution sounds quite promising because of their longevity, low power consumption, and high capacity. However in real life storing data in the form of DNA is quite expensive, and challenging. Therefore researchers and developers develop such kind of software that helps simulate real-life DNA storage without worrying about t… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: 19 pages, 19 figures, draft, review

  3. arXiv:2402.17245  [pdf, other

    cs.CV cs.AI

    Playground v2.5: Three Insights towards Enhancing Aesthetic Quality in Text-to-Image Generation

    Authors: Daiqing Li, Aleks Kamko, Ehsan Akhgari, Ali Sabet, Linmiao Xu, Suhail Doshi

    Abstract: In this work, we share three insights for achieving state-of-the-art aesthetic quality in text-to-image generative models. We focus on three critical aspects for model improvement: enhancing color and contrast, improving generation across multiple aspect ratios, and improving human-centric fine details. First, we delve into the significance of the noise schedule in training a diffusion model, demo… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: Model weights: https://huggingface.co/playgroundai/playground-v2.5-1024px-aesthetic

  4. arXiv:2304.11110  [pdf, other

    cs.HC cs.RO

    Immersive Virtual Reality and Robotics for Upper Extremity Rehabilitation

    Authors: Vuthea Chheang, Rakshith Lokesh, Amit Chaudhari, Qile Wang, Lauren Baron, Behdokht Kiafar, Sagar Doshi, Erik Thostenson, Joshua Cashaback, Roghayeh Leila Barmaki

    Abstract: Stroke patients often experience upper limb impairments that restrict their mobility and daily activities. Physical therapy (PT) is the most effective method to improve impairments, but low patient adherence and participation in PT exercises pose significant challenges. To overcome these barriers, a combination of virtual reality (VR) and robotics in PT is promising. However, few systems effective… ▽ More

    Submitted 29 June, 2023; v1 submitted 21 April, 2023; originally announced April 2023.

    Comments: 9 pages, 6 figures

  5. arXiv:2303.06381  [pdf, other

    eess.SP cs.IT cs.LG

    Learning to Precode for Integrated Sensing and Communications Systems

    Authors: R. S. Prasobh Sankar, Sidharth S. Nair, Siddhant Doshi, Sundeep Prabhakar Chepuri

    Abstract: In this paper, we present an unsupervised learning neural model to design transmit precoders for integrated sensing and communication (ISAC) systems to maximize the worst-case target illumination power while ensuring a minimum signal-to-interference-plus-noise ratio (SINR) for all the users. The problem of learning transmit precoders from uplink pilots and echoes can be viewed as a parameterized f… ▽ More

    Submitted 11 March, 2023; originally announced March 2023.

  6. arXiv:2204.07705  [pdf, other

    cs.CL cs.AI

    Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

    Authors: Yizhong Wang, Swaroop Mishra, Pegah Alipoormolabashi, Yeganeh Kordi, Amirreza Mirzaei, Anjana Arunkumar, Arjun Ashok, Arut Selvan Dhanasekaran, Atharva Naik, David Stap, Eshaan Pathak, Giannis Karamanolakis, Haizhi Gary Lai, Ishan Purohit, Ishani Mondal, Jacob Anderson, Kirby Kuznia, Krima Doshi, Maitreya Patel, Kuntal Kumar Pal, Mehrad Moradshahi, Mihir Parmar, Mirali Purohit, Neeraj Varshney, Phani Rohitha Kaza , et al. (15 additional authors not shown)

    Abstract: How well can NLP models generalize to a variety of unseen tasks when provided with task instructions? To address this question, we first introduce Super-NaturalInstructions, a benchmark of 1,616 diverse NLP tasks and their expert-written instructions. Our collection covers 76 distinct task types, including but not limited to classification, extraction, infilling, sequence tagging, text rewriting,… ▽ More

    Submitted 24 October, 2022; v1 submitted 15 April, 2022; originally announced April 2022.

    Comments: Accepted to EMNLP 2022, 25 pages

  7. arXiv:2111.11482  [pdf, other

    cs.LG eess.SP stat.ML

    Graph Neural Networks with Parallel Neighborhood Aggregations for Graph Classification

    Authors: Siddhant Doshi, Sundeep Prabhakar Chepuri

    Abstract: We focus on graph classification using a graph neural network (GNN) model that precomputes the node features using a bank of neighborhood aggregation graph operators arranged in parallel. These GNN models have a natural advantage of reduced training and inference time due to the precomputations but are also fundamentally different from popular GNN variants that update node features through a seque… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

  8. arXiv:2012.02151  [pdf, other

    cs.LG q-bio.MN q-bio.QM

    Dr-COVID: Graph Neural Networks for SARS-CoV-2 Drug Repurposing

    Authors: Siddhant Doshi, Sundeep Prabhakar Chepuri

    Abstract: The 2019 novel coronavirus (SARS-CoV-2) pandemic has resulted in more than a million deaths, high morbidities, and economic distress worldwide. There is an urgent need to identify medications that would treat and prevent novel diseases like the 2019 coronavirus disease (COVID-19). Drug repurposing is a promising strategy to discover new medical indications of the existing approved drugs due to sev… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

  9. arXiv:1810.09786  [pdf, other

    cs.RO

    Human-centered manipulation and navigation with Robot DE NIRO

    Authors: Fabian Falck, Sagar Doshi, Nico Smuts, John Lingi, Kim Rants, Petar Kormushev

    Abstract: Social assistance robots in health and elderly care have the potential to support and ease human lives. Given the macrosocial trends of aging and long-lived populations, robotics-based care research mainly focused on helping the elderly live independently. In this paper, we introduce Robot DE NIRO, a research platform that aims to support the supporter (the caregiver) and also offers direct human-… ▽ More

    Submitted 23 October, 2018; originally announced October 2018.

    Comments: In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2018) Workshop "Towards Robots that Exhibit Manipulation Intelligence", Madrid, Spain, Oct. 1, 2018