Skip to main content

Showing 1–4 of 4 results for author: Apte, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.17604  [pdf, other

    cs.AI

    OmniScience: A Domain-Specialized LLM for Scientific Reasoning and Discovery

    Authors: Vignesh Prabhakar, Md Amirul Islam, Adam Atanas, Yao-Ting Wang, Joah Han, Aastha Jhunjhunwala, Rucha Apte, Robert Clark, Kang Xu, Zihan Wang, Kai Liu

    Abstract: Large Language Models (LLMs) have demonstrated remarkable potential in advancing scientific knowledge and addressing complex challenges. In this work, we introduce OmniScience, a specialized large reasoning model for general science, developed through three key components: (1) domain adaptive pretraining on a carefully curated corpus of scientific literature, (2) instruction tuning on a specialize… ▽ More

    Submitted 22 April, 2025; v1 submitted 21 March, 2025; originally announced March 2025.

  2. arXiv:2311.16267  [pdf, other

    cs.CL cs.SE

    Novel Preprocessing Technique for Data Embedding in Engineering Code Generation Using Large Language Model

    Authors: Yu-Chen Lin, Akhilesh Kumar, Norman Chang, Wenliang Zhang, Muhammad Zakir, Rucha Apte, Haiyang He, Chao Wang, Jyh-Shing Roger Jang

    Abstract: We present four main contributions to enhance the performance of Large Language Models (LLMs) in generating domain-specific code: (i) utilizing LLM-based data splitting and data renovation techniques to improve the semantic representation of embeddings' space; (ii) introducing the Chain of Density for Renovation Credibility (CoDRC), driven by LLMs, and the Adaptive Text Renovation (ATR) algorithm… ▽ More

    Submitted 30 January, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

  3. arXiv:2306.11075  [pdf, other

    physics.flu-dyn cs.LG

    Diffusion model based data generation for partial differential equations

    Authors: Rucha Apte, Sheel Nidhan, Rishikesh Ranade, Jay Pathak

    Abstract: In a preliminary attempt to address the problem of data scarcity in physics-based machine learning, we introduce a novel methodology for data generation in physics-based simulations. Our motivation is to overcome the limitations posed by the limited availability of numerical data. To achieve this, we leverage a diffusion model that allows us to generate synthetic data samples and test them for two… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

  4. arXiv:1908.11863  [pdf

    cs.LG cs.CV eess.IV stat.ML

    Systematic Analysis of Image Generation using GANs

    Authors: Rohan Akut, Sumukh Marathe, Rucha Apte, Ishan Joshi, Siddhivinayak Kulkarni

    Abstract: Generative Adversarial Networks have been crucial in the developments made in unsupervised learning in recent times. Exemplars of image synthesis from text or other images, these networks have shown remarkable improvements over conventional methods in terms of performance. Trained on the adversarial training philosophy, these networks aim to estimate the potential distribution from the real data a… ▽ More

    Submitted 30 August, 2019; originally announced August 2019.

    Comments: Accepted in IEEE ICMLDS 2018