Skip to main content

Showing 1–24 of 24 results for author: Dwivedi, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.16654  [pdf, ps, other

    cs.LG cs.AI cs.DB

    Relational Deep Learning: Challenges, Foundations and Next-Generation Architectures

    Authors: Vijay Prakash Dwivedi, Charilaos Kanatsoulis, Shenyang Huang, Jure Leskovec

    Abstract: Graph machine learning has led to a significant increase in the capabilities of models that learn on arbitrary graph-structured data and has been applied to molecules, social networks, recommendation systems, and transportation, among other domains. Data in multi-tabular relational databases can also be constructed as 'relational entity graphs' for Relational Deep Learning (RDL) - a new blueprint… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

  2. arXiv:2506.05725  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Large Language Models are Good Relational Learners

    Authors: Fang Wu, Vijay Prakash Dwivedi, Jure Leskovec

    Abstract: Large language models (LLMs) have demonstrated remarkable capabilities across various domains, yet their application to relational deep learning (RDL) remains underexplored. Existing approaches adapt LLMs by traversing relational links between entities in a database and converting the structured data into flat text documents. Still, this text-based serialization disregards critical relational stru… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

  3. arXiv:2505.10960  [pdf, other

    cs.LG cs.AI cs.DB

    Relational Graph Transformer

    Authors: Vijay Prakash Dwivedi, Sri Jaladi, Yangyi Shen, Federico López, Charilaos I. Kanatsoulis, Rishi Puri, Matthias Fey, Jure Leskovec

    Abstract: Relational Deep Learning (RDL) is a promising approach for building state-of-the-art predictive models on multi-table relational data by representing it as a heterogeneous temporal graph. However, commonly used Graph Neural Network models suffer from fundamental limitations in capturing complex structural patterns and long-range dependencies that are inherent in relational data. While Graph Transf… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

    Comments: Code: https://github.com/snap-stanford/relgt

  4. arXiv:2503.06347  [pdf, other

    cs.LG

    Curriculum Learning-Driven PIELMs for Fluid Flow Simulations

    Authors: Vikas Dwivedi, Bruno Sixou, Monica Sigovan

    Abstract: This paper presents two novel, physics-informed extreme learning machine (PIELM)-based algorithms for solving steady and unsteady nonlinear partial differential equations (PDEs) related to fluid flow. Although single-hidden-layer PIELMs outperform deep physics-informed neural networks (PINNs) in speed and accuracy for linear and quasilinear PDEs, their extension to nonlinear problems remains chall… ▽ More

    Submitted 8 March, 2025; originally announced March 2025.

  5. arXiv:2410.13351  [pdf, other

    cs.CL cs.AI cs.LG

    Representation Learning of Structured Data for Medical Foundation Models

    Authors: Vijay Prakash Dwivedi, Viktor Schlegel, Andy T. Liu, Thanh-Tung Nguyen, Abhinav Ramesh Kashyap, Jeng Wei, Wei-Hsian Yin, Stefan Winkler, Robby T. Tan

    Abstract: Large Language Models (LLMs) have demonstrated remarkable performance across various domains, including healthcare. However, their ability to effectively represent structured non-textual data, such as the alphanumeric medical codes used in records like ICD-10 or SNOMED-CT, is limited and has been particularly exposed in recent research. This paper examines the challenges LLMs face in processing me… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: NeurIPS 2024 Workshop on Unifying Representations in Neural Models (UniReps 2024)

  6. arXiv:2408.14418  [pdf, other

    cs.CL cs.AI

    MEDSAGE: Enhancing Robustness of Medical Dialogue Summarization to ASR Errors with LLM-generated Synthetic Dialogues

    Authors: Kuluhan Binici, Abhinav Ramesh Kashyap, Viktor Schlegel, Andy T. Liu, Vijay Prakash Dwivedi, Thanh-Tung Nguyen, Xiaoxue Gao, Nancy F. Chen, Stefan Winkler

    Abstract: Automatic Speech Recognition (ASR) systems are pivotal in transcribing speech into text, yet the errors they introduce can significantly degrade the performance of downstream tasks like summarization. This issue is particularly pronounced in clinical dialogue summarization, a low-resource domain where supervised data for fine-tuning is scarce, necessitating the use of ASR models as black-box solut… ▽ More

    Submitted 8 January, 2025; v1 submitted 26 August, 2024; originally announced August 2024.

    Comments: Accepted by the Thirty-Ninth AAAI Conference on Artificial Intelligence (AAAI-25)

  7. arXiv:2408.12095  [pdf, other

    cs.CL cs.AI cs.LG

    uMedSum: A Unified Framework for Advancing Medical Abstractive Summarization

    Authors: Aishik Nagar, Yutong Liu, Andy T. Liu, Viktor Schlegel, Vijay Prakash Dwivedi, Arun-Kumar Kaliya-Perumal, Guna Pratheep Kalanchiam, Yili Tang, Robby T. Tan

    Abstract: Medical abstractive summarization faces the challenge of balancing faithfulness and informativeness. Current methods often sacrifice key information for faithfulness or introduce confabulations when prioritizing informativeness. While recent advancements in techniques like in-context learning (ICL) and fine-tuning have improved medical summarization, they often overlook crucial aspects such as fai… ▽ More

    Submitted 25 August, 2024; v1 submitted 21 August, 2024; originally announced August 2024.

    Comments: 12 pages

  8. arXiv:2406.03699  [pdf, other

    cs.CL

    M-QALM: A Benchmark to Assess Clinical Reading Comprehension and Knowledge Recall in Large Language Models via Question Answering

    Authors: Anand Subramanian, Viktor Schlegel, Abhinav Ramesh Kashyap, Thanh-Tung Nguyen, Vijay Prakash Dwivedi, Stefan Winkler

    Abstract: There is vivid research on adapting Large Language Models (LLMs) to perform a variety of tasks in high-stakes domains such as healthcare. Despite their popularity, there is a lack of understanding of the extent and contributing factors that allow LLMs to recall relevant knowledge and combine it with presented information in the clinical and biomedical domain: a fundamental pre-requisite for succes… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted at ACL 2024 (Findings)

  9. arXiv:2404.07395  [pdf, other

    cs.LG cs.CV physics.ao-ph

    Global versus Local: Evaluating AlexNet Architectures for Tropical Cyclone Intensity Estimation

    Authors: Vikas Dwivedi

    Abstract: Given the destructive impacts of tropical cyclones, it is critical to have a reliable system for cyclone intensity detection. Various techniques are available for this purpose, each with differing levels of accuracy. In this paper, we introduce two ensemble-based models based on AlexNet architecture to estimate tropical cyclone intensity using visible satellite images. The first model, trained on… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  10. arXiv:2312.13533  [pdf, other

    cs.CL

    Automated Clinical Coding for Outpatient Departments

    Authors: Viktor Schlegel, Abhinav Ramesh Kashyap, Thanh-Tung Nguyen, Tsung-Han Yang, Vijay Prakash Dwivedi, Wei-Hsian Yin, Jeng Wei, Stefan Winkler

    Abstract: Computerised clinical coding approaches aim to automate the process of assigning a set of codes to medical records. While there is active research pushing the state of the art on clinical coding for hospitalized patients, the outpatient setting -- where doctors tend to non-hospitalised patients -- is overlooked. Although both settings can be formalised as a multi-label classification task, they pr… ▽ More

    Submitted 24 December, 2023; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: 9 pages, preprint under review

  11. arXiv:2312.11109  [pdf, other

    cs.LG

    Graph Transformers for Large Graphs

    Authors: Vijay Prakash Dwivedi, Yozen Liu, Anh Tuan Luu, Xavier Bresson, Neil Shah, Tong Zhao

    Abstract: Transformers have recently emerged as powerful neural networks for graph learning, showcasing state-of-the-art performance on several graph property prediction tasks. However, these results have been limited to small-scale graphs, where the computational feasibility of the global attention mechanism is possible. The next goal is to scale up these architectures to handle very large graphs on the sc… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  12. PICS in Pics: Physics Informed Contour Selection for Rapid Image Segmentation

    Authors: Vikas Dwivedi, Balaji Srinivasan, Ganapathy Krishnamurthi

    Abstract: Effective training of deep image segmentation models is challenging due to the need for abundant, high-quality annotations. Generating annotations is laborious and time-consuming for human experts, especially in medical image segmentation. To facilitate image annotation, we introduce Physics Informed Contour Selection (PICS) - an interpretable, physics-informed algorithm for rapid image segmentati… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

  13. arXiv:2305.15747  [pdf, other

    cs.LG

    Union Subgraph Neural Networks

    Authors: Jiaxing Xu, Aihu Zhang, Qingtian Bian, Vijay Prakash Dwivedi, Yiping Ke

    Abstract: Graph Neural Networks (GNNs) are widely used for graph representation learning in many application domains. The expressiveness of vanilla GNNs is upper-bounded by 1-dimensional Weisfeiler-Leman (1-WL) test as they operate on rooted subtrees through iterative message passing. In this paper, we empower GNNs by injecting neighbor-connectivity information extracted from a new type of substructure. We… ▽ More

    Submitted 9 January, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

  14. arXiv:2206.08164  [pdf, other

    cs.LG

    Long Range Graph Benchmark

    Authors: Vijay Prakash Dwivedi, Ladislav Rampášek, Mikhail Galkin, Ali Parviz, Guy Wolf, Anh Tuan Luu, Dominique Beaini

    Abstract: Graph Neural Networks (GNNs) that are based on the message passing (MP) paradigm generally exchange information between 1-hop neighbors to build node representations at each layer. In principle, such networks are not able to capture long-range interactions (LRI) that may be desired or necessary for learning a given task on graphs. Recently, there has been an increasing interest in development of T… ▽ More

    Submitted 28 November, 2023; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: Added reference to Tönshoff et al., 2023 in Sec. 4.1; NeurIPS 2022 Track on D&B; Open-sourced at: https://github.com/vijaydwivedi75/lrgb

  15. arXiv:2205.12454  [pdf, other

    cs.LG

    Recipe for a General, Powerful, Scalable Graph Transformer

    Authors: Ladislav Rampášek, Mikhail Galkin, Vijay Prakash Dwivedi, Anh Tuan Luu, Guy Wolf, Dominique Beaini

    Abstract: We propose a recipe on how to build a general, powerful, scalable (GPS) graph Transformer with linear complexity and state-of-the-art results on a diverse set of benchmarks. Graph Transformers (GTs) have gained popularity in the field of graph representation learning with a variety of recent publications but they lack a common foundation about what constitutes a good positional or structural encod… ▽ More

    Submitted 15 January, 2023; v1 submitted 24 May, 2022; originally announced May 2022.

    Comments: In Proceedings of NeurIPS 2022

  16. arXiv:2111.02987  [pdf

    cs.LG

    Numerical Approximation in CFD Problems Using Physics Informed Machine Learning

    Authors: Siddharth Rout, Vikas Dwivedi, Balaji Srinivasan

    Abstract: The thesis focuses on various techniques to find an alternate approximation method that could be universally used for a wide range of CFD problems but with low computational cost and low runtime. Various techniques have been explored within the field of machine learning to gauge the utility in fulfilling the core ambition. Steady advection diffusion problem has been used as the test case to unders… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

  17. arXiv:2110.07875  [pdf, other

    cs.LG

    Graph Neural Networks with Learnable Structural and Positional Representations

    Authors: Vijay Prakash Dwivedi, Anh Tuan Luu, Thomas Laurent, Yoshua Bengio, Xavier Bresson

    Abstract: Graph neural networks (GNNs) have become the standard learning architectures for graphs. GNNs have been applied to numerous domains ranging from quantum chemistry, recommender systems to knowledge graphs and natural language processing. A major issue with arbitrary graphs is the absence of canonical positional information of nodes, which decreases the representation power of GNNs to distinguish e.… ▽ More

    Submitted 10 February, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: Code at https://github.com/vijaydwivedi75/gnn-lspe

    Journal ref: ICLR 2022 (https://openreview.net/pdf?id=wTTjnvGphYj)

  18. arXiv:2012.09699  [pdf, other

    cs.LG

    A Generalization of Transformer Networks to Graphs

    Authors: Vijay Prakash Dwivedi, Xavier Bresson

    Abstract: We propose a generalization of transformer neural network architecture for arbitrary graphs. The original transformer was designed for Natural Language Processing (NLP), which operates on fully connected graphs representing all connections between the words in a sequence. Such architecture does not leverage the graph connectivity inductive bias, and can perform poorly when the graph topology is im… ▽ More

    Submitted 24 January, 2021; v1 submitted 17 December, 2020; originally announced December 2020.

    Comments: AAAI 2021 Workshop on Deep Learning on Graphs: Methods and Applications (DLG-AAAI 2021); Code at https://github.com/graphdeeplearning/graphtransformer

  19. Impact of RF I/Q Imbalance on Interference-Limited Mixed RF/FSO TWR Systems with Non-Zero Boresight Error

    Authors: Abhijeet Upadhya, Juhi Gupta, Vivek K. Dwivedi, Mohamed-Slim Alouini

    Abstract: In this letter, we investigate a generic model assessing the effect of in-phase/quadrature-phase imbalance (IQI) on an asymmetric dual hop radio frequency/free space optical (RF/FSO) two-way relay (TWR) system in the presence of multiple co-channel interferers (CCIs) at the relay. The fading on the RF and FSO links have been modeled using K-distribution and double generalized Gamma (D-GG) turbulen… ▽ More

    Submitted 8 November, 2020; originally announced November 2020.

    Comments: 13 Pages, 03 Figures

    Journal ref: IEEE Wireless Communications Letters, 2020

  20. arXiv:2003.00982  [pdf, other

    cs.LG stat.ML

    Benchmarking Graph Neural Networks

    Authors: Vijay Prakash Dwivedi, Chaitanya K. Joshi, Anh Tuan Luu, Thomas Laurent, Yoshua Bengio, Xavier Bresson

    Abstract: In the last few years, graph neural networks (GNNs) have become the standard toolkit for analyzing and learning from data on graphs. This emerging field has witnessed an extensive growth of promising techniques that have been applied with success to computer science, mathematics, biology, physics and chemistry. But for any successful field to become mainstream and reliable, benchmarks must be deve… ▽ More

    Submitted 27 December, 2022; v1 submitted 2 March, 2020; originally announced March 2020.

    Comments: Benchmarking framework on GitHub at https://github.com/graphdeeplearning/benchmarking-gnns

    Journal ref: Journal of Machine Learning Research (JMLR), 2022

  21. arXiv:1907.08967  [pdf, other

    cs.LG physics.comp-ph stat.ML

    Distributed physics informed neural network for data-efficient solution to partial differential equations

    Authors: Vikas Dwivedi, Nishant Parashar, Balaji Srinivasan

    Abstract: The physics informed neural network (PINN) is evolving as a viable method to solve partial differential equations. In the recent past PINNs have been successfully tested and validated to find solutions to both linear and non-linear partial differential equations (PDEs). However, the literature lacks detailed investigation of PINNs in terms of their representation capability. In this work, we first… ▽ More

    Submitted 21 July, 2019; originally announced July 2019.

    Comments: 16 pages, 8 figures

    Journal ref: Neurocomputing, 420, 299-316

  22. arXiv:1907.03507  [pdf, other

    cs.LG physics.comp-ph stat.ML

    Physics Informed Extreme Learning Machine (PIELM) -- A rapid method for the numerical solution of partial differential equations

    Authors: Vikas Dwivedi, Balaji Srinivasan

    Abstract: There has been rapid progress recently on the application of deep networks to the solution of partial differential equations, collectively labelled as Physics Informed Neural Networks (PINNs). In this paper, we develop Physics Informed Extreme Learning Machine (PIELM), a rapid version of PINNs which can be applied to stationary and time dependent linear partial differential equations. We demonstra… ▽ More

    Submitted 8 July, 2019; originally announced July 2019.

    Comments: 29 pages, 30 figures

  23. arXiv:1701.04185  [pdf

    cs.MM cs.CR cs.CV

    A Watermarking Technique Using Discrete Curvelet Transform for Security of Multiple Biometric Features

    Authors: Rohit M. Thanki, Ved Vyas Dwivedi, Komal R. Borisagar

    Abstract: The robustness and security of the biometric watermarking approach can be improved by using a multiple watermarking. This multiple watermarking proposed for improving security of biometric features and data. When the imposter tries to create the spoofed biometric feature, the invisible biometric watermark features can provide appropriate protection to multimedia data. In this paper, a biometric wa… ▽ More

    Submitted 16 January, 2017; originally announced January 2017.

    Journal ref: International Journal of Information Processing,volume 10, issue 1, pp. 103 - 114 (2016)

  24. Foundations and Tools for End-User Architecting

    Authors: David Garlan, Vishal Dwivedi, Ivan Ruchkin, Bradley Schmerl

    Abstract: Within an increasing number of domains an important emerging need is the ability for technically naive users to compose computational elements into novel configurations. Examples include astronomers who create new analysis pipelines to process telescopic data, intelligence analysts who must process diverse sources of unstructured text to discover socio-technical trends, and medical researchers who… ▽ More

    Submitted 17 October, 2012; originally announced October 2012.

    ACM Class: D.2.11; H.5.2

    Journal ref: Large-Scale Complex IT Systems. Development, Operation and Management. Lecture Notes in Computer Science, 2012, Volume 7539/2012, 157-182