Skip to main content

Showing 1–30 of 30 results for author: Ramnath, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.01923  [pdf, ps, other

    cs.CV cs.AI cs.LG

    TaxaDiffusion: Progressively Trained Diffusion Model for Fine-Grained Species Generation

    Authors: Amin Karimi Monsefi, Mridul Khurana, Rajiv Ramnath, Anuj Karpatne, Wei-Lun Chao, Cheng Zhang

    Abstract: We propose TaxaDiffusion, a taxonomy-informed training framework for diffusion models to generate fine-grained animal images with high morphological and identity accuracy. Unlike standard approaches that treat each species as an independent category, TaxaDiffusion incorporates domain knowledge that many species exhibit strong visual similarities, with distinctions often residing in subtle variatio… ▽ More

    Submitted 25 June, 2025; v1 submitted 2 June, 2025; originally announced June 2025.

    Comments: Accepted to ICCV 2025

  2. arXiv:2412.00994  [pdf, other

    cs.LG cs.AI

    DSSRNN: Decomposition-Enhanced State-Space Recurrent Neural Network for Time-Series Analysis

    Authors: Ahmad Mohammadshirazi, Ali Nosratifiroozsalari, Rajiv Ramnath

    Abstract: Time series forecasting is a crucial yet challenging task in machine learning, requiring domain-specific knowledge due to its wide-ranging applications. While recent Transformer models have improved forecasting capabilities, they come with high computational costs. Linear-based models have shown better accuracy than Transformers but still fall short of ideal performance. To address these challenge… ▽ More

    Submitted 1 December, 2024; originally announced December 2024.

  3. arXiv:2412.00151  [pdf, other

    cs.CV cs.AI

    DLaVA: Document Language and Vision Assistant for Answer Localization with Enhanced Interpretability and Trustworthiness

    Authors: Ahmad Mohammadshirazi, Pinaki Prasad Guha Neogi, Ser-Nam Lim, Rajiv Ramnath

    Abstract: Document Visual Question Answering (VQA) requires models to interpret textual information within complex visual layouts and comprehend spatial relationships to answer questions based on document images. Existing approaches often lack interpretability and fail to precisely localize answers within the document, hindering users' ability to verify responses and understand the reasoning process. Moreov… ▽ More

    Submitted 29 November, 2024; originally announced December 2024.

  4. Scalable Deep Metric Learning on Attributed Graphs

    Authors: Xiang Li, Gagan Agrawal, Ruoming Jin, Rajiv Ramnath

    Abstract: We consider the problem of constructing embeddings of large attributed graphs and supporting multiple downstream learning tasks. We develop a graph embedding method, which is based on extending deep metric and unbiased contrastive learning techniques to 1) work with attributed graphs, 2) enabling a mini-batch based approach, and 3) achieving scalability. Based on a multi-class tuplet loss function… ▽ More

    Submitted 19 November, 2024; originally announced November 2024.

    Comments: This is the complete version of a published paper with appendix including detailed proofs

  5. arXiv:2411.12098  [pdf, other

    cs.LG

    Federated Contrastive Learning of Graph-Level Representations

    Authors: Xiang Li, Gagan Agrawal, Rajiv Ramnath, Ruoming Jin

    Abstract: Graph-level representations (and clustering/classification based on these representations) are required in a variety of applications. Examples include identifying malicious network traffic, prediction of protein properties, and many others. Often, data has to stay in isolated local systems (i.e., cannot be centrally shared for analysis) due to a variety of considerations like privacy concerns, lac… ▽ More

    Submitted 18 November, 2024; originally announced November 2024.

    Comments: Accepted in BigData 2024. This is a preprint

  6. arXiv:2410.01595  [pdf, other

    cs.CV cs.AI

    KnobGen: Controlling the Sophistication of Artwork in Sketch-Based Diffusion Models

    Authors: Pouyan Navard, Amin Karimi Monsefi, Mengxi Zhou, Wei-Lun Chao, Alper Yilmaz, Rajiv Ramnath

    Abstract: Recent advances in diffusion models have significantly improved text-to-image (T2I) generation, but they often struggle to balance fine-grained precision with high-level control. Methods like ControlNet and T2I-Adapter excel at following sketches by seasoned artists but tend to be overly rigid, replicating unintentional flaws in sketches from novice users. Meanwhile, coarse-grained methods, such a… ▽ More

    Submitted 9 April, 2025; v1 submitted 2 October, 2024; originally announced October 2024.

    Comments: Accepted to CVPR 2025 Workshop on CVEU

  7. arXiv:2409.10362  [pdf, other

    cs.CV

    Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning

    Authors: Amin Karimi Monsefi, Mengxi Zhou, Nastaran Karimi Monsefi, Ser-Nam Lim, Wei-Lun Chao, Rajiv Ramnath

    Abstract: We present a novel frequency-based Self-Supervised Learning (SSL) approach that significantly enhances its efficacy for pre-training. Prior work in this direction masks out pre-defined frequencies in the input image and employs a reconstruction loss to pre-train the model. While achieving promising results, such an implementation has two fundamental limitations as identified in our paper. First, u… ▽ More

    Submitted 28 March, 2025; v1 submitted 16 September, 2024; originally announced September 2024.

    Comments: Accepted to ICLR 2025

  8. arXiv:2409.06809  [pdf, other

    cs.CV

    DetailCLIP: Detail-Oriented CLIP for Fine-Grained Tasks

    Authors: Amin Karimi Monsefi, Kishore Prakash Sailaja, Ali Alilooee, Ser-Nam Lim, Rajiv Ramnath

    Abstract: In this paper, we introduce DetailCLIP: A Detail-Oriented CLIP to address the limitations of contrastive learning-based vision-language models, particularly CLIP, in handling detail-oriented and fine-grained tasks like segmentation. While CLIP and its variants excel in the global alignment of image and text representations, they often struggle to capture the fine-grained details necessary for prec… ▽ More

    Submitted 31 March, 2025; v1 submitted 10 September, 2024; originally announced September 2024.

    Comments: Accepted in SSI-FM Workshop of ICLR 2025

  9. arXiv:2406.17591  [pdf, other

    cs.CV

    DocParseNet: Advanced Semantic Segmentation and OCR Embeddings for Efficient Scanned Document Annotation

    Authors: Ahmad Mohammadshirazi, Ali Nosrati Firoozsalari, Mengxi Zhou, Dheeraj Kulshrestha, Rajiv Ramnath

    Abstract: Automating the annotation of scanned documents is challenging, requiring a balance between computational efficiency and accuracy. DocParseNet addresses this by combining deep learning and multi-modal learning to process both text and visual data. This model goes beyond traditional OCR and semantic segmentation, capturing the interplay between text and images to preserve contextual nuances in compl… ▽ More

    Submitted 21 July, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

  10. arXiv:2406.13968  [pdf, other

    cs.LG

    Recent Advances in Traffic Accident Analysis and Prediction: A Comprehensive Review of Machine Learning Techniques

    Authors: Noushin Behboudi, Sobhan Moosavi, Rajiv Ramnath

    Abstract: Traffic accidents pose a severe global public health issue, leading to 1.19 million fatalities annually, with the greatest impact on individuals aged 5 to 29 years old. This paper addresses the critical need for advanced predictive methods in road safety by conducting a comprehensive review of recent advancements in applying machine learning (ML) techniques to traffic accident analysis and predict… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: A review paper, 26 pages

  11. arXiv:2405.13173  [pdf, other

    cs.LG

    Efficient and Interpretable Information Retrieval for Product Question Answering with Heterogeneous Data

    Authors: Biplob Biswas, Rajiv Ramnath

    Abstract: Expansion-enhanced sparse lexical representation improves information retrieval (IR) by minimizing vocabulary mismatch problems during lexical matching. In this paper, we explore the potential of jointly learning dense semantic representation and combining it with the lexical one for ranking candidate information. We present a hybrid information retrieval mechanism that maximizes lexical and seman… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 10 pages, 5 figures, ECNLP 7 @ LREC-COLING 2024

  12. Masked LoGoNet: Fast and Accurate 3D Image Analysis for Medical Domain

    Authors: Amin Karimi Monsefi, Payam Karisani, Mengxi Zhou, Stacey Choi, Nathan Doble, Heng Ji, Srinivasan Parthasarathy, Rajiv Ramnath

    Abstract: Standard modern machine-learning-based imaging methods have faced challenges in medical applications due to the high cost of dataset construction and, thereby, the limited labeled training data available. Additionally, upon deployment, these methods are usually used to process a large volume of data on a daily basis, imposing a high maintenance cost on medical facilities. In this paper, we introdu… ▽ More

    Submitted 28 March, 2025; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: Accepted to KDD 2024

  13. CrashFormer: A Multimodal Architecture to Predict the Risk of Crash

    Authors: Amin Karimi Monsefi, Pouya Shiri, Ahmad Mohammadshirazi, Nastaran Karimi Monsefi, Ron Davies, Sobhan Moosavi, Rajiv Ramnath

    Abstract: Reducing traffic accidents is a crucial global public safety concern. Accident prediction is key to improving traffic safety, enabling proactive measures to be taken before a crash occurs, and informing safety policies, regulations, and targeted interventions. Despite numerous studies on accident prediction over the past decades, many have limitations in terms of generalizability, reproducibility,… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: The paper is accepted In 1st ACM SIGSPATIAL International Workshop on Advances in Urban-AI (UrbanAI 23), November 13, 2023, Hamburg, Germany

  14. arXiv:2308.01438  [pdf, other

    cs.LG cs.AI physics.data-an

    Novel Physics-Based Machine-Learning Models for Indoor Air Quality Approximations

    Authors: Ahmad Mohammadshirazi, Aida Nadafian, Amin Karimi Monsefi, Mohammad H. Rafiei, Rajiv Ramnath

    Abstract: Cost-effective sensors are capable of real-time capturing a variety of air quality-related modalities from different pollutant concentrations to indoor/outdoor humidity and temperature. Machine learning (ML) models are capable of performing air-quality "ahead-of-time" approximations. Undoubtedly, accurate indoor air quality approximation significantly helps provide a healthy indoor environment, op… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    ACM Class: I.2.6

  15. arXiv:2305.03740  [pdf, other

    cs.LG

    Judge Me in Context: A Telematics-Based Driving Risk Prediction Framework in Presence of Weak Risk Labels

    Authors: Sobhan Moosavi, Rajiv Ramnath

    Abstract: Driving risk prediction has been a topic of much research over the past few decades to minimize driving risk and increase safety. The use of demographic information in risk prediction is a traditional solution with applications in insurance planning, however, it is difficult to capture true driving behavior via such coarse-grained factors. Therefor, the use of telematics data has gained a widespre… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: Preprint submitted for peer-review

  16. Will there be a construction? Predicting road constructions based on heterogeneous spatiotemporal data

    Authors: Amin Karimi Monsefi, Sobhan Moosavi, Rajiv Ramnath

    Abstract: Road construction projects maintain transportation infrastructures. These projects range from the short-term (e.g., resurfacing or fixing potholes) to the long-term (e.g., adding a shoulder or building a bridge). Deciding what the next construction project is and when it is to be scheduled is traditionally done through inspection by humans using special equipment. This approach is costly and diffi… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

    Comments: In Proceedings of the 30th ACM SIGSPATIAL, International Conference on Advances in Geographic Information Systems (2022) [accepted as a short paper]

  17. Scalable Deep Graph Clustering with Random-walk based Self-supervised Learning

    Authors: Xiang Li, Dong Li, Ruoming Jin, Gagan Agrawal, Rajiv Ramnath

    Abstract: Web-based interactions can be frequently represented by an attributed graph, and node clustering in such graphs has received much attention lately. Multiple efforts have successfully applied Graph Convolutional Networks (GCN), though with some limits on accuracy as GCNs have been shown to suffer from over-smoothing issues. Though other methods (particularly those based on Laplacian Smoothing) have… ▽ More

    Submitted 17 January, 2023; v1 submitted 31 December, 2021; originally announced December 2021.

  18. arXiv:2102.05843  [pdf, other

    cs.CV cs.LG

    Driving Style Representation in Convolutional Recurrent Neural Network Model of Driver Identification

    Authors: Sobhan Moosavi, Pravar D. Mahajan, Srinivasan Parthasarathy, Colleen Saunders-Chukwu, Rajiv Ramnath

    Abstract: Identifying driving styles is the task of analyzing the behavior of drivers in order to capture variations that will serve to discriminate different drivers from each other. This task has become a prerequisite for a variety of applications, including usage-based insurance, driver coaching, driver action prediction, and even in designing autonomous vehicles; because driving style encodes essential… ▽ More

    Submitted 10 February, 2021; originally announced February 2021.

    Comments: 12 pages, research on driving style representation

  19. arXiv:1911.04427  [pdf, other

    cs.CL cs.IR cs.LG

    Sequence-to-Set Semantic Tagging: End-to-End Multi-label Prediction using Neural Attention for Complex Query Reformulation and Automated Text Categorization

    Authors: Manirupa Das, Juanxi Li, Eric Fosler-Lussier, Simon Lin, Soheil Moosavinasab, Steve Rust, Yungui Huang, Rajiv Ramnath

    Abstract: Novel contexts may often arise in complex querying scenarios such as in evidence-based medicine (EBM) involving biomedical literature, that may not explicitly refer to entities or canonical concept forms occurring in any fact- or rule-based knowledge source such as an ontology like the UMLS. Moreover, hidden associations between candidate concepts meaningful in the current context, may not exist w… ▽ More

    Submitted 11 November, 2019; originally announced November 2019.

    Comments: 8 pages, 4 figures, 1 table

  20. arXiv:1910.12446  [pdf, other

    cs.SI cs.CL cs.IR

    Towards Successful Social Media Advertising: Predicting the Influence of Commercial Tweets

    Authors: Renhao Cui, Gagan Agrawal, Rajiv Ramnath

    Abstract: Businesses communicate using Twitter for a variety of reasons -- to raise awareness of their brands, to market new products, to respond to community comments, and to connect with their customers and potential customers in a targeted manner. For businesses to do this effectively, they need to understand which content and structural elements about a tweet make it influential, that is, widely liked,… ▽ More

    Submitted 28 October, 2019; originally announced October 2019.

  21. arXiv:1910.08270  [pdf, other

    cs.CL cs.IR cs.LG

    Learning to Answer Subjective, Specific Product-Related Queries using Customer Reviews by Adversarial Domain Adaptation

    Authors: Manirupa Das, Zhen Wang, Evan Jaffe, Madhuja Chattopadhyay, Eric Fosler-Lussier, Rajiv Ramnath

    Abstract: Online customer reviews on large-scale e-commerce websites, represent a rich and varied source of opinion data, often providing subjective qualitative assessments of product usage that can help potential customers to discover features that meet their personal needs and preferences. Thus they have the potential to automatically answer specific queries about products, and to address the problems of… ▽ More

    Submitted 22 October, 2019; v1 submitted 18 October, 2019; originally announced October 2019.

    Comments: 8 pages, 1 figure, 6 tables, added additional references to end of section 2.1, removed graphics from referenced works, added to argument in section 2.3 corrected typos, results unchanged

  22. arXiv:1909.09638  [pdf, other

    cs.LG cs.DB stat.ML

    Accident Risk Prediction based on Heterogeneous Sparse Data: New Dataset and Insights

    Authors: Sobhan Moosavi, Mohammad Hossein Samavatian, Srinivasan Parthasarathy, Radu Teodorescu, Rajiv Ramnath

    Abstract: Reducing traffic accidents is an important public safety challenge, therefore, accident analysis and prediction has been a topic of much research over the past few decades. Using small-scale datasets with limited coverage, being dependent on extensive set of data, and being not applicable for real-time purposes are the important shortcomings of the existing studies. To address these challenges, we… ▽ More

    Submitted 19 September, 2019; originally announced September 2019.

    Comments: In Proceedings of the 27th ACM SIGSPATIAL, International Conference on Advances in Geographic Information Systems (2019). arXiv admin note: substantial text overlap with arXiv:1906.05409

  23. arXiv:1908.02551  [pdf, ps, other

    cs.SI cs.AI cs.CL

    Tweets Can Tell: Activity Recognition using Hybrid Long Short-Term Memory Model

    Authors: Renhao Cui, Gagan Agrawal, Rajiv Ramnath

    Abstract: This paper presents techniques to detect the "offline" activity a person is engaged in when she is tweeting (such as dining, shopping or entertainment), in order to create a dynamic profile of the user, for uses such as better targeting of advertisements. To this end, we propose a hybrid LSTM model for rich contextual learning, along with studies on the effects of applying and combining multiple L… ▽ More

    Submitted 9 July, 2019; originally announced August 2019.

  24. arXiv:1906.05409  [pdf, other

    cs.DB cs.CY

    A Countrywide Traffic Accident Dataset

    Authors: Sobhan Moosavi, Mohammad Hossein Samavatian, Srinivasan Parthasarathy, Rajiv Ramnath

    Abstract: Reducing traffic accidents is an important public safety challenge. However, the majority of studies on traffic accident analysis and prediction have used small-scale datasets with limited coverage, which limits their impact and applicability; and existing large-scale datasets are either private, old, or do not include important contextual information such as environmental stimuli (weather, points… ▽ More

    Submitted 12 June, 2019; originally announced June 2019.

    Comments: New preprint, 6 pages

  25. Short and Long-term Pattern Discovery Over Large-Scale Geo-Spatiotemporal Data

    Authors: Sobhan Moosavi, Mohammad Hossein Samavatian, Arnab Nandi, Srinivasan Parthasarathy, Rajiv Ramnath

    Abstract: Pattern discovery in geo-spatiotemporal data (such as traffic and weather data) is about finding patterns of collocation, co-occurrence, cascading, or cause and effect between geospatial entities. Using simplistic definitions of spatiotemporal neighborhood (a common characteristic of the existing general-purpose frameworks) is not semantically representative of geo-spatiotemporal data. We therefor… ▽ More

    Submitted 17 May, 2019; v1 submitted 13 February, 2019; originally announced February 2019.

    Comments: In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

  26. arXiv:1804.08748  [pdf, other

    cs.AI

    Discovery of Driving Patterns by Trajectory Segmentation

    Authors: Sobhan Moosavi, Arnab Nandi, Rajiv Ramnath

    Abstract: Telematics data is becoming increasingly available due to the ubiquity of devices that collect data during drives, for different purposes, such as usage based insurance (UBI), fleet management, navigation of connected vehicles, etc. Consequently, a variety of data-analytic applications have become feasible that extract valuable insights from the data. In this paper, we address the especially chall… ▽ More

    Submitted 3 April, 2020; v1 submitted 23 April, 2018; originally announced April 2018.

    Comments: Accepted in the 3rd PhD workshop, ACM SIGSPATIAL 2016

  27. arXiv:1804.00109  [pdf, other

    cs.SI cs.AI

    QDEE: Question Difficulty and Expertise Estimation in Community Question Answering Sites

    Authors: Jiankai Sun, Sobhan Moosavi, Rajiv Ramnath, Srinivasan Parthasarathy

    Abstract: In this paper, we present a framework for Question Difficulty and Expertise Estimation (QDEE) in Community Question Answering sites (CQAs) such as Yahoo! Answers and Stack Overflow, which tackles a fundamental challenge in crowdsourcing: how to appropriately route and assign questions to users with the suitable expertise. This problem domain has been the subject of much research and includes both… ▽ More

    Submitted 20 April, 2018; v1 submitted 30 March, 2018; originally announced April 2018.

    Comments: Accepted in the Proceedings of the 12th International AAAI Conference on Web and Social Media (ICWSM 2018). June 2018. Stanford, CA, USA

  28. Characterizing Driving Context from Driver Behavior

    Authors: Sobhan Moosavi, Behrooz Omidvar-Tehrani, R. Bruce Craig, Arnab Nandi, Rajiv Ramnath

    Abstract: Because of the increasing availability of spatiotemporal data, a variety of data-analytic applications have become possible. Characterizing driving context, where context may be thought of as a combination of location and time, is a new challenging application. An example of such a characterization is finding the correlation between driving behavior and traffic conditions. This contextual informat… ▽ More

    Submitted 17 November, 2017; v1 submitted 13 October, 2017; originally announced October 2017.

    Comments: Accepted to be published at The 25th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACM SIGSPATIAL 2017)

  29. arXiv:1705.05219  [pdf, other

    cs.OH

    Annotation of Car Trajectories based on Driving Patterns

    Authors: Sobhan Moosavi, Behrooz Omidvar-Tehrani, R. Bruce Craig, Rajiv Ramnath

    Abstract: Nowadays, the ubiquity of various sensors enables the collection of voluminous datasets of car trajectories. Such datasets enable analysts to make sense of driving patterns and behaviors: in order to understand the behavior of drivers, one approach is to break a trajectory into its underlying patterns and then analyze that trajectory in terms of derived patterns. The process of trajectory segmenta… ▽ More

    Submitted 16 May, 2017; v1 submitted 15 May, 2017; originally announced May 2017.

    Comments: A 10 pages technical report which described the process of preparing a ground-truth dataset

  30. arXiv:1508.03348  [pdf

    cs.CY cs.SE

    Looking at Software Sustainability and Productivity Challenges from NSF

    Authors: Daniel S. Katz, Rajiv Ramnath

    Abstract: This paper is a contribution to the Computational Science & Engineering Software Sustainability and Productivity Challenges (CSESSP Challenges) Workshop (https://www.nitrd.gov/csessp/), sponsored by the Networking and Information Technology Research and Development (NITRD) Software Design and Productivity (SDP) Coordinating Group, held October 15th-16th 2015 in Washington DC, USA. It introduces th… ▽ More

    Submitted 17 August, 2015; v1 submitted 13 August, 2015; originally announced August 2015.