Skip to main content

Showing 1–13 of 13 results for author: Mudur, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.19475  [pdf, ps, other

    cs.CL

    Continuous Self-Improvement of Large Language Models by Test-time Training with Verifier-Driven Sample Selection

    Authors: Mohammad Mahdi Moradi, Hossam Amer, Sudhir Mudur, Weiwei Zhang, Yang Liu, Walid Ahmed

    Abstract: Learning to adapt pretrained language models to unlabeled, out-of-distribution data is a critical challenge, as models often falter on structurally novel reasoning tasks even while excelling within their training distribution. We introduce a new framework called VDS-TTT - Verifier-Driven Sample Selection for Test-Time Training to efficiently address this. We use a learned verifier to score a pool… ▽ More

    Submitted 28 May, 2025; v1 submitted 25 May, 2025; originally announced May 2025.

  2. arXiv:2505.19472  [pdf, ps, other

    cs.CL

    Balancing Computation Load and Representation Expressivity in Parallel Hybrid Neural Networks

    Authors: Mohammad Mahdi Moradi, Walid Ahmed, Shuangyue Wen, Sudhir Mudur, Weiwei Zhang, Yang Liu

    Abstract: Attention and State-Space Models (SSMs) when combined in a hybrid network in sequence or in parallel provide complementary strengths. In a hybrid sequential pipeline they alternate between applying a transformer to the input and then feeding its output into a SSM. This results in idle periods in the individual components increasing end-to-end latency and lowering throughput caps. In the parallel h… ▽ More

    Submitted 28 May, 2025; v1 submitted 25 May, 2025; originally announced May 2025.

  3. arXiv:2505.19354  [pdf, ps, other

    cs.CL cs.CV

    GC-KBVQA: A New Four-Stage Framework for Enhancing Knowledge Based Visual Question Answering Performance

    Authors: Mohammad Mahdi Moradi, Sudhir Mudur

    Abstract: Knowledge-Based Visual Question Answering (KB-VQA) methods focus on tasks that demand reasoning with information extending beyond the explicit content depicted in the image. Early methods relied on explicit knowledge bases to provide this auxiliary information. Recent approaches leverage Large Language Models (LLMs) as implicit knowledge sources. While KB-VQA methods have demonstrated promising re… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

  4. arXiv:2311.12091  [pdf, other

    cs.CV

    DAS: A Deformable Attention to Capture Salient Information in CNNs

    Authors: Farzad Salajegheh, Nader Asadi, Soroush Saryazdi, Sudhir Mudur

    Abstract: Convolutional Neural Networks (CNNs) excel in local spatial pattern recognition. For many vision tasks, such as object recognition and segmentation, salient information is also present outside CNN's kernel boundaries. However, CNNs struggle in capturing such relevant information due to their confined receptive fields. Self-attention can improve a model's access to global information but increases… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  5. arXiv:2310.04561  [pdf, other

    cs.GR cs.LG

    DragD3D: Realistic Mesh Editing with Rigidity Control Driven by 2D Diffusion Priors

    Authors: Tianhao Xie, Eugene Belilovsky, Sudhir Mudur, Tiberiu Popa

    Abstract: Direct mesh editing and deformation are key components in the geometric modeling and animation pipeline. Mesh editing methods are typically framed as optimization problems combining user-specified vertex constraints with a regularizer that determines the position of the rest of the vertices. The choice of the regularizer is key to the realism and authenticity of the final result. Physics and geome… ▽ More

    Submitted 2 August, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: 11 pages, 8 figures, project page: https://tianhaoxie.github.io/project/DragD3D/

  6. arXiv:2304.04858  [pdf, other

    cs.LG cs.CV

    Simulated Annealing in Early Layers Leads to Better Generalization

    Authors: Amirmohammad Sarfi, Zahra Karimpour, Muawiz Chaudhary, Nasir M. Khalid, Mirco Ravanelli, Sudhir Mudur, Eugene Belilovsky

    Abstract: Recently, a number of iterative learning methods have been introduced to improve generalization. These typically rely on training for longer periods of time in exchange for improved generalization. LLF (later-layer-forgetting) is a state-of-the-art method in this category. It strengthens learning in early layers by periodically re-initializing the last few layers of the network. Our principal inno… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

  7. arXiv:2303.14771  [pdf, other

    cs.LG

    Prototype-Sample Relation Distillation: Towards Replay-Free Continual Learning

    Authors: Nader Asadi, MohammadReza Davari, Sudhir Mudur, Rahaf Aljundi, Eugene Belilovsky

    Abstract: In Continual learning (CL) balancing effective adaptation while combating catastrophic forgetting is a central challenge. Many of the recent best-performing methods utilize various forms of prior task data, e.g. a replay buffer, to tackle the catastrophic forgetting problem. Having access to previous task data can be restrictive in many real-world scenarios, for example when task data is sensitive… ▽ More

    Submitted 6 June, 2023; v1 submitted 26 March, 2023; originally announced March 2023.

    Comments: Accepted at ICML 2023

  8. arXiv:2203.13381  [pdf, other

    cs.LG cs.AI cs.CV

    Probing Representation Forgetting in Supervised and Unsupervised Continual Learning

    Authors: MohammadReza Davari, Nader Asadi, Sudhir Mudur, Rahaf Aljundi, Eugene Belilovsky

    Abstract: Continual Learning research typically focuses on tackling the phenomenon of catastrophic forgetting in neural networks. Catastrophic forgetting is associated with an abrupt loss of knowledge previously learned by a model when the task, or more broadly the data distribution, being trained on changes. In supervised learning problems this forgetting, resulting from a change in the model's representat… ▽ More

    Submitted 5 April, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

    Comments: Accepted at CVPR 2022

  9. arXiv:2203.13307  [pdf, other

    cs.LG cs.AI

    Tackling Online One-Class Incremental Learning by Removing Negative Contrasts

    Authors: Nader Asadi, Sudhir Mudur, Eugene Belilovsky

    Abstract: Recent work studies the supervised online continual learning setting where a learner receives a stream of data whose class distribution changes over time. Distinct from other continual learning settings the learner is presented new samples only once and must distinguish between all seen classes. A number of successful methods in this setting focus on storing and replaying a subset of samples along… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

    Comments: Accepted at NeurIPS 2021 Workshop on Distribution Shifts

  10. arXiv:2101.03054  [pdf, other

    cs.IR cs.LG

    Application of Knowledge Graphs to Provide Side Information for Improved Recommendation Accuracy

    Authors: Yuhao Mao, Serguei A. Mokhov, Sudhir P. Mudur

    Abstract: Personalized recommendations are popular in these days of Internet driven activities, specifically shopping. Recommendation methods can be grouped into three major categories, content based filtering, collaborative filtering and machine learning enhanced. Information about products and preferences of different users are primarily used to infer preferences for a specific user. Inadequate informatio… ▽ More

    Submitted 7 January, 2021; originally announced January 2021.

    Comments: 27 pages, 16 figures, 5 tables, 2 algorithms; Submitted to Science of Computer Programming

  11. arXiv:1808.00118  [pdf, other

    cs.CV cs.CR cs.HC

    Toward Multimodal Interaction in Scalable Visual Digital Evidence Visualization Using Computer Vision Techniques and ISS

    Authors: Serguei A. Mokhov, Miao Song, Jashanjot Singh, Joey Paquet, Mourad Debbabi, Sudhir Mudur

    Abstract: Visualization requirements in Forensic Lucid have to do with different levels of case knowledge abstraction, representation, aggregation, as well as the operational aspects as the final long-term goal of this proposal. It encompasses anything from the finer detailed representation of hierarchical contexts to Forensic Lucid programs, to the documented evidence and its management, its linkage to pro… ▽ More

    Submitted 31 July, 2018; originally announced August 2018.

    Comments: reformatted; ICPRAI 2018 conference proceedings, pp. 151-157, CENPARMI, Concordia University, Montreal

  12. arXiv:1710.02566  [pdf, ps, other

    cs.CV

    CAMREP- Concordia Action and Motion Repository

    Authors: Kaustubha Mendhurwar, Qing Gu, Vladimir de la Cruz, Sudhir Mudur, Tiberiu Popa

    Abstract: Action recognition, motion classification, gait analysis and synthesis are fundamental problems in a number of fields such as computer graphics, bio-mechanics and human computer interaction that generate a large body of research. This type of data is complex because it is inherently multidimensional and has multiple modalities such as video, motion capture data, accelerometer data, etc. While some… ▽ More

    Submitted 6 October, 2017; originally announced October 2017.

  13. arXiv:1709.07368  [pdf, other

    cs.CV

    Multi-label Pixelwise Classification for Reconstruction of Large-scale Urban Areas

    Authors: Yuanlie He, Sudhir Mudur, Charalambos Poullis

    Abstract: Object classification is one of the many holy grails in computer vision and as such has resulted in a very large number of algorithms being proposed already. Specifically in recent years there has been considerable progress in this area primarily due to the increased efficiency and accessibility of deep learning techniques. In fact, for single-label object classification [i.e. only one object pres… ▽ More

    Submitted 23 January, 2018; v1 submitted 21 September, 2017; originally announced September 2017.