Skip to main content

Showing 1–12 of 12 results for author: J, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.02362  [pdf, other

    cs.CV cs.AI

    A Comprehensive Survey of Mamba Architectures for Medical Image Analysis: Classification, Segmentation, Restoration and Beyond

    Authors: Shubhi Bansal, Sreeharish A, Madhava Prasath J, Manikandan S, Sreekanth Madisetty, Mohammad Zia Ur Rehman, Chandravardhan Singh Raghaw, Gaurav Duggal, Nagendra Kumar

    Abstract: Mamba, a special case of the State Space Model, is gaining popularity as an alternative to template-based deep learning approaches in medical image analysis. While transformers are powerful architectures, they have drawbacks, including quadratic computational complexity and an inability to address long-range dependencies efficiently. This limitation affects the analysis of large and complex datase… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

  2. arXiv:2405.06702  [pdf, other

    cs.CL cs.CV

    Malayalam Sign Language Identification using Finetuned YOLOv8 and Computer Vision Techniques

    Authors: Abhinand K., Abhiram B. Nair, Dhananjay C., Hanan Hamza, Mohammed Fawaz J., Rahma Fahim K., Anoop V. S

    Abstract: Technological advancements and innovations are advancing our daily life in all the ways possible but there is a larger section of society who are deprived of accessing the benefits due to their physical inabilities. To reap the real benefits and make it accessible to society, these talented and gifted people should also use such innovations without any hurdles. Many applications developed these da… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  3. arXiv:2404.10779  [pdf, other

    cs.SE cs.LG

    Fine Tuning LLM for Enterprise: Practical Guidelines and Recommendations

    Authors: Mathav Raj J, Kushala VM, Harikrishna Warrier, Yogesh Gupta

    Abstract: There is a compelling necessity from enterprises for fine tuning LLMs (Large Language Models) o get them trained on proprietary domain knowledge. The challenge is to imbibe the LLMs with domain specific knowledge using the most optimial resource and cost and in the best possible time. Many enterprises rely on RAG (Retrieval Augmented Generation) which does not need LLMs to be ine-tuned but they ar… ▽ More

    Submitted 23 March, 2024; originally announced April 2024.

    Comments: 17 pages, 12 tables, 3 figures

  4. arXiv:2310.14654  [pdf, ps, other

    cs.CL eess.AS

    SPRING-INX: A Multilingual Indian Language Speech Corpus by SPRING Lab, IIT Madras

    Authors: Nithya R, Malavika S, Jordan F, Arjun Gangwar, Metilda N J, S Umesh, Rithik Sarab, Akhilesh Kumar Dubey, Govind Divakaran, Samudra Vijaya K, Suryakanth V Gangashetty

    Abstract: India is home to a multitude of languages of which 22 languages are recognised by the Indian Constitution as official. Building speech based applications for the Indian population is a difficult problem owing to limited data and the number of languages and accents to accommodate. To encourage the language technology community to build speech based applications in Indian languages, we are open sour… ▽ More

    Submitted 24 October, 2023; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: 3 pages, About SPRING-INX Data

  5. arXiv:2309.09807  [pdf

    cs.CR cs.AI cs.LG

    Efficient Concept Drift Handling for Batch Android Malware Detection Models

    Authors: Molina-Coronado B., Mori U., Mendiburu A., Miguel-Alonso J

    Abstract: The rapidly evolving nature of Android apps poses a significant challenge to static batch machine learning algorithms employed in malware detection systems, as they quickly become obsolete. Despite this challenge, the existing literature pays limited attention to addressing this issue, with many advanced Android malware detection approaches, such as Drebin, DroidDet and MaMaDroid, relying on stati… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: 18 pages

  6. arXiv:2309.02617  [pdf, other

    cs.CV cs.LG

    Compressing Vision Transformers for Low-Resource Visual Learning

    Authors: Eric Youn, Sai Mitheran J, Sanjana Prabhu, Siyuan Chen

    Abstract: Vision transformer (ViT) and its variants have swept through visual learning leaderboards and offer state-of-the-art accuracy in tasks such as image classification, object detection, and semantic segmentation by attending to different parts of the visual input and capturing long-range spatial dependencies. However, these models are large and computation-heavy. For instance, the recently proposed V… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  7. arXiv:2104.05596  [pdf

    cs.CL

    Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages

    Authors: Gowtham Ramesh, Sumanth Doddapaneni, Aravinth Bheemaraj, Mayank Jobanputra, Raghavan AK, Ajitesh Sharma, Sujit Sahoo, Harshita Diddee, Mahalakshmi J, Divyanshu Kakwani, Navneet Kumar, Aswin Pradeep, Srihari Nagaraj, Kumar Deepak, Vivek Raghavan, Anoop Kunchukuttan, Pratyush Kumar, Mitesh Shantadevi Khapra

    Abstract: We present Samanantar, the largest publicly available parallel corpora collection for Indic languages. The collection contains a total of 49.7 million sentence pairs between English and 11 Indic languages (from two language families). Specifically, we compile 12.4 million sentence pairs from existing, publicly-available parallel corpora, and additionally mine 37.4 million sentence pairs from the w… ▽ More

    Submitted 12 June, 2023; v1 submitted 12 April, 2021; originally announced April 2021.

    Comments: Accepted to the Transactions of the Association for Computational Linguistics (TACL)

  8. arXiv:2008.03247  [pdf, other

    eess.AS cs.CV cs.SD

    Investigation of Speaker-adaptation methods in Transformer based ASR

    Authors: Vishwas M. Shetty, Metilda Sagaya Mary N J, S. Umesh

    Abstract: End-to-end models are fast replacing the conventional hybrid models in automatic speech recognition. Transformer, a sequence-to-sequence model, based on self-attention popularly used in machine translation tasks, has given promising results when used for automatic speech recognition. This paper explores different ways of incorporating speaker information at the encoder input while training a trans… ▽ More

    Submitted 17 November, 2021; v1 submitted 7 August, 2020; originally announced August 2020.

    Comments: 5 pages, 6 figures

  9. arXiv:1911.03637  [pdf, ps, other

    cs.DM math.CO

    Boundary-type Sets of Strong Product of Directed Graphs

    Authors: Prasanth G. Narasimha-Shenoi, Bijo S Anand, Mary Shalet T J

    Abstract: Let $D=(V,E)$ be a strongly connected digraph and let $u ,v\in V(D)$. The maximum distance $md (u,v)$ is defined as\\ $md(u,v)$=max\{$\overrightarrow{d}(u,v), \overrightarrow{d}(v,u)$\} where $\overrightarrow{d}(u,v)$ denote the length of a shortest directed $u-v$ path in $D$. This is a metric. The boundary, contour, eccentric and peripheral sets of a strong digraph $D$ with respect to this metric… ▽ More

    Submitted 9 November, 2019; originally announced November 2019.

  10. arXiv:1903.10641  [pdf, other

    cs.CV cs.RO

    INFER: INtermediate representations for FuturE pRediction

    Authors: Shashank Srikanth, Junaid Ahmed Ansari, Karnik Ram R, Sarthak Sharma, Krishna Murthy J., Madhava Krishna K

    Abstract: In urban driving scenarios, forecasting future trajectories of surrounding vehicles is of paramount importance. While several approaches for the problem have been proposed, the best-performing ones tend to require extremely detailed input representations (eg. image sequences). But, such methods do not generalize to datasets they have not been trained on. We propose intermediate representations tha… ▽ More

    Submitted 25 March, 2019; originally announced March 2019.

    Comments: Manuscript under review. Submitted to IROS 2019

  11. arXiv:1609.03110  [pdf, ps, other

    cs.DM

    Directed graphs and its Boundary Vertices

    Authors: Manoj Changat, Prasanth G. Narasimha-Shenoi, Mary Shallet T. J, Ram Kumar

    Abstract: Suppose that $D=(V,E)$ is a strongly connected digraph. Let $u,v\in V(D)$. The maximum distance $md (u,v)$ is defined as $md(u,v)$=max\{$\overrightarrow{d}(u,v), \overrightarrow{d}(v,u)$\} where $\overrightarrow{d}(u,v)$ denote the length of a shortest directed $u-v$ path in $D$. This is a metric. The boundary, contour, eccentric and peripheral sets of a strong digraph $D$ are defined with respect… ▽ More

    Submitted 10 September, 2016; originally announced September 2016.

  12. arXiv:1306.5982  [pdf

    cs.AI cs.DB

    Activity Modeling in Smart Home using High Utility Pattern Mining over Data Streams

    Authors: Menaka Gandhi. J, K. S. Gayathri

    Abstract: Smart home technology is a better choice for the people to care about security, comfort and power saving as well. It is required to develop technologies that recognize the Activities of Daily Living (ADLs) of the residents at home and detect the abnormal behavior in the individual's patterns. Data mining techniques such as Frequent pattern mining (FPM), High Utility Pattern (HUP) Mining were used… ▽ More

    Submitted 25 June, 2013; originally announced June 2013.

    Comments: This research paper consists of 7 pages, 7 figures and 4 algorithms

    Journal ref: "Interactive mining of high utility patterns over data streams", Elsevier, Vol. 39, No. 15, 2012