Skip to main content

Showing 1–3 of 3 results for author: Bendale, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.06694  [pdf, other

    cs.CL cs.AI

    SUTRA: Scalable Multilingual Language Model Architecture

    Authors: Abhijit Bendale, Michael Sapienza, Steven Ripplinger, Simon Gibbs, Jaewon Lee, Pranav Mistry

    Abstract: In this paper, we introduce SUTRA, multilingual Large Language Model architecture capable of understanding, reasoning, and generating text in over 50 languages. SUTRA's design uniquely decouples core conceptual understanding from language-specific processing, which facilitates scalable and efficient multilingual alignment and learning. Employing a Mixture of Experts framework both in language and… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  2. arXiv:1511.06233  [pdf, other

    cs.CV cs.LG

    Towards Open Set Deep Networks

    Authors: Abhijit Bendale, Terrance Boult

    Abstract: Deep networks have produced significant gains for various visual recognition problems, leading to high impact academic and commercial applications. Recent work in deep networks highlighted that it is easy to generate images that humans would never classify as a particular object class, yet networks classify such images high confidence as that given class - deep network are easily fooled with image… ▽ More

    Submitted 19 November, 2015; originally announced November 2015.

  3. Towards Open World Recognition

    Authors: Abhijit Bendale, Terrance Boult

    Abstract: With the of advent rich classification models and high computational power visual recognition systems have found many operational applications. Recognition in the real world poses multiple challenges that are not apparent in controlled lab environments. The datasets are dynamic and novel categories must be continuously detected and then added. At prediction time, a trained system has to deal with… ▽ More

    Submitted 17 December, 2014; originally announced December 2014.

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2015) 1893 - 1902