Skip to main content

Showing 1–6 of 6 results for author: Baeg, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.18934  [pdf, other

    cs.CL cs.LG

    Kanana: Compute-efficient Bilingual Language Models

    Authors: Kanana LLM Team, Yunju Bak, Hojin Lee, Minho Ryu, Jiyeon Ham, Seungjae Jung, Daniel Wontae Nam, Taegyeong Eo, Donghun Lee, Doohae Jung, Boseop Kim, Nayeon Kim, Jaesun Park, Hyunho Kim, Hyunwoong Ko, Changmin Lee, Kyoung-Woon On, Seulye Baeg, Junrae Cho, Sunghee Jung, Jieun Kang, EungGyun Kim, Eunhwa Kim, Byeongil Ko, Daniel Lee , et al. (4 additional authors not shown)

    Abstract: We introduce Kanana, a series of bilingual language models that demonstrate exceeding performance in Korean and competitive performance in English. The computational cost of Kanana is significantly lower than that of state-of-the-art models of similar size. The report details the techniques employed during pre-training to achieve compute-efficient yet competitive models, including high quality dat… ▽ More

    Submitted 28 February, 2025; v1 submitted 26 February, 2025; originally announced February 2025.

    Comments: 40 pages, 15 figures

  2. arXiv:2412.12709  [pdf, other

    astro-ph.GA astro-ph.CO astro-ph.IM cs.CV cs.LG

    Accelerating lensed quasar discovery and modeling with physics-informed variational autoencoders

    Authors: Irham T. Andika, Stefan Schuldt, Sherry H. Suyu, Satadru Bag, Raoul Cañameras, Alejandra Melo, Claudio Grillo, James H. H. Chan

    Abstract: Strongly lensed quasars provide valuable insights into the rate of cosmic expansion, the distribution of dark matter in foreground deflectors, and the characteristics of quasar hosts. However, detecting them in astronomical images is difficult due to the prevalence of non-lensing objects. To address this challenge, we developed a generative deep learning model called VariLens, built upon a physics… ▽ More

    Submitted 27 January, 2025; v1 submitted 17 December, 2024; originally announced December 2024.

    Comments: Accepted for publication in the Astronomy & Astrophysics journal and updated to reflect the revised version. The paper consists of 15 main pages, 12 figures, and 1 table. We welcome feedback and comments from readers!

    Journal ref: A&A 694, A227 (2025)

  3. arXiv:1503.05294  [pdf

    cs.DB

    Easy and Fast Design and Implementation of PostgreSQL based image handling application

    Authors: Kisor Ray, Sourav Bag, Saumen Sarkar

    Abstract: In modern computing, RDBMS are great to store different types of data. To a developer, one of the major objectives is to provide a very low cost and easy to use solution to an existing problem. While commercial databases are more easy to use along with their new as well as documented features come with complicated licensing cost, free open source databases are not that straightforward under many s… ▽ More

    Submitted 18 March, 2015; originally announced March 2015.

    Comments: 05 pages, 04 figures, 02 tables, International Journal of Advanced Research in Computer Science and Software Engineering, Volume 5, Issue 2, February 2015, ISSN 2277 128X

    ACM Class: K.8.1

  4. Topographic Feature Extraction for Bengali and Hindi Character Images

    Authors: Soumen Bag, Gaurav Harit

    Abstract: Feature selection and extraction plays an important role in different classification based problems such as face recognition, signature verification, optical character recognition (OCR) etc. The performance of OCR highly depends on the proper selection and extraction of feature set. In this paper, we present novel features based on the topography of a character as visible from different viewing di… ▽ More

    Submitted 14 July, 2011; originally announced July 2011.

    Journal ref: Signal & Image Processing : An International Journal (SIPIJ), vol.2, no.2, pp. 181-196, June 2011

  5. arXiv:1104.1237  [pdf

    cs.CV

    A Statistical Nonparametric Approach of Face Recognition: Combination of Eigenface & Modified k-Means Clustering

    Authors: Soumen Bag, Soumen Barik, Prithwiraj Sen, Gautam Sanyal

    Abstract: Facial expressions convey non-verbal cues, which play an important role in interpersonal relations. Automatic recognition of human face based on facial expression can be an important component of natural human-machine interface. It may also be used in behavioural science. Although human can recognize the face practically without any effort, but reliable face recognition by machine is a challenge.… ▽ More

    Submitted 6 April, 2011; originally announced April 2011.

    Comments: 7 pages, 2 figures. In proceedings of the Second International Conference on Information Processing (ICIP), pp. 198-204, Bangalore, India, 2008

  6. arXiv:1103.0738  [pdf, ps, other

    cs.CV cs.DL

    A Medial Axis Based Thinning Strategy for Character Images

    Authors: Soumen Bag, Gaurav Harit

    Abstract: Thinning of character images is a big challenge. Removal of strokes or deformities in thinning is a difficult problem. In this paper, we have proposed a medial axis based thinning strategy used for performing skeletonization of printed and handwritten character images. In this method, we have used shape characteristics of text to get skeleton of nearly same as the true character shape. This approa… ▽ More

    Submitted 3 March, 2011; originally announced March 2011.

    Comments: 6 pages, 5 figures. In proceedings of the second National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG), pp. 67-72, Jaipur, India, 2010