Skip to main content

Showing 1–16 of 16 results for author: Hirata, N S T

.
  1. arXiv:2505.05681  [pdf, ps, other

    cs.CV

    Fine-Tuning Video-Text Contrastive Model for Primate Behavior Retrieval from Unlabeled Raw Videos

    Authors: Giulio Cesare Mastrocinque Santo, Patrícia Izar, Irene Delval, Victor de Napole Gregolin, Nina S. T. Hirata

    Abstract: Video recordings of nonhuman primates in their natural habitat are a common source for studying their behavior in the wild. We fine-tune pre-trained video-text foundational models for the specific domain of capuchin monkeys, with the goal of developing useful computational models to help researchers to retrieve useful clips from videos. We focus on the challenging problem of training a model based… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

  2. arXiv:2502.04602  [pdf, other

    cs.CL cs.AI

    Extracting and Understanding the Superficial Knowledge in Alignment

    Authors: Runjin Chen, Gabriel Jacob Perin, Xuxi Chen, Xilun Chen, Yan Han, Nina S. T. Hirata, Junyuan Hong, Bhavya Kailkhura

    Abstract: Alignment of large language models (LLMs) with human values and preferences, often achieved through fine-tuning based on human feedback, is essential for ensuring safe and responsible AI behaviors. However, the process typically requires substantial data and computation resources. Recent studies have revealed that alignment might be attainable at lower costs through simpler methods, such as in-con… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

  3. arXiv:2501.04750  [pdf, other

    cs.CV cs.LG

    Efficient License Plate Recognition in Videos Using Visual Rhythm and Accumulative Line Analysis

    Authors: Victor Nascimento Ribeiro, Nina S. T. Hirata

    Abstract: Video-based Automatic License Plate Recognition (ALPR) involves extracting vehicle license plate text information from video captures. Traditional systems typically rely heavily on high-end computing resources and utilize multiple frames to recognize license plates, leading to increased computational overhead. In this paper, we propose two methods capable of efficiently extracting exactly one fram… ▽ More

    Submitted 8 January, 2025; originally announced January 2025.

    Comments: Accepted for presentation at the Conference on Graphics, Patterns and Images (SIBGRAPI) 2024

  4. Combining YOLO and Visual Rhythm for Vehicle Counting

    Authors: Victor Nascimento Ribeiro, Nina S. T. Hirata

    Abstract: Video-based vehicle detection and counting play a critical role in managing transport infrastructure. Traditional image-based counting methods usually involve two main steps: initial detection and subsequent tracking, which are applied to all video frames, leading to a significant increase in computational complexity. To address this issue, this work presents an alternative and more efficient meth… ▽ More

    Submitted 8 January, 2025; originally announced January 2025.

    Comments: Accepted for presentation at the Conference on Graphics, Patterns and Images (SIBGRAPI) 2023

  5. arXiv:2501.02270  [pdf, other

    cs.CV cs.LG eess.IV

    Efficient Video-Based ALPR System Using YOLO and Visual Rhythm

    Authors: Victor Nascimento Ribeiro, Nina S. T. Hirata

    Abstract: Automatic License Plate Recognition (ALPR) involves extracting vehicle license plate information from image or a video capture. These systems have gained popularity due to the wide availability of low-cost surveillance cameras and advances in Deep Learning. Typically, video-based ALPR systems rely on multiple frames to detect the vehicle and recognize the license plates. Therefore, we propose a sy… ▽ More

    Submitted 8 January, 2025; v1 submitted 4 January, 2025; originally announced January 2025.

    Comments: Accepted to CVPR 2024

  6. Understanding attention-based encoder-decoder networks: a case study with chess scoresheet recognition

    Authors: Sergio Y. Hayashi, Nina S. T. Hirata

    Abstract: Deep neural networks are largely used for complex prediction tasks. There is plenty of empirical evidence of their successful end-to-end training for a diversity of tasks. Success is often measured based solely on the final performance of the trained network, and explanations on when, why and how they work are less emphasized. In this paper we study encoder-decoder recurrent neural networks with a… ▽ More

    Submitted 23 April, 2024; originally announced June 2024.

    Comments: This work was accepted and published in the 2022 26th International Conference on Pattern Recognition (ICPR)

    Journal ref: 2022 26th International Conference on Pattern Recognition (ICPR)

  7. arXiv:2404.09925  [pdf, other

    astro-ph.CO astro-ph.IM

    The Quasar Catalogue for S-PLUS DR4 (QuCatS) and the estimation of photometric redshifts

    Authors: L. Nakazono, R. R. Valença, G. Soares, R. Izbicki, Ž. Ivezić, E. V. R. Lima, N. S. T. Hirata, L. Sodré Jr., R. Overzier, F. Almeida-Fernandes, G. B. Oliveira Schwarz, W. Schoenell, A. Kanaan, T. Ribeiro, C. Mendes de Oliveira

    Abstract: The advent of massive broad-band photometric surveys enabled photometric redshift estimates for unprecedented numbers of galaxies and quasars. These estimates can be improved using better algorithms or by obtaining complementary data such as narrow-band photometry, and broad-band photometry over an extended wavelength range. We investigate the impact of both approaches on photometric redshifts for… ▽ More

    Submitted 23 August, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Journal ref: Monthly Notices of the Royal Astronomical Society, 2024, 531, 327-339

  8. arXiv:2202.12941  [pdf, other

    eess.SP cs.LG nucl-ex physics.comp-ph physics.data-an physics.ins-det

    Digital Signal Analysis based on Convolutional Neural Networks for Active Target Time Projection Chambers

    Authors: G. F. Fortino, J. C. Zamora, L. E. Tamayose, N. S. T. Hirata, V. Guimaraes

    Abstract: An algorithm for digital signal analysis using convolutional neural networks (CNN) was developed in this work. The main objective of this algorithm is to make the analysis of experiments with active target time projection chambers more efficient. The code is divided in three steps: baseline correction, signal deconvolution and peak detection and integration. The CNNs were able to learn the signal… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

  9. arXiv:2106.11986  [pdf, other

    astro-ph.GA astro-ph.CO astro-ph.IM astro-ph.SR

    On the discovery of stars, quasars, and galaxies in the Southern Hemisphere with S-PLUS DR2

    Authors: L. Nakazono, C. Mendes de Oliveira, N. S. T. Hirata, S. Jeram, C. Queiroz, Stephen S. Eikenberry, A. H. Gonzalez, R. Abramo, R. Overzier, M. Espadoto, A. Martinazzo, L. Sampedro, F. R. Herpich, F. Almeida-Fernandes, A. Werle, C. E. Barbosa, L. Sodré Jr., E. V. Lima, M. L. Buzzo, A. Cortesi, K. Menéndez-Delmestre, S. Akras, Alvaro Alvarez-Candal, A. R. Lopes, E. Telles , et al. (3 additional authors not shown)

    Abstract: This paper provides a catalogue of stars, quasars, and galaxies for the Southern Photometric Local Universe Survey Data Release 2 (S-PLUS DR2) in the Stripe 82 region. We show that a 12-band filter system (5 Sloan-like and 7 narrow bands) allows better performance for object classification than the usual analysis based solely on broad bands (regardless of infrared information). Moreover, we show t… ▽ More

    Submitted 4 November, 2021; v1 submitted 22 June, 2021; originally announced June 2021.

    Comments: 27 pages, 22 figures. Updated to reflect the published version. Data products are available in https://splus.cloud/ website

    Journal ref: Monthly Notices of the Royal Astronomical Society, 2021, 507, 5847-5868

  10. arXiv:2004.11336  [pdf, other

    cs.CV

    Self-supervised Learning for Astronomical Image Classification

    Authors: Ana Martinazzo, Mateus Espadoto, Nina S. T. Hirata

    Abstract: In Astronomy, a huge amount of image data is generated daily by photometric surveys, which scan the sky to collect data from stars, galaxies and other celestial objects. In this paper, we propose a technique to leverage unlabeled astronomical images to pre-train deep convolutional neural networks, in order to learn a domain-specific feature extractor which improves the results of machine learning… ▽ More

    Submitted 25 June, 2020; v1 submitted 23 April, 2020; originally announced April 2020.

    Comments: Accepted for ICPR 2020

  11. arXiv:1912.06199  [pdf, other

    cs.CV cs.LG

    Greenery Segmentation In Urban Images By Deep Learning

    Authors: Artur A. M. Oliveira, Nina S. T. Hirata, Roberto Hirata Jr

    Abstract: Vegetation is a relevant feature in the urban scenery and its awareness can be measured in an image by the Green View Index (GVI). Previous approaches to estimate the GVI were based upon heuristics image processing approaches and recently by deep learning networks (DLN). By leveraging some recent DLN architectures tuned to the image segmentation problem and exploiting a weighting strategy in the l… ▽ More

    Submitted 12 December, 2019; originally announced December 2019.

    Comments: Supplemental material can be found at http://greenery_data.arturao.org/

    MSC Class: I.4.6; I.5.4 ACM Class: I.4.6; I.5.4

  12. arXiv:1907.01567  [pdf, other

    astro-ph.GA astro-ph.CO astro-ph.HE astro-ph.IM astro-ph.SR

    The Southern Photometric Local Universe Survey (S-PLUS): improved SEDs, morphologies and redshifts with 12 optical filters

    Authors: C. Mendes de Oliveira, T. Ribeiro, W. Schoenell, A. Kanaan, R. A. Overzier, A. Molino, L. Sampedro, P. Coelho, C. E. Barbosa, A. Cortesi, M. V. Costa-Duarte, F. R. Herpich, J. A. Hernandez-Jimenez, V. M. Placco, H. S. Xavier, L. R. Abramo, R. K. Saito, A. L. Chies-Santos, A. Ederoclite, R. Lopes de Oliveira, D. R. Gonçalves, S. Akras, L. A. Almeida, F. Almeida-Fernandes, T. C. Beers , et al. (120 additional authors not shown)

    Abstract: The Southern Photometric Local Universe Survey (S-PLUS) is imaging ~9300 deg^2 of the celestial sphere in twelve optical bands using a dedicated 0.8 m robotic telescope, the T80-South, at the Cerro Tololo Inter-American Observatory, Chile. The telescope is equipped with a 9.2k by 9.2k e2v detector with 10 um pixels, resulting in a field-of-view of 2 deg^2 with a plate scale of 0.55"/pixel. The sur… ▽ More

    Submitted 2 September, 2019; v1 submitted 2 July, 2019; originally announced July 2019.

    Comments: Updated to reflect the published version (MNRAS, 489, 241). For a short introductory video of the S-PLUS project, see https://youtu.be/yc5kHrHU9Jk - The S-PLUS Data Release 1 is available at http://datalab.noao.edu/splus

  13. arXiv:1902.07958  [pdf, other

    cs.LG stat.ML

    Deep Learning Multidimensional Projections

    Authors: Mateus Espadoto, Nina S. T. Hirata, Alexandru C. Telea

    Abstract: Dimensionality reduction methods, also known as projections, are frequently used for exploring multidimensional data in machine learning, data science, and information visualization. Among these, t-SNE and its variants have become very popular for their ability to visually separate distinct data clusters. However, such methods are computationally expensive for large datasets, suffer from stability… ▽ More

    Submitted 21 February, 2019; originally announced February 2019.

  14. arXiv:1712.04833  [pdf, other

    cs.CV

    Symbol detection in online handwritten graphics using Faster R-CNN

    Authors: Frank D. Julca-Aguilar, Nina S. T. Hirata

    Abstract: Symbol detection techniques in online handwritten graphics (e.g. diagrams and mathematical expressions) consist of methods specifically designed for a single graphic type. In this work, we evaluate the Faster R-CNN object detection algorithm as a general method for detection of symbols in handwritten graphics. We evaluate different configurations of the Faster R-CNN method, and point out issues re… ▽ More

    Submitted 13 December, 2017; originally announced December 2017.

    Comments: Submitted to DAS-2018

  15. arXiv:1709.06476  [pdf, other

    cs.CV

    Image operator learning coupled with CNN classification and its application to staff line removal

    Authors: Frank D. Julca-Aguilar, Nina S. T. Hirata

    Abstract: Many image transformations can be modeled by image operators that are characterized by pixel-wise local functions defined on a finite support window. In image operator learning, these functions are estimated from training data using machine learning techniques. Input size is usually a critical issue when using learning algorithms, and it limits the size of practicable windows. We propose the use o… ▽ More

    Submitted 19 September, 2017; originally announced September 2017.

    Comments: To appear in ICDAR 2017

  16. arXiv:1709.06389  [pdf, other

    cs.CV

    A General Framework for the Recognition of Online Handwritten Graphics

    Authors: Frank Julca-Aguilar, Harold Mouchère, Christian Viard-Gaudin, Nina S. T. Hirata

    Abstract: We propose a new framework for the recognition of online handwritten graphics. Three main features of the framework are its ability to treat symbol and structural level information in an integrated way, its flexibility with respect to different families of graphics, and means to control the tradeoff between recognition effectiveness and computational cost. We model a graphic as a labeled graph gen… ▽ More

    Submitted 19 September, 2017; originally announced September 2017.

    Comments: Submitted to TPAMI