Skip to main content

Showing 1–26 of 26 results for author: Latecki, L J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.08166  [pdf, other

    cs.CV

    Learning Object Focused Attention

    Authors: Vivek Trivedy, Amani Almalki, Longin Jan Latecki

    Abstract: We propose an adaptation to the training of Vision Transformers (ViTs) that allows for an explicit modeling of objects during the attention computation. This is achieved by adding a new branch to selected attention layers that computes an auxiliary loss which we call the object-focused attention (OFA) loss. We restrict the attention to image patches that belong to the same object class, which allo… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

  2. arXiv:2504.04616  [pdf, other

    cs.CL

    DynClean: Training Dynamics-based Label Cleaning for Distantly-Supervised Named Entity Recognition

    Authors: Qi Zhang, Huitong Pan, Zhijia Chen, Longin Jan Latecki, Cornelia Caragea, Eduard Dragut

    Abstract: Distantly Supervised Named Entity Recognition (DS-NER) has attracted attention due to its scalability and ability to automatically generate labeled data. However, distant annotation introduces many mislabeled instances, limiting its performance. Most of the existing work attempt to solve this problem by developing intricate models to learn from the noisy labels. An alternative approach is to attem… ▽ More

    Submitted 6 April, 2025; originally announced April 2025.

    Comments: Accepted to NAACL2025-Findings

  3. arXiv:2410.21155  [pdf, other

    cs.CL

    SciER: An Entity and Relation Extraction Dataset for Datasets, Methods, and Tasks in Scientific Documents

    Authors: Qi Zhang, Zhijia Chen, Huitong Pan, Cornelia Caragea, Longin Jan Latecki, Eduard Dragut

    Abstract: Scientific information extraction (SciIE) is critical for converting unstructured knowledge from scholarly articles into structured data (entities and relations). Several datasets have been proposed for training and validating SciIE models. However, due to the high complexity and cost of annotating scientific texts, those datasets restrict their annotations to specific parts of paper, such as abst… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.

    Comments: EMNLP2024 Main

  4. arXiv:2407.05183  [pdf, other

    cs.CV cs.AI

    FlowLearn: Evaluating Large Vision-Language Models on Flowchart Understanding

    Authors: Huitong Pan, Qi Zhang, Cornelia Caragea, Eduard Dragut, Longin Jan Latecki

    Abstract: Flowcharts are graphical tools for representing complex concepts in concise visual representations. This paper introduces the FlowLearn dataset, a resource tailored to enhance the understanding of flowcharts. FlowLearn contains complex scientific flowcharts and simulated flowcharts. The scientific subset contains 3,858 flowcharts sourced from scientific literature and the simulated subset contains… ▽ More

    Submitted 9 July, 2024; v1 submitted 6 July, 2024; originally announced July 2024.

    Comments: ECAI 2024

  5. arXiv:2406.14756  [pdf, other

    cs.AI

    SciDMT: A Large-Scale Corpus for Detecting Scientific Mentions

    Authors: Huitong Pan, Qi Zhang, Cornelia Caragea, Eduard Dragut, Longin Jan Latecki

    Abstract: We present SciDMT, an enhanced and expanded corpus for scientific mention detection, offering a significant advancement over existing related resources. SciDMT contains annotated scientific documents for datasets (D), methods (M), and tasks (T). The corpus consists of two components: 1) the SciDMT main corpus, which includes 48 thousand scientific articles with over 1.8 million weakly annotated me… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: LREC/COLING 2024

    MSC Class: I.2.7

    Journal ref: LREC-COLING. (2024) 14407-14417

  6. arXiv:2403.14559  [pdf, other

    cs.CV

    VAPO: Visibility-Aware Keypoint Localization for Efficient 6DoF Object Pose Estimation

    Authors: Ruyi Lian, Yuewei Lin, Longin Jan Latecki, Haibin Ling

    Abstract: Localizing predefined 3D keypoints in a 2D image is an effective way to establish 3D-2D correspondences for 6DoF object pose estimation. However, unreliable localization results of invisible keypoints degrade the quality of correspondences. In this paper, we address this issue by localizing the important keypoints in terms of visibility. Since keypoint visibility information is currently missing i… ▽ More

    Submitted 18 February, 2025; v1 submitted 21 March, 2024; originally announced March 2024.

  7. arXiv:2306.10623  [pdf, other

    cs.CV

    Enhanced Masked Image Modeling for Analysis of Dental Panoramic Radiographs

    Authors: Amani Almalki, Longin Jan Latecki

    Abstract: The computer-assisted radiologic informative report has received increasing research attention to facilitate diagnosis and treatment planning for dental care providers. However, manual interpretation of dental images is limited, expensive, and time-consuming. Another barrier in dental imaging is the limited number of available images for training, which is a challenge in the era of deep learning.… ▽ More

    Submitted 18 June, 2023; originally announced June 2023.

  8. DMDD: A Large-Scale Dataset for Dataset Mentions Detection

    Authors: Huitong Pan, Qi Zhang, Eduard Dragut, Cornelia Caragea, Longin Jan Latecki

    Abstract: The recognition of dataset names is a critical task for automatic information extraction in scientific literature, enabling researchers to understand and identify research opportunities. However, existing corpora for dataset mention detection are limited in size and naming diversity. In this paper, we introduce the Dataset Mentions Detection Dataset (DMDD), the largest publicly available corpus fo… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

    Comments: Pre-MIT Press publication version. Submitted to TACL

    ACM Class: I.2.7

    Journal ref: Transactions of the Association for Computational Linguistics. 11 (2023) 1132-1146

  9. Graph Convolutional Networks based on Manifold Learning for Semi-Supervised Image Classification

    Authors: Lucas Pascotti Valem, Daniel Carlos GuimarĂ£es Pedronette, Longin Jan Latecki

    Abstract: Due to a huge volume of information in many domains, the need for classification methods is imperious. In spite of many advances, most of the approaches require a large amount of labeled data, which is often not available, due to costs and difficulties of manual labeling processes. In this scenario, unsupervised and semi-supervised approaches have been gaining increasing attention. The GCNs (Graph… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

  10. arXiv:2304.12448  [pdf, other

    cs.CV cs.IR cs.LG

    Rank Flow Embedding for Unsupervised and Semi-Supervised Manifold Learning

    Authors: Lucas Pascotti Valem, Daniel Carlos GuimarĂ£es Pedronette, Longin Jan Latecki

    Abstract: Impressive advances in acquisition and sharing technologies have made the growth of multimedia collections and their applications almost unlimited. However, the opposite is true for the availability of labeled data, which is needed for supervised training, since such data is often expensive and time-consuming to obtain. While there is a pressing need for the development of effective retrieval and… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

  11. arXiv:2212.00847  [pdf, other

    cs.CV cs.AI

    Weakly Supervised Annotations for Multi-modal Greeting Cards Dataset

    Authors: Sidra Hanif, Longin Jan Latecki

    Abstract: In recent years, there is a growing number of pre-trained models trained on a large corpus of data and yielding good performance on various tasks such as classifying multimodal datasets. These models have shown good performance on natural images but are not fully explored for scarce abstract concepts in images. In this work, we introduce an image/text-based dataset called Greeting Cards. Dataset (… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

    Comments: Accepted for poster presentation at Pretrain@WACV 2023

  12. arXiv:2210.11404  [pdf, other

    cs.CV

    Self-Supervised Learning with Masked Image Modeling for Teeth Numbering, Detection of Dental Restorations, and Instance Segmentation in Dental Panoramic Radiographs

    Authors: Amani Almalki, Longin Jan Latecki

    Abstract: The computer-assisted radiologic informative report is currently emerging in dental practice to facilitate dental care and reduce time consumption in manual panoramic radiographic interpretation. However, the amount of dental radiographs for training is very limited, particularly from the point of view of deep learning. This study aims to utilize recent self-supervised learning methods like SimMIM… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

  13. arXiv:2209.13933  [pdf, other

    cs.CV

    DPNet: Dual-Path Network for Real-time Object Detection with Lightweight Attention

    Authors: Quan Zhou, Huimin Shi, Weikang Xiang, Bin Kang, Xiaofu Wu, Longin Jan Latecki

    Abstract: The recent advances of compressing high-accuracy convolution neural networks (CNNs) have witnessed remarkable progress for real-time object detection. To accelerate detection speed, lightweight detectors always have few convolution layers using single-path backbone. Single-path architecture, however, involves continuous pooling and downsampling operations, always resulting in coarse and inaccurate… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

  14. arXiv:2111.00509  [pdf, other

    cs.CV

    DRBANET: A Lightweight Dual-Resolution Network for Semantic Segmentation with Boundary Auxiliary

    Authors: Linjie Wang, Quan Zhou, Chenfeng Jiang, Xiaofu Wu, Longin Jan Latecki

    Abstract: Due to the powerful ability to encode image details and semantics, many lightweight dual-resolution networks have been proposed in recent years. However, most of them ignore the benefit of boundary information. This paper introduces a lightweight dual-resolution network, called DRBANet, aiming to refine semantic segmentation results with the aid of boundary information. DRBANet adopts dual paralle… ▽ More

    Submitted 31 October, 2021; originally announced November 2021.

  15. arXiv:2111.00500  [pdf, other

    cs.CV

    DPNET: Dual-Path Network for Efficient Object Detectioj with Lightweight Self-Attention

    Authors: Huimin Shi, Quan Zhou, Yinghao Ni, Xiaofu Wu, Longin Jan Latecki

    Abstract: Object detection often costs a considerable amount of computation to get satisfied performance, which is unfriendly to be deployed in edge devices. To address the trade-off between computational cost and detection accuracy, this paper presents a dual path network, named DPNet, for efficient object detection with lightweight self-attention. In backbone, a single input/output lightweight self-attent… ▽ More

    Submitted 31 October, 2021; originally announced November 2021.

  16. arXiv:2011.01163  [pdf, other

    cs.CV cs.RO

    Pushing the Envelope of Rotation Averaging for Visual SLAM

    Authors: Xinyi Li, Lin Yuan, Longin Jan Latecki, Haibin Ling

    Abstract: As an essential part of structure from motion (SfM) and Simultaneous Localization and Mapping (SLAM) systems, motion averaging has been extensively studied in the past years and continues to attract surging research attention. While canonical approaches such as bundle adjustment are predominantly inherited in most of state-of-the-art SLAM systems to estimate and update the trajectory in the robot… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

  17. arXiv:2003.09669  [pdf, other

    cs.CV eess.IV

    BiCANet: Bi-directional Contextual Aggregating Network for Image Semantic Segmentation

    Authors: Quan Zhou, Dechun Cong, Bin Kang, Xiaofu Wu, Baoyu Zheng, Huimin Lu, Longin Jan Latecki

    Abstract: Exploring contextual information in convolution neural networks (CNNs) has gained substantial attention in recent years for semantic segmentation. This paper introduces a Bi-directional Contextual Aggregating Network, called BiCANet, for semantic segmentation. Unlike previous approaches that encode context in feature space, BiCANet aggregates contextual cues from a categorical perspective, which i… ▽ More

    Submitted 21 March, 2020; originally announced March 2020.

  18. arXiv:2002.01690  [pdf, other

    cs.LG cs.CV stat.ML

    Entropy Minimization vs. Diversity Maximization for Domain Adaptation

    Authors: Xiaofu Wu, Suofei hang, Quan Zhou, Zhen Yang, Chunming Zhao, Longin Jan Latecki

    Abstract: Entropy minimization has been widely used in unsupervised domain adaptation (UDA). However, existing works reveal that entropy minimization only may result into collapsed trivial solutions. In this paper, we propose to avoid trivial solutions by further introducing diversity maximization. In order to achieve the possible minimum target risk for UDA, we show that diversity maximization should be el… ▽ More

    Submitted 5 February, 2020; originally announced February 2020.

    Comments: submitted to IEEE T-IP

  19. arXiv:1912.03730  [pdf, other

    cs.CV

    Dually Supervised Feature Pyramid for Object Detection and Segmentation

    Authors: Fan Yang, Cheng Lu, Yandong Guo, Longin Jan Latecki, Haibin Ling

    Abstract: Feature pyramid architecture has been broadly adopted in object detection and segmentation to deal with multi-scale problem. However, in this paper we show that the capacity of the architecture has not been fully explored due to the inadequate utilization of the supervision information. Such insufficient utilization is caused by the supervision signal degradation in back propagation. Thus inspired… ▽ More

    Submitted 13 December, 2019; v1 submitted 8 December, 2019; originally announced December 2019.

  20. arXiv:1905.02423  [pdf, other

    cs.CV

    LEDNet: A Lightweight Encoder-Decoder Network for Real-Time Semantic Segmentation

    Authors: Yu Wang, Quan Zhou, Jia Liu, Jian Xiong, Guangwei Gao, Xiaofu Wu, Longin Jan Latecki

    Abstract: The extensive computational burden limits the usage of CNNs in mobile devices for dense estimation tasks. In this paper, we present a lightweight network to address this problem,namely LEDNet, which employs an asymmetric encoder-decoder architecture for the task of real-time semantic segmentation.More specifically, the encoder adopts a ResNet as backbone network, where two new operations, channel… ▽ More

    Submitted 13 May, 2019; v1 submitted 7 May, 2019; originally announced May 2019.

    Comments: 5 pages,3 figures,3 tables,accepted in IEEE ICIP 2019

  21. arXiv:1811.04778  [pdf, other

    cs.CV

    Scene Parsing via Dense Recurrent Neural Networks with Attentional Selection

    Authors: Heng Fan, Peng Chu, Longin Jan Latecki, Haibin Ling

    Abstract: Recurrent neural networks (RNNs) have shown the ability to improve scene parsing through capturing long-range dependencies among image units. In this paper, we propose dense RNNs for scene labeling by exploring various long-range semantic dependencies among image units. Different from existing RNN based approaches, our dense RNNs are able to capture richer contextual dependencies for each image un… ▽ More

    Submitted 8 November, 2018; originally announced November 2018.

    Comments: 10 pages. arXiv admin note: substantial text overlap with arXiv:1801.06831

  22. arXiv:1804.08187  [pdf, ps, other

    cs.AI

    Advancing Tabu and Restart in Local Search for Maximum Weight Cliques

    Authors: Yi Fan, Nan Li, Chengqian Li, Zongjie Ma, Longin Jan Latecki, Kaile Su

    Abstract: The tabu and restart are two fundamental strategies for local search. In this paper, we improve the local search algorithms for solving the Maximum Weight Clique (MWC) problem by introducing new tabu and restart strategies. Both the tabu and restart strategies proposed are based on the notion of a local search scenario, which involves not only a candidate solution but also the tabu status and unlo… ▽ More

    Submitted 22 April, 2018; originally announced April 2018.

  23. arXiv:1605.00286  [pdf, other

    cs.CV

    Multidimensional Scaling on Multiple Input Distance Matrices

    Authors: Song Bai, Xiang Bai, Longin Jan Latecki, Qi Tian

    Abstract: Multidimensional Scaling (MDS) is a classic technique that seeks vectorial representations for data points, given the pairwise distances between them. However, in recent years, data are usually collected from diverse sources or have multiple heterogeneous representations. How to do multidimensional scaling on multiple input distance matrices is still unsolved to our best knowledge. In this paper,… ▽ More

    Submitted 25 August, 2017; v1 submitted 1 May, 2016; originally announced May 2016.

  24. arXiv:1604.01879  [pdf, other

    cs.CV

    GIFT: A Real-time and Scalable 3D Shape Search Engine

    Authors: Song Bai, Xiang Bai, Zhichao Zhou, Zhaoxiang Zhang, Longin Jan Latecki

    Abstract: Projective analysis is an important solution for 3D shape retrieval, since human visual perceptions of 3D shapes rely on various 2D observations from different view points. Although multiple informative and discriminative views are utilized, most projection-based retrieval systems suffer from heavy computational cost, thus cannot satisfy the basic requirement of scalability for search engines. In… ▽ More

    Submitted 31 March, 2017; v1 submitted 7 April, 2016; originally announced April 2016.

    Comments: accepted by CVPR16, achieved the first place in Shrec2016 competition: Large-Scale 3D Shape Retrieval under the perturbed case

  25. arXiv:1303.2643  [pdf, ps, other

    cs.LG cs.GT

    Revealing Cluster Structure of Graph by Path Following Replicator Dynamic

    Authors: Hairong Liu, Longin Jan Latecki, Shuicheng Yan

    Abstract: In this paper, we propose a path following replicator dynamic, and investigate its potentials in uncovering the underlying cluster structure of a graph. The proposed dynamic is a generalization of the discrete replicator dynamic. The replicator dynamic has been successfully used to extract dense clusters of graphs; however, it is often sensitive to the degree distribution of a graph, and usually b… ▽ More

    Submitted 11 March, 2013; originally announced March 2013.

  26. arXiv:1109.2361  [pdf, other

    cs.CG cs.CC math.NA math.OC

    Spherical coverage verification

    Authors: Marko D. Petkovic, Dragoljub Pokrajac, Longin Jan Latecki

    Abstract: We consider the problem of covering hypersphere by a set of spherical hypercaps. This sort of problem has numerous practical applications such as error correcting codes and reverse k-nearest neighbor problem. Using the reduction of non degenerated concave quadratic programming (QP) problem, we demonstrate that spherical coverage verification is NP hard. We propose a recursive algorithm based on re… ▽ More

    Submitted 11 September, 2011; originally announced September 2011.