Skip to main content

Showing 1–13 of 13 results for author: Rosa, K

.
  1. arXiv:2504.06272  [pdf, other

    cs.IR cs.AI

    RAVEN: An Agentic Framework for Multimodal Entity Discovery from Large-Scale Video Collections

    Authors: Kevin Dela Rosa

    Abstract: We present RAVEN an adaptive AI agent framework designed for multimodal entity discovery and retrieval in large-scale video collections. Synthesizing information across visual, audio, and textual modalities, RAVEN autonomously processes video data to produce structured, actionable representations for downstream tasks. Key contributions include (1) a category understanding step to infer video theme… ▽ More

    Submitted 3 March, 2025; originally announced April 2025.

    Comments: Presented at AI Agent for Information Retrieval: Generating and Ranking (Agent4IR) @ AAAI 2025 [https://sites.google.com/view/ai4ir/aaai-2025]

  2. arXiv:2501.00290  [pdf, ps, other

    math.FA

    Zero-dilation indices and numerical ranges

    Authors: Kennett L. Dela Rosa

    Abstract: The zero-dilation index $d(A) $ of a matrix $A$ is the largest integer $k$ for which $\begin{bmatrix}0_k& *\\ * & *\end{bmatrix}$ is unitarily similar to $A$. In this study, the zero-dilation indices of certain block matrices are considered, namely, the block matrix analogues of companion matrices and upper triangular KMS matrices, respectively shown as \[\mathcal{C}=\begin{bmatrix} 0& \bigoplus_{… ▽ More

    Submitted 31 December, 2024; originally announced January 2025.

    Comments: 25 pages

    MSC Class: 15A45; 15A60; 15B99; 47A12; 47A20

  3. arXiv:2409.13339  [pdf, ps, other

    math.RA

    On commutators of unipotent matrices of index 2

    Authors: Kennett L. Dela Rosa, Juan Paolo C. Santos

    Abstract: A commutator of unipotent matrices of index 2 is a matrix of the form $XYX^{-1}Y^{-1}$, where $X$ and $Y$ are unipotent matrices of index 2, that is, $X\ne I_n$, $Y\ne I_n$, and $(X-I_n)^2=(Y-I_n)^2=0_n$. If $n>2$ and $\mathbb F$ is a field with $|\mathbb F|\geq 4$, then it is shown that every $n\times n$ matrix over $\mathbb F$ with determinant 1 is a product of at most four commutators of unipot… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

    Comments: 23 pages

    MSC Class: 15A21; 15A23; 15B33; 15B99; 20H20

  4. arXiv:2407.14625  [pdf, other

    eess.SP

    Benchmarking deep learning models for bearing fault diagnosis using the CWRU dataset: A multi-label approach

    Authors: Rodrigo Kobashikawa Rosa, Danilo Braga, Danilo Silva

    Abstract: This paper proposes a novel approach for modeling the problem of fault diagnosis using the Case Western Reserve University (CWRU) bearing fault dataset. Although the dataset is considered a standard reference for testing new algorithms, the typical dataset division suffers from data leakage, as shown by Hendriks et al. (2022) and Abburi et al. (2023), leading to papers reporting over-optimistic re… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

  5. arXiv:2405.17706  [pdf, other

    cs.AI cs.CV cs.IR

    Video Enriched Retrieval Augmented Generation Using Aligned Video Captions

    Authors: Kevin Dela Rosa

    Abstract: In this work, we propose the use of "aligned visual captions" as a mechanism for integrating information contained within videos into retrieval augmented generation (RAG) based chat assistant systems. These captions are able to describe the visual and audio content of videos in a large corpus while having the advantage of being in a textual format that is both easy to reason about & incorporate in… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: SIGIR 2024 Workshop on Multimodal Representation and Retrieval (MRR 2024)

  6. arXiv:2312.01671  [pdf, other

    cs.CV

    Multimodality-guided Image Style Transfer using Cross-modal GAN Inversion

    Authors: Hanyu Wang, Pengxiang Wu, Kevin Dela Rosa, Chen Wang, Abhinav Shrivastava

    Abstract: Image Style Transfer (IST) is an interdisciplinary topic of computer vision and art that continuously attracts researchers' interests. Different from traditional Image-guided Image Style Transfer (IIST) methods that require a style reference image as input to define the desired style, recent works start to tackle the problem in a text-guided manner, i.e., Text-guided Image Style Transfer (TIST). C… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: WACV 2024. Project website: https://hywang66.github.io/mmist/

  7. arXiv:2309.16249  [pdf, other

    cs.CV

    FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding

    Authors: Pengxiang Wu, Siman Wang, Kevin Dela Rosa, Derek Hao Hu

    Abstract: Image retrieval is a fundamental task in computer vision. Despite recent advances in this field, many techniques have been evaluated on a limited number of domains, with a small number of instance categories. Notably, most existing works only consider domains like 3D landmarks, making it difficult to generalize the conclusions made by these works to other domains, e.g., logo and other 2D flat obje… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: NeurIPS 2023 Datasets and Benchmarks Track

  8. arXiv:2011.10678  [pdf, other

    cs.CV cs.AI cs.LG

    Open-Vocabulary Object Detection Using Captions

    Authors: Alireza Zareian, Kevin Dela Rosa, Derek Hao Hu, Shih-Fu Chang

    Abstract: Despite the remarkable accuracy of deep neural networks in object detection, they are costly to train and scale due to supervision requirements. Particularly, learning more object categories typically requires proportionally more bounding box annotations. Weakly supervised and zero-shot learning techniques have been explored to scale object detectors to more categories with less supervision, but t… ▽ More

    Submitted 14 March, 2021; v1 submitted 20 November, 2020; originally announced November 2020.

    Comments: To be presented at CVPR 2021 (oral paper)

  9. arXiv:2004.05288  [pdf, other

    math.FA math.CO

    Location of Ritz values in the numerical range of normal matrices

    Authors: Kennett L. Dela Rosa, Hugo J. Woerdeman

    Abstract: Let $μ_1$ be a complex number in the numerical range $W(A)$ of a normal matrix $A$. In the case when no eigenvalues of $A$ lie in the interior of $W(A)$, we identify the smallest convex region containing all possible complex numbers $μ_2$ for which $\begin{bmatrix}μ_1& *\\0& μ_2\end{bmatrix}$ is a $2$-by-$2$ compression of $A$.

    Submitted 9 May, 2020; v1 submitted 10 April, 2020; originally announced April 2020.

    Comments: 32 pages

    MSC Class: 15A18; 15A29; 15A60; 47A12; 47A20

  10. arXiv:2002.05069  [pdf

    q-bio.PE q-bio.QM

    Real-time forecasts of the 2019-nCoV epidemic in China from February 5th to February 24th, 2020

    Authors: K. Roosa, Y. Lee, R. Luo, A. Kirpich, R. Rothenberg, J. M. Hyman, P. Yan, G. Chowell

    Abstract: The initial cluster of severe pneumonia cases that triggered the 2019-nCoV epidemic was identified in Wuhan, China in December 2019. While early cases of the disease were linked to a wet market, human-to-human transmission has driven the rapid spread of the virus throughout China. The ongoing outbreak presents a challenge for modelers, as limited data are available on the early growth trajectory,… ▽ More

    Submitted 12 February, 2020; originally announced February 2020.

    Comments: 6 figures

  11. arXiv:1806.00941  [pdf, ps, other

    math.GR

    Bounds for Finite Semiprimitive Permutation Groups: Order, Base Size, and Minimal Degree

    Authors: Luke Morgan, Cheryl E. Praeger, Kyle Rosa

    Abstract: In this paper we study finite semiprimitive permutation groups, that is, groups in which each normal subgroup is transitive or semiregular. We give bounds on the order, base size, minimal degree, fixity, and chief length of an arbitrary finite semiprimitive group in terms of its degree. To establish these bounds, we classify finite semiprimitive groups that induce the alternating or symmetric grou… ▽ More

    Submitted 3 June, 2018; originally announced June 2018.

    MSC Class: 20B15; 20H30; 20B05

  12. arXiv:1712.05520  [pdf, ps, other

    math.GR

    Bounding the composition length of primitive permutation groups and completely reducible linear groups

    Authors: S. P. Glasby, Cheryl E. Praeger, Kyle Rosa, Gabriel Verret

    Abstract: We obtain upper bounds on the composition length of a finite permutation group in terms of the degree and the number of orbits, and analogous bounds for primitive, quasiprimitive and semiprimitive groups. Similarly, we obtain upper bounds on the composition length of a finite completely reducible linear group in terms of some of its parameters. In almost all cases we show that the bounds are sharp… ▽ More

    Submitted 14 March, 2018; v1 submitted 14 December, 2017; originally announced December 2017.

    Comments: 23 pages; a few minor corrections following the referee's comments

    MSC Class: 20B15; 20H30; 20B05

  13. The Advection of Supergranules by the Sun's Axisymmetric Flows

    Authors: David H. Hathaway, Peter E. Williams, Kevin Dela Rosa, Manfred Cuntz

    Abstract: We show that the motions of supergranules are consistent with a model in which they are simply advected by the axisymmetric flows in the Sun's surface shear layer. We produce a 10-day series of simulated Doppler images at a 15-minute cadence that reproduces most spatial and temporal characteristics seen in the SOHO/MDI Doppler data. Our simulated data have a spectrum of cellular flows with just tw… ▽ More

    Submitted 25 August, 2010; originally announced August 2010.

    Comments: 15 pages, 8 figures