Skip to main content

Showing 1–1 of 1 results for author: Hsu, T E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.19353  [pdf, other

    cs.CL cs.AI cs.CV

    Do Large Multimodal Models Solve Caption Generation for Scientific Figures? Lessons Learned from SciCap Challenge 2023

    Authors: Ting-Yao E. Hsu, Yi-Li Hsu, Shaurya Rohatgi, Chieh-Yang Huang, Ho Yin Sam Ng, Ryan Rossi, Sungchul Kim, Tong Yu, Lun-Wei Ku, C. Lee Giles, Ting-Hao K. Huang

    Abstract: Since the SciCap datasets launch in 2021, the research community has made significant progress in generating captions for scientific figures in scholarly articles. In 2023, the first SciCap Challenge took place, inviting global teams to use an expanded SciCap dataset to develop models for captioning diverse figure types across various academic fields. At the same time, text generation models advan… ▽ More

    Submitted 18 February, 2025; v1 submitted 31 January, 2025; originally announced January 2025.

    Comments: Accepted to TACL 2025