Skip to main content

Showing 1–2 of 2 results for author: Andecy, V P d

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.01054  [pdf, other

    cs.DB cs.IR

    CHIC: Corporate Document for Visual question Answering

    Authors: Ibrahim Souleiman Mahamoud, Mickael Coustaty, Aurelie Joseph, Vincent Poulain d Andecy, Jean-Marc Ogier

    Abstract: The massive use of digital documents due to the substantial trend of paperless initiatives confronted some companies to find ways to process thousands of documents per day automatically. To achieve this, they use automatic information retrieval (IR) allowing them to extract useful information from large datasets quickly. In order to have effective IR methods, it is first necessary to have an adequ… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

  2. arXiv:2007.07547  [pdf, other

    cs.CV cs.LG

    Evaluation of Neural Network Classification Systems on Document Stream

    Authors: Joris Voerman, Aurelie Joseph, Mickael Coustaty, Vincent Poulain d Andecy, Jean-Marc Ogier

    Abstract: One major drawback of state of the art Neural Networks (NN)-based approaches for document classification purposes is the large number of training samples required to obtain an efficient classification. The minimum required number is around one thousand annotated documents for each class. In many cases it is very difficult, if not impossible, to gather this number of samples in real industrial proc… ▽ More

    Submitted 15 July, 2020; originally announced July 2020.

    Comments: 15 pages, 3 figures and submitted to DAS conferences 2020

    ACM Class: I.7.1; J.1