Skip to main content

Showing 1–7 of 7 results for author: Baudiš, P

Searching in archive cs. Search in all archives.
.
  1. Table understanding in structured documents

    Authors: Martin Holeček, Antonín Hoskovec, Petr Baudiš, Pavel Klinger

    Abstract: Abstract--- Table detection and extraction has been studied in the context of documents like reports, where tables are clearly outlined and stand out from the document structure visually. We study this topic in a rather more challenging domain of layout-heavy business documents, particularly invoices. Invoices present the novel challenges of tables being often without outlines - either in the form… ▽ More

    Submitted 9 July, 2019; v1 submitted 22 March, 2019; originally announced April 2019.

    Comments: Changed from previous version based on icdar2019 feedback to include 6 pages, 2 figures. Slightly changed paper name and abstract to be less misleading. Corrected grammar and shortened content heavily, corrected misleading information and readability. Currently in review for icdar2019-wml subconference/workshop

    Journal ref: 2019 International Conference on Document Analysis and Recognition Workshops (ICDARW), 2019, pp. 158-164

  2. arXiv:1605.04655  [pdf, ps, other

    cs.CL cs.LG cs.NE

    Joint Learning of Sentence Embeddings for Relevance and Entailment

    Authors: Petr Baudis, Silvestr Stanko, Jan Sedivy

    Abstract: We consider the problem of Recognizing Textual Entailment within an Information Retrieval context, where we must simultaneously determine the relevancy as well as degree of entailment for individual pieces of evidence to determine a yes/no answer to a binary natural language question. We compare several variants of neural networks for sentence embeddings in a setting of decision-making based on… ▽ More

    Submitted 22 June, 2016; v1 submitted 16 May, 2016; originally announced May 2016.

    Comments: repl4nlp workshop at ACL Berlin 2016

  3. arXiv:1603.06127  [pdf, ps, other

    cs.CL cs.AI cs.LG cs.NE

    Sentence Pair Scoring: Towards Unified Framework for Text Comprehension

    Authors: Petr Baudiš, Jan Pichl, Tomáš Vyskočil, Jan Šedivý

    Abstract: We review the task of Sentence Pair Scoring, popular in the literature in various forms - viewed as Answer Sentence Selection, Semantic Text Scoring, Next Utterance Ranking, Recognizing Textual Entailment, Paraphrasing or e.g. a component of Memory Networks. We argue that all such tasks are similar from the model perspective and propose new baselines by comparing the performance of common IR met… ▽ More

    Submitted 17 May, 2016; v1 submitted 19 March, 2016; originally announced March 2016.

    Comments: submitted as paper to CoNLL 2016

  4. Evaluating Go Game Records for Prediction of Player Attributes

    Authors: Josef Moudřík, Petr Baudiš, Roman Neruda

    Abstract: We propose a way of extracting and aggregating per-move evaluations from sets of Go game records. The evaluations capture different aspects of the games such as played patterns or statistic of sente/gote sequences. Using machine learning algorithms, the evaluations can be utilized to predict different relevant target variables. We apply this methodology to predict the strength and playing style of… ▽ More

    Submitted 30 December, 2015; originally announced December 2015.

    Journal ref: Computational Intelligence and Games (CIG), 2015 IEEE Conference on , vol., no., pp.162-168, Aug. 31 2015-Sept. 2 2015

  5. arXiv:1405.3496  [pdf, other

    cs.SE

    Current Concepts in Version Control Systems

    Authors: Petr Baudiš

    Abstract: We give the reader a comprehensive overview of the state of the Version Control software engineering field, describing and analysing the concepts, architectural approaches and methods researched and included in the currently widely used version control systems and propose some possible future research directions.

    Submitted 14 May, 2014; originally announced May 2014.

    Comments: Written in 2009

    ACM Class: D.2.7

  6. arXiv:1405.3487  [pdf, ps, other

    cs.AI

    COCOpf: An Algorithm Portfolio Framework

    Authors: Petr Baudiš

    Abstract: Algorithm portfolios represent a strategy of composing multiple heuristic algorithms, each suited to a different class of problems, within a single general solver that will choose the best suited algorithm for each input. This approach recently gained popularity especially for solving combinatoric problems, but optimization applications are still emerging. The COCO platform of the BBOB workshop se… ▽ More

    Submitted 14 May, 2014; originally announced May 2014.

    Comments: POSTER2014. arXiv admin note: text overlap with arXiv:1206.5780 by other authors without attribution

    ACM Class: G.1.6

    Journal ref: Poster 2014 --- the 18th International Student Conference on Electrical Engineering. Czech Technical University, Prague, Czech Republic (2014)

  7. arXiv:1209.5251  [pdf, ps, other

    cs.AI cs.LG

    On Move Pattern Trends in a Large Go Games Corpus

    Authors: Petr Baudiš, Josef Moudřík

    Abstract: We process a large corpus of game records of the board game of Go and propose a way of extracting summary information on played moves. We then apply several basic data-mining methods on the summary information to identify the most differentiating features within the summary information, and discuss their correspondence with traditional Go knowledge. We show statistically significant mappings of th… ▽ More

    Submitted 24 September, 2012; originally announced September 2012.