Skip to main content

Showing 1–4 of 4 results for author: van Zaanen, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:cs/0205025  [pdf, ps, other

    cs.LG cs.CL

    Bootstrapping Structure into Language: Alignment-Based Learning

    Authors: Menno M. van Zaanen

    Abstract: This thesis introduces a new unsupervised learning framework, called Alignment-Based Learning, which is based on the alignment of sentences and Harris's (1951) notion of substitutability. Instances of the framework can be applied to an untagged, unstructured corpus of natural language sentences, resulting in a labelled, bracketed version of that corpus. Firstly, the framework aligns all senten… ▽ More

    Submitted 16 May, 2002; originally announced May 2002.

    Comments: 148 pages

    ACM Class: I.2; I.2.6; I.2.7

  2. arXiv:cs/0104007  [pdf, ps, other

    cs.LG cs.CL

    Bootstrapping Syntax and Recursion using Alignment-Based Learning

    Authors: Menno van Zaanen

    Abstract: This paper introduces a new type of unsupervised learning algorithm, based on the alignment of sentences and Harris's (1951) notion of interchangeability. The algorithm is applied to an untagged, unstructured corpus of natural language sentences, resulting in a labelled, bracketed version of the corpus. Firstly, the algorithm aligns all sentences in the corpus in pairs, resulting in a partition… ▽ More

    Submitted 3 April, 2001; originally announced April 2001.

    Comments: 8 pages

    ACM Class: I.2; I.2.6; I.2.7

    Journal ref: Proceedings of the Seventeenth International Conference on Machine Learning. pages 1063-1070

  3. arXiv:cs/0104006  [pdf, ps, other

    cs.LG cs.CL

    ABL: Alignment-Based Learning

    Authors: Menno van Zaanen

    Abstract: This paper introduces a new type of grammar learning algorithm, inspired by string edit distance (Wagner and Fischer, 1974). The algorithm takes a corpus of flat sentences as input and returns a corpus of labelled, bracketed sentences. The method works on pairs of unstructured sentences that have one or more words in common. When two sentences are divided into parts that are the same in both sen… ▽ More

    Submitted 3 April, 2001; originally announced April 2001.

    Comments: 7 pages

    ACM Class: I.2; I.2.6; I.2.7

    Journal ref: Proceedings of the 18th International Conference on Computational Linguistics (COLING); Saarbrucken, Germany. pages 961-967

  4. arXiv:cs/0104005  [pdf, ps, other

    cs.LG cs.CL

    Bootstrapping Structure using Similarity

    Authors: Menno van Zaanen

    Abstract: In this paper a new similarity-based learning algorithm, inspired by string edit-distance (Wagner and Fischer, 1974), is applied to the problem of bootstrapping structure from scratch. The algorithm takes a corpus of unannotated sentences as input and returns a corpus of bracketed sentences. The method works on pairs of unstructured sentences or sentences partially bracketed by the algorithm tha… ▽ More

    Submitted 3 April, 2001; originally announced April 2001.

    Comments: 11 pages

    ACM Class: I.2, I.2.6, I.2.7

    Journal ref: Computational Linguistics in the Netherlands 1999 - Selected Papers from the Tenth CLIN Meeting, pages 235-245