Skip to main content

Showing 1–37 of 37 results for author: Steedman, M

.
  1. arXiv:2505.21011  [pdf, ps, other

    cs.CL

    LLMs are Frequency Pattern Learners in Natural Language Inference

    Authors: Liang Cheng, Zhaowei Wang, Mark Steedman

    Abstract: While fine-tuning LLMs on NLI corpora improves their inferential performance, the underlying mechanisms driving this improvement remain largely opaque. In this work, we conduct a series of experiments to investigate what LLMs actually learn during fine-tuning. We begin by analyzing predicate frequencies in premises and hypotheses across NLI datasets and identify a consistent frequency bias, where… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: 9 pages

  2. arXiv:2505.20097  [pdf, ps, other

    cs.CL

    S2LPP: Small-to-Large Prompt Prediction across LLMs

    Authors: Liang Cheng, Tianyi LI, Zhaowei Wang, Mark Steedman

    Abstract: The performance of pre-trained Large Language Models (LLMs) is often sensitive to nuances in prompt templates, requiring careful prompt engineering, adding costs in terms of computing and human effort. In this study, we present experiments encompassing multiple LLMs variants of varying sizes aimed at probing their preference with different prompts. Through experiments on Question Answering, we sho… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: 15 pages

  3. arXiv:2505.10610  [pdf, other

    cs.CV cs.CL

    MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly

    Authors: Zhaowei Wang, Wenhao Yu, Xiyu Ren, Jipeng Zhang, Yu Zhao, Rohit Saxena, Liang Cheng, Ginny Wong, Simon See, Pasquale Minervini, Yangqiu Song, Mark Steedman

    Abstract: The rapid extension of context windows in large vision-language models has given rise to long-context vision-language models (LCVLMs), which are capable of handling hundreds of images with interleaved text tokens in a single forward pass. In this work, we introduce MMLongBench, the first benchmark covering a diverse set of long-context vision-language tasks, to evaluate LCVLMs effectively and thor… ▽ More

    Submitted 26 May, 2025; v1 submitted 15 May, 2025; originally announced May 2025.

    Comments: Work in progress

  4. arXiv:2503.12832  [pdf, other

    cs.CL

    Modelling Child Learning and Parsing of Long-range Syntactic Dependencies

    Authors: Louis Mahon, Mark Johnson, Mark Steedman

    Abstract: This work develops a probabilistic child language acquisition model to learn a range of linguistic phenonmena, most notably long-range syntactic dependencies of the sort found in object wh-questions, among other constructions. The model is trained on a corpus of real child-directed speech, where each utterance is paired with a logical form as a meaning representation. It then learns both word mean… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

  5. arXiv:2503.11614  [pdf, other

    cs.CL

    Neutralizing Bias in LLM Reasoning using Entailment Graphs

    Authors: Liang Cheng, Tianyi Li, Zhaowei Wang, Tianyang Liu, Mark Steedman

    Abstract: LLMs are often claimed to be capable of Natural Language Inference (NLI), which is widely regarded as a cornerstone of more complex forms of reasoning. However, recent works show that LLMs still suffer from hallucinations in NLI due to attestation bias, where LLMs overly rely on propositional memory to build shortcuts. To solve the issue, we design an unsupervised framework to construct counterfac… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

    Comments: 17 pages, 7 figures

  6. arXiv:2410.12040  [pdf, other

    cs.CL cs.AI

    Concept-Reversed Winograd Schema Challenge: Evaluating and Improving Robust Reasoning in Large Language Models via Abstraction

    Authors: Kaiqiao Han, Tianqing Fang, Zhaowei Wang, Yangqiu Song, Mark Steedman

    Abstract: While Large Language Models (LLMs) have showcased remarkable proficiency in reasoning, there is still a concern about hallucinations and unreliable reasoning issues due to semantic associations and superficial logical chains. To evaluate the extent to which LLMs perform robust reasoning instead of relying on superficial logical chains, we propose a new evaluation dataset, the Concept-Reversed Wino… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

  7. arXiv:2408.14467  [pdf, other

    cs.CL

    Explicit Inductive Inference using Large Language Models

    Authors: Tianyang Liu, Tianyi Li, Liang Cheng, Mark Steedman

    Abstract: Large Language Models (LLMs) are reported to hold undesirable attestation bias on inference tasks: when asked to predict if a premise P entails a hypothesis H, instead of considering H's conditional truthfulness entailed by P, LLMs tend to use the out-of-context truth label of H as a fragile proxy. In this paper, we propose a pipeline that exploits this bias to do explicit inductive inference. Our… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

  8. arXiv:2408.12254  [pdf, other

    cs.CL cs.AI

    A Language-agnostic Model of Child Language Acquisition

    Authors: Louis Mahon, Omri Abend, Uri Berger, Katherine Demuth, Mark Johnson, Mark Steedman

    Abstract: This work reimplements a recent semantic bootstrapping child-language acquisition model, which was originally designed for English, and trains it to learn a new language: Hebrew. The model learns from pairs of utterances and logical forms as meaning representations, and acquires both syntax and word meanings simultaneously. The results show that the model mostly transfers to Hebrew, but that a num… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

  9. arXiv:2402.14901  [pdf, other

    cs.CL cs.AI

    A Usage-centric Take on Intent Understanding in E-Commerce

    Authors: Wendi Zhou, Tianyi Li, Pavlos Vougiouklis, Mark Steedman, Jeff Z. Pan

    Abstract: Identifying and understanding user intents is a pivotal task for E-Commerce. Despite its essential role in product recommendation and business user profiling analysis, intent understanding has not been consistently defined or accurately benchmarked. In this paper, we focus on predicative user intents as "how a customer uses a product", and pose intent understanding as a natural language reasoning… ▽ More

    Submitted 7 October, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: Acepted by EMNLP 2024 main

  10. arXiv:2401.16313  [pdf, other

    cs.CL

    Machine Translation Meta Evaluation through Translation Accuracy Challenge Sets

    Authors: Nikita Moghe, Arnisa Fazla, Chantal Amrhein, Tom Kocmi, Mark Steedman, Alexandra Birch, Rico Sennrich, Liane Guillou

    Abstract: Recent machine translation (MT) metrics calibrate their effectiveness by correlating with human judgement but without any insights about their behaviour across different error types. Challenge sets are used to probe specific dimensions of metric behaviour but there are very few such datasets and they either focus on a limited number of phenomena or a limited number of language pairs. We introduce… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2210.15615

  11. arXiv:2305.16947  [pdf, other

    cs.CL

    Sentence-Incremental Neural Coreference Resolution

    Authors: Matt Grenander, Shay B. Cohen, Mark Steedman

    Abstract: We propose a sentence-incremental neural coreference resolution system which incrementally builds clusters after marking mention boundaries in a shift-reduce method. The system is aimed at bridging two recent approaches at coreference resolution: (1) state-of-the-art non-incremental models that incur quadratic complexity in document length with high computational cost, and (2) memory network-based… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted at EMNLP 2022

  12. arXiv:2305.14552  [pdf, other

    cs.CL cs.AI

    Sources of Hallucination by Large Language Models on Inference Tasks

    Authors: Nick McKenna, Tianyi Li, Liang Cheng, Mohammad Javad Hosseini, Mark Johnson, Mark Steedman

    Abstract: Large Language Models (LLMs) are claimed to be capable of Natural Language Inference (NLI), necessary for applied tasks like question answering and summarization. We present a series of behavioral studies on several LLM families (LLaMA, GPT-3.5, and PaLM) which probe their behavior using controlled experiments. We establish two biases originating from pretraining which predict much of their behavi… ▽ More

    Submitted 22 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Findings of EMNLP 2023

  13. arXiv:2302.12165  [pdf, other

    cs.CL

    Prosodic features improve sentence segmentation and parsing

    Authors: Elizabeth Nielsen, Sharon Goldwater, Mark Steedman

    Abstract: Parsing spoken dialogue presents challenges that parsing text does not, including a lack of clear sentence boundaries. We know from previous work that prosody helps in parsing single sentences (Tran et al. 2018), but we want to show the effect of prosody on parsing speech that isn't segmented into sentences. In experiments on the English Switchboard corpus, we find prosody helps our model both wit… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

    Comments: arXiv admin note: text overlap with arXiv:2105.12667

  14. arXiv:2212.10297  [pdf, other

    cs.CL cs.AI

    Extrinsic Evaluation of Machine Translation Metrics

    Authors: Nikita Moghe, Tom Sherborne, Mark Steedman, Alexandra Birch

    Abstract: Automatic machine translation (MT) metrics are widely used to distinguish the translation qualities of machine translation systems across relatively large test sets (system-level evaluation). However, it is unclear if automatic metrics are reliable at distinguishing good translations from bad translations at the sentence level (segment-level evaluation). In this paper, we investigate how useful MT… ▽ More

    Submitted 18 June, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: ACL 2023 Camera Ready

  15. arXiv:2210.16147  [pdf, other

    cs.CL

    Modeling structure-building in the brain with CCG parsing and large language models

    Authors: Miloš Stanojević, Jonathan R. Brennan, Donald Dunagan, Mark Steedman, John T. Hale

    Abstract: To model behavioral and neural correlates of language comprehension in naturalistic environments researchers have turned to broad-coverage tools from natural-language processing and machine learning. Where syntactic structure is explicitly modeled, prior work has relied predominantly on context-free grammars (CFG), yet such formalisms are not sufficiently expressive for human languages. Combinator… ▽ More

    Submitted 16 April, 2023; v1 submitted 28 October, 2022; originally announced October 2022.

  16. arXiv:2210.04695  [pdf, other

    cs.CL

    Language Models Are Poor Learners of Directional Inference

    Authors: Tianyi Li, Mohammad Javad Hosseini, Sabine Weber, Mark Steedman

    Abstract: We examine LMs' competence of directional predicate entailments by supervised fine-tuning with prompts. Our analysis shows that contrary to their apparent success on standard NLI, LMs show limited ability to learn such directional inference; moreover, existing datasets fail to test directionality, and/or are infested by artefacts that can be learnt as proxy for entailments, yielding over-optimisti… ▽ More

    Submitted 14 October, 2022; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: Findings of EMNLP 2022

  17. arXiv:2208.01006  [pdf, other

    cs.CL

    Multi-Document Summarization with Centroid-Based Pretraining

    Authors: Ratish Puduppully, Parag Jain, Nancy F. Chen, Mark Steedman

    Abstract: In Multi-Document Summarization (MDS), the input can be modeled as a set of documents, and the output is its summary. In this paper, we focus on pretraining objectives for MDS. Specifically, we introduce a novel pretraining objective, which involves selecting the ROUGE-based centroid of each document cluster as a proxy for its summary. Our objective thus does not require human written summaries an… ▽ More

    Submitted 31 May, 2023; v1 submitted 1 August, 2022; originally announced August 2022.

    Comments: ACL 2023 camera-ready

  18. arXiv:2208.00318  [pdf, other

    cs.CL

    Smoothing Entailment Graphs with Language Models

    Authors: Nick McKenna, Tianyi Li, Mark Johnson, Mark Steedman

    Abstract: The diversity and Zipfian frequency distribution of natural language predicates in corpora leads to sparsity in Entailment Graphs (EGs) built by Open Relation Extraction (ORE). EGs are computationally efficient and explainable models of natural language inference, but as symbolic models, they fail if a novel premise or hypothesis vertex is missing at test-time. We present theory and methodology fo… ▽ More

    Submitted 21 September, 2023; v1 submitted 30 July, 2022; originally announced August 2022.

    Comments: Published at AACL 2023

  19. arXiv:2207.02356  [pdf, other

    cs.CL

    Zero-shot Cross-Linguistic Learning of Event Semantics

    Authors: Malihe Alikhani, Thomas Kober, Bashar Alhafni, Yue Chen, Mert Inan, Elizabeth Nielsen, Shahab Raji, Mark Steedman, Matthew Stone

    Abstract: Typologically diverse languages offer systems of lexical and grammatical aspect that allow speakers to focus on facets of event structure in ways that comport with the specific communicative setting and discourse constraints they face. In this paper, we look specifically at captions of images across Arabic, Chinese, Farsi, German, Russian, and Turkish and describe a computational model for predict… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

    Comments: Accepted at INLG 2022

  20. arXiv:2203.06264  [pdf, other

    cs.CL

    Cross-lingual Inference with A Chinese Entailment Graph

    Authors: Tianyi Li, Sabine Weber, Mohammad Javad Hosseini, Liane Guillou, Mark Steedman

    Abstract: Predicate entailment detection is a crucial task for question-answering from text, where previous work has explored unsupervised learning of entailment graphs from typed open relation triples. In this paper, we present the first pipeline for building Chinese entailment graphs, which involves a novel high-recall open relation extraction (ORE) method and the first Chinese fine-grained entity typing… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: Accepted to Findings of ACL 2022

  21. arXiv:2109.13620  [pdf, other

    cs.CL

    Cross-lingual Intermediate Fine-tuning improves Dialogue State Tracking

    Authors: Nikita Moghe, Mark Steedman, Alexandra Birch

    Abstract: Recent progress in task-oriented neural dialogue systems is largely focused on a handful of languages, as annotation of training data is tedious and expensive. Machine translation has been used to make systems multilingual, but this can introduce a pipeline of errors. Another promising solution is using cross-lingual transfer learning through pretrained multilingual models. Existing methods train… ▽ More

    Submitted 28 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021 Camera Ready

  22. arXiv:2109.10952  [pdf, other

    cs.CL

    Cross-linguistically Consistent Semantic and Syntactic Annotation of Child-directed Speech

    Authors: Ida Szubert, Omri Abend, Nathan Schneider, Samuel Gibbon, Louis Mahon, Sharon Goldwater, Mark Steedman

    Abstract: This paper proposes a methodology for constructing such corpora of child directed speech (CDS) paired with sentential logical forms, and uses this method to create two such corpora, in English and Hebrew. The approach enforces a cross-linguistically consistent representation, building on recent advances in dependency representation and semantic parsing. Specifically, the approach involves two step… ▽ More

    Submitted 14 March, 2024; v1 submitted 22 September, 2021; originally announced September 2021.

  23. arXiv:2109.10227  [pdf, other

    cs.CL

    Blindness to Modality Helps Entailment Graph Mining

    Authors: Liane Guillou, Sander Bijl de Vroe, Mark Johnson, Mark Steedman

    Abstract: Understanding linguistic modality is widely seen as important for downstream tasks such as Question Answering and Knowledge Graph Population. Entailment Graph learning might also be expected to benefit from attention to modality. We build Entailment Graphs using a news corpus filtered with a modality parser, and show that stripping modal modifiers from predicates in fact increases performance. Thi… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: To appear at the Workshop on Insights from Negative Results in NLP at EMNLP 2021

  24. arXiv:2109.09412  [pdf, other

    cs.CL

    Incorporating Temporal Information in Entailment Graph Mining

    Authors: Liane Guillou, Sander Bijl de Vroe, Mohammad Javad Hosseini, Mark Johnson, Mark Steedman

    Abstract: We present a novel method for injecting temporality into entailment graphs to address the problem of spurious entailments, which may arise from similar but temporally distinct events involving the same pair of entities. We focus on the sports domain in which the same pairs of teams play on different occasions, with different outcomes. We present an unsupervised model that aims to learn entailments… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: L. Guillou, S. Bijl de Vroe, M.J. Hosseini, M. Johnson, and M. Steedman. 2020. Incorporating temporal information in entailment graph mining. In Proceedings of the Graph-based Methods for Natural Language Processing (TextGraphs), pages 60-71, Barcelona, Spain (Online). Association for Computational Linguistics

    Journal ref: In Proceedings of TextGraphs 2020, pages 60-71, Barcelona, Spain (Online)

  25. Modality and Negation in Event Extraction

    Authors: Sander Bijl de Vroe, Liane Guillou, Miloš Stanojević, Nick McKenna, Mark Steedman

    Abstract: Language provides speakers with a rich system of modality for expressing thoughts about events, without being committed to their actual occurrence. Modality is commonly used in the political news domain, where both actual and possible courses of events are discussed. NLP systems struggle with these semantic phenomena, often incorrectly extracting events which did not happen, which can lead to issu… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: S. Bijl de Vroe, L. Guillou, M. Stanojević, N. McKenna, and M. Steedman. 2021. Modality and Negation in Event Extraction. In Proceedings of the 4th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2021), pages 31-42, online. Association for Computational Linguistics

    Journal ref: In Proceedings of CASE 2021, pages 31-42, online. Association for Computational Linguistics

  26. arXiv:2105.12667   

    cs.CL

    Prosodic segmentation for parsing spoken dialogue

    Authors: Elizabeth Nielsen, Mark Steedman, Sharon Goldwater

    Abstract: Parsing spoken dialogue poses unique difficulties, including disfluencies and unmarked boundaries between sentence-like units. Previous work has shown that prosody can help with parsing disfluent speech (Tran et al. 2018), but has assumed that the input to the parser is already segmented into sentence-like units (SUs), which isn't true in existing speech applications. We investigate how prosody af… ▽ More

    Submitted 12 October, 2021; v1 submitted 26 May, 2021; originally announced May 2021.

    Comments: This paper has been retracted -- do not cite. An error occurred in the preprocessing of the pitch and intensity features that this model used. This error means that it can no longer be concluded that prosody is as helpful for finding sentence boundaries and parsing as asserted in this paper

  27. arXiv:2104.07846  [pdf, other

    cs.CL

    Multivalent Entailment Graphs for Question Answering

    Authors: Nick McKenna, Liane Guillou, Mohammad Javad Hosseini, Sander Bijl de Vroe, Mark Johnson, Mark Steedman

    Abstract: Drawing inferences between open-domain natural language predicates is a necessity for true language understanding. There has been much progress in unsupervised learning of entailment graphs for this purpose. We make three contributions: (1) we reinterpret the Distributional Inclusion Hypothesis to model entailment between predicates of different valencies, like DEFEAT(Biden, Trump) entails WIN(Bid… ▽ More

    Submitted 19 September, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

    Comments: Accepted to EMNLP 2021

  28. arXiv:2011.00345  [pdf, other

    cs.CL cs.AI

    Aspectuality Across Genre: A Distributional Semantics Approach

    Authors: Thomas Kober, Malihe Alikhani, Matthew Stone, Mark Steedman

    Abstract: The interpretation of the lexical aspect of verbs in English plays a crucial role for recognizing textual entailment and learning discourse-level inferences. We show that two elementary dimensions of aspectual class, states vs. events, and telic vs. atelic events, can be modelled effectively with distributional semantics. We find that a verb's local context is most indicative of its aspectual clas… ▽ More

    Submitted 31 October, 2020; originally announced November 2020.

    Comments: to appear at Coling 2020 in oh so lovely virtual Barcelona :)

  29. arXiv:2004.14846  [pdf, other

    cs.CL cs.SD eess.AS

    The role of context in neural pitch accent detection in English

    Authors: Elizabeth Nielsen, Mark Steedman, Sharon Goldwater

    Abstract: Prosody is a rich information source in natural language, serving as a marker for phenomena such as contrast. In order to make this information available to downstream tasks, we need a way to detect prosodic events in speech. We propose a new model for pitch accent detection, inspired by the work of Stehwien et al. (2018), who presented a CNN-based model for this task. Our model makes greater use… ▽ More

    Submitted 12 October, 2020; v1 submitted 30 April, 2020; originally announced April 2020.

    Journal ref: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing

  30. arXiv:1908.08672  [pdf, other

    cs.CL

    Jointly Modeling Hierarchical and Horizontal Features for Relational Triple Extraction

    Authors: Zhepei Wei, Yantao Jia, Yuan Tian, Mohammad Javad Hosseini, Sujian Li, Mark Steedman, Yi Chang

    Abstract: Recent works on relational triple extraction have shown the superiority of jointly extracting entities and relations over the pipelined extraction manner. However, most existing joint models fail to balance the modeling of entity features and the joint decoding strategy, and thus the interactions between the entity level and triple level are not fully investigated. In this work, we first introduce… ▽ More

    Submitted 3 May, 2022; v1 submitted 23 August, 2019; originally announced August 2019.

    Comments: 20 pages, 5 figures

  31. arXiv:1904.01297  [pdf, other

    cs.CL

    Temporal and Aspectual Entailment

    Authors: Thomas Kober, Sander Bijl de Vroe, Mark Steedman

    Abstract: Inferences regarding "Jane's arrival in London" from predications such as "Jane is going to London" or "Jane has gone to London" depend on tense and aspect of the predications. Tense determines the temporal location of the predication in the past, present or future of the time of utterance. The aspectual auxiliaries on the other hand specify the internal constituency of the event, i.e. whether the… ▽ More

    Submitted 2 April, 2019; originally announced April 2019.

    Comments: accepted at IWCS 2019

  32. arXiv:1903.09460  [pdf, other

    cs.CL

    Data Augmentation via Dependency Tree Morphing for Low-Resource Languages

    Authors: Gözde Gül Şahin, Mark Steedman

    Abstract: Neural NLP systems achieve high scores in the presence of sizable training dataset. Lack of such datasets leads to poor system performances in the case low-resource languages. We present two simple text augmentation techniques using dependency trees, inspired from image processing. We crop sentences by removing dependency links, and we rotate sentences by moving the tree fragments around the root.… ▽ More

    Submitted 22 March, 2019; originally announced March 2019.

  33. arXiv:1805.11937  [pdf, other

    cs.CL

    Character-Level Models versus Morphology in Semantic Role Labeling

    Authors: Gözde Gül Şahin, Mark Steedman

    Abstract: Character-level models have become a popular approach specially for their accessibility and ability to handle unseen data. However, little is known on their ability to reveal the underlying morphological structure of a word, which is a crucial skill for high-level semantic analysis tasks, such as semantic role labeling (SRL). In this work, we train various types of SRL models that use word, charac… ▽ More

    Submitted 30 May, 2018; originally announced May 2018.

    Comments: Accepted for publication at the 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018)

  34. arXiv:1702.03196  [pdf, other

    cs.CL

    Universal Semantic Parsing

    Authors: Siva Reddy, Oscar Täckström, Slav Petrov, Mark Steedman, Mirella Lapata

    Abstract: Universal Dependencies (UD) offer a uniform cross-lingual syntactic representation, with the aim of advancing multilingual applications. Recent work shows that semantic parsing can be accomplished by transforming syntactic dependencies to logical forms. However, this work is limited to English, and cannot process dependency graphs, which allow handling complex phenomena such as control. In this wo… ▽ More

    Submitted 28 August, 2017; v1 submitted 10 February, 2017; originally announced February 2017.

    Comments: EMNLP 2017

  35. arXiv:1609.09405  [pdf, other

    cs.CL cs.AI

    Evaluating Induced CCG Parsers on Grounded Semantic Parsing

    Authors: Yonatan Bisk, Siva Reddy, John Blitzer, Julia Hockenmaier, Mark Steedman

    Abstract: We compare the effectiveness of four different syntactic CCG parsers for a semantic slot-filling task to explore how much syntactic supervision is required for downstream semantic analysis. This extrinsic, task-based evaluation provides a unique window to explore the strengths and weaknesses of semantics captured by unsupervised grammar induction systems. We release a new Freebase semantic parsing… ▽ More

    Submitted 31 January, 2017; v1 submitted 29 September, 2016; originally announced September 2016.

    Comments: EMNLP 2016, Table 2 erratum, Code and Freebase Semantic Parsing data URL

  36. arXiv:1210.4889  [pdf

    cs.LG cs.AI stat.ML

    Learning STRIPS Operators from Noisy and Incomplete Observations

    Authors: Kira Mourao, Luke S. Zettlemoyer, Ronald P. A. Petrick, Mark Steedman

    Abstract: Agents learning to act autonomously in real-world domains must acquire a model of the dynamics of the domain in which they operate. Learning domain dynamics can be challenging, especially where an agent only has partial access to the world state, and/or noisy external sensors. Even in standard STRIPS domains, existing approaches cannot learn from noisy, incomplete observations typical of real-worl… ▽ More

    Submitted 16 October, 2012; originally announced October 2012.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-614-623

  37. Specifying Intonation from Context for Speech Synthesis

    Authors: Scott Prevost, Mark Steedman

    Abstract: This paper presents a theory and a computational implementation for generating prosodically appropriate synthetic speech in response to database queries. Proper distinctions of contrast and emphasis are expressed in an intonation contour that is synthesized by rule under the control of a grammar, a discourse model, and a knowledge base. The theory is based on Combinatory Categorial Grammar, a fo… ▽ More

    Submitted 18 July, 1994; originally announced July 1994.

    Comments: 18 pages

    Report number: MS-CIS-94-37/LINC LAB 273