Skip to main content

Showing 1–6 of 6 results for author: Sakota, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.14901  [pdf, ps, other

    cs.CL

    Combining Constrained and Unconstrained Decoding via Boosting: BoostCD and Its Application to Information Extraction

    Authors: Marija Šakota, Robert West

    Abstract: Many recent approaches to structured NLP tasks use an autoregressive language model $M$ to map unstructured input text $x$ to output text $y$ representing structured objects (such as tuples, lists, trees, code, etc.), where the desired output structure is enforced via constrained decoding. During training, these approaches do not require the model to be aware of the constraints, which are merely i… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

  2. arXiv:2404.03428  [pdf, other

    cs.CL

    Edisum: Summarizing and Explaining Wikipedia Edits at Scale

    Authors: Marija Šakota, Isaac Johnson, Guosheng Feng, Robert West

    Abstract: An edit summary is a succinct comment written by a Wikipedia editor explaining the nature of, and reasons for, an edit to a Wikipedia page. Edit summaries are crucial for maintaining the encyclopedia: they are the first thing seen by content moderators and they help them decide whether to accept or reject an edit. Additionally, edit summaries constitute a valuable data source for researchers. Unfo… ▽ More

    Submitted 18 August, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

  3. Fly-Swat or Cannon? Cost-Effective Language Model Choice via Meta-Modeling

    Authors: Marija Šakota, Maxime Peyrard, Robert West

    Abstract: Generative language models (LMs) have become omnipresent across data science. For a wide variety of tasks, inputs can be phrased as natural language prompts for an LM, from whose output the solution can then be extracted. LM performance has consistently been increasing with model size - but so has the monetary cost of querying the ever larger models. Importantly, however, not all inputs are equall… ▽ More

    Submitted 18 December, 2023; v1 submitted 11 August, 2023; originally announced August 2023.

  4. arXiv:2303.04132  [pdf, other

    cs.CL cs.AI cs.LG

    Exploiting Asymmetry for Synthetic Training Data Generation: SynthIE and the Case of Information Extraction

    Authors: Martin Josifoski, Marija Sakota, Maxime Peyrard, Robert West

    Abstract: Large language models (LLMs) have great potential for synthetic data generation. This work shows that useful data can be synthetically generated even for tasks that cannot be solved directly by LLMs: for problems with structured outputs, it is possible to prompt an LLM to perform the task in the reverse direction, by generating plausible input text for a target output structure. Leveraging this as… ▽ More

    Submitted 29 October, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

    Comments: Accepted at EMNLP 2023

  5. Descartes: Generating Short Descriptions of Wikipedia Articles

    Authors: Marija Sakota, Maxime Peyrard, Robert West

    Abstract: Wikipedia is one of the richest knowledge sources on the Web today. In order to facilitate navigating, searching, and maintaining its content, Wikipedia's guidelines state that all articles should be annotated with a so-called short description indicating the article's topic (e.g., the short description of beer is "Alcoholic drink made from fermented cereal grains"). Nonetheless, a large fraction… ▽ More

    Submitted 17 February, 2023; v1 submitted 20 May, 2022; originally announced May 2022.

  6. Putting Ridesharing to the Test: Efficient and Scalable Solutions and the Power of Dynamic Vehicle Relocation

    Authors: Panayiotis Danassis, Marija Sakota, Aris Filos-Ratsikas, Boi Faltings

    Abstract: We study the optimization of large-scale, real-time ridesharing systems and propose a modular design methodology, Component Algorithms for Ridesharing (CAR). We evaluate a diverse set of CARs (14 in total), focusing on the key algorithmic components of ridesharing. We take a multi-objective approach, evaluating 12 metrics related to global efficiency, complexity, passenger, driver, and platform in… ▽ More

    Submitted 20 June, 2022; v1 submitted 17 December, 2019; originally announced December 2019.

    Comments: A version of this paper has been published in the Artificial Intelligence Review (https://doi.org/10.1007/s10462-022-10145-0)

    Journal ref: Artificial Intelligence Review (2022)