Skip to main content

Showing 1–17 of 17 results for author: Polozov, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2205.14318  [pdf, other

    cs.LG cs.PL

    Learning Math Reasoning from Self-Sampled Correct and Partially-Correct Solutions

    Authors: Ansong Ni, Jeevana Priya Inala, Chenglong Wang, Oleksandr Polozov, Christopher Meek, Dragomir Radev, Jianfeng Gao

    Abstract: Pretrained language models have shown superior performance on many natural language processing tasks, yet they still struggle at multi-step formal reasoning tasks like grade school math problems. One key challenge of finetuning them to solve such math reasoning problems is that many existing datasets only contain one reference solution for each problem, despite the fact that there are often altern… ▽ More

    Submitted 17 February, 2023; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: Accepted to ICLR 2023

  2. arXiv:2204.02311  [pdf, other

    cs.CL

    PaLM: Scaling Language Modeling with Pathways

    Authors: Aakanksha Chowdhery, Sharan Narang, Jacob Devlin, Maarten Bosma, Gaurav Mishra, Adam Roberts, Paul Barham, Hyung Won Chung, Charles Sutton, Sebastian Gehrmann, Parker Schuh, Kensen Shi, Sasha Tsvyashchenko, Joshua Maynez, Abhishek Rao, Parker Barnes, Yi Tay, Noam Shazeer, Vinodkumar Prabhakaran, Emily Reif, Nan Du, Ben Hutchinson, Reiner Pope, James Bradbury, Jacob Austin , et al. (42 additional authors not shown)

    Abstract: Large language models have been shown to achieve remarkable performance across a variety of natural language tasks using few-shot learning, which drastically reduces the number of task-specific training examples needed to adapt the model to a particular application. To further our understanding of the impact of scale on few-shot learning, we trained a 540-billion parameter, densely activated, Tran… ▽ More

    Submitted 5 October, 2022; v1 submitted 5 April, 2022; originally announced April 2022.

  3. arXiv:2201.11227  [pdf, other

    cs.LG cs.PL

    Synchromesh: Reliable code generation from pre-trained language models

    Authors: Gabriel Poesia, Oleksandr Polozov, Vu Le, Ashish Tiwari, Gustavo Soares, Christopher Meek, Sumit Gulwani

    Abstract: Large pre-trained language models have been used to generate code,providing a flexible interface for synthesizing programs from natural language specifications. However, they often violate syntactic and semantic rules of their output language, limiting their practical usability. In this paper, we propose Synchromesh: a framework for substantially improving the reliability of pre-trained models for… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.

    Comments: 10 pages, 9 additional pages of Appendix

  4. arXiv:2106.11455  [pdf, other

    cs.CL cs.AI cs.DB cs.PL

    KaggleDBQA: Realistic Evaluation of Text-to-SQL Parsers

    Authors: Chia-Hsuan Lee, Oleksandr Polozov, Matthew Richardson

    Abstract: The goal of database question answering is to enable natural language querying of real-life relational databases in diverse application domains. Recently, large-scale datasets such as Spider and WikiSQL facilitated novel modeling techniques for text-to-SQL parsing, improving zero-shot generalization to unseen databases. In this work, we examine the challenges that still prevent these techniques fr… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

    Comments: Published as a conference paper at ACL-IJCNLP 2021

  5. arXiv:2106.05784  [pdf, other

    cs.LG cs.AI cs.CL cs.PL cs.SE

    Programming Puzzles

    Authors: Tal Schuster, Ashwin Kalyan, Oleksandr Polozov, Adam Tauman Kalai

    Abstract: We introduce a new type of programming challenge called programming puzzles, as an objective and comprehensive evaluation of program synthesis, and release an open-source dataset of Python Programming Puzzles (P3). Each puzzle is defined by a short Python program $f$, and the goal is to find an input which makes $f$ return True. The puzzles are objective in that each one is specified entirely by t… ▽ More

    Submitted 6 November, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021 (Datasets and Benchmarks Track). Puzzles repository: https://github.com/microsoft/PythonProgrammingPuzzles

  6. Structure-Grounded Pretraining for Text-to-SQL

    Authors: Xiang Deng, Ahmed Hassan Awadallah, Christopher Meek, Oleksandr Polozov, Huan Sun, Matthew Richardson

    Abstract: Learning to capture text-table alignment is essential for tasks like text-to-SQL. A model needs to correctly recognize natural language references to columns and values and to ground them in the given database schema. In this paper, we present a novel weakly supervised Structure-Grounded pretraining framework (StruG) for text-to-SQL that can effectively learn to capture text-table alignment based… ▽ More

    Submitted 30 August, 2022; v1 submitted 24 October, 2020; originally announced October 2020.

    Comments: Accepted to NAACL 2021. The Spider-Realistic dataset is available at https://doi.org/10.5281/zenodo.5205322

  7. arXiv:2006.11524  [pdf, other

    cs.LG cs.AI cs.CV cs.NE cs.SC stat.ML

    Neuro-Symbolic Visual Reasoning: Disentangling "Visual" from "Reasoning"

    Authors: Saeed Amizadeh, Hamid Palangi, Oleksandr Polozov, Yichen Huang, Kazuhito Koishida

    Abstract: Visual reasoning tasks such as visual question answering (VQA) require an interplay of visual perception with reasoning about the question semantics grounded in perception. However, recent advances in this area are still primarily driven by perception improvements (e.g. scene graph generation) rather than reasoning. Neuro-symbolic models such as Neural Module Networks bring the benefits of composi… ▽ More

    Submitted 25 August, 2020; v1 submitted 20 June, 2020; originally announced June 2020.

    Comments: Published in Proceedings of the 37th International Conference on Machine Learning (ICML), Online, PMLR 119, 2020

  8. arXiv:1911.04942  [pdf, other

    cs.CL cs.AI

    RAT-SQL: Relation-Aware Schema Encoding and Linking for Text-to-SQL Parsers

    Authors: Bailin Wang, Richard Shin, Xiaodong Liu, Oleksandr Polozov, Matthew Richardson

    Abstract: When translating natural language questions into SQL queries to answer questions from a database, contemporary semantic parsing models struggle to generalize to unseen database schemas. The generalization challenge lies in (a) encoding the database relations in an accessible way for the semantic parser, and (b) modeling alignment between database columns and their mentions in a given query. We pre… ▽ More

    Submitted 24 August, 2021; v1 submitted 10 November, 2019; originally announced November 2019.

    Comments: Fix some errors of ACL 2020 camera-ready; 12 pages, 5 figures, 7 tables. arXiv admin note: text overlap with arXiv:1906.11790

  9. arXiv:1906.10816  [pdf, other

    cs.LG cs.AI cs.CL cs.PL stat.ML

    Program Synthesis and Semantic Parsing with Learned Code Idioms

    Authors: Richard Shin, Miltiadis Allamanis, Marc Brockschmidt, Oleksandr Polozov

    Abstract: Program synthesis of general-purpose source code from natural language specifications is challenging due to the need to reason about high-level patterns in the target program and low-level implementation details at the same time. In this work, we present PATOIS, a system that allows a neural program synthesizer to explicitly interleave high-level and low-level reasoning at every generation step. I… ▽ More

    Submitted 4 November, 2019; v1 submitted 25 June, 2019; originally announced June 2019.

    Comments: 33rd Conference on Neural Information Processing Systems (NeurIPS) 2019. 13 pages total, 9 pages of main text

  10. arXiv:1903.06089  [pdf, ps, other

    cs.SE

    Are My Invariants Valid? A Learning Approach

    Authors: Vincent J. Hellendoorn, Premkumar T. Devanbu, Oleksandr Polozov, Mark Marron

    Abstract: Ensuring that a program operates correctly is a difficult task in large, complex systems. Enshrining invariants -- desired properties of correct execution -- in code or comments can support maintainability and help sustain correctness. Tools that can automatically infer and recommend invariants can thus be very beneficial. However, current invariant-suggesting tools, such as Daikon, suffer from hi… ▽ More

    Submitted 15 March, 2019; v1 submitted 14 March, 2019; originally announced March 2019.

    Comments: 10 pages

  11. arXiv:1809.05054  [pdf, other

    cs.CL cs.AI cs.DB cs.LG

    IncSQL: Training Incremental Text-to-SQL Parsers with Non-Deterministic Oracles

    Authors: Tianze Shi, Kedar Tatwawadi, Kaushik Chakrabarti, Yi Mao, Oleksandr Polozov, Weizhu Chen

    Abstract: We present a sequence-to-action parsing approach for the natural language to SQL task that incrementally fills the slots of a SQL query with feasible actions from a pre-defined inventory. To account for the fact that typically there are multiple correct SQL queries with the same or very similar semantics, we draw inspiration from syntactic parsing techniques and propose to train our sequence-to-ac… ▽ More

    Submitted 1 October, 2018; v1 submitted 13 September, 2018; originally announced September 2018.

  12. arXiv:1807.03100  [pdf, other

    cs.CL cs.AI cs.DB cs.LG cs.PL

    Robust Text-to-SQL Generation with Execution-Guided Decoding

    Authors: Chenglong Wang, Kedar Tatwawadi, Marc Brockschmidt, Po-Sen Huang, Yi Mao, Oleksandr Polozov, Rishabh Singh

    Abstract: We consider the problem of neural semantic parsing, which translates natural language questions into executable SQL queries. We introduce a new mechanism, execution guidance, to leverage the semantics of SQL. It detects and excludes faulty programs during the decoding procedure by conditioning on the execution of partially generated program. The mechanism can be used with any autoregressive genera… ▽ More

    Submitted 12 September, 2018; v1 submitted 9 July, 2018; originally announced July 2018.

  13. arXiv:1805.08490  [pdf, other

    cs.LG cs.PL stat.ML

    Generative Code Modeling with Graphs

    Authors: Marc Brockschmidt, Miltiadis Allamanis, Alexander L. Gaunt, Oleksandr Polozov

    Abstract: Generative models for source code are an interesting structured prediction problem, requiring to reason about both hard syntactic and semantic constraints as well as about natural, likely programs. We present a novel model for this problem that uses a graph to represent the intermediate state of the generated output. The generative procedure interleaves grammar-driven expansion steps with graph au… ▽ More

    Submitted 16 April, 2019; v1 submitted 22 May, 2018; originally announced May 2018.

  14. arXiv:1804.01186  [pdf, ps, other

    cs.AI cs.LG cs.PL

    Neural-Guided Deductive Search for Real-Time Program Synthesis from Examples

    Authors: Ashwin Kalyan, Abhishek Mohta, Oleksandr Polozov, Dhruv Batra, Prateek Jain, Sumit Gulwani

    Abstract: Synthesizing user-intended programs from a small number of input-output examples is a challenging problem with several important applications like spreadsheet manipulation, data wrangling and code refactoring. Existing synthesis systems either completely rely on deductive logic techniques that are extensively hand-engineered or on purely statistical models that need massive amounts of data, and in… ▽ More

    Submitted 9 September, 2018; v1 submitted 3 April, 2018; originally announced April 2018.

    Comments: Published in ICLR 2018, International Conference on Learning Representations (2018)

  15. FlashProfile: A Framework for Synthesizing Data Profiles

    Authors: Saswat Padhi, Prateek Jain, Daniel Perelman, Oleksandr Polozov, Sumit Gulwani, Todd Millstein

    Abstract: We address the problem of learning a syntactic profile for a collection of strings, i.e. a set of regex-like patterns that succinctly describe the syntactic variations in the strings. Real-world datasets, typically curated from multiple sources, often contain data in various syntactic formats. Thus, any data processing task is preceded by the critical step of data format identification. However, m… ▽ More

    Submitted 16 April, 2019; v1 submitted 17 September, 2017; originally announced September 2017.

    Comments: 28 pages, SPLASH (OOPSLA) 2018

    Journal ref: Proc. ACM Program. Lang. 2, OOPSLA, Article 150 (November 2018) 150:1-150:28

  16. arXiv:1703.03539  [pdf, other

    cs.PL

    Interactive Program Synthesis

    Authors: Vu Le, Daniel Perelman, Oleksandr Polozov, Mohammad Raza, Abhishek Udupa, Sumit Gulwani

    Abstract: Program synthesis from incomplete specifications (e.g. input-output examples) has gained popularity and found real-world applications, primarily due to its ease-of-use. Since this technology is often used in an interactive setting, efficiency and correctness are often the key user expectations from a system based on such technologies. Ensuring efficiency is challenging since the highly combinatori… ▽ More

    Submitted 9 March, 2017; originally announced March 2017.

  17. arXiv:1608.09000  [pdf, other

    cs.SE cs.LG cs.PL

    Learning Syntactic Program Transformations from Examples

    Authors: Reudismam Rolim, Gustavo Soares, Loris D'Antoni, Oleksandr Polozov, Sumit Gulwani, Rohit Gheyi, Ryo Suzuki, Bjoern Hartmann

    Abstract: IDEs, such as Visual Studio, automate common transformations, such as Rename and Extract Method refactorings. However, extending these catalogs of transformations is complex and time-consuming. A similar phenomenon appears in intelligent tutoring systems where instructors have to write cumbersome code transformations that describe "common faults" to fix similar student submissions to programming a… ▽ More

    Submitted 31 August, 2016; originally announced August 2016.