Skip to main content

Showing 1–29 of 29 results for author: Svyatkovskiy, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.06261  [pdf, ps, other

    cs.CL cs.AI

    Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

    Authors: Gheorghe Comanici, Eric Bieber, Mike Schaekermann, Ice Pasupat, Noveen Sachdeva, Inderjit Dhillon, Marcel Blistein, Ori Ram, Dan Zhang, Evan Rosen, Luke Marris, Sam Petulla, Colin Gaffney, Asaf Aharoni, Nathan Lintz, Tiago Cardal Pais, Henrik Jacobsson, Idan Szpektor, Nan-Jiang Jiang, Krishna Haridasan, Ahmed Omran, Nikunj Saunshi, Dara Bahri, Gaurav Mishra, Eric Chu , et al. (3278 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 2.X model family: Gemini 2.5 Pro and Gemini 2.5 Flash, as well as our earlier Gemini 2.0 Flash and Flash-Lite models. Gemini 2.5 Pro is our most capable model yet, achieving SoTA performance on frontier coding and reasoning benchmarks. In addition to its incredible coding and reasoning skills, Gemini 2.5 Pro is a thinking model that excels at multimodal unde… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

    Comments: 72 pages, 17 figures

  2. arXiv:2412.14308   

    cs.SE cs.LG

    Reinforcement Learning from Automatic Feedback for High-Quality Unit Test Generation

    Authors: Benjamin Steenhoek, Michele Tufano, Neel Sundaresan, Alexey Svyatkovskiy

    Abstract: Software testing is a crucial but time-consuming aspect of software development, and recently, Large Language Models (LLMs) have gained popularity for automated test case generation. However, because LLMs are trained on vast amounts of open-source code, they often generate test cases that do not adhere to best practices and may even contain test smells (anti-patterns). To address this issue, we pr… ▽ More

    Submitted 6 January, 2025; v1 submitted 18 December, 2024; originally announced December 2024.

    Comments: This work was intended as a replacement of arXiv:2310.02368 and any subsequent updates will appear there

  3. arXiv:2310.04610  [pdf, other

    cs.AI cs.LG

    DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies

    Authors: Shuaiwen Leon Song, Bonnie Kruft, Minjia Zhang, Conglong Li, Shiyang Chen, Chengming Zhang, Masahiro Tanaka, Xiaoxia Wu, Jeff Rasley, Ammar Ahmad Awan, Connor Holmes, Martin Cai, Adam Ghanem, Zhongzhu Zhou, Yuxiong He, Pete Luferenko, Divya Kumar, Jonathan Weyn, Ruixiong Zhang, Sylwester Klocek, Volodymyr Vragov, Mohammed AlQuraishi, Gustaf Ahdritz, Christina Floristean, Cristina Negri , et al. (67 additional authors not shown)

    Abstract: In the upcoming decade, deep learning may revolutionize the natural sciences, enhancing our capacity to model and predict natural occurrences. This could herald a new era of scientific exploration, bringing significant advancements across sectors from drug development to renewable energy. To answer this call, we present DeepSpeed4Science initiative (deepspeed4science.ai) which aims to build unique… ▽ More

    Submitted 11 October, 2023; v1 submitted 6 October, 2023; originally announced October 2023.

  4. arXiv:2310.02368  [pdf, other

    cs.SE cs.LG

    Reinforcement Learning from Automatic Feedback for High-Quality Unit Test Generation

    Authors: Benjamin Steenhoek, Michele Tufano, Neel Sundaresan, Alexey Svyatkovskiy

    Abstract: Software testing is a crucial aspect of software development, and the creation of high-quality tests that adhere to best practices is essential for effective maintenance. Recently, Large Language Models (LLMs) have gained popularity for code generation, including the automated creation of test cases. However, these LLMs are often trained on vast amounts of publicly available code, which may includ… ▽ More

    Submitted 6 January, 2025; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: Accepted to DeepTest 2025 (ICSE Workshop). Previously this version appeared as arXiv:2412.14308 which was submitted as a new work by accident

  5. arXiv:2305.05383  [pdf, other

    cs.PL cs.AI cs.CL cs.SE

    Code Execution with Pre-trained Language Models

    Authors: Chenxiao Liu, Shuai Lu, Weizhu Chen, Daxin Jiang, Alexey Svyatkovskiy, Shengyu Fu, Neel Sundaresan, Nan Duan

    Abstract: Code execution is a fundamental aspect of programming language semantics that reflects the exact behavior of the code. However, most pre-trained models for code intelligence ignore the execution trace and only rely on source code and syntactic structures. In this paper, we investigate how well pre-trained models can understand and perform code execution. We develop a mutation-based data augmentati… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: Accepted to the Findings of ACL 2023

  6. arXiv:2303.07263  [pdf, other

    cs.SE

    InferFix: End-to-End Program Repair with LLMs

    Authors: Matthew Jin, Syed Shahriar, Michele Tufano, Xin Shi, Shuai Lu, Neel Sundaresan, Alexey Svyatkovskiy

    Abstract: Software development life cycle is profoundly influenced by bugs: their introduction, identification, and eventual resolution account for a significant portion of software cost. This has motivated software engineering researchers and practitioners to propose different approaches for automating the identification and repair of software defects. Large language models have been adapted to the program… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

  7. arXiv:2208.13928  [pdf, other

    cs.SE cs.CL cs.LG

    Exploring and Evaluating Personalized Models for Code Generation

    Authors: Andrei Zlotchevski, Dawn Drain, Alexey Svyatkovskiy, Colin Clement, Neel Sundaresan, Michele Tufano

    Abstract: Large Transformer models achieved the state-of-the-art status for Natural Language Understanding tasks and are increasingly becoming the baseline model architecture for modeling source code. Transformers are usually pre-trained on large unsupervised corpora, learning token representations and transformations relevant to modeling generally available text, and are then fine-tuned on a particular dow… ▽ More

    Submitted 19 September, 2022; v1 submitted 29 August, 2022; originally announced August 2022.

    Comments: Accepted to the ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2022), Industry Track - Singapore, November 14-18, 2022, to appear 9 pages

  8. arXiv:2205.11023  [pdf, other

    cs.SE cs.CL

    AdaptivePaste: Code Adaptation through Learning Semantics-aware Variable Usage Representations

    Authors: Xiaoyu Liu, Jinu Jang, Neel Sundaresan, Miltiadis Allamanis, Alexey Svyatkovskiy

    Abstract: In software development, it is common for programmers to copy-paste or port code snippets and then adapt them to their use case. This scenario motivates the code adaptation task -- a variant of program repair which aims to adapt variable identifiers in a pasted snippet of code to the surrounding, preexisting source code. However, no existing approach has been shown to effectively address this task… ▽ More

    Submitted 6 October, 2023; v1 submitted 22 May, 2022; originally announced May 2022.

  9. Methods2Test: A dataset of focal methods mapped to test cases

    Authors: Michele Tufano, Shao Kun Deng, Neel Sundaresan, Alexey Svyatkovskiy

    Abstract: Unit testing is an essential part of the software development process, which helps to identify issues with source code in early stages of development and prevent regressions. Machine learning has emerged as viable approach to help software developers generate automated unit tests. However, generating reliable unit test cases that are semantically correct and capable of catching software bugs or un… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

    Comments: Accepted for publication in the proceedings of The 2022 Mining Software Repositories Conference (MSR 2022) - Data and Tool track

  10. arXiv:2203.09095  [pdf, other

    cs.SE cs.AI

    Automating Code Review Activities by Large-Scale Pre-training

    Authors: Zhiyu Li, Shuai Lu, Daya Guo, Nan Duan, Shailesh Jannu, Grant Jenks, Deep Majumder, Jared Green, Alexey Svyatkovskiy, Shengyu Fu, Neel Sundaresan

    Abstract: Code review is an essential part to software development lifecycle since it aims at guaranteeing the quality of codes. Modern code review activities necessitate developers viewing, understanding and even running the programs to assess logic, functionality, latency, style and other factors. It turns out that developers have to spend far too much time reviewing the code of their peers. Accordingly,… ▽ More

    Submitted 11 October, 2022; v1 submitted 17 March, 2022; originally announced March 2022.

    Comments: ESEC/FSE 2022, camera-ready version

  11. arXiv:2203.07722  [pdf, other

    cs.SE cs.AI cs.CL

    ReACC: A Retrieval-Augmented Code Completion Framework

    Authors: Shuai Lu, Nan Duan, Hojae Han, Daya Guo, Seung-won Hwang, Alexey Svyatkovskiy

    Abstract: Code completion, which aims to predict the following code token(s) according to the code context, can improve the productivity of software development. Recent work has proved that statistical language modeling with transformers can greatly improve the performance in the code completion task via learning from large-scale source code datasets. However, current approaches focus only on code context w… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

    Comments: Published in ACL 2022

  12. arXiv:2109.08780  [pdf, other

    cs.LG cs.SE

    Long-Range Modeling of Source Code Files with eWASH: Extended Window Access by Syntax Hierarchy

    Authors: Colin B. Clement, Shuai Lu, Xiaoyu Liu, Michele Tufano, Dawn Drain, Nan Duan, Neel Sundaresan, Alexey Svyatkovskiy

    Abstract: Statistical language modeling and translation with transformers have found many successful applications in program understanding and generation tasks, setting high benchmarks for tools in modern software development environments. The finite context window of these neural models means, however, that they will be unable to leverage the entire relevant context of large files and packages for any give… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021 camera ready

  13. Program Merge Conflict Resolution via Neural Transformers

    Authors: Alexey Svyatkovskiy, Sarah Fakhoury, Negar Ghorbani, Todd Mytkowicz, Elizabeth Dinella, Christian Bird, Jinu Jang, Neel Sundaresan, Shuvendu Lahiri

    Abstract: Collaborative software development is an integral part of the modern software development life cycle, essential to the success of large-scale software projects. When multiple developers make concurrent changes around the same lines of code, a merge conflict may occur. Such conflicts stall pull requests and continuous integration pipelines for hours to several days, seriously hurting developer prod… ▽ More

    Submitted 29 November, 2022; v1 submitted 31 August, 2021; originally announced September 2021.

    Comments: ESEC/FSE '22 camera ready version. 12 pages, 4 figures, online appendix

  14. arXiv:2106.10158  [pdf, other

    cs.LG cs.SE

    Learning to Complete Code with Sketches

    Authors: Daya Guo, Alexey Svyatkovskiy, Jian Yin, Nan Duan, Marc Brockschmidt, Miltiadis Allamanis

    Abstract: Code completion is usually cast as a language modelling problem, i.e., continuing an input in a left-to-right fashion. However, in practice, some parts of the completion (e.g., string literals) may be very hard to predict, whereas subsequent parts directly follow from the context. To handle this, we instead consider the scenario of generating code completions with "holes" inserted in places where… ▽ More

    Submitted 23 January, 2022; v1 submitted 18 June, 2021; originally announced June 2021.

    Comments: Published in ICLR 2022

  15. arXiv:2105.07569  [pdf, other

    cs.SE

    DeepMerge: Learning to Merge Programs

    Authors: Elizabeth Dinella, Todd Mytkowicz, Alexey Svyatkovskiy, Christian Bird, Mayur Naik, Shuvendu K. Lahiri

    Abstract: In collaborative software development, program merging is the mechanism to integrate changes from multiple programmers. Merge algorithms in modern version control systems report a conflict when changes interfere textually. Merge conflicts require manual intervention and frequently stall modern continuous integration pipelines. Prior work found that, although costly, a large majority of resolutions… ▽ More

    Submitted 6 September, 2021; v1 submitted 16 May, 2021; originally announced May 2021.

    Comments: 11 pages

  16. Generating Bug-Fixes Using Pretrained Transformers

    Authors: Dawn Drain, Chen Wu, Alexey Svyatkovskiy, Neel Sundaresan

    Abstract: Detecting and fixing bugs are two of the most important yet frustrating parts of the software development cycle. Existing bug detection tools are based mainly on static analyzers, which rely on mathematical logic and symbolic reasoning about the program execution to detect common types of bugs. Fixing bugs is typically left out to the developer. In this work we introduce DeepDebug: a data-driven p… ▽ More

    Submitted 28 April, 2021; v1 submitted 16 April, 2021; originally announced April 2021.

  17. arXiv:2102.04664  [pdf, other

    cs.SE cs.CL

    CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation

    Authors: Shuai Lu, Daya Guo, Shuo Ren, Junjie Huang, Alexey Svyatkovskiy, Ambrosio Blanco, Colin Clement, Dawn Drain, Daxin Jiang, Duyu Tang, Ge Li, Lidong Zhou, Linjun Shou, Long Zhou, Michele Tufano, Ming Gong, Ming Zhou, Nan Duan, Neel Sundaresan, Shao Kun Deng, Shengyu Fu, Shujie Liu

    Abstract: Benchmark datasets have a significant impact on accelerating research in programming language tasks. In this paper, we introduce CodeXGLUE, a benchmark dataset to foster machine learning research for program understanding and generation. CodeXGLUE includes a collection of 10 tasks across 14 datasets and a platform for model evaluation and comparison. CodeXGLUE also features three baseline systems,… ▽ More

    Submitted 16 March, 2021; v1 submitted 9 February, 2021; originally announced February 2021.

    Comments: 14 pages; Revise CodeBLEU scores for all models on text-to-code task

  18. arXiv:2010.03150  [pdf, other

    cs.LG cs.SE

    PyMT5: multi-mode translation of natural language and Python code with transformers

    Authors: Colin B. Clement, Dawn Drain, Jonathan Timcheck, Alexey Svyatkovskiy, Neel Sundaresan

    Abstract: Simultaneously modeling source code and natural language has many exciting applications in automated software development and understanding. Pursuant to achieving such technology, we introduce PyMT5, the Python method text-to-text transfer transformer, which is trained to translate between all pairs of Python method feature combinations: a single model that can both predict whole methods from natu… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: 14 pages, 7 figures, 5 tables, EMNLP 2020 camera ready version

  19. arXiv:2009.08366  [pdf, other

    cs.SE cs.CL

    GraphCodeBERT: Pre-training Code Representations with Data Flow

    Authors: Daya Guo, Shuo Ren, Shuai Lu, Zhangyin Feng, Duyu Tang, Shujie Liu, Long Zhou, Nan Duan, Alexey Svyatkovskiy, Shengyu Fu, Michele Tufano, Shao Kun Deng, Colin Clement, Dawn Drain, Neel Sundaresan, Jian Yin, Daxin Jiang, Ming Zhou

    Abstract: Pre-trained models for programming language have achieved dramatic empirical improvements on a variety of code-related tasks such as code search, code completion, code summarization, etc. However, existing pre-trained models regard a code snippet as a sequence of tokens, while ignoring the inherent structure of code, which provides crucial code semantics and would enhance the code understanding pr… ▽ More

    Submitted 13 September, 2021; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: Accepted by ICLR2021

  20. Generating Accurate Assert Statements for Unit Test Cases using Pretrained Transformers

    Authors: Michele Tufano, Dawn Drain, Alexey Svyatkovskiy, Neel Sundaresan

    Abstract: Unit testing represents the foundational basis of the software testing pyramid, beneath integration and end-to-end testing. Automated software testing researchers have proposed a variety of techniques to assist developers in this time-consuming task. In this paper we present an approach to support developers in writing unit test cases by generating accurate and useful assert statements. Our approa… ▽ More

    Submitted 11 September, 2020; originally announced September 2020.

  21. arXiv:2009.05617  [pdf, other

    cs.SE cs.CL cs.LG

    Unit Test Case Generation with Transformers and Focal Context

    Authors: Michele Tufano, Dawn Drain, Alexey Svyatkovskiy, Shao Kun Deng, Neel Sundaresan

    Abstract: Automated unit test case generation tools facilitate test-driven development and support developers by suggesting tests intended to identify flaws in their code. Existing approaches are usually guided by the test coverage criteria, generating synthetic test cases that are often difficult for developers to read or understand. In this paper we propose AthenaTest, an approach that aims to generate un… ▽ More

    Submitted 20 May, 2021; v1 submitted 11 September, 2020; originally announced September 2020.

  22. arXiv:2005.08025  [pdf, other

    cs.CL cs.SE

    IntelliCode Compose: Code Generation Using Transformer

    Authors: Alexey Svyatkovskiy, Shao Kun Deng, Shengyu Fu, Neel Sundaresan

    Abstract: In software development through integrated development environments (IDEs), code completion is one of the most widely used features. Nevertheless, majority of integrated development environments only support completion of methods and APIs, or arguments. In this paper, we introduce IntelliCode Compose $-$ a general-purpose multilingual code completion tool which is capable of predicting sequences… ▽ More

    Submitted 29 October, 2020; v1 submitted 16 May, 2020; originally announced May 2020.

    Comments: Accepted for publication at ESEC/FSE conference

  23. arXiv:2004.13651  [pdf, other

    cs.SE cs.LG

    Fast and Memory-Efficient Neural Code Completion

    Authors: Alexey Svyatkovskiy, Sebastian Lee, Anna Hadjitofi, Maik Riechert, Juliana Franco, Miltiadis Allamanis

    Abstract: Code completion is one of the most widely used features of modern integrated development environments (IDEs). While deep learning has made significant progress in the statistical prediction of source code, state-of-the-art neural network models consume hundreds of megabytes of memory, bloating the development environment. We address this in two steps: first we present a modular neural framework fo… ▽ More

    Submitted 16 March, 2021; v1 submitted 28 April, 2020; originally announced April 2020.

    Journal ref: Published at Mining Software Repositories 2021

  24. Pythia: AI-assisted Code Completion System

    Authors: Alexey Svyatkovskiy, Ying Zhao, Shengyu Fu, Neel Sundaresan

    Abstract: In this paper, we propose a novel end-to-end approach for AI-assisted code completion called Pythia. It generates ranked lists of method and API recommendations which can be used by software developers at edit time. The system is currently deployed as part of Intellicode extension in Visual Studio Code IDE. Pythia exploits state-of-the-art large-scale deep learning models trained on code contexts… ▽ More

    Submitted 28 November, 2019; originally announced December 2019.

    Comments: Published in Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD '19)

  25. arXiv:1912.00547  [pdf, other

    cs.CL cs.DC cs.LG

    Large-scale text processing pipeline with Apache Spark

    Authors: Alexey Svyatkovskiy, Kosuke Imai, Mary Kroeger, Yuki Shiraito

    Abstract: In this paper, we evaluate Apache Spark for a data-intensive machine learning problem. Our use case focuses on policy diffusion detection across the state legislatures in the United States over time. Previous work on policy diffusion has been unable to make an all-pairs comparison between bills due to computational intensity. As a substitute, scholars have studied single topic areas. We provide… ▽ More

    Submitted 1 December, 2019; originally announced December 2019.

    Journal ref: Published in Proceedings of Big NLP workshop at the IEEE Big Data Conference 2016

  26. Training Distributed Deep Recurrent Neural Networks with Mixed Precision on GPU Clusters

    Authors: Alexey Svyatkovskiy, Julian Kates-Harbeck, William Tang

    Abstract: In this paper, we evaluate training of deep recurrent neural networks with half-precision floats. We implement a distributed, data-parallel, synchronous training algorithm by integrating TensorFlow and CUDA-aware MPI to enable execution across multiple GPU nodes and making use of high-speed interconnects. We introduce a learning rate schedule facilitating neural network convergence at up to… ▽ More

    Submitted 30 November, 2019; originally announced December 2019.

    Journal ref: Published in Proceedings of the Machine Learning on HPC Environments (MLHPC) at the Supercomputing conference 2017

  27. Using Big Data Technologies for HEP Analysis

    Authors: Matteo Cremonesi, Claudio Bellini, Bianny Bian, Luca Canali, Vasileios Dimakopoulos, Peter Elmer, Ian Fisk, Maria Girone, Oliver Gutsche, Siew-Yan Hoh, Bo Jayatilaka, Viktor Khristenko, Andrea Luiselli, Andrew Melo, Evangelos Evangelos, Dominick Olivito, Jacopo Pazzini, Jim Pivarski, Alexey Svyatkovskiy, Marco Zanetti

    Abstract: The HEP community is approaching an era were the excellent performances of the particle accelerators in delivering collision at high rate will force the experiments to record a large amount of information. The growing size of the datasets could potentially become a limiting factor in the capability to produce scientific results timely and efficiently. Recently, new technologies and new approaches… ▽ More

    Submitted 21 January, 2019; originally announced January 2019.

  28. arXiv:1711.00375  [pdf, other

    cs.DC

    CMS Analysis and Data Reduction with Apache Spark

    Authors: Oliver Gutsche, Luca Canali, Illia Cremer, Matteo Cremonesi, Peter Elmer, Ian Fisk, Maria Girone, Bo Jayatilaka, Jim Kowalkowski, Viktor Khristenko, Evangelos Motesnitsalis, Jim Pivarski, Saba Sehrish, Kacper Surdy, Alexey Svyatkovskiy

    Abstract: Experimental Particle Physics has been at the forefront of analyzing the world's largest datasets for decades. The HEP community was among the first to develop suitable software and computing tools for this task. In recent times, new toolkits and systems for distributed data processing, collectively called "Big Data" technologies have emerged from industry and open source projects to support the a… ▽ More

    Submitted 31 October, 2017; originally announced November 2017.

    Comments: Proceedings for 18th International Workshop on Advanced Computing and Analysis Techniques in Physics Research (ACAT 2017). arXiv admin note: text overlap with arXiv:1703.04171

  29. Big Data in HEP: A comprehensive use case study

    Authors: Oliver Gutsche, Matteo Cremonesi, Peter Elmer, Bo Jayatilaka, Jim Kowalkowski, Jim Pivarski, Saba Sehrish, Cristina Mantilla Surez, Alexey Svyatkovskiy, Nhan Tran

    Abstract: Experimental Particle Physics has been at the forefront of analyzing the worlds largest datasets for decades. The HEP community was the first to develop suitable software and computing tools for this task. In recent times, new toolkits and systems collectively called Big Data technologies have emerged to support the analysis of Petabyte and Exabyte datasets in industry. While the principles of dat… ▽ More

    Submitted 12 March, 2017; originally announced March 2017.

    Comments: Proceedings for 22nd International Conference on Computing in High Energy and Nuclear Physics (CHEP 2016)