Skip to main content

Showing 1–15 of 15 results for author: Podkopaev, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.22846  [pdf, ps, other

    cs.LG cs.AI cs.LO cs.SE

    RocqStar: Leveraging Similarity-driven Retrieval and Agentic Systems for Rocq generation

    Authors: Nikita Khramov, Andrei Kozyrev, Gleb Solovev, Anton Podkopaev

    Abstract: Interactive Theorem Proving was repeatedly shown to be fruitful combined with Generative Artificial Intelligence. This paper assesses multiple approaches to Rocq generation and illuminates potential avenues for improvement. We highlight the importance of thorough premise selection for generating Rocq proofs and propose a novel approach, leveraging retrieval via a self-attentive embedder model. The… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  2. arXiv:2503.14183  [pdf, other

    cs.SE cs.AI cs.PL

    Can LLMs Enable Verification in Mainstream Programming?

    Authors: Aleksandr Shefer, Igor Engel, Stanislav Alekseev, Daniil Berezun, Ekaterina Verbitskaia, Anton Podkopaev

    Abstract: Although formal methods are capable of producing reliable software, they have seen minimal adoption in everyday programming. Automatic code generation using large language models is becoming increasingly widespread, but it rarely considers producing strong correctness guarantees. In this study, we explore the ability of LLMs to produce verified code in three verification languages (Dafny, Nagini,… ▽ More

    Submitted 18 March, 2025; originally announced March 2025.

  3. LitmusKt: Concurrency Stress Testing for Kotlin

    Authors: Denis Lochmelis, Evgenii Moiseenko, Yaroslav Golubev, Anton Podkopaev

    Abstract: We present LitmusKt - the first tool for litmus testing concurrent programs in Kotlin. The tool's novelty also lies in the fact that Kotlin is a multiplatform language, i.e., it compiles into multiple platforms, which means that the concurrency has to be tested on several of them. Our tool allows writing litmus tests in a single custom DSL, and these tests are then run in Kotlin/Native and Kotlin/… ▽ More

    Submitted 22 April, 2025; v1 submitted 13 January, 2025; originally announced January 2025.

    Comments: Accepted to FSE'25 Demonstrations, 5 pages, 3 figures

  4. arXiv:2412.19318  [pdf, other

    stat.ML cs.LG

    Adaptive Conformal Inference by Betting

    Authors: Aleksandr Podkopaev, Darren Xu, Kuang-Chih Lee

    Abstract: Conformal prediction is a valuable tool for quantifying predictive uncertainty of machine learning models. However, its applicability relies on the assumption of data exchangeability, a condition which is often not met in real-world scenarios. In this paper, we consider the problem of adaptive conformal inference without any assumptions about the data generating process. Existing approaches for ad… ▽ More

    Submitted 26 December, 2024; originally announced December 2024.

  5. arXiv:2410.19605  [pdf, other

    cs.SE cs.AI cs.LO

    CoqPilot, a plugin for LLM-based generation of proofs

    Authors: Andrei Kozyrev, Gleb Solovev, Nikita Khramov, Anton Podkopaev

    Abstract: We present CoqPilot, a VS Code extension designed to help automate writing of Coq proofs. The plugin collects the parts of proofs marked with the admit tactic in a Coq file, i.e., proof holes, and combines LLMs along with non-machine-learning methods to generate proof candidates for the holes. Then, CoqPilot checks if each proof candidate solves the given subgoal and, if successful, replaces the h… ▽ More

    Submitted 25 October, 2024; originally announced October 2024.

    Comments: Published in the proceedings of the ASE'24 Tool Demonstrations Track

  6. arXiv:2305.00143  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Sequential Predictive Two-Sample and Independence Testing

    Authors: Aleksandr Podkopaev, Aaditya Ramdas

    Abstract: We study the problems of sequential nonparametric two-sample and independence testing. Sequential tests process data online and allow using observed data to decide whether to stop and reject the null hypothesis or to collect more data, while maintaining type I error control. We build upon the principle of (nonparametric) testing by betting, where a gambler places bets on future observations and th… ▽ More

    Submitted 19 July, 2023; v1 submitted 28 April, 2023; originally announced May 2023.

  7. arXiv:2212.07383  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Sequential Kernelized Independence Testing

    Authors: Aleksandr Podkopaev, Patrick Blöbaum, Shiva Prasad Kasiviswanathan, Aaditya Ramdas

    Abstract: Independence testing is a classical statistical problem that has been extensively studied in the batch setting when one fixes the sample size before collecting data. However, practitioners often prefer procedures that adapt to the complexity of a problem at hand instead of setting sample size in advance. Ideally, such procedures should (a) stop earlier on easy tasks (and later on harder tasks), he… ▽ More

    Submitted 20 May, 2025; v1 submitted 14 December, 2022; originally announced December 2022.

    Comments: ICML 2023

  8. arXiv:2110.06177  [pdf, other

    stat.ML cs.LG

    Tracking the risk of a deployed model and detecting harmful distribution shifts

    Authors: Aleksandr Podkopaev, Aaditya Ramdas

    Abstract: When deployed in the real world, machine learning models inevitably encounter changes in the data distribution, and certain -- but not all -- distribution shifts could result in significant performance degradation. In practice, it may make sense to ignore benign shifts, under which the performance of a deployed model does not degrade substantially, making interventions by a human expert (or model… ▽ More

    Submitted 5 May, 2022; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: Accepted as a conference paper at ICLR 2022

  9. arXiv:2103.03323  [pdf, other

    stat.ML cs.LG

    Distribution-free uncertainty quantification for classification under label shift

    Authors: Aleksandr Podkopaev, Aaditya Ramdas

    Abstract: Trustworthy deployment of ML models requires a proper measure of uncertainty, especially in safety-critical applications. We focus on uncertainty quantification (UQ) for classification problems via two avenues -- prediction sets using conformal prediction and calibration of probabilistic predictors by post-hoc binning -- since these possess distribution-free guarantees for i.i.d. data. Two common… ▽ More

    Submitted 7 July, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

  10. arXiv:2012.01067  [pdf, ps, other

    cs.PL

    Making Weak Memory Models Fair

    Authors: Ori Lahav, Egor Namakonov, Jonas Oberhauser, Anton Podkopaev, Viktor Vafeiadis

    Abstract: Liveness properties, such as termination, of even the simplest shared-memory concurrent programs under sequential consistency typically require some fairness assumptions about the scheduler. Under weak memory models, we observe that the standard notions of thread fairness are insufficient, and an additional fairness property, which we call memory fairness, is needed. In this paper, we propose a un… ▽ More

    Submitted 9 September, 2021; v1 submitted 2 December, 2020; originally announced December 2020.

    Comments: 43 pages, 2 figures

    ACM Class: D.3.1; F.3.2

  11. arXiv:2006.10564  [pdf, other

    stat.ML cs.AI cs.LG math.ST stat.ME

    Distribution-free binary classification: prediction sets, confidence intervals and calibration

    Authors: Chirag Gupta, Aleksandr Podkopaev, Aaditya Ramdas

    Abstract: We study three notions of uncertainty quantification -- calibration, confidence intervals and prediction sets -- for binary classification in the distribution-free setting, that is without making any distributional assumptions on the data. With a focus towards calibration, we establish a 'tripod' of theorems that connect these three notions for score-based classifiers. A direct implication is that… ▽ More

    Submitted 16 February, 2022; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: 34 pages; significant updates from previous version (unambiguous notation, better exposition, and cleaner results); originally appeared as a spotlight at Neural Information Processing Systems (NeurIPS) '20

  12. Repairing and Mechanising the JavaScript Relaxed Memory Model

    Authors: Conrad Watt, Christopher Pulte, Anton Podkopaev, Guillaume Barbier, Stephen Dolan, Shaked Flur, Jean Pichon-Pharabod, Shu-yu Guo

    Abstract: Modern JavaScript includes the SharedArrayBuffer feature, which provides access to true shared memory concurrency. SharedArrayBuffers are simple linear buffers of bytes, and the JavaScript specification defines an axiomatic relaxed memory model to describe their behaviour. While this model is heavily based on the C/C++11 model, it diverges in some key areas. JavaScript chooses to give a well-defin… ▽ More

    Submitted 21 May, 2020; v1 submitted 21 May, 2020; originally announced May 2020.

    Comments: 16 pages, 13 figiures

  13. arXiv:1911.06567  [pdf, other

    cs.PL

    Reconciling Event Structures with Modern Multiprocessors

    Authors: Evgenii Moiseenko, Anton Podkopaev, Ori Lahav, Orestis Melkonian, Viktor Vafeiadis

    Abstract: Weakestmo is a recently proposed memory consistency model that uses event structures to resolve the infamous "out-of-thin-air" problem. Although it has been shown to have important benefits over other memory models, its established compilation schemes are suboptimal in that they add more fences than necessary. In this paper, we prove the correctness in Coq of the intended compilation schemes for W… ▽ More

    Submitted 28 May, 2020; v1 submitted 15 November, 2019; originally announced November 2019.

  14. Bridging the Gap between Programming Languages and Hardware Weak Memory Models

    Authors: Anton Podkopaev, Ori Lahav, Viktor Vafeiadis

    Abstract: We develop a new intermediate weak memory model, IMM, as a way of modularizing the proofs of correctness of compilation from concurrent programming languages with weak memory consistency semantics to mainstream multi-core architectures, such as POWER and ARM. We use IMM to prove the correctness of compilation from the promising semantics of Kang et al. to POWER (thereby correcting and improving th… ▽ More

    Submitted 9 November, 2018; v1 submitted 20 July, 2018; originally announced July 2018.

  15. arXiv:1606.01400  [pdf, other

    cs.PL

    Operational Aspects of C/C++ Concurrency

    Authors: Anton Podkopaev, Ilya Sergey, Aleksandar Nanevski

    Abstract: In this work, we present a family of operational semantics that gradually approximates the realistic program behaviors in the C/C++11 memory model. Each semantics in our framework is built by elaborating and combining two simple ingredients: viewfronts and operation buffers. Viewfronts allow us to express the spatial aspect of thread interaction, i.e., which values a thread can read, while operati… ▽ More

    Submitted 9 July, 2016; v1 submitted 4 June, 2016; originally announced June 2016.