Skip to main content

Showing 1–50 of 51 results for author: Gordon, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.16596  [pdf, ps, other

    cs.AI

    A Community-driven vision for a new Knowledge Resource for AI

    Authors: Vinay K Chaudhri, Chaitan Baru, Brandon Bennett, Mehul Bhatt, Darion Cassel, Anthony G Cohn, Rina Dechter, Esra Erdem, Dave Ferrucci, Ken Forbus, Gregory Gelfond, Michael Genesereth, Andrew S. Gordon, Benjamin Grosof, Gopal Gupta, Jim Hendler, Sharat Israni, Tyler R. Josephson, Patrick Kyllonen, Yuliya Lierler, Vladimir Lifschitz, Clifton McFate, Hande K. McGinty, Leora Morgenstern, Alessandro Oltramari , et al. (7 additional authors not shown)

    Abstract: The long-standing goal of creating a comprehensive, multi-purpose knowledge resource, reminiscent of the 1984 Cyc project, still persists in AI. Despite the success of knowledge resources like WordNet, ConceptNet, Wolfram|Alpha and other commercial knowledge graphs, verifiable, general-purpose widely available sources of knowledge remain a critical deficiency in AI infrastructure. Large language m… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

    Comments: 17 pages

  2. arXiv:2506.01719  [pdf, ps, other

    cs.LO

    Modal Verification Patterns for Systems

    Authors: Ismail Kuru, Colin S. Gordon

    Abstract: Although they differ in the functionality they offer, low-level systems exhibit certain patterns of design and utilization of computing resources. In this paper, we argue the position that modalities, in the sense of modal logic, should be a go-to approach when specifying and verifying low-level systems code. We explain how the concept of a resource context helps guide the design of new modalities… ▽ More

    Submitted 25 June, 2025; v1 submitted 2 June, 2025; originally announced June 2025.

    ACM Class: D.2.1; D.2.4; D.3.1; E.1

  3. arXiv:2502.21300  [pdf

    cs.HC

    Hybrid Team Tetris: A New Platform For Hybrid Multi-Agent, Multi-Human Teaming

    Authors: Kaleb Mcdowell, Nick Waytowich, Javier Garcia, Stephen Gordon, Bryce Bartlett, Jeremy Gaston

    Abstract: Metcalfe et al (1) argue that the greatest potential for human-AI partnerships lies in their application to highly complex problem spaces. Herein, we discuss three different forms of hybrid team intelligence and posit that across all three forms, the hybridization of man and machine intelligence can be effective under the right conditions. We foresee two significant research and development (R&D)… ▽ More

    Submitted 28 February, 2025; originally announced February 2025.

  4. arXiv:2502.07958  [pdf, other

    cs.PL cs.DC

    Actor Capabilities for Message Ordering (Extended Version)

    Authors: Colin S. Gordon

    Abstract: Actor systems are a flexible model of concurrent and distributed programming, which are efficiently implementable, and avoid many classic concurrency bugs by construction. However actor systems must still deal with the challenge of messages arriving in unexpected orderings. We describe an approach to restricting the orders in which actors send messages to each other, by equipping actor reference… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

    Comments: Extended version (with proof sketches) of a draft chapter for Gul Agha's festschrift

  5. arXiv:2501.14651  [pdf, other

    cs.CR

    Data-NoMAD: A Tool for Boosting Confidence in the Integrity of Social Science Survey Data

    Authors: Sanford C. Gordon, Cyrus Samii, Zhihao Su

    Abstract: To safeguard against data fabrication and enhance trust in quantitative social science, we present Data Non-Manipulation Authentication Digest (Data-NoMAD). Data-NoMAD is a tool that allows researchers to certify, and others to verify, that a dataset has not been inappropriately manipulated between the point of data collection and the point at which a replication archive is made publicly available… ▽ More

    Submitted 24 January, 2025; originally announced January 2025.

    Comments: 18 pages, 2 figures

    ACM Class: E.1; E.2

  6. arXiv:2411.10107  [pdf, ps, other

    cs.CC

    Monotone Contractions

    Authors: Eleni Batziou, John Fearnley, Spencer Gordon, Ruta Mehta, Rahul Savani

    Abstract: We study functions $f : [0, 1]^d \rightarrow [0, 1]^d$ that are both monotone and contracting, and we consider the problem of finding an $\varepsilon$-approximate fixed point of $f$. We show that the problem lies in the complexity class UEOPL. We give an algorithm that finds an $\varepsilon$-approximate fixed point of a three-dimensional monotone contraction using $O(\log (1/\varepsilon))$ queries… ▽ More

    Submitted 27 March, 2025; v1 submitted 15 November, 2024; originally announced November 2024.

    Comments: To appear in STOC'25

  7. arXiv:2410.14897  [pdf, other

    cs.CL cs.AI

    From Test-Taking to Test-Making: Examining LLM Authoring of Commonsense Assessment Items

    Authors: Melissa Roemmele, Andrew S. Gordon

    Abstract: LLMs can now perform a variety of complex writing tasks. They also excel in answering questions pertaining to natural language inference and commonsense reasoning. Composing these questions is itself a skilled writing task, so in this paper we consider LLMs as authors of commonsense assessment items. We prompt LLMs to generate items in the style of a prominent benchmark for commonsense reasoning,… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

    Comments: Accepted at Findings of EMNLP 2024

  8. arXiv:2406.07775  [pdf, other

    cs.LG

    Self-attention-based non-linear basis transformations for compact latent space modelling of dynamic optical fibre transmission matrices

    Authors: Yijie Zheng, Robert J. Kilpatrick, David B. Phillips, George S. D. Gordon

    Abstract: Multimode optical fibres are hair-thin strands of glass that efficiently transport light. They promise next-generation medical endoscopes that provide unprecedented sub-cellular image resolution deep inside the body. However, confining light to such fibres means that images are inherently scrambled in transit. Conventionally, this scrambling has been compensated by pre-calibrating how a specific f… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  9. arXiv:2403.17992  [pdf, other

    q-bio.QM cs.AI cs.LG eess.IV eess.SP

    Interpretable cancer cell detection with phonon microscopy using multi-task conditional neural networks for inter-batch calibration

    Authors: Yijie Zheng, Rafael Fuentes-Dominguez, Matt Clark, George S. D. Gordon, Fernando Perez-Cota

    Abstract: Advances in artificial intelligence (AI) show great potential in revealing underlying information from phonon microscopy (high-frequency ultrasound) data to identify cancerous cells. However, this technology suffers from the 'batch effect' that comes from unavoidable technical variations between each experiment, creating confounding variables that the AI model may inadvertently learn. We therefore… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  10. Characterizing Role Models in Software Practitioners' Career: An Interview Study

    Authors: Mary Sánchez-Gordón, Ricardo Colomo-Palacios, Alex Sanchez Gordon

    Abstract: A role model is a person who serves as an example for others to follow, especially in terms of values, behavior, achievements, and personal characteristics. In this paper, authors study how role models influence software practitioners careers, an aspect not studied in the literature before. By means of this study, authors aim to understand if there are any salient role model archetypes and what ch… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 6 pages, 2 Tables. To appear in CHASE 2024: Proceedings of the 17th International Conference on Cooperative and Human Aspects of Software Engineering, April 14-15, 2024, Lisbon, Portugal

    Journal ref: In Proceedings of the 317th International Conference on Cooperative and Human Aspects of Software Engineering (CHASE 2024). Association for Computing Machinery, New York, NY, USA

  11. arXiv:2402.07683  [pdf, other

    cs.CC cs.CG math.OC

    Two Choices are Enough for P-LCPs, USOs, and Colorful Tangents

    Authors: Michaela Borzechowski, John Fearnley, Spencer Gordon, Rahul Savani, Patrick Schnider, Simon Weber

    Abstract: We provide polynomial-time reductions between three search problems from three distinct areas: the P-matrix linear complementarity problem (P-LCP), finding the sink of a unique sink orientation (USO), and a variant of the $α$-Ham Sandwich problem. For all three settings, we show that "two choices are enough", meaning that the general non-binary version of the problem can be reduced in polynomial t… ▽ More

    Submitted 21 May, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: 29 pages, 9 figures

  12. arXiv:2311.07454  [pdf, other

    cs.LG cs.CC math.ST

    Causal Discovery under Latent Class Confounding

    Authors: Bijan Mazaheri, Spencer Gordon, Yuval Rabani, Leonard Schulman

    Abstract: An acyclic causal structure can be described with directed acyclic graph (DAG), where arrows indicate the possibility of direct causation. The task of learning this structure from data is known as "causal discovery." Diverse populations or changing environments can sometimes give rise to data that is heterogeneous in the following sense: each population/environment is a "source" which idiosyncrati… ▽ More

    Submitted 16 October, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

  13. arXiv:2310.09397  [pdf, other

    cs.LG math.AG math.ST

    Identifiability of Product of Experts Models

    Authors: Spencer L. Gordon, Manav Kant, Eric Ma, Leonard J. Schulman, Andrei Staicu

    Abstract: Product of experts (PoE) are layered networks in which the value at each node is an AND (or product) of the values (possibly negated) at its inputs. These were introduced as a neural network architecture that can efficiently learn to generate high-dimensional data which satisfy many low-dimensional constraints -- thereby allowing each individual expert to perform a simple task. PoEs have found a v… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: 24 pages, 2 figures

    MSC Class: 62E10; 62F99; 68T05 ACM Class: I.2.6

  14. Trustworthy Formal Natural Language Specifications

    Authors: Colin S. Gordon, Sergey Matskevich

    Abstract: Interactive proof assistants are computer programs carefully constructed to check a human-designed proof of a mathematical claim with high confidence in the implementation. However, this only validates truth of a formal claim, which may have been mistranslated from a claim made in natural language. This is especially problematic when using proof assistants to formally verify the correctness of sof… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2205.07811

    Journal ref: Proceedings of the 2023 ACM SIGPLAN International Symposium on New Ideas, New Paradigms, and Reflections on Programming and Software (Onward! '23)

  15. arXiv:2309.13993  [pdf, ps, other

    cs.LG cs.DS eess.SP stat.ML

    Identification of Mixtures of Discrete Product Distributions in Near-Optimal Sample and Time Complexity

    Authors: Spencer L. Gordon, Erik Jahn, Bijan Mazaheri, Yuval Rabani, Leonard J. Schulman

    Abstract: We consider the problem of identifying, from statistics, a distribution of discrete random variables $X_1,\ldots,X_n$ that is a mixture of $k$ product distributions. The best previous sample complexity for $n \in O(k)$ was $(1/ζ)^{O(k^2 \log k)}$ (under a mild separation assumption parameterized by $ζ$). The best known lower bound was $\exp(Ω(k))$. It is known that $n\geq 2k-1$ is necessary and su… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

  16. arXiv:2307.15777  [pdf, other

    cs.PL

    Error Localization for Sequential Effect Systems (Extended Version)

    Authors: Colin S. Gordon, Chaewon Yun

    Abstract: We describe a new concrete approach to giving predictable error locations for sequential (flow-sensitive) effect systems. Prior implementations of sequential effect systems rely on either computing a bottom-up effect and comparing it to a declaration (e.g., method annotation) or leaning on constraint-based type inference. These approaches do not necessarily report program locations that precisely… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Comments: Extended report of upcoming Static Analysis Symposium 2023 paper

  17. arXiv:2307.14471  [pdf, other

    cs.PL

    Modal Abstractions for Virtualizing Memory Addresses

    Authors: Ismail Kuru, Colin S. Gordon

    Abstract: Operating system kernels employ virtual memory subsystems, which use a CPU's memory management units (MMUs) to virtualize the addresses of memory regions Operating systems manipulate these virtualized memory mappings to isolate untrusted processes, restrict which memory is accessible to different processes, hide memory limits from user programs, ensure process isolation, implement demand-paging an… ▽ More

    Submitted 14 September, 2024; v1 submitted 26 July, 2023; originally announced July 2023.

    ACM Class: D.3; D.4; F.3

  18. arXiv:2304.09050  [pdf

    q-bio.NC cs.LG stat.ML

    Decoding Neural Activity to Assess Individual Latent State in Ecologically Valid Contexts

    Authors: Stephen M. Gordon, Jonathan R. McDaniel, Kevin W. King, Vernon J. Lawhern, Jonathan Touryan

    Abstract: There exist very few ways to isolate cognitive processes, historically defined via highly controlled laboratory studies, in more ecologically valid contexts. Specifically, it remains unclear as to what extent patterns of neural activity observed under such constraints actually manifest outside the laboratory in a manner that can be used to make an accurate inference about the latent state, associa… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Journal ref: Journal of Neural Engineering, vol. 20(4), 2023

  19. arXiv:2208.11235  [pdf, other

    cs.SE cs.LG

    Preprocessing Source Code Comments for Linguistic Models

    Authors: Sergey Matskevich, Colin S. Gordon

    Abstract: Comments are an important part of the source code and are a primary source of documentation. This has driven interest in using large bodies of comments to train or evaluate tools that consume or produce them -- such as generating oracles or even code from comments, or automatically generating code summaries. Most of this work makes strong assumptions about the structure and quality of comments, su… ▽ More

    Submitted 26 August, 2022; v1 submitted 23 August, 2022; originally announced August 2022.

    Comments: Correcting author name

  20. arXiv:2206.00485  [pdf, other

    cs.CY cs.HC

    Co-creation and ownership for AI radio

    Authors: Skylar Gordon, Robert Mahari, Manaswi Mishra, Ziv Epstein

    Abstract: Recent breakthroughs in AI-generated music open the door for new forms for co-creation and co-creativity. We present Artificial$.\!$fm, a proof-of-concept casual creator that blends AI-music generation, subjective ratings, and personalized recommendation for the creation and curation of AI-generated music. Listeners can rate emergent songs to steer the evolution of future music. They can also pers… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

  21. arXiv:2205.07811  [pdf, other

    cs.PL cs.CL

    Natural Language Specifications in Proof Assistants

    Authors: Colin S. Gordon, Sergey Matskevich

    Abstract: Interactive proof assistants are computer programs carefully constructed to check a human-designed proof of a mathematical claim with high confidence in the implementation. However, this only validates truth of a formal claim, which may have been mistranslated from a claim made in natural language. This is especially problematic when using proof assistants to formally verify the correctness of sof… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

  22. arXiv:2202.12950  [pdf, other

    eess.SP cs.AI cs.LG

    2021 BEETL Competition: Advancing Transfer Learning for Subject Independence & Heterogenous EEG Data Sets

    Authors: Xiaoxi Wei, A. Aldo Faisal, Moritz Grosse-Wentrup, Alexandre Gramfort, Sylvain Chevallier, Vinay Jayaram, Camille Jeunet, Stylianos Bakas, Siegfried Ludwig, Konstantinos Barmpas, Mehdi Bahri, Yannis Panagakis, Nikolaos Laskaris, Dimitrios A. Adamos, Stefanos Zafeiriou, William C. Duong, Stephen M. Gordon, Vernon J. Lawhern, Maciej Śliwowski, Vincent Rouanne, Piotr Tempczyk

    Abstract: Transfer learning and meta-learning offer some of the most promising avenues to unlock the scalability of healthcare and consumer technologies driven by biosignal data. This is because current methods cannot generalise well across human subjects' data and handle learning from different heterogeneously collected data sets, thus limiting the scale of training data. On the other side, developments in… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

    Comments: PrePrint of the NeurIPS2021 BEETL Competition Submitted to Proceedings of Machine Learning Research (PMLR)

  23. Towards Property-Based Tests in Natural Language

    Authors: Colin S. Gordon

    Abstract: We consider a new approach to generate tests from natural language. Rather than relying on machine learning or templated extraction from structured comments, we propose to apply classic ideas from linguistics to translate natural-language sentences into executable tests. This paper explores the application of combinatory categorial grammars (CCGs) to generating property-based tests. Our prototype… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

    Journal ref: International Conference on Software Engineering: New Ideas and Emerging Results, ICSE (NIER) 2022

  24. arXiv:2112.11602  [pdf, ps, other

    cs.LG cs.DS eess.SP stat.ML

    Causal Inference Despite Limited Global Confounding via Mixture Models

    Authors: Spencer L. Gordon, Bijan Mazaheri, Yuval Rabani, Leonard J. Schulman

    Abstract: A Bayesian Network is a directed acyclic graph (DAG) on a set of $n$ random variables (the vertices); a Bayesian Network Distribution (BND) is a probability distribution on the random variables that is Markovian on the graph. A finite $k$-mixture of such models is graphically represented by a larger graph which has an additional ``hidden'' (or ``latent'') random variable $U$, ranging in… ▽ More

    Submitted 31 May, 2023; v1 submitted 21 December, 2021; originally announced December 2021.

    Comments: Published in CleaR 2023

    MSC Class: 68W40; 62F99; 62-09 ACM Class: F.2; G.3

    Journal ref: Proceedings of Machine Learning Research vol 213:1-27, 2023

  25. arXiv:2110.05644  [pdf, ps, other

    cs.DM

    On the computational equivalence of co-NP refutations of a matrix being a P-matrix

    Authors: Spencer Gordon, Kevin Shu

    Abstract: A P-matrix is a square matrix $X$ such that all principal submatrices of $X$ have positive determinant. Such matrices appear naturally in instances of the linear complementarity problem, where these are precisely the matrices for which the corresponding linear complementarity problem has a unique solution for any input vector. Testing whether or not a square matrix is a P-matrix is co-NP compl… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

  26. arXiv:2109.07708  [pdf, other

    cs.CR cs.CC

    Compressed Oblivious Encoding for Homomorphically Encrypted Search

    Authors: Seung Geol Choi, Dana Dachman-Soled, S. Dov Gordon, Linsheng Liu, Arkady Yerukhimovich

    Abstract: Fully homomorphic encryption (FHE) enables a simple, attractive framework for secure search. Compared to other secure search systems, no costly setup procedure is necessary; it is sufficient for the client merely to upload the encrypted database to the server. Confidentiality is provided because the server works only on the encrypted query and records. While the search functionality is enabled by… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

  27. arXiv:2107.04140  [pdf, other

    cs.AR

    First-Generation Inference Accelerator Deployment at Facebook

    Authors: Michael Anderson, Benny Chen, Stephen Chen, Summer Deng, Jordan Fix, Michael Gschwind, Aravind Kalaiah, Changkyu Kim, Jaewon Lee, Jason Liang, Haixin Liu, Yinghai Lu, Jack Montgomery, Arun Moorthy, Satish Nadathur, Sam Naghshineh, Avinash Nayak, Jongsoo Park, Chris Petersen, Martin Schatz, Narayanan Sundaram, Bangsheng Tang, Peter Tang, Amy Yang, Jiecao Yu , et al. (90 additional authors not shown)

    Abstract: In this paper, we provide a deep dive into the deployment of inference accelerators at Facebook. Many of our ML workloads have unique characteristics, such as sparse memory accesses, large model sizes, as well as high compute, memory and network bandwidth requirements. We co-designed a high-performance, energy-efficient inference accelerator platform based on these requirements. We describe the in… ▽ More

    Submitted 4 August, 2021; v1 submitted 8 July, 2021; originally announced July 2021.

  28. arXiv:2103.04349  [pdf

    cs.LG eess.SY

    Markov Cricket: Using Forward and Inverse Reinforcement Learning to Model, Predict And Optimize Batting Performance in One-Day International Cricket

    Authors: Manohar Vohra, George S. D. Gordon

    Abstract: In this paper, we model one-day international cricket games as Markov processes, applying forward and inverse Reinforcement Learning (RL) to develop three novel tools for the game. First, we apply Monte-Carlo learning to fit a nonlinear approximation of the value function for each state of the game using a score-based reward model. We show that, when used as a proxy for remaining scoring resources… ▽ More

    Submitted 7 March, 2021; originally announced March 2021.

    Comments: 15 pages, 8 Figures

  29. arXiv:2101.11688  [pdf, other

    cs.LG eess.SP stat.ML

    Hadamard Extensions and the Identification of Mixtures of Product Distributions

    Authors: Spencer L. Gordon, Leonard J. Schulman

    Abstract: The Hadamard Extension of a matrix is the matrix consisting of all Hadamard products of subsets of its rows. This construction arises in the context of identifying a mixture of product distributions on binary random variables: full column rank of such extensions is a necessary ingredient of identification algorithms. We provide several results concerning when a Hadamard Extension has full column r… ▽ More

    Submitted 12 February, 2021; v1 submitted 27 January, 2021; originally announced January 2021.

    Comments: V2: re-titled and slight edits

    MSC Class: 68W40; 62F99 ACM Class: F.2; G.3

  30. arXiv:2012.14540  [pdf, ps, other

    cs.LG cs.DS eess.SP stat.ML

    Source Identification for Mixtures of Product Distributions

    Authors: Spencer L. Gordon, Bijan Mazaheri, Yuval Rabani, Leonard J. Schulman

    Abstract: We give an algorithm for source identification of a mixture of $k$ product distributions on $n$ bits. This is a fundamental problem in machine learning with many applications. Our algorithm identifies the source parameters of an identifiable mixture, given, as input, approximate values of multilinear moments (derived, for instance, from a sufficiently large sample), using $2^{O(k^2)} n^{O(k)}$ ari… ▽ More

    Submitted 28 December, 2020; originally announced December 2020.

    MSC Class: 68W40; 62F99 ACM Class: F.2; G.3

  31. arXiv:2007.11119  [pdf, other

    cs.HC cs.AI cs.LG cs.MM

    Interpolating GANs to Scaffold Autotelic Creativity

    Authors: Ziv Epstein, Océane Boulais, Skylar Gordon, Matt Groh

    Abstract: The latent space modeled by generative adversarial networks (GANs) represents a large possibility space. By interpolating categories generated by GANs, it is possible to create novel hybrid images. We present "Meet the Ganimals," a casual creator built on interpolations of BigGAN that can generate novel, hybrid animals called ganimals by efficiently searching this possibility space. Like tradition… ▽ More

    Submitted 21 July, 2020; originally announced July 2020.

  32. arXiv:2007.08101  [pdf, ps, other

    cs.LG cs.DS stat.ML

    The Sparse Hausdorff Moment Problem, with Application to Topic Models

    Authors: Spencer Gordon, Bijan Mazaheri, Leonard J. Schulman, Yuval Rabani

    Abstract: We consider the problem of identifying, from its first $m$ noisy moments, a probability distribution on $[0,1]$ of support $k<\infty$. This is equivalent to the problem of learning a distribution on $m$ observable binary random variables $X_1,X_2,\dots,X_m$ that are iid conditional on a hidden random variable $U$ taking values in $\{1,2,\dots,k\}$. Our focus is on accomplishing this with $m=2k$, w… ▽ More

    Submitted 7 September, 2020; v1 submitted 16 July, 2020; originally announced July 2020.

  33. Designing with Static Capabilities and Effects: Use, Mention, and Invariants

    Authors: Colin S. Gordon

    Abstract: Capabilities (whether object or reference capabilities) are fundamentally tools to restrict effects. Thus static capabilities (object or reference) and effect systems take different technical machinery to the same core problem of statically restricting or reasoning about effects in programs. Any time two approaches can in principle address the same sets of problems, it becomes important to underst… ▽ More

    Submitted 26 May, 2020; v1 submitted 22 May, 2020; originally announced May 2020.

    Comments: Preprint of ECOOP 2020 paper

  34. arXiv:1811.12285  [pdf, other

    cs.PL

    Sequential Effect Systems with Control Operators

    Authors: Colin S. Gordon

    Abstract: Sequential effect systems are a class of effect system that exploits information about program order, rather than discarding it as traditional commutative effect systems do. This extra expressive power allows effect systems to reason about behavior over time, capturing properties such as atomicity, unstructured lock ownership, or even general safety properties. While we now understand the essentia… ▽ More

    Submitted 15 May, 2020; v1 submitted 29 November, 2018; originally announced November 2018.

    Comments: Extended technical report corresponding to ECOOP 2020 paper "Lifting Sequential Effects to Control Operators"

    ACM Class: F.3.2

  35. arXiv:1811.11853  [pdf, ps, other

    cs.PL

    Safe Deferred Memory Reclamation with Types

    Authors: Ismail Kuru, Colin S. Gordon

    Abstract: Memory management in lock-free data structures remains a major challenge in concurrent programming. Design techniques including read-copy-update (RCU) and hazard pointers provide workable solutions, and are widely used to great effect. These techniques rely on the concept of a grace period: nodes that should be freed are not deallocated immediately, and all threads obey a protocol to ensure that t… ▽ More

    Submitted 18 February, 2019; v1 submitted 28 November, 2018; originally announced November 2018.

    ACM Class: D.2.4; D.1.3

  36. arXiv:1811.03841  [pdf, ps, other

    cs.CC cs.DS

    Unique End of Potential Line

    Authors: John Fearnley, Spencer Gordon, Ruta Mehta, Rahul Savani

    Abstract: This paper studies the complexity of problems in PPAD $\cap$ PLS that have unique solutions. Three well-known examples of such problems are the problem of finding a fixpoint of a contraction map, finding the unique sink of a Unique Sink Orientation (USO), and solving the P-matrix Linear Complementarity Problem (P-LCP). Each of these are promise-problems, and when the promise holds, they always pos… ▽ More

    Submitted 9 November, 2018; originally announced November 2018.

    Comments: This paper substantially revises and extends the work described in our previous preprint "End of Potential Line'' (arXiv:1804.03450). The abstract has been shortened to meet the arXiv character limit

  37. arXiv:1810.06600  [pdf, ps, other

    cs.PL

    Synthesizing Program-Specific Static Analyses

    Authors: Colin S. Gordon

    Abstract: Designing a static analysis is generally a substantial undertaking, requiring significant expertise in both program analysis and the domain of the program analysis, and significant development resources. As a result, most program analyses target properties that are universallly of interest (e.g., absence of null pointer dereference) or nearly so (e.g., deadlock freedom). However, many interesting… ▽ More

    Submitted 15 October, 2018; originally announced October 2018.

    Comments: Late archiving of OBT'18 abstract

  38. Generating Comments From Source Code with CCGs

    Authors: Sergey Matskevich, Colin S. Gordon

    Abstract: Good comments help developers understand software faster and provide better maintenance. However, comments are often missing, generally inaccurate, or out of date. Many of these problems can be avoided by automatic comment generation. This paper presents a method to generate informative comments directly from the source code using general-purpose techniques from natural language processing. We gen… ▽ More

    Submitted 15 October, 2018; originally announced October 2018.

    Comments: NL4SE 2018 preprint

  39. arXiv:1808.02010  [pdf, other

    cs.PL cs.LO

    Polymorphic Iterable Sequential Effect Systems

    Authors: Colin S. Gordon

    Abstract: Effect systems are lightweight extensions to type systems that can verify a wide range of important properties with modest developer burden. But our general understanding of effect systems is limited primarily to systems where the order of effects is irrelevant. Understanding such systems in terms of a semilattice of effects grounds understanding of the essential issues, and provides guidance when… ▽ More

    Submitted 15 July, 2021; v1 submitted 6 August, 2018; originally announced August 2018.

    Comments: Extended journal version of ECOOP 2017 paper (preprint at arXiv:1705.02264) generalizing the iteration operator for behavioral effect systems, strengthening existence results, strengthening proof, and adding to examples and comparison to related work (more details in paper). Final author version

    Journal ref: ACM Transactions on Programming Languages and Systems (TOPLAS), 2021

  40. arXiv:1805.04740  [pdf, ps, other

    cs.LG cs.HC stat.ML

    Agreement Rate Initialized Maximum Likelihood Estimator for Ensemble Classifier Aggregation and Its Application in Brain-Computer Interface

    Authors: Dongrui Wu, Vernon J. Lawhern, Stephen Gordon, Brent J. Lance, Chin-Teng Lin

    Abstract: Ensemble learning is a powerful approach to construct a strong learner from multiple base learners. The most popular way to aggregate an ensemble of classifiers is majority voting, which assigns a sample to the class that most base classifiers vote for. However, improved performance can be obtained by assigning weights to the base classifiers according to their accuracy. This paper proposes an agr… ▽ More

    Submitted 12 May, 2018; originally announced May 2018.

    Journal ref: IEEE Int'l. Conf. on Systems, Man and Cybernetics, pp. 724-729, Budapest, Hungary, 2016

  41. arXiv:1805.04737  [pdf, ps, other

    cs.LG cs.HC stat.ML

    Offline EEG-Based Driver Drowsiness Estimation Using Enhanced Batch-Mode Active Learning (EBMAL) for Regression

    Authors: Dongrui Wu, Vernon J. Lawhern, Stephen Gordon, Brent J. Lance, Chin-Teng Lin

    Abstract: There are many important regression problems in real-world brain-computer interface (BCI) applications, e.g., driver drowsiness estimation from EEG signals. This paper considers offline analysis: given a pool of unlabeled EEG epochs recorded during driving, how do we optimally select a small number of them to label so that an accurate regression model can be built from them to label the rest? Acti… ▽ More

    Submitted 12 May, 2018; originally announced May 2018.

    Journal ref: IEEE Int'l. Conf. on Systems, Man and Cybernetics, pp. 730-736, Budapest, Hungary, 2016

  42. arXiv:1804.03450  [pdf, ps, other

    cs.CC cs.GT

    End of Potential Line

    Authors: John Fearnley, Spencer Gordon, Ruta Mehta, Rahul Savani

    Abstract: We introduce the problem EndOfPotentialLine and the corresponding complexity class EOPL of all problems that can be reduced to it in polynomial time. This class captures problems that admit a single combinatorial proof of their joint membership in the complexity classes PPAD of fixpoint problems and PLS of local search problems. EOPL is a combinatorially-defined alternative to the class CLS (for C… ▽ More

    Submitted 18 April, 2018; v1 submitted 10 April, 2018; originally announced April 2018.

    Comments: v2 includes runtimes for P-LCP algorithms based on USOs in related work

  43. An efficient MPI/OpenMP parallelization of the Hartree-Fock method for the second generation of Intel Xeon Phi processor

    Authors: Vladimir Mironov, Yuri Alexeev, Kristopher Keipert, Michael D'mello, Alexander Moskovsky, Mark S. Gordon

    Abstract: Modern OpenMP threading techniques are used to convert the MPI-only Hartree-Fock code in the GAMESS program to a hybrid MPI/OpenMP algorithm. Two separate implementations that differ by the sharing or replication of key data structures among threads are considered, density and Fock matrices. All implementations are benchmarked on a super-computer of 3,000 Intel Xeon Phi processors. With 64 cores p… ▽ More

    Submitted 14 August, 2017; v1 submitted 31 July, 2017; originally announced August 2017.

    Comments: SC17 conference paper, 12 pages, 7 figures

    ACM Class: D.1.3; J.2

  44. arXiv:1705.02264  [pdf, other

    cs.PL

    A Generic Approach to Flow-Sensitive Polymorphic Effects (Extended Version)

    Authors: Colin S. Gordon

    Abstract: Effect systems are lightweight extensions to type systems that can verify a wide range of important properties with modest developer burden. But our general understanding of effect systems is limited primarily to systems where the order of effects is irrelevant. Understanding such systems in terms of a lattice of effects grounds understanding of the essential issues, and provides guidance when des… ▽ More

    Submitted 5 May, 2017; originally announced May 2017.

    Comments: Extended version with proofs for a paper to appear at ECOOP 2017

    ACM Class: F.3.2

  45. arXiv:1704.08533  [pdf, ps, other

    cs.HC cs.LG

    EEG-Based User Reaction Time Estimation Using Riemannian Geometry Features

    Authors: Dongrui Wu, Brent J. Lance, Vernon J. Lawhern, Stephen Gordon, Tzyy-Ping Jung, Chin-Teng Lin

    Abstract: Riemannian geometry has been successfully used in many brain-computer interface (BCI) classification problems and demonstrated superior performance. In this paper, for the first time, it is applied to BCI regression problems, an important category of BCI applications. More specifically, we propose a new feature extraction approach for Electroencephalogram (EEG) based BCI regression problems: a spa… ▽ More

    Submitted 27 April, 2017; originally announced April 2017.

    Comments: arXiv admin note: text overlap with arXiv:1702.02914

    Journal ref: IEEE Trans. on Neural Systems and Rehabilitation Engineering, 25(11), pp. 2157-2168, 2017

  46. arXiv:1702.06017  [pdf, other

    cs.CC

    CLS: New Problems and Completeness

    Authors: John Fearnley, Spencer Gordon, Ruta Mehta, Rahul Savani

    Abstract: The complexity class CLS was introduced by Daskalakis and Papadimitriou with the goal of capturing the complexity of some well-known problems in PPAD$~\cap~$PLS that have resisted, in some cases for decades, attempts to put them in polynomial time. No complete problem was known for CLS, and in previous work, the problems ContractionMap, i.e., the problem of finding an approximate fixpoint of a con… ▽ More

    Submitted 7 April, 2017; v1 submitted 20 February, 2017; originally announced February 2017.

  47. Driver Drowsiness Estimation from EEG Signals Using Online Weighted Adaptation Regularization for Regression (OwARR)

    Authors: Dongrui Wu, Vernon J. Lawhern, Stephen Gordon, Brent J. Lance, Chin-Teng Lin

    Abstract: One big challenge that hinders the transition of brain-computer interfaces (BCIs) from laboratory settings to real-life applications is the availability of high-performance and robust learning algorithms that can effectively handle individual differences, i.e., algorithms that can be applied to a new subject with zero or very little subject-specific calibration data. Transfer learning and domain a… ▽ More

    Submitted 9 February, 2017; originally announced February 2017.

    Comments: in press

    Journal ref: IEEE Trans.on Fuzzy Systems, 25(6), pp. 1522-1535, 2017

  48. arXiv:1611.08024  [pdf, other

    cs.LG q-bio.NC stat.ML

    EEGNet: A Compact Convolutional Network for EEG-based Brain-Computer Interfaces

    Authors: Vernon J. Lawhern, Amelia J. Solon, Nicholas R. Waytowich, Stephen M. Gordon, Chou P. Hung, Brent J. Lance

    Abstract: Brain computer interfaces (BCI) enable direct communication with a computer, using neural activity as the control signal. This neural signal is generally chosen from a variety of well-studied electroencephalogram (EEG) signals. For a given BCI paradigm, feature extractors and classifiers are tailored to the distinct characteristics of its expected EEG control signal, limiting its application to th… ▽ More

    Submitted 15 May, 2018; v1 submitted 23 November, 2016; originally announced November 2016.

    Comments: 30 pages, 10 figures. Added additional feature relevance analyses. Minor change to EEGNet architecture. Source code can be found at https://github.com/vlawhern/arl-eegmodels

  49. arXiv:1608.07261  [pdf, other

    cs.PL

    Type Inference for Static Compilation of JavaScript (Extended Version)

    Authors: Satish Chandra, Colin S. Gordon, Jean-Baptiste Jeannin, Cole Schlesinger, Manu Sridharan, Frank Tip, Youngil Choi

    Abstract: We present a type system and inference algorithm for a rich subset of JavaScript equipped with objects, structural subtyping, prototype inheritance, and first-class methods. The type system supports abstract and recursive objects, and is expressive enough to accommodate several standard benchmarks with only minor workarounds. The invariants enforced by the types enable an ahead-of-time compiler to… ▽ More

    Submitted 18 October, 2016; v1 submitted 25 August, 2016; originally announced August 2016.

    Comments: Extended version of OOPSLA 2016 paper of the same name

  50. arXiv:1607.05818  [pdf, ps, other

    cs.CL

    An Adaptation of Topic Modeling to Sentences

    Authors: Ruey-Cheng Chen, Reid Swanson, Andrew S. Gordon

    Abstract: Advances in topic modeling have yielded effective methods for characterizing the latent semantics of textual data. However, applying standard topic modeling approaches to sentence-level tasks introduces a number of challenges. In this paper, we adapt the approach of latent-Dirichlet allocation to include an additional layer for incorporating information about the sentence boundaries in documents.… ▽ More

    Submitted 20 July, 2016; originally announced July 2016.

    Comments: 8 pages, 2010, unpublished