Skip to main content

Showing 1–15 of 15 results for author: John, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2412.10751  [pdf, ps, other

    cs.LG cs.GT

    p-Mean Regret for Stochastic Bandits

    Authors: Anand Krishna, Philips George John, Adarsh Barik, Vincent Y. F. Tan

    Abstract: In this work, we extend the concept of the $p$-mean welfare objective from social choice theory (Moulin 2004) to study $p$-mean regret in stochastic multi-armed bandit problems. The $p$-mean regret, defined as the difference between the optimal mean among the arms and the $p$-mean of the expected rewards, offers a flexible framework for evaluating bandit algorithms, enabling algorithm designers to… ▽ More

    Submitted 14 December, 2024; originally announced December 2024.

    Report number: Accepted to AAAI 2025

  2. arXiv:2411.12700  [pdf, other

    cs.LG cs.DS cs.IT stat.ML

    Learning multivariate Gaussians with imperfect advice

    Authors: Arnab Bhattacharyya, Davin Choo, Philips George John, Themis Gouleakis

    Abstract: We revisit the problem of distribution learning within the framework of learning-augmented algorithms. In this setting, we explore the scenario where a probability distribution is provided as potentially inaccurate advice on the true, unknown distribution. Our objective is to develop learning algorithms whose sample complexity decreases as the quality of the advice improves, thereby surpassing sta… ▽ More

    Submitted 31 January, 2025; v1 submitted 19 November, 2024; originally announced November 2024.

  3. arXiv:2411.10906  [pdf, other

    cs.LG cs.DS

    Efficient, Low-Regret, Online Reinforcement Learning for Linear MDPs

    Authors: Philips George John, Arnab Bhattacharyya, Silviu Maniu, Dimitrios Myrisiotis, Zhenan Wu

    Abstract: Reinforcement learning algorithms are usually stated without theoretical guarantees regarding their performance. Recently, Jin, Yang, Wang, and Jordan (COLT 2020) showed a polynomial-time reinforcement learning algorithm (namely, LSVI-UCB) for the setting of linear Markov decision processes, and provided theoretical guarantees regarding its running time and regret. In real-world scenarios, however… ▽ More

    Submitted 16 November, 2024; originally announced November 2024.

    Comments: 27 pages, 9 figures

  4. arXiv:2411.10548  [pdf, ps, other

    cs.LG q-bio.BM

    BioNeMo Framework: a modular, high-performance library for AI model development in drug discovery

    Authors: Peter St. John, Dejun Lin, Polina Binder, Malcolm Greaves, Vega Shah, John St. John, Adrian Lange, Patrick Hsu, Rajesh Illango, Arvind Ramanathan, Anima Anandkumar, David H Brookes, Akosua Busia, Abhishaike Mahajan, Stephen Malina, Neha Prasad, Sam Sinai, Lindsay Edwards, Thomas Gaudelet, Cristian Regep, Martin Steinegger, Burkhard Rost, Alexander Brace, Kyle Hippe, Luca Naef , et al. (68 additional authors not shown)

    Abstract: Artificial Intelligence models encoding biology and chemistry are opening new routes to high-throughput and high-quality in-silico drug development. However, their training increasingly relies on computational scale, with recent protein language models (pLM) training on hundreds of graphical processing units (GPUs). We introduce the BioNeMo Framework to facilitate the training of computational bio… ▽ More

    Submitted 12 June, 2025; v1 submitted 15 November, 2024; originally announced November 2024.

  5. arXiv:2405.07914  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Distribution Learning Meets Graph Structure Sampling

    Authors: Arnab Bhattacharyya, Sutanu Gayen, Philips George John, Sayantan Sen, N. V. Vinodchandran

    Abstract: This work establishes a novel link between the problem of PAC-learning high-dimensional graphical models and the task of (efficient) counting and sampling of graph structures, using an online learning framework. We observe that if we apply the exponentially weighted average (EWA) or randomized weighted majority (RWM) forecasters on a sequence of samples from a distribution P using the log loss f… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 48 pages, 2 figures. Shortened abstract as per arXiv criteria

  6. Plug & Play Directed Evolution of Proteins with Gradient-based Discrete MCMC

    Authors: Patrick Emami, Aidan Perreault, Jeffrey Law, David Biagioni, Peter C. St. John

    Abstract: A long-standing goal of machine-learning-based protein engineering is to accelerate the discovery of novel mutations that improve the function of a known protein. We introduce a sampling framework for evolving proteins in silico that supports mixing and matching a variety of unsupervised models, such as protein language models, and supervised models that predict protein function from sequence. By… ▽ More

    Submitted 6 April, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: 31 pages, 8 figures. To appear in the Machine Learning: Science & Technology (ML:S&T) journal. Code is available at https://github.com/pemami4911/ppde. A short version of this work appeared at the NeurIPS 2022 Machine Learning in Structural Biology Workshop

  7. arXiv:2006.11737  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Verifying Individual Fairness in Machine Learning Models

    Authors: Philips George John, Deepak Vijaykeerthy, Diptikalyan Saha

    Abstract: We consider the problem of whether a given decision model, working with structured data, has individual fairness. Following the work of Dwork, a model is individually biased (or unfair) if there is a pair of valid inputs which are close to each other (according to an appropriate metric) but are treated differently by the model (different class label, or large difference in output), and it is unbia… ▽ More

    Submitted 21 June, 2020; originally announced June 2020.

    Comments: An extended version of the paper accepted at UAI 2020, 12 pages, code is available at https://github.com/philips-george/ifv-uai-2020

  8. arXiv:1905.11612   

    cs.CC cs.DM cs.DS

    Average Bias and Polynomial Sources

    Authors: Arnab Bhattacharyya, Philips George John, Suprovat Ghoshal, Raghu Meka

    Abstract: We identify a new notion of pseudorandomness for randomness sources, which we call the average bias. Given a distribution $Z$ over $\{0,1\}^n$, its average bias is: $b_{\text{av}}(Z) =2^{-n} \sum_{c \in \{0,1\}^n} |\mathbb{E}_{z \sim Z}(-1)^{\langle c, z\rangle}|$. A source with average bias at most $2^{-k}$ has min-entropy at least $k$, and so low average bias is a stronger condition than high mi… ▽ More

    Submitted 30 May, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

    Comments: We found out one of the main results has a much easier and direct proof

  9. arXiv:1808.09037  [pdf

    cs.CY cs.CL cs.IT physics.soc-ph

    Measuring the Volatility of the Political agenda in Public Opinion and News Media

    Authors: Chico Q. Camargo, Scott A. Hale, Peter John, Helen Z. Margetts

    Abstract: Recent election surprises, regime changes, and political shocks indicate that political agendas have become more fast-moving and volatile. The ability to measure the complex dynamics of agenda change and capture the nature and extent of volatility in political systems is therefore more crucial than ever before. This study proposes a definition and operationalization of volatility that combines ins… ▽ More

    Submitted 19 September, 2021; v1 submitted 27 August, 2018; originally announced August 2018.

    Comments: Copyright is held by the authors, and the article is published by Oxford University Press

    Journal ref: Public Opinion Quarterly, nfab032, 2021

  10. arXiv:1807.10363  [pdf, other

    physics.comp-ph cs.LG stat.ML

    Message-passing neural networks for high-throughput polymer screening

    Authors: Peter C. St. John, Caleb Phillips, Travis W. Kemper, A. Nolan Wilson, Michael F. Crowley, Mark R. Nimlos, Ross E. Larsen

    Abstract: Machine learning methods have shown promise in predicting molecular properties, and given sufficient training data machine learning approaches can enable rapid high-throughput virtual screening of large libraries of compounds. Graph-based neural network architectures have emerged in recent years as the most successful approach for predictions based on molecular structure, and have consistently ach… ▽ More

    Submitted 5 April, 2019; v1 submitted 26 July, 2018; originally announced July 2018.

    Comments: 7 pages, 3 figures

  11. arXiv:1806.00793  [pdf, other

    cs.CL cs.CY

    Transfer Topic Labeling with Domain-Specific Knowledge Base: An Analysis of UK House of Commons Speeches 1935-2014

    Authors: Alexander Herzog, Peter John, Slava Jankin Mikhaylov

    Abstract: Topic models are widely used in natural language processing, allowing researchers to estimate the underlying themes in a collection of documents. Most topic models use unsupervised methods and hence require the additional step of attaching meaningful labels to estimated topics. This process of manual labeling is not scalable and suffers from human bias. We present a semi-automatic transfer topic l… ▽ More

    Submitted 27 August, 2018; v1 submitted 3 June, 2018; originally announced June 2018.

  12. arXiv:1609.00475  [pdf, other

    cs.NI

    868 MHz Wireless Sensor Network - A Study

    Authors: Pushpam Aji John, Rudolf Agren, Yu-Jung Chen, Christian Rohner, Edith Ngai

    Abstract: Today 2.4 GHz based wireless sensor networks are increasing at a tremendous pace, and are seen in widespread applications. Product innovation and support by many vendors in 2.4 GHz makes it a preferred choice, but the networks are prone to issues like interference, and range issues. On the other hand, the less popular 868 MHz in the ISM band has not seen significant usage. In this paper we explore… ▽ More

    Submitted 2 September, 2016; originally announced September 2016.

    Comments: 11th Swedish National Computer Networking Workshop SNCNW 2015

  13. arXiv:1510.03797  [pdf, other

    physics.soc-ph cs.CL cs.SI

    Complex Politics: A Quantitative Semantic and Topological Analysis of UK House of Commons Debates

    Authors: Stefano Gurciullo, Michael Smallegan, María Pereda, Federico Battiston, Alice Patania, Sebastian Poledna, Daniel Hedblom, Bahattin Tolga Oztan, Alexander Herzog, Peter John, Slava Mikhaylov

    Abstract: This study is a first, exploratory attempt to use quantitative semantics techniques and topological analysis to analyze systemic patterns arising in a complex political system. In particular, we use a rich data set covering all speeches and debates in the UK House of Commons between 1975 and 2014. By the use of dynamic topic modeling (DTM) and topological data analysis (TDA) we show that both memb… ▽ More

    Submitted 13 October, 2015; originally announced October 2015.

    MSC Class: 91F10

  14. arXiv:1408.3562  [pdf

    physics.soc-ph cs.CY cs.SI physics.data-an

    Investigating Political Participation and Social Information Using Big Data and a Natural Experiment

    Authors: Scott A. Hale, Peter John, Helen Margetts, Taha Yasseri

    Abstract: Social information is particularly prominent in digital settings where the design of platforms can more easily give real-time information about the behaviour of peers and reference groups and thereby stimulate political activity. Changes to these platforms can generate natural experiments allowing an assessment of the impact of changes in social information and design on participation. This paper… ▽ More

    Submitted 15 August, 2014; originally announced August 2014.

    Comments: Prepared for delivery at the 2014 Annual Meeting of the American Political Science Association, August 28-31, 2014

    Journal ref: PLOS ONE 13(4): e0196068 (2018)

  15. Leadership without Leaders? Starters and Followers in Online Collective Action

    Authors: Helen Z. Margetts, Peter John, Scott A. Hale, Stéphane Reissfelder

    Abstract: The Internet has been ascribed a prominent role in collective action, particularly with widespread use of social media. But most mobilisations fail. We investigate the characteristics of those few mobilisations that succeed and hypothesise that the presence of 'starters' with low thresholds for joining will determine whether a mobilisation achieves success, as suggested by threshold models. We use… ▽ More

    Submitted 1 August, 2013; originally announced August 2013.