Skip to main content

Showing 1–11 of 11 results for author: Carranza, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.08537  [pdf, other

    cs.LG stat.ML

    Robust Offline Policy Learning with Observational Data from Multiple Sources

    Authors: Aldo Gael Carranza, Susan Athey

    Abstract: We consider the problem of using observational bandit feedback data from multiple heterogeneous data sources to learn a personalized decision policy that robustly generalizes across diverse target settings. To achieve this, we propose a minimax regret optimization objective to ensure uniformly low regret under general mixtures of the source distributions. We develop a policy learning algorithm tai… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2305.12407

  2. arXiv:2406.09366  [pdf, other

    cs.LG cs.CV q-bio.NC

    Towards an Improved Understanding and Utilization of Maximum Manifold Capacity Representations

    Authors: Rylan Schaeffer, Victor Lecomte, Dhruv Bhandarkar Pai, Andres Carranza, Berivan Isik, Alyssa Unell, Mikail Khona, Thomas Yerxa, Yann LeCun, SueYeon Chung, Andrey Gromov, Ravid Shwartz-Ziv, Sanmi Koyejo

    Abstract: Maximum Manifold Capacity Representations (MMCR) is a recent multi-view self-supervised learning (MVSSL) method that matches or surpasses other leading MVSSL methods. MMCR is intriguing because it does not fit neatly into any of the commonplace MVSSL lineages, instead originating from a statistical mechanical perspective on the linear separability of data manifolds. In this paper, we seek to impro… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  3. arXiv:2402.10202  [pdf, other

    cs.LG

    Bridging Associative Memory and Probabilistic Modeling

    Authors: Rylan Schaeffer, Nika Zahedi, Mikail Khona, Dhruv Pai, Sang Truong, Yilun Du, Mitchell Ostrow, Sarthak Chandra, Andres Carranza, Ila Rani Fiete, Andrey Gromov, Sanmi Koyejo

    Abstract: Associative memory and probabilistic modeling are two fundamental topics in artificial intelligence. The first studies recurrent neural networks designed to denoise, complete and retrieve data, whereas the second studies learning and sampling from probability distributions. Based on the observation that associative memory's energy functions can be seen as probabilistic modeling's negative log like… ▽ More

    Submitted 13 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

  4. arXiv:2307.10569  [pdf, ps, other

    cs.LG cs.AI

    Deceptive Alignment Monitoring

    Authors: Andres Carranza, Dhruv Pai, Rylan Schaeffer, Arnuv Tandon, Sanmi Koyejo

    Abstract: As the capabilities of large machine learning models continue to grow, and as the autonomy afforded to such models continues to expand, the spectre of a new adversary looms: the models themselves. The threat that a model might behave in a seemingly reasonable manner, while secretly and subtly modifying its behavior for ulterior reasons is often referred to as deceptive alignment in the AI Safety &… ▽ More

    Submitted 25 July, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: Accepted as BlueSky Oral to 2023 ICML AdvML Workshop

  5. arXiv:2307.10563  [pdf, other

    cs.LG cs.AI

    FACADE: A Framework for Adversarial Circuit Anomaly Detection and Evaluation

    Authors: Dhruv Pai, Andres Carranza, Rylan Schaeffer, Arnuv Tandon, Sanmi Koyejo

    Abstract: We present FACADE, a novel probabilistic and geometric framework designed for unsupervised mechanistic anomaly detection in deep neural networks. Its primary goal is advancing the understanding and mitigation of adversarial attacks. FACADE aims to generate probabilistic distributions over circuits, which provide critical insights to their contribution to changes in the manifold properties of pseud… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: Accepted as BlueSky Poster at 2023 ICML AdvML Workshop

  6. arXiv:2305.12407  [pdf, other

    cs.LG cs.DC econ.EM stat.ML

    Federated Offline Policy Learning

    Authors: Aldo Gael Carranza, Susan Athey

    Abstract: We consider the problem of learning personalized decision policies from observational bandit feedback data across multiple heterogeneous data sources. In our approach, we introduce a novel regret analysis that establishes finite-sample upper bounds on distinguishing notions of global regret for all data sources on aggregate and of local regret for any given data source. We characterize these regre… ▽ More

    Submitted 11 October, 2024; v1 submitted 21 May, 2023; originally announced May 2023.

  7. arXiv:2305.05973  [pdf, other

    cs.CL cs.CR cs.IR

    Synthetic Query Generation for Privacy-Preserving Deep Retrieval Systems using Differentially Private Language Models

    Authors: Aldo Gael Carranza, Rezsa Farahani, Natalia Ponomareva, Alex Kurakin, Matthew Jagielski, Milad Nasr

    Abstract: We address the challenge of ensuring differential privacy (DP) guarantees in training deep retrieval systems. Training these systems often involves the use of contrastive-style losses, which are typically non-per-example decomposable, making them difficult to directly DP-train with since common techniques require per-example gradients. To address this issue, we propose an approach that prioritizes… ▽ More

    Submitted 23 May, 2024; v1 submitted 10 May, 2023; originally announced May 2023.

    Comments: Accepted to NAACL 2024

  8. arXiv:2203.16668  [pdf, other

    cs.LG math.ST stat.ME stat.ML

    Flexible and Efficient Contextual Bandits with Heterogeneous Treatment Effect Oracles

    Authors: Aldo Gael Carranza, Sanath Kumar Krishnamurthy, Susan Athey

    Abstract: Contextual bandit algorithms often estimate reward models to inform decision-making. However, true rewards can contain action-independent redundancies that are not relevant for decision-making. We show it is more data-efficient to estimate any function that explains the reward differences between actions, that is, the treatment effects. Motivated by this observation, building on recent work on ora… ▽ More

    Submitted 24 February, 2023; v1 submitted 30 March, 2022; originally announced March 2022.

  9. arXiv:2010.14058  [pdf, other

    cs.SI cs.DS cs.LG

    Heterogeneous Graphlets

    Authors: Ryan A. Rossi, Nesreen K. Ahmed, Aldo Carranza, David Arbour, Anup Rao, Sungchul Kim, Eunyee Koh

    Abstract: In this paper, we introduce a generalization of graphlets to heterogeneous networks called typed graphlets. Informally, typed graphlets are small typed induced subgraphs. Typed graphlets generalize graphlets to rich heterogeneous networks as they explicitly capture the higher-order typed connectivity patterns in such networks. To address this problem, we describe a general framework for counting t… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1901.10026

  10. arXiv:1901.10026  [pdf, other

    cs.SI cs.DM cs.DS cs.LG

    Heterogeneous Network Motifs

    Authors: Ryan A. Rossi, Nesreen K. Ahmed, Aldo Carranza, David Arbour, Anup Rao, Sungchul Kim, Eunyee Koh

    Abstract: Many real-world applications give rise to large heterogeneous networks where nodes and edges can be of any arbitrary type (e.g., user, web page, location). Special cases of such heterogeneous graphs include homogeneous graphs, bipartite, k-partite, signed, labeled graphs, among many others. In this work, we generalize the notion of network motifs to heterogeneous networks. In particular, small ind… ▽ More

    Submitted 10 May, 2019; v1 submitted 28 January, 2019; originally announced January 2019.

  11. arXiv:1810.02959  [pdf, other

    cs.SI cs.LG math.SP

    Higher-order Spectral Clustering for Heterogeneous Graphs

    Authors: Aldo G. Carranza, Ryan A. Rossi, Anup Rao, Eunyee Koh

    Abstract: Higher-order connectivity patterns such as small induced sub-graphs called graphlets (network motifs) are vital to understand the important components (modules/functional units) governing the configuration and behavior of complex networks. Existing work in higher-order clustering has focused on simple homogeneous graphs with a single node/edge type. However, heterogeneous graphs consisting of node… ▽ More

    Submitted 6 October, 2018; originally announced October 2018.