Skip to main content

Showing 1–4 of 4 results for author: Bhaskar, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2410.08847  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization

    Authors: Noam Razin, Sadhika Malladi, Adithya Bhaskar, Danqi Chen, Sanjeev Arora, Boris Hanin

    Abstract: Direct Preference Optimization (DPO) and its variants are increasingly used for aligning language models with human preferences. Although these methods are designed to teach a model to generate preferred responses more frequently relative to dispreferred responses, prior work has observed that the likelihood of preferred responses often decreases during training. The current work sheds light on th… ▽ More

    Submitted 27 April, 2025; v1 submitted 11 October, 2024; originally announced October 2024.

    Comments: Accepted to ICLR 2025; Code available at https://github.com/princeton-nlp/unintentional-unalignment

  2. arXiv:1610.07306  [pdf, other

    q-bio.PE stat.ME

    Novel probabilistic models of spatial genetic ancestry with applications to stratification correction in genome-wide association studies

    Authors: Anand Bhaskar, Adel Javanmard, Thomas A. Courtade, David Tse

    Abstract: Genetic variation in human populations is influenced by geographic ancestry due to spatial locality in historical mating and migration patterns. Spatial population structure in genetic datasets has been traditionally analyzed using either model-free algorithms, such as principal components analysis (PCA) and multidimensional scaling, or using explicit spatial probabilistic models of allele frequen… ▽ More

    Submitted 25 October, 2016; v1 submitted 24 October, 2016; originally announced October 2016.

    Comments: Supplementary information included to the main text

  3. arXiv:1310.1068  [pdf, ps, other

    q-bio.PE math.FA stat.AP stat.ME

    A novel spectral method for inferring general diploid selection from time series genetic data

    Authors: Matthias Steinrücken, Anand Bhaskar, Yun S. Song

    Abstract: The increased availability of time series genetic variation data from experimental evolution studies and ancient DNA samples has created new opportunities to identify genomic regions under selective pressure and to estimate their associated fitness parameters. However, it is a challenging problem to compute the likelihood of nonneutral models for the population allele frequency dynamics, given the… ▽ More

    Submitted 26 January, 2015; v1 submitted 3 October, 2013; originally announced October 2013.

    Comments: Published in at http://dx.doi.org/10.1214/14-AOAS764 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS764

    Journal ref: Annals of Applied Statistics 2014, Vol. 8, No. 4, 2203-2222

  4. arXiv:1309.5056  [pdf, ps, other

    q-bio.PE math.ST stat.AP

    Descartes' rule of signs and the identifiability of population demographic models from genomic variation data

    Authors: Anand Bhaskar, Yun S. Song

    Abstract: The sample frequency spectrum (SFS) is a widely-used summary statistic of genomic variation in a sample of homologous DNA sequences. It provides a highly efficient dimensional reduction of large-scale population genomic data and its mathematical dependence on the underlying population demography is well understood, thus enabling the development of efficient inference algorithms. However, it has be… ▽ More

    Submitted 1 December, 2014; v1 submitted 19 September, 2013; originally announced September 2013.

    Comments: Published in at http://dx.doi.org/10.1214/14-AOS1264 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1264

    Journal ref: Annals of Statistics 2014, Vol. 42, No. 6, 2469-2493