Skip to main content

Showing 1–2 of 2 results for author: Liu, A B

.
  1. arXiv:2403.03218  [pdf, other

    cs.LG cs.AI cs.CL cs.CY

    The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning

    Authors: Nathaniel Li, Alexander Pan, Anjali Gopal, Summer Yue, Daniel Berrios, Alice Gatti, Justin D. Li, Ann-Kathrin Dombrowski, Shashwat Goel, Long Phan, Gabriel Mukobi, Nathan Helm-Burger, Rassin Lababidi, Lennart Justen, Andrew B. Liu, Michael Chen, Isabelle Barrass, Oliver Zhang, Xiaoyuan Zhu, Rishub Tamirisa, Bhrugu Bharathi, Adam Khoja, Zhenqi Zhao, Ariel Herbert-Voss, Cort B. Breuer , et al. (32 additional authors not shown)

    Abstract: The White House Executive Order on Artificial Intelligence highlights the risks of large language models (LLMs) empowering malicious actors in developing biological, cyber, and chemical weapons. To measure these risks of malicious use, government institutions and major AI labs are developing evaluations for hazardous capabilities in LLMs. However, current evaluations are private, preventing furthe… ▽ More

    Submitted 15 May, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: See the project page at https://wmdp.ai

  2. Refined height pairing

    Authors: Bruno Kahn, with an appendix by Qing Liu

    Abstract: For a $d$-dimensional smooth projective variety $X$ over the function field of a smooth variety $B$ over a field $k$ and for $i\ge 0$, we define a subgroup $CH^i(X)^{(0)}$ of $CH^i(X)$ and construct a "refined height pairing" \[CH^i(X)^{(0)}\times CH^{d+1-i}(X)^{(0)}\to CH^1(B)\] in the category of abelian groups modulo isogeny. For $i=1,d$, $CH^i(X)^{(0)}$ is the group of cycles numerically equiv… ▽ More

    Submitted 6 December, 2023; v1 submitted 1 September, 2020; originally announced September 2020.

    Comments: To appear in Alg. & Number theory. Added after Def. 2.2: Even if it is not apparent anymore, this definition was inspired by [8, Assumption 2] and [5, 1.2]

    Journal ref: Alg. Number Th. 18 (2024) 1039-1079