Skip to main content

Showing 1–23 of 23 results for author: Karp, S

.
  1. arXiv:2506.20792  [pdf, ps, other

    math.CO math.AG

    Richardson tableaux and components of Springer fibers equal to Richardson varieties

    Authors: Steven N. Karp, Martha E. Precup

    Abstract: Motivated by the study of Springer fibers and their totally nonnegative counterparts, we define a new subset of standard tableaux called Richardson tableaux. We characterize Richardson tableaux combinatorially using evacuation as well as in terms of a pair of associated reading words. We also characterize Richardson tableaux geometrically, proving that a tableau is Richardson if and only if the co… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

    Comments: 58 pages

    MSC Class: 14M15; 05A15; 05E10; 14B05; 14N15; 15B48

  2. arXiv:2503.20020  [pdf, other

    cs.RO

    Gemini Robotics: Bringing AI into the Physical World

    Authors: Gemini Robotics Team, Saminda Abeyruwan, Joshua Ainslie, Jean-Baptiste Alayrac, Montserrat Gonzalez Arenas, Travis Armstrong, Ashwin Balakrishna, Robert Baruch, Maria Bauza, Michiel Blokzijl, Steven Bohez, Konstantinos Bousmalis, Anthony Brohan, Thomas Buschmann, Arunkumar Byravan, Serkan Cabi, Ken Caluwaerts, Federico Casarini, Oscar Chang, Jose Enrique Chen, Xi Chen, Hao-Tien Lewis Chiang, Krzysztof Choromanski, David D'Ambrosio, Sudeep Dasari , et al. (93 additional authors not shown)

    Abstract: Recent advancements in large multimodal models have led to the emergence of remarkable generalist capabilities in digital domains, yet their translation to physical agents such as robots remains a significant challenge. This report introduces a new family of AI models purposefully designed for robotics and built upon the foundation of Gemini 2.0. We present Gemini Robotics, an advanced Vision-Lang… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

  3. arXiv:2409.19044  [pdf, other

    cs.CL cs.AI cs.LG

    On the Inductive Bias of Stacking Towards Improving Reasoning

    Authors: Nikunj Saunshi, Stefani Karp, Shankar Krishnan, Sobhan Miryoosefi, Sashank J. Reddi, Sanjiv Kumar

    Abstract: Given the increasing scale of model sizes, novel training strategies like gradual stacking [Gong et al., 2019, Reddi et al., 2023] have garnered interest. Stacking enables efficient training by gradually growing the depth of a model in stages and using layers from a smaller model in an earlier stage to initialize the next stage. Although efficient for training, the model biases induced by such gro… ▽ More

    Submitted 27 September, 2024; originally announced September 2024.

    Comments: Accepted at NeurIPS 2024

  4. arXiv:2406.02469  [pdf, other

    cs.LG cs.CL

    Landscape-Aware Growing: The Power of a Little LAG

    Authors: Stefani Karp, Nikunj Saunshi, Sobhan Miryoosefi, Sashank J. Reddi, Sanjiv Kumar

    Abstract: Recently, there has been increasing interest in efficient pretraining paradigms for training Transformer-based models. Several recent approaches use smaller models to initialize larger models in order to save computation (e.g., stacking and fusion). In this work, we study the fundamental question of how to select the best growing strategy from a given pool of growing strategies. Prior works have e… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  5. arXiv:2405.20229  [pdf, other

    math.CV math-ph math.RT

    Positivity and universal Plücker coordinates for spaces of quasi-exponentials

    Authors: Steven N. Karp, Evgeny Mukhin, Vitaly Tarasov

    Abstract: A quasi-exponential is an entire function of the form $e^{cu}p(u)$, where $p(u)$ is a polynomial and $c \in \mathbb{C}$. Let $V = \langle e^{h_1u}p_1(u), \dots, e^{h_Nu}p_N(u) \rangle$ be a vector space with a basis of quasi-exponentials. We show that if $h_1, \dots, h_N$ are nonnegative and all of the complex zeros of the Wronskian $\operatorname{Wr}(V)$ are real, then $V$ is totally nonnegative… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 24 pages

    MSC Class: 82B23; 15B48; 05E05; 14M15; 30C15

  6. arXiv:2403.15707  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Role of Locality and Weight Sharing in Image-Based Tasks: A Sample Complexity Separation between CNNs, LCNs, and FCNs

    Authors: Aakash Lahoti, Stefani Karp, Ezra Winston, Aarti Singh, Yuanzhi Li

    Abstract: Vision tasks are characterized by the properties of locality and translation invariance. The superior performance of convolutional neural networks (CNNs) on these tasks is widely attributed to the inductive bias of locality and weight sharing baked into their architecture. Existing attempts to quantify the statistical benefits of these biases in CNNs over locally connected convolutional neural net… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 40 pages, 4 figures, Accepted to ICLR 2024, Spotlight

  7. arXiv:2311.15404  [pdf, other

    cs.LG cond-mat.dis-nn stat.ML

    Applying statistical learning theory to deep learning

    Authors: Cédric Gerbelot, Avetik Karagulyan, Stefani Karp, Kavya Ravichandran, Menachem Stern, Nathan Srebro

    Abstract: Although statistical learning theory provides a robust framework to understand supervised learning, many theoretical aspects of deep learning remain unclear, in particular how different architectures may lead to inductive bias when trained using gradient based methods. The goal of these lectures is to provide an overview of some of the main questions that arise when attempting to understand deep l… ▽ More

    Submitted 25 March, 2024; v1 submitted 26 November, 2023; originally announced November 2023.

    Comments: 66 pages, 20 figures

  8. arXiv:2309.04645  [pdf, ps, other

    math.RT math.AG math.CO math.QA

    Universal Plücker coordinates for the Wronski map and positivity in real Schubert calculus

    Authors: Steven N. Karp, Kevin Purbhoo

    Abstract: Given a $d$-dimensional vector space $V \subset \mathbb{C}[u]$ of polynomials, its Wronskian is the polynomial $(u + z_1) \cdots (u + z_n)$ whose zeros $-z_i$ are the points of $\mathbb{C}$ such that $V$ contains a nonzero polynomial with a zero of order at least $d$ at $-z_i$. Equivalently, $V$ is a solution to the Schubert problem defined by osculating planes to the moment curve at… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: 70 pages

    MSC Class: 14M15; 20C30; 81R05 (Primary) 05E05; 15B48 (Secondary)

  9. arXiv:2304.10697  [pdf, other

    nlin.SI math.CO math.DS

    Symmetric Toda, gradient flows, and tridiagonalization

    Authors: Anthony M. Bloch, Steven N. Karp

    Abstract: The Toda lattice (1967) is a Hamiltonian system given by $n$ points on a line governed by an exponential potential. Flaschka (1974) showed that the Toda lattice is integrable by interpreting it as a flow on the space of symmetric tridiagonal $n\times n$ matrices, while Moser (1975) showed that it is a gradient flow on a projective space. The symmetric Toda flow of Deift, Li, Nanda, and Tomei (1986… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: 21 pages

    Journal ref: Phys. D 450 (2023), Paper No. 133766, 10 pages

  10. Anisotropic Satellite Galaxy Quenching: A Unique Signature of Energetic Feedback by Supermassive Black Holes?

    Authors: Juliana S. M. Karp, Johannes U. Lange, Risa H. Wechsler

    Abstract: The quenched fraction of satellite galaxies is aligned with the orientation of the halo's central galaxy, such that on average, satellites form stars at a lower rate along the major axis of the central. This effect, called anisotropic satellite galaxy quenching (ASGQ), has been found in observational data and cosmological simulations. Analyzing the IllustrisTNG simulation, Martín-Navarro et al. (2… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: 7 pages, 4 figures; Submitted to ApJL; Comments welcome!

  11. arXiv:2207.12590  [pdf, other

    math.CO math.PR math.RT

    q-Whittaker functions, finite fields, and Jordan forms

    Authors: Steven N. Karp, Hugh Thomas

    Abstract: The $q$-Whittaker function $W_λ(\mathbf{x};q)$ associated to a partition $λ$ is a $q$-analogue of the Schur function $s_λ(\mathbf{x})$, and is defined as the $t=0$ specialization of the Macdonald polynomial $P_λ(\mathbf{x};q,t)$. We show combinatorially how to expand $W_λ(\mathbf{x};q)$ in terms of partial flags compatible with a nilpotent endomorphism over the finite field of size $1/q$. This yie… ▽ More

    Submitted 10 February, 2025; v1 submitted 25 July, 2022; originally announced July 2022.

    Comments: 72 pages. v2: Added Remark 5.11. v3: Revised Section 7.2, other minor changes

    MSC Class: 05E05; 60C05; 14M15; 15B33; 16G20; 05A30

  12. arXiv:2206.05806  [pdf, other

    math.CO math.AG math.RT

    On two notions of total positivity for partial flag varieties

    Authors: Anthony M. Bloch, Steven N. Karp

    Abstract: Given integers $1 \le k_1 < \cdots < k_l \le n-1$, let $\text{Fl}_{k_1,\dots,k_l;n}$ denote the type $A$ partial flag variety consisting of all chains of subspaces $(V_{k_1}\subset\cdots\subset V_{k_l})$ inside $\mathbb{R}^n$, where each $V_k$ has dimension $k$. Lusztig (1994, 1998) introduced the totally positive part $\text{Fl}_{k_1,\dots,k_l;n}^{>0}$ as the subset of partial flags which can be… ▽ More

    Submitted 10 October, 2022; v1 submitted 12 June, 2022; originally announced June 2022.

    Comments: 21 pages. v2: Minor changes

    MSC Class: 15B48; 14M15; 52B40; 81T60

    Journal ref: Adv. Math. 414 (2023), Paper No. 108855, 24 pages

  13. arXiv:2201.13419  [pdf, ps, other

    cs.LG math.OC stat.ML

    Agnostic Learnability of Halfspaces via Logistic Loss

    Authors: Ziwei Ji, Kwangjun Ahn, Pranjal Awasthi, Satyen Kale, Stefani Karp

    Abstract: We investigate approximation guarantees provided by logistic regression for the fundamental problem of agnostic learning of homogeneous halfspaces. Previously, for a certain broad class of "well-behaved" distributions on the examples, Diakonikolas et al. (2020) proved an $\tildeΩ(\textrm{OPT})$ lower bound, while Frei et al. (2021) proved an $\tilde{O}(\sqrt{\textrm{OPT}})$ upper bound, where… ▽ More

    Submitted 31 January, 2022; originally announced January 2022.

  14. Wronskians, total positivity, and real Schubert calculus

    Authors: Steven N. Karp

    Abstract: A complete flag in $\mathbb{R}^n$ is a sequence of nested subspaces $V_1 \subset \cdots \subset V_{n-1}$ such that each $V_k$ has dimension $k$. It is called totally nonnegative if all its Plücker coordinates are nonnegative. We may view each $V_k$ as a subspace of polynomials in $\mathbb{R}[x]$ of degree at most $n-1$, by associating a vector $(a_1, \dots, a_n)$ in $\mathbb{R}^n$ to the polynomia… ▽ More

    Submitted 1 September, 2023; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: 25 pages. v2: Updated references. v3: Clarified references to the secant conjecture

    MSC Class: 14M15; 15B48; 14N15; 14P05; 41A50; 34C10

    Journal ref: Selecta Math. (N.S.) 30 (2024), no. 1, Paper No. 1, 28 pages

  15. arXiv:2109.04558  [pdf, other

    math.CO math-ph math.DG math.DS

    Gradient flows, adjoint orbits, and the topology of totally nonnegative flag varieties

    Authors: Anthony M. Bloch, Steven N. Karp

    Abstract: One can view a partial flag variety in $\mathbb{C}^n$ as an adjoint orbit $\mathcal{O}_λ$ inside the Lie algebra of $n \times n$ skew-Hermitian matrices. We use the orbit context to study the totally nonnegative part of a partial flag variety from an algebraic, geometric, and dynamical perspective. The paper has three main parts: (1) We introduce the totally nonnegative part of $\mathcal{O}_λ$,… ▽ More

    Submitted 22 November, 2021; v1 submitted 9 September, 2021; originally announced September 2021.

    Comments: 79 pages. v2: Updated references. v3: Minor changes

    MSC Class: 15B48; 14M15; 20G20; 17B45; 81T60; 37J35

    Journal ref: Comm. Math. Phys. 398 (2023), no. 3, 1213-1289

  16. Shelling the m=1 amplituhedron

    Authors: Steven N. Karp, John Machacek

    Abstract: The amplituhedron $\mathcal{A}_{n,k,m}$ was introduced by Arkani-Hamed and Trnka (2014) in order to give a geometric basis for calculating scattering amplitudes in planar $\mathcal{N}=4$ supersymmetric Yang-Mills theory. It is a projection inside the Grassmannian $\text{Gr}_{k,k+m}$ of the totally nonnegative part of $\text{Gr}_{k,n}$. Karp and Williams (2019) studied the $m=1$ amplituhedron… ▽ More

    Submitted 10 October, 2022; v1 submitted 6 April, 2021; originally announced April 2021.

    Comments: 20 pages. v2: Minor changes

    MSC Class: 06A07; 14M15; 81T60; 05A19

    Journal ref: Comb. Theory 3 (2023), no. 1, Paper No. 6, 22 pages

  17. arXiv:1904.00527  [pdf, other

    math.CO math.AG math.GT math.RT

    Regularity theorem for totally nonnegative flag varieties

    Authors: Pavel Galashin, Steven N. Karp, Thomas Lam

    Abstract: We show that the totally nonnegative part of a partial flag variety $G/P$ (in the sense of Lusztig) is a regular CW complex, confirming a conjecture of Williams. In particular, the closure of each positroid cell inside the totally nonnegative Grassmannian is homeomorphic to a ball, confirming a conjecture of Postnikov.

    Submitted 12 April, 2021; v1 submitted 31 March, 2019; originally announced April 2019.

    Comments: 63 pages, 2 figures; v2: Minor changes; v3: Final version to appear in J. Amer. Math. Soc

    MSC Class: Primary: 14M15. Secondary: 05E45; 15B48; 20G20

    Journal ref: J. Amer. Math. Soc. 35 (2022), no. 2, 513-579

  18. arXiv:1805.06004  [pdf, ps, other

    math.CO math.AG

    Moment curves and cyclic symmetry for positive Grassmannians

    Authors: Steven N. Karp

    Abstract: We show that for each k and n, the cyclic shift map on the complex Grassmannian Gr(k,n) has exactly $\binom{n}{k}$ fixed points. There is a unique totally nonnegative fixed point, given by taking n equally spaced points on the trigonometric moment curve (if k is odd) or the symmetric moment curve (if k is even). We introduce a parameter q, and show that the fixed points of a q-deformation of the c… ▽ More

    Submitted 10 July, 2019; v1 submitted 15 May, 2018; originally announced May 2018.

    Comments: 18 pages. v2: Minor changes

    MSC Class: 14M15; 14N35; 15B48; 52Bxx

    Journal ref: Bull. Lond. Math. Soc. 51 (2019), no. 5, 900-916

  19. arXiv:1801.08953  [pdf, other

    math.RT math.AG math.CO

    The totally nonnegative part of G/P is a ball

    Authors: Pavel Galashin, Steven N. Karp, Thomas Lam

    Abstract: We show that the totally nonnegative part of a partial flag variety (in the sense of Lusztig) is homeomorphic to a closed ball.

    Submitted 9 April, 2019; v1 submitted 26 January, 2018; originally announced January 2018.

    Comments: 6 pages. v2: Proof of Lemma 1 moved to arXiv:1707.02010. v3: Minor changes

    MSC Class: 14M15; 15B48; 20Gxx

    Journal ref: Adv. Math. 351 (2019), 614-620

  20. arXiv:1708.09525  [pdf, other

    math.CO hep-th

    Decompositions of amplituhedra

    Authors: Steven N. Karp, Lauren K. Williams, Yan X Zhang

    Abstract: The (tree) amplituhedron A(n,k,m) is the image in the Grassmannian Gr(k,k+m) of the totally nonnegative part of Gr(k,n), under a (map induced by a) linear map which is totally positive. It was introduced by Arkani-Hamed and Trnka in 2013 in order to give a geometric basis for the computation of scattering amplitudes in N=4 supersymmetric Yang-Mills theory. In the case relevant to physics (m=4), th… ▽ More

    Submitted 30 August, 2017; originally announced August 2017.

    Comments: 46 pages; appendix written with Hugh Thomas

    Journal ref: Ann. Inst. Henri Poincaré D 7 (2020), no. 3, 303-363

  21. arXiv:1707.02010  [pdf, other

    math.CO hep-th math.GT

    The totally nonnegative Grassmannian is a ball

    Authors: Pavel Galashin, Steven N. Karp, Thomas Lam

    Abstract: We prove that three spaces of importance in topological combinatorics are homeomorphic to closed balls: the totally nonnegative Grassmannian, the compactification of the space of electrical networks, and the cyclically symmetric amplituhedron.

    Submitted 7 July, 2021; v1 submitted 6 July, 2017; originally announced July 2017.

    Comments: 19 pages. v2: Exposition improved in many places. v3: Final version

    MSC Class: 05E45; 14M15; 15B48; 52Bxx

    Journal ref: Adv. Math. 397 (2022), Paper No. 108123, 23 pages

  22. arXiv:1608.08288  [pdf, other

    math.CO hep-th

    The m=1 amplituhedron and cyclic hyperplane arrangements

    Authors: Steven N. Karp, Lauren K. Williams

    Abstract: The (tree) amplituhedron A(n,k,m) is the image in the Grassmannian Gr(k,k+m) of the totally nonnegative part of Gr(k,n), under a (map induced by a) linear map which is totally positive. It was introduced by Arkani-Hamed and Trnka in 2013 in order to give a geometric basis for the computation of scattering amplitudes in N=4 supersymmetric Yang-Mills theory. When k+m=n, the amplituhedron is isomorph… ▽ More

    Submitted 9 April, 2019; v1 submitted 29 August, 2016; originally announced August 2016.

    Comments: 50 pages. v2: Final version

    Journal ref: Int. Math. Res. Not. IMRN (2019), no. 5, 1401-1462

  23. arXiv:1503.05622  [pdf, ps, other

    math.CO hep-th math.CA

    Sign variation, the Grassmannian, and total positivity

    Authors: Steven N. Karp

    Abstract: The totally nonnegative Grassmannian is the set of k-dimensional subspaces V of R^n whose nonzero Pluecker coordinates all have the same sign. Gantmakher and Krein (1950) and Schoenberg and Whitney (1951) independently showed that V is totally nonnegative iff every vector in V, when viewed as a sequence of n numbers and ignoring any zeros, changes sign at most k-1 times. We generalize this result… ▽ More

    Submitted 10 August, 2016; v1 submitted 18 March, 2015; originally announced March 2015.

    Comments: 28 pages. v2: We characterize when a generalized amplituhedron construction is well defined, in new Section 4 (the previous Section 4 is now Section 5); v3: Final version to appear in J. Combin. Theory Ser. A

    Journal ref: J. Combin. Theory Ser. A 145 (2017), 308-339