Skip to main content

Showing 1–48 of 48 results for author: Segal, E

.
  1. arXiv:2505.16490  [pdf, ps, other

    eess.AS

    HPP-Voice: A Large-Scale Evaluation of Speech Embeddings for Multi-Phenotypic Classification

    Authors: David Krongauz, Hido Pinto, Sarah Kohn, Yanir Marmor, Eran Segal

    Abstract: Human speech contains paralinguistic cues that reflect a speaker's physiological and neurological state, potentially enabling non-invasive detection of various medical phenotypes. We introduce the Human Phenotype Project Voice corpus (HPP-Voice): a dataset of 7,188 recordings in which Hebrew-speaking adults count for 30 seconds, with each speaker linked to up to 15 potentially voice-related phenot… ▽ More

    Submitted 25 May, 2025; v1 submitted 22 May, 2025; originally announced May 2025.

    Comments: supplementary figures added; typos corrected

  2. arXiv:2505.00949  [pdf, other

    cs.CL cs.AI cs.LG

    Llama-Nemotron: Efficient Reasoning Models

    Authors: Akhiad Bercovich, Itay Levy, Izik Golan, Mohammad Dabbah, Ran El-Yaniv, Omri Puny, Ido Galil, Zach Moshe, Tomer Ronen, Najeeb Nabwani, Ido Shahaf, Oren Tropp, Ehud Karpas, Ran Zilberstein, Jiaqi Zeng, Soumye Singhal, Alexander Bukharin, Yian Zhang, Tugrul Konuk, Gerald Shen, Ameya Sunil Mahabaleshwarkar, Bilal Kartal, Yoshi Suhara, Olivier Delalleau, Zijia Chen , et al. (109 additional authors not shown)

    Abstract: We introduce the Llama-Nemotron series of models, an open family of heterogeneous reasoning models that deliver exceptional reasoning capabilities, inference efficiency, and an open license for enterprise use. The family comes in three sizes -- Nano (8B), Super (49B), and Ultra (253B) -- and performs competitively with state-of-the-art reasoning models such as DeepSeek-R1 while offering superior i… ▽ More

    Submitted 14 May, 2025; v1 submitted 1 May, 2025; originally announced May 2025.

  3. arXiv:2504.03624  [pdf, other

    cs.CL cs.AI cs.LG

    Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

    Authors: NVIDIA, :, Aaron Blakeman, Aarti Basant, Abhinav Khattar, Adithya Renduchintala, Akhiad Bercovich, Aleksander Ficek, Alexis Bjorlin, Ali Taghibakhshi, Amala Sanjay Deshmukh, Ameya Sunil Mahabaleshwarkar, Andrew Tao, Anna Shors, Ashwath Aithal, Ashwin Poojary, Ayush Dattagupta, Balaram Buddharaju, Bobby Chen, Boris Ginsburg, Boxin Wang, Brandon Norick, Brian Butterfield, Bryan Catanzaro, Carlo del Mundo , et al. (176 additional authors not shown)

    Abstract: As inference-time scaling becomes critical for enhanced reasoning capabilities, it is increasingly becoming important to build models that are efficient to infer. We introduce Nemotron-H, a family of 8B and 56B/47B hybrid Mamba-Transformer models designed to reduce inference cost for a given accuracy level. To achieve this goal, we replace the majority of self-attention layers in the common Transf… ▽ More

    Submitted 15 April, 2025; v1 submitted 4 April, 2025; originally announced April 2025.

  4. arXiv:2504.00036  [pdf, other

    q-bio.QM cs.AI cs.LG

    Improving Diseases Predictions Utilizing External Bio-Banks

    Authors: Hido Pinto, Eran Segal

    Abstract: Machine learning has been successfully used in critical domains, such as medicine. However, extracting meaningful insights from biomedical data is often constrained by the lack of their available disease labels. In this research, we demonstrate how machine learning can be leveraged to enhance explainability and uncover biologically meaningful associations, even when predictive improvements in dise… ▽ More

    Submitted 30 March, 2025; originally announced April 2025.

  5. arXiv:2503.18908  [pdf, other

    cs.LG

    FFN Fusion: Rethinking Sequential Computation in Large Language Models

    Authors: Akhiad Bercovich, Mohammad Dabbah, Omri Puny, Ido Galil, Amnon Geifman, Yonatan Geifman, Izhak Golan, Ehud Karpas, Itay Levy, Zach Moshe, Najeeb Nabwani, Tomer Ronen, Itamar Schen, Elad Segal, Ido Shahaf, Oren Tropp, Ran Zilberstein, Ran El-Yaniv

    Abstract: We introduce FFN Fusion, an architectural optimization technique that reduces sequential computation in large language models by identifying and exploiting natural opportunities for parallelization. Our key insight is that sequences of Feed-Forward Network (FFN) layers, particularly those remaining after the removal of specific attention layers, can often be parallelized with minimal accuracy impa… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

  6. arXiv:2412.16276  [pdf, other

    q-bio.QM cs.LG

    SGAC: A Graph Neural Network Framework for Imbalanced and Structure-Aware AMP Classification

    Authors: Yingxu Wang, Victor Liang, Nan Yin, Siwei Liu, Eran Segal

    Abstract: Classifying antimicrobial peptides(AMPs) from the vast array of peptides mined from metagenomic sequencing data is a significant approach to addressing the issue of antibiotic resistance. However, current AMP classification methods, primarily relying on sequence-based data, neglect the spatial structure of peptides, thereby limiting the accurate classification of AMPs. Additionally, the number of… ▽ More

    Submitted 20 December, 2024; originally announced December 2024.

  7. arXiv:2412.14748  [pdf, other

    math.AG

    A short guide to GKZ

    Authors: Ed Segal

    Abstract: These notes are a brief summary of the main results from the book `Discriminants, Resultants and Multidimensional Determinants' by Gelfand-Kapranov-Zelevinsky. We sketch the key ideas involved in the proofs, using as little technical background as possible.

    Submitted 19 December, 2024; originally announced December 2024.

    Comments: 21 pages

    MSC Class: 14-01

  8. arXiv:2412.06993  [pdf, other

    cs.AI cs.LG q-bio.QM

    Toward AI-Driven Digital Organism: Multiscale Foundation Models for Predicting, Simulating and Programming Biology at All Levels

    Authors: Le Song, Eran Segal, Eric Xing

    Abstract: We present an approach of using AI to model and simulate biology and life. Why is it important? Because at the core of medicine, pharmacy, public health, longevity, agriculture and food security, environmental protection, and clean energy, it is biology at work. Biology in the physical world is too complex to manipulate and always expensive and risky to tamper with. In this perspective, we layout… ▽ More

    Submitted 9 December, 2024; originally announced December 2024.

  9. arXiv:2411.06518  [pdf, other

    cs.LG q-bio.QM stat.ME

    Causal Representation Learning from Multimodal Biomedical Observations

    Authors: Yuewen Sun, Lingjing Kong, Guangyi Chen, Loka Li, Gongxu Luo, Zijian Li, Yixuan Zhang, Yujia Zheng, Mengyue Yang, Petar Stojanov, Eran Segal, Eric P. Xing, Kun Zhang

    Abstract: Prevalent in biomedical applications (e.g., human phenotype research), multimodal datasets can provide valuable insights into the underlying physiological mechanisms. However, current machine learning (ML) models designed to analyze these datasets often lack interpretability and identifiability guarantees, which are essential for biomedical research. Recent advances in causal representation learni… ▽ More

    Submitted 16 March, 2025; v1 submitted 10 November, 2024; originally announced November 2024.

  10. arXiv:2408.17421  [pdf, other

    eess.IV cs.CV

    Generative AI Enables Medical Image Segmentation in Ultra Low-Data Regimes

    Authors: Li Zhang, Basu Jindal, Ahmed Alaa, Robert Weinreb, David Wilson, Eran Segal, James Zou, Pengtao Xie

    Abstract: Semantic segmentation of medical images is pivotal in applications like disease diagnosis and treatment planning. While deep learning has excelled in automating this task, a major hurdle is the need for numerous annotated segmentation masks, which are resource-intensive to produce due to the required expertise and time. This scenario often leads to ultra low-data regimes, where annotated images ar… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

  11. arXiv:2408.11876  [pdf

    q-bio.QM cs.AI cs.LG

    From Glucose Patterns to Health Outcomes: A Generalizable Foundation Model for Continuous Glucose Monitor Data Analysis

    Authors: Guy Lutsker, Gal Sapir, Smadar Shilo, Jordi Merino, Anastasia Godneva, Jerry R Greenfield, Dorit Samocha-Bonet, Raja Dhir, Francisco Gude, Shie Mannor, Eli Meirom, Gal Chechik, Hagai Rossman, Eran Segal

    Abstract: Recent advances in SSL enabled novel medical AI models, known as foundation models, offer great potential for better characterizing health from diverse biomedical data. CGM provides rich, temporal data on glycemic patterns, but its full potential for predicting broader health outcomes remains underutilized. Here, we present GluFormer, a generative foundation model for CGM data that learns nuanced… ▽ More

    Submitted 7 January, 2025; v1 submitted 20 August, 2024; originally announced August 2024.

  12. arXiv:2404.11087  [pdf, other

    q-bio.QM

    FrackyFrac: A Standalone UniFrac Calculator

    Authors: Amit Lavon, Smadar Shilo, Ayya Keshet, Eran Segal

    Abstract: UniFrac is a family of distance metrics over microbial abundances, that take taxonomic relatedness into account. Current tools and libraries for calculating UniFrac have specific requirements regarding the user's technical expertise, operating system, and pre-installed software, which might exclude potential users. FrackyFrac is a native command-line tool that can run on any platform and has no re… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  13. arXiv:2403.09672  [pdf, other

    cs.CV cs.LG

    COMPRER: A Multimodal Multi-Objective Pretraining Framework for Enhanced Medical Image Representation

    Authors: Guy Lutsker, Hagai Rossman, Nastya Godiva, Eran Segal

    Abstract: Substantial advances in multi-modal Artificial Intelligence (AI) facilitate the combination of diverse medical modalities to achieve holistic health assessments. We present COMPRER , a novel multi-modal, multi-objective pretraining framework which enhances medical-image representation, diagnostic inferences, and prognosis of diseases. COMPRER employs a multi-objective training framework, where eac… ▽ More

    Submitted 4 February, 2024; originally announced March 2024.

  14. arXiv:2402.05763  [pdf, ps, other

    math.AG

    The McKay correspondence in type $D_4$ via VGIT

    Authors: Tarig Abdelgadir, Ed Segal

    Abstract: We present an explicit GIT construction which produces both the minimal resolution of the type $D_4$ surface singularity, and also the orbifold resolution. Our construction is based on a Tannakian approach which is in principle applicable to arbitrary quotient singularities.

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 12 pages, comments welcome

    MSC Class: 14J17; 14L24; 14D23

  15. arXiv:2312.07160  [pdf, other

    cs.IR

    Audience Prospecting for Dynamic-Product-Ads in Native Advertising

    Authors: Eliran Abutbul, Yohay Kaplan, Naama Krasne, Oren Somekh, Or David, Omer Duvdevany, Evgeny Segal

    Abstract: With yearly revenue exceeding one billion USD, Yahoo Gemini native advertising marketplace serves more than two billion impressions daily to hundreds of millions of unique users. One of the fastest growing segments of Gemini native is dynamic-product-ads (DPA), where major advertisers, such as Amazon and Walmart, provide catalogs with millions of products for the system to choose from and present… ▽ More

    Submitted 13 December, 2023; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: In Proc. IeeeBigData'2023 (Industry and Government Program)

  16. arXiv:2311.08979  [pdf, other

    cs.LG eess.SP

    A Multimodal Dataset of 21,412 Recorded Nights for Sleep and Respiratory Research

    Authors: Alon Diament, Maria Gorodetski, Adam Jankelow, Ayya Keshet, Tal Shor, Daphna Weissglas-Volkov, Hagai Rossman, Eran Segal

    Abstract: This study introduces a novel, rich dataset obtained from home sleep apnea tests using the FDA-approved WatchPAT-300 device, collected from 7,077 participants over 21,412 nights. The dataset comprises three levels of sleep data: raw multi-channel time-series from sensors, annotated sleep events, and computed summary statistics, which include 447 features related to sleep architecture, sleep apnea,… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2023, December 10th, 2023, New Orleans, United States, 14 pages

  17. arXiv:2309.01724  [pdf, other

    astro-ph.GA stat.AP

    Neural network-based emulation of interstellar medium models

    Authors: Pierre Palud, Lucas Einig, Franck Le Petit, Emeric Bron, Pierre Chainais, Jocelyn Chanussot, Jérôme Pety, Pierre-Antoine Thouvenin, David Languignon, Ivana Bešlić, Miriam G. Santa-Maria, Jan H. Orkisz, Léontine E. Ségal, Antoine Zakardjian, Sébastien Bardeau, Maryvonne Gerin, Javier R. Goicoechea, Pierre Gratier, Viviana V. Guzman, Annie Hughes, François Levrier, Harvey S. Liszt, Jacques Le Bourlot, Antoine Roueff, Albrecht Sievers

    Abstract: The interpretation of observations of atomic and molecular tracers in the galactic and extragalactic interstellar medium (ISM) requires comparisons with state-of-the-art astrophysical models to infer some physical conditions. Usually, ISM models are too time-consuming for such inference procedures, as they call for numerous model evaluations. As a result, they are often replaced by an interpolatio… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Journal ref: A&A 678, A198 (2023)

  18. arXiv:2306.04971  [pdf, other

    cs.NE cs.LG

    A Melting Pot of Evolution and Learning

    Authors: Moshe Sipper, Achiya Elyasaf, Tomer Halperin, Zvika Haramaty, Raz Lapid, Eyal Segal, Itai Tzruia, Snir Vitrack Tamam

    Abstract: We survey eight recent works by our group, involving the successful blending of evolutionary algorithms with machine learning and deep learning: 1. Binary and Multinomial Classification through Evolutionary Symbolic Regression, 2. Classy Ensemble: A Novel Ensemble Algorithm for Classification, 3. EC-KitY: Evolutionary Computation Tool Kit in Python, 4. Evolution of Activation Functions for Deep Le… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: To Appear in Proceedings of Genetic Programming Theory & Practice XX, 2023

  19. arXiv:2304.10969  [pdf, ps, other

    math.SG math.AG

    Equivariant Fukaya categories at singular values

    Authors: Yanki Lekili, Ed Segal

    Abstract: Given a Hamiltonian torus action on a symplectic manifold, Teleman and Fukaya have proposed that the Fukaya category of each symplectic quotient should be equivalent to an equivariant Fukaya category of the original manifold. We lay out new conjectures that extend this story - in certain situations - to singular values of the moment map. These include a proposal for how, in some cases, we can reco… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

    MSC Class: 53D37; 14J33

  20. arXiv:2211.00262  [pdf, other

    cs.CL cs.CV

    Training Vision-Language Models with Less Bimodal Supervision

    Authors: Elad Segal, Ben Bogin, Jonathan Berant

    Abstract: Standard practice in pretraining multimodal models, such as vision-language models, is to rely on pairs of aligned inputs from both modalities, for example, aligned image-text pairs. However, such pairs can be difficult to obtain in low-resource settings and for some modality pairs (e.g., structured tables and images). In this work, we investigate the extent to which we can reduce the reliance on… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: AKBC 2022

  21. arXiv:2209.03618  [pdf, other

    cs.NE cs.MA

    Adaptive Combination of a Genetic Algorithm and Novelty Search for Deep Neuroevolution

    Authors: Eyal Segal, Moshe Sipper

    Abstract: Evolutionary Computation (EC) has been shown to be able to quickly train Deep Artificial Neural Networks (DNNs) to solve Reinforcement Learning (RL) problems. While a Genetic Algorithm (GA) is well-suited for exploiting reward functions that are neither deceptive nor sparse, it struggles when the reward function is either of those. To that end, Novelty Search (NS) has been shown to be able to outp… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

    Journal ref: Proceedings of the 14th International Joint Conference on Computational Intelligence (IJCCI 2022)

  22. arXiv:2208.08895  [pdf

    cond-mat.soft physics.bio-ph

    Self-Assembled Fatty Acid Crystalline Coatings Display Non-Toxic Superhydrophobic Antimicrobial Properties

    Authors: Elena Prudnikov, Iryna Polishchuk, Andy Sand, Hanan Abu Hamad, Naama Massad-Ivanir, Ester Segal, Boaz Pokroy

    Abstract: Superhydrophobcity is a well-known wetting phenomenon found in numerous plants and insects. It is achieved by the combination of the surfaces chemical properties and its surface roughness. Inspired by nature, numerous synthetic superhydrophobic surfaces have been developed for various applications. Designated surface coating is one of the fabrication routes to achieve the superhydrophobicity. Yet,… ▽ More

    Submitted 10 August, 2022; originally announced August 2022.

  23. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  24. arXiv:2205.04793  [pdf, ps, other

    math.AG

    Serre functors of residual categories via hybrid models

    Authors: Federico Barbacovi, Ed Segal

    Abstract: In this short note we observe that the Serre functor on the residual category of a complete intersection can be easily described in the framework of hybrid models. Using this description we recover some recent results of Kuznetsov and Perry.

    Submitted 23 May, 2023; v1 submitted 10 May, 2022; originally announced May 2022.

    Comments: v1:9 pages. v2: We thank the referee for pointing out that our Prop. 2.1.2 had already appeared in a paper of Favero-Kelly, with an identical proof. To appear in Bull. LMS

    MSC Class: 14F08

  25. arXiv:2201.03533  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    SCROLLS: Standardized CompaRison Over Long Language Sequences

    Authors: Uri Shaham, Elad Segal, Maor Ivgi, Avia Efrat, Ori Yoran, Adi Haviv, Ankit Gupta, Wenhan Xiong, Mor Geva, Jonathan Berant, Omer Levy

    Abstract: NLP benchmarks have largely focused on short texts, such as sentences and paragraphs, even though long texts comprise a considerable amount of natural language in the wild. We introduce SCROLLS, a suite of tasks that require reasoning over long texts. We examine existing long-text datasets, and handpick ones where the text is naturally long, while prioritizing tasks that involve synthesizing infor… ▽ More

    Submitted 11 October, 2022; v1 submitted 10 January, 2022; originally announced January 2022.

    Comments: EMNLP 2022

  26. arXiv:2106.05745  [pdf, other

    math.AG

    Line fields on punctured surfaces and twisted derived categories

    Authors: Ed Segal

    Abstract: The Fukaya category of a punctured surface can be reconstructed from a pair-of-pants decomposition using a formal construction that attaches a category to a trivalent graph. We extend this formal construction to include a choice of line field on the surface, this requires a certain decoration on the graph. On the mirror side we show that this leads to a kind of twisted derived category which has n… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

    Comments: 47 pages

    MSC Class: 14F08; 14J33

  27. Signal Processing Techniques to Reduce the Limit of Detection for Thin Film Biosensors

    Authors: Simon J. Ward, Rabeb Layouni, Sofia Arshavsky-Graham, Ester Segal, Sharon M. Weiss

    Abstract: The ultimate detection limit of optical biosensors is often limited by various noise sources, including those introduced by the optical measurement setup. While sophisticated modifications to instrumentation may reduce noise, a simpler approach that can benefit all sensor platforms is the application of signal processing to minimize the deleterious effects of noise. In this work, we show that appl… ▽ More

    Submitted 12 March, 2021; originally announced March 2021.

    Comments: 11 pages, 3 Figures, 2 Tables

    Journal ref: ACS Sensors 6, 2967-2978 (2021)

  28. Discriminants and semi-orthogonal decompositions

    Authors: Alex Kite, Ed Segal

    Abstract: The derived categories of toric varieties admit semi-orthogonal decompositions coming from wall-crossing in GIT. We prove that these decompositions satisfy a Jordan-Holder property: the subcategories that appear, and their multiplicities, are independent of the choices made. For Calabi-Yau toric varieties wall-crossing instead gives derived equivalences and autoequivalences, and mirror symmetry… ▽ More

    Submitted 2 February, 2022; v1 submitted 16 February, 2021; originally announced February 2021.

    Comments: v1: 21 pages. v2: minor revisions. Published in Comm. Math. Phys

    MSC Class: 14F08; 14J33

  29. arXiv:2101.02235  [pdf, other

    cs.CL

    Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies

    Authors: Mor Geva, Daniel Khashabi, Elad Segal, Tushar Khot, Dan Roth, Jonathan Berant

    Abstract: A key limitation in current datasets for multi-hop reasoning is that the required steps for answering the question are mentioned in it explicitly. In this work, we introduce StrategyQA, a question answering (QA) benchmark where the required reasoning steps are implicit in the question, and should be inferred using a strategy. A fundamental challenge in this setup is how to elicit such creative que… ▽ More

    Submitted 6 January, 2021; originally announced January 2021.

    Comments: Accepted for publication in Transactions of the Association for Computational Linguistics (TACL), 2021. Author's final version

  30. arXiv:1909.13375  [pdf, other

    cs.CL

    A Simple and Effective Model for Answering Multi-span Questions

    Authors: Elad Segal, Avia Efrat, Mor Shoham, Amir Globerson, Jonathan Berant

    Abstract: Models for reading comprehension (RC) commonly restrict their output space to the set of all single contiguous spans from the input, in order to alleviate the learning problem and avoid the need for a model that generates text explicitly. However, forcing an answer to be a single span can be restrictive, and some recent datasets also include multi-span questions, i.e., questions whose answer is a… ▽ More

    Submitted 5 October, 2020; v1 submitted 29 September, 2019; originally announced September 2019.

    Comments: EMNLP 2020

  31. arXiv:1805.08691  [pdf, other

    cs.CV

    Highly Efficient 8-bit Low Precision Inference of Convolutional Neural Networks with IntelCaffe

    Authors: Jiong Gong, Haihao Shen, Guoming Zhang, Xiaoli Liu, Shane Li, Ge Jin, Niharika Maheshwari, Evarist Fomenko, Eden Segal

    Abstract: High throughput and low latency inference of deep neural networks are critical for the deployment of deep learning applications. This paper presents the efficient inference techniques of IntelCaffe, the first Intel optimized deep learning framework that supports efficient 8-bit low precision inference and model optimization techniques of convolutional neural networks on Intel Xeon Scalable Process… ▽ More

    Submitted 4 May, 2018; originally announced May 2018.

    Comments: 1st Reproducible Tournament on Pareto-efficient Image Classification, co-held with ASPLOS 2018

  32. arXiv:1805.06440  [pdf, other

    stat.ML cs.LG

    Regularization Learning Networks: Deep Learning for Tabular Datasets

    Authors: Ira Shavitt, Eran Segal

    Abstract: Despite their impressive performance, Deep Neural Networks (DNNs) typically underperform Gradient Boosting Trees (GBTs) on many tabular-dataset learning tasks. We propose that applying a different regularization coefficient to each weight might boost the performance of DNNs by allowing them to make more use of the more relevant inputs. However, this will lead to an intractable number of hyperparam… ▽ More

    Submitted 23 October, 2018; v1 submitted 16 May, 2018; originally announced May 2018.

    Comments: Accepted to the 32nd Conference on Neural Information Processing Systems (NIPS 2018), Montreal, Canada

  33. arXiv:1705.01366  [pdf, ps, other

    math.AG math.RA

    A non-commutative Bertini theorem

    Authors: Jørgen Vold Rennemo, Ed Segal, Michel Van den Bergh

    Abstract: We prove a version of the classical 'generic smoothness' theorem with smooth varieties replaced by non-commutative resolutions of singular varieties. This in particular implies a non-commutative version of the Bertini theorem.

    Submitted 22 June, 2020; v1 submitted 3 May, 2017; originally announced May 2017.

    Comments: 6 pages. v2: added funder acknowledgement. Published in J. Noncommutative Geometry

    MSC Class: 14A22 (Primary); 14E15; 16S38 (Secondary)

    Journal ref: J. Noncommutative Geometry 13 (2019), no. 2, 609-616

  34. Hori-mological projective duality

    Authors: Jørgen Vold Rennemo, Ed Segal

    Abstract: Kuznetsov has conjectured that Pfaffian varieties should admit non-commutative crepant resolutions which satisfy his Homological Projective Duality. We prove half the cases of this conjecture, by interpreting and proving a duality of non-abelian gauged linear sigma models proposed by Hori.

    Submitted 22 June, 2020; v1 submitted 13 September, 2016; originally announced September 2016.

    Comments: 55 pages. V2: slightly rewritten to take advantage of the `non-commutative Bertini theorem' recently proved by the authors and Van den Bergh. V3: lots of changes in exposition following referees' comments. Section 5 has been mostly cut because it was boring. To appear in Duke Math. J. V3: added funder acknowledgement

    MSC Class: 14F05; 81T30; 16E35; 16S38

    Journal ref: Duke Math. J. 168, no. 11 (2019), 2127-2205

  35. arXiv:1603.06717  [pdf, ps, other

    math.AG math.CT math.RA

    All autoequivalences are spherical twists

    Authors: Ed Segal

    Abstract: In this short note we observe that, for purely formal reasons, any autoequivalence can be constructed as a twist around a spherical functor. As an example, we show how the P-twists constructed by Huybrechts and Thomas can be formulated as spherical twists.

    Submitted 13 October, 2020; v1 submitted 22 March, 2016; originally announced March 2016.

    Comments: 11 pages at a relaxed pace. V2: Added funder acknowledgement. Published in IMRN. V3: `Proposition' 3.11 (which was only sketched) is wrong - added a counterexample due to Merlin Christ

    MSC Class: 14F05; 18E30

    Journal ref: Int. Math. Res. Not. 2018 (2018), no. 10, 3137-3154

  36. A new 5-fold flop and derived equivalence

    Authors: Ed Segal

    Abstract: We describe a new example of a flop in 5-dimensions, due to Roland Abuaf, with the nice feature that the contracting loci on either side are not isomorphic. We prove that the two sides are derived equivalent.

    Submitted 27 January, 2016; v1 submitted 23 June, 2015; originally announced June 2015.

    Comments: v1. It may well be that this example has appeared before - references welcome! v2. Minor changes. Final version, to appear in Bull. London Math. Soc

    MSC Class: 14E05; 13D09

  37. arXiv:1410.6829  [pdf, ps, other

    math.AG hep-th

    Quintic threefolds and Fano elevenfolds

    Authors: Ed Segal, Richard P. Thomas

    Abstract: The derived category of coherent sheaves on a general quintic threefold is a central object in mirror symmetry. We show that it can be embedded into the derived category of a certain Fano elevenfold. Our proof also generates related examples in different dimensions.

    Submitted 17 November, 2015; v1 submitted 24 October, 2014; originally announced October 2014.

    Comments: V1: 12 pages. V2: added reference to work of Iliev and Manivel. V3: persistent sign error corrected. Other minor changes following referee's suggestions. To appear in Crelle

    MSC Class: 14F05; 14J33

  38. K-Theoretic and Categorical Properties of Toric Deligne--Mumford Stacks

    Authors: Tom Coates, Hiroshi Iritani, Yunfeng Jiang, Ed Segal

    Abstract: We prove the following results for toric Deligne-Mumford stacks, under minimal compactness hypotheses: the Localization Theorem in equivariant K-theory; the equivariant Hirzebruch-Riemann-Roch theorem; the Fourier--Mukai transformation associated to a crepant toric wall-crossing gives an equivariant derived equivalence.

    Submitted 25 August, 2016; v1 submitted 30 September, 2014; originally announced October 2014.

    Comments: 14 pages, no figures. v2: references updated, v3: minor revision, final version

    MSC Class: 14A20 (Primary); 19L47; 14F05 (Secondary)

    Journal ref: Pure and Applied Mathematics Quarterly, Vol. 11, No. 2 (2015), pp. 239-266

  39. arXiv:1401.3661  [pdf, ps, other

    math.AG hep-th

    The Pfaffian-Grassmannian equivalence revisited

    Authors: Nicolas Addington, Will Donovan, Ed Segal

    Abstract: We give a new proof of the 'Pfaffian-Grassmannian' derived equivalence between certain pairs of non-birational Calabi-Yau threefolds. Our proof follows the physical constructions of Hori and Tong, and we factor the equivalence into three steps by passing through some intermediate categories of (global) matrix factorizations. The first step is global Knoerrer periodicity, the second comes from a bi… ▽ More

    Submitted 25 November, 2014; v1 submitted 15 January, 2014; originally announced January 2014.

    Comments: Improved exposition, minor corrections. 32 pages

    MSC Class: Primary 14F05; 14J32; 18E30; 81T30; Secondary 14M15

    Journal ref: Alg. Geom. 2(3):332-364, 2015

  40. arXiv:1310.7877  [pdf, ps, other

    math.AG hep-th math.RT

    Mixed braid group actions from deformations of surface singularities

    Authors: Will Donovan, Ed Segal

    Abstract: We consider a set of toric Calabi-Yau varieties which arise as deformations of the small resolutions of type A surface singularities. By careful analysis of the heuristics of B-brane transport in the associated GLSMs, we predict the existence of a mixed braid group action on the derived category of each variety, and then prove that this action does indeed exist. This generalizes the braid group ac… ▽ More

    Submitted 29 October, 2013; originally announced October 2013.

    Comments: 37 pages, including many figures and examples

    MSC Class: Primary 14F05; 18E30; Secondary 14J33; 20F36

  41. arXiv:1301.2289  [pdf

    cs.AI

    Exact Inference in Networks with Discrete Children of Continuous Parents

    Authors: Uri Lerner, Eran Segal, Daphne Koller

    Abstract: Many real life domains contain a mixture of discrete and continuous variables and can be modeled as hybrid Bayesian Networks. Animportant subclass of hybrid BNs are conditional linear Gaussian (CLG) networks, where the conditional distribution of the continuous variables given an assignment to the discrete variables is a multivariate Gaussian. Lauritzen's extension to the clique tree algorithm can… ▽ More

    Submitted 10 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Seventeenth Conference on Uncertainty in Artificial Intelligence (UAI2001)

    Report number: UAI-P-2001-PG-319-328

  42. arXiv:1212.2517  [pdf

    cs.LG cs.CE stat.ML

    Learning Module Networks

    Authors: Eran Segal, Dana Pe'er, Aviv Regev, Daphne Koller, Nir Friedman

    Abstract: Methods for learning Bayesian network structure can discover dependency structure between observed variables, and have been shown to be useful in many applications. However, in domains that involve a large number of variables, the space of possible network structures is enormous, making it difficult, for both computational and statistical reasons, to identify a good model. In this… ▽ More

    Submitted 19 October, 2012; originally announced December 2012.

    Comments: Appears in Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI2003)

    Report number: UAI-P-2003-PG-525-534

  43. D-brane probes, branched double covers, and noncommutative resolutions

    Authors: Nicolas Addington, Edward Segal, Eric Sharpe

    Abstract: This paper describes D-brane probes of theories arising in abelian gauged linear sigma models (GLSMs) describing branched double covers and noncommutative resolutions thereof, via nonperturbative effects rather than as the critical locus of a superpotential. As these theories can be described as IR limits of Landau-Ginzburg models, technically this paper is an exercise in utilizing (sheafy) matrix… ▽ More

    Submitted 11 November, 2012; originally announced November 2012.

    Comments: 61 pages, LaTeX

    Journal ref: Adv. Theor. Math. Phys. 18(6):1369-1436, 2014

  44. Window shifts, flop equivalences and Grassmannian twists

    Authors: Will Donovan, Ed Segal

    Abstract: We introduce a new class of autoequivalences that act on the derived categories of certain vector bundles over Grassmannians. These autoequivalences arise from Grassmannian flops: they generalize Seidel-Thomas spherical twists, which can be seen as arising from standard flops. We first give a simple algebraic construction, which is well-suited to explicit computations. We then give a geometric con… ▽ More

    Submitted 2 October, 2012; v1 submitted 1 June, 2012; originally announced June 2012.

    Comments: Improved structure and formatting. Minor edits to some explanations. Added acknowledgements and addresses. 38 pages, 7 figures

    MSC Class: 14F05; 18E30 (Primary) 14M15 (Secondary)

    Journal ref: Compositio Math. 150 (2014) 942-978

  45. Equivalences between GIT quotients of Landau-Ginzburg B-models

    Authors: Ed Segal

    Abstract: We define the category of B-branes in a (not necessarily affine) Landau-Ginzburg B-model, incorporating the notion of R-charge. Our definition is a direct generalization of the category of perfect complexes. We then consider pairs of Landau-Ginzburg B-models that arise as different GIT quotients of a vector space by a one-dimensional torus, and show that for each such pair the two categories of B-… ▽ More

    Submitted 24 November, 2010; v1 submitted 29 October, 2009; originally announced October 2009.

    Comments: v3: Added two references. Final version, to appear in Comm. Math. Phys

    Journal ref: Commun.Math.Phys.304:411-432,2011

  46. arXiv:0904.1339  [pdf, other

    math.AG hep-th math.QA

    The closed state space of affine Landau-Ginzburg B-models

    Authors: Ed Segal

    Abstract: We study the category of perfect cdg-modules over a curved algebra, and in particular the category of B-branes in an affine Landau-Ginzburg model. We construct an explicit chain map from the Hochschild complex of the category to the closed state space of the model, and prove that this is a quasi-isomorphism from the Borel-Moore Hochschild complex. Using the lowest-order term of our map we derive K… ▽ More

    Submitted 19 April, 2011; v1 submitted 8 April, 2009; originally announced April 2009.

    Comments: Completely rewritten due to errors in the first version

  47. arXiv:0902.3239  [pdf, ps, other

    math.DG

    Gauge Theory in higher dimensions, II

    Authors: Simon Donaldson, Ed Segal

    Abstract: The main aim of the paper is to develop the "Floer theory" associated to Calabi-Yau 3-folds, exending the analogy of Thomas' "holomorphic Casson invariant". The treatment in the body of the paper is largely formal, assuming appropriate compactness properties of moduli spaces of $G_{2}$-instantons, but in the last section we make some remarks about these compactness isssues. Section 3 of the pape… ▽ More

    Submitted 18 February, 2009; originally announced February 2009.

  48. arXiv:math/0702539  [pdf, ps, other

    math.AG hep-th math.RA

    The A-infinity Deformation Theory of a Point and the Derived Categories of Local Calabi-Yaus

    Authors: Ed Segal

    Abstract: Let A be an augmented algebra over a semi-simple algebra S. We show that the Ext algebra of S as an A-module, enriched with its natural A-infinity structure, can be used to reconstruct the completion of A at the augmentation ideal. We use this technical result to justify a calculation in the physics literature describing algebras that are derived equivalent to certain non-compact Calabi-Yau thre… ▽ More

    Submitted 11 July, 2008; v1 submitted 19 February, 2007; originally announced February 2007.

    Comments: Final version, to be published in J. Algebra