Skip to main content

Showing 1–50 of 191 results for author: Cohen, N

.
  1. arXiv:2506.03931  [pdf, ps, other

    cs.LG stat.ML

    Do Neural Networks Need Gradient Descent to Generalize? A Theoretical Study

    Authors: Yotam Alexander, Yonatan Slutzky, Yuval Ran-Milo, Nadav Cohen

    Abstract: Conventional wisdom attributes the mysterious generalization abilities of overparameterized neural networks to gradient descent (and its variants). The recent volume hypothesis challenges this view: it posits that these generalization abilities persist even when gradient descent is replaced by Guess & Check (G&C), i.e., by drawing weight settings until one that fits the training data is found. The… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  2. arXiv:2506.00458  [pdf, other

    cs.LG cs.AI cs.GT cs.MA

    Reinforcement Learning for Hanabi

    Authors: Nina Cohen, Kordel K. France

    Abstract: Hanabi has become a popular game for research when it comes to reinforcement learning (RL) as it is one of the few cooperative card games where you have incomplete knowledge of the entire environment, thus presenting a challenge for a RL agent. We explored different tabular and deep reinforcement learning algorithms to see which had the best performance both against an agent of the same type and a… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

  3. arXiv:2505.17013  [pdf, ps, other

    cs.LG cs.CV

    When Are Concepts Erased From Diffusion Models?

    Authors: Kevin Lu, Nicky Kriplani, Rohit Gandikota, Minh Pham, David Bau, Chinmay Hegde, Niv Cohen

    Abstract: Concept erasure, the ability to selectively prevent a model from generating specific concepts, has attracted growing interest, with various approaches emerging to address the challenge. However, it remains unclear how thoroughly these methods erase the target concept. We begin by proposing two conceptual models for the erasure mechanism in diffusion models: (i) reducing the likelihood of generatin… ▽ More

    Submitted 30 May, 2025; v1 submitted 22 May, 2025; originally announced May 2025.

    Comments: Project Page: https://nyu-dice-lab.github.io/when-are-concepts-erased/

  4. arXiv:2505.03040  [pdf

    astro-ph.EP physics.ao-ph physics.geo-ph

    Multi-parameter constraints on empirical infrasound period-yield relations for bolides and implications for planetary defense

    Authors: Elizabeth A. Silber, Josep M. Trigo-Rodríguez, Iyare Oseghae, Eloy Peña Asensio, Mark Boslough, Rodney Whitaker, Christoph Pilger, Philip Lubin, Vedant Sawal, Claus Hetzer, Randy Longenbaugh, Peter Jenniskens, Brin Bailey, Esther Mas Sanz, Patrick Hupe, Alexander N. Cohen, Thom R. Edwards, Sasha Egan, Reynold E. Silber, Summer Czarnowski, Miro Ronac Giannone

    Abstract: How effective are methods for estimating bolide energies from infrasound signal period-yield relationships? A single global period-energy relation can obscure significant variability introduced by parameters such as the atmospheric Doppler wind profile and the bolide's energy deposition profile as a function of altitude. Bolide speed, entry angle, burst altitude, and multi-episode fragmentation al… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

    Comments: 37 pages, 14 figures, 3 tables, Appendix A, Appendix B

    Report number: SAND2025-05462O

    Journal ref: The Astronomical Journal (2025)

  5. arXiv:2504.20111  [pdf, ps, other

    cs.CV

    Forging and Removing Latent-Noise Diffusion Watermarks Using a Single Image

    Authors: Anubhav Jain, Yuya Kobayashi, Naoki Murata, Yuhta Takida, Takashi Shibuya, Yuki Mitsufuji, Niv Cohen, Nasir Memon, Julian Togelius

    Abstract: Watermarking techniques are vital for protecting intellectual property and preventing fraudulent use of media. Most previous watermarking schemes designed for diffusion models embed a secret key in the initial noise. The resulting pattern is often considered hard to remove and forge into unrelated images. In this paper, we propose a black-box adversarial attack without presuming access to the diff… ▽ More

    Submitted 27 April, 2025; originally announced April 2025.

  6. arXiv:2504.15315  [pdf, other

    cs.LG cs.AI

    Diffusion-Driven Inertial Generated Data for Smartphone Location Classification

    Authors: Noa Cohen, Rotem Dror, Itzik Klein

    Abstract: Despite the crucial role of inertial measurements in motion tracking and navigation systems, the time-consuming and resource-intensive nature of collecting extensive inertial data has hindered the development of robust machine learning models in this field. In recent years, diffusion models have emerged as a revolutionary class of generative models, reshaping the landscape of artificial data gener… ▽ More

    Submitted 20 April, 2025; originally announced April 2025.

  7. arXiv:2504.07697  [pdf, other

    cs.RO

    Transformer-Based Robust Underwater Inertial Navigation in Prolonged Doppler Velocity Log Outages

    Authors: Zeev Yampolsky, Nadav Cohen, Itzik Klein

    Abstract: Autonomous underwater vehicles (AUV) have a wide variety of applications in the marine domain, including exploration, surveying, and mapping. Their navigation systems rely heavily on fusing data from inertial sensors and a Doppler velocity log (DVL), typically via nonlinear filtering. The DVL estimates the AUV's velocity vector by transmitting acoustic beams to the seabed and analyzing the Doppler… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

    Comments: Eight pages, 7 Figures, 4 Tables

  8. arXiv:2503.21727  [pdf, other

    cs.RO

    Enhancing Underwater Navigation through Cross-Correlation-Aware Deep INS/DVL Fusion

    Authors: Nadav Cohen, Itzik Klein

    Abstract: The accurate navigation of autonomous underwater vehicles critically depends on the precision of Doppler velocity log (DVL) velocity measurements. Recent advancements in deep learning have demonstrated significant potential in improving DVL outputs by leveraging spatiotemporal dependencies across multiple sensor modalities. However, integrating these estimates into model-based filters, such as the… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

  9. arXiv:2503.12172  [pdf, other

    cs.LG cs.CR cs.CV

    SEAL: Semantic Aware Image Watermarking

    Authors: Kasra Arabi, R. Teal Witter, Chinmay Hegde, Niv Cohen

    Abstract: Generative models have rapidly evolved to generate realistic outputs. However, their synthetic outputs increasingly challenge the clear distinction between natural and AI-generated content, necessitating robust watermarking techniques. Watermarks are typically expected to preserve the integrity of the target image, withstand removal attempts, and prevent unauthorized replication onto unrelated ima… ▽ More

    Submitted 9 April, 2025; v1 submitted 15 March, 2025; originally announced March 2025.

  10. arXiv:2503.05309  [pdf, other

    eess.SP

    Performance Analysis of Spatial and Temporal Learning Networks in the Presence of DVL Noise

    Authors: Rajini Makam, Nadav Cohen, Sumukh Shadakshari, Srinivasa Puranika Bhatta, Itzik Klein, Suresh Sundaram

    Abstract: Navigation is a critical aspect of autonomous underwater vehicles (AUVs) operating in complex underwater environments. Since global navigation satellite system (GNSS) signals are unavailable underwater, navigation relies on inertial sensing, which tends to accumulate errors over time. To mitigate this, the Doppler velocity log (DVL) plays a crucial role in determining navigation accuracy. In this… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

    Comments: OCEANS 2025

  11. arXiv:2503.00592  [pdf, other

    cs.LG

    SolidMark: Evaluating Image Memorization in Generative Models

    Authors: Nicky Kriplani, Minh Pham, Gowthami Somepalli, Chinmay Hegde, Niv Cohen

    Abstract: Recent works have shown that diffusion models are able to memorize training images and emit them at generation time. However, the metrics used to evaluate memorization and its mitigation techniques suffer from dataset-dependent biases and struggle to detect whether a given specific image has been memorized or not. This paper begins with a comprehensive exploration of issues surrounding memorizat… ▽ More

    Submitted 1 March, 2025; originally announced March 2025.

  12. arXiv:2502.16510  [pdf, other

    cs.RO cs.AI eess.SP eess.SY

    Gaussian Process Regression for Improved Underwater Navigation

    Authors: Nadav Cohen, Itzik Klein

    Abstract: Accurate underwater navigation is a challenging task due to the absence of global navigation satellite system signals and the reliance on inertial navigation systems that suffer from drift over time. Doppler velocity logs (DVLs) are typically used to mitigate this drift through velocity measurements, which are commonly estimated using a parameter estimation approach such as least squares (LS). How… ▽ More

    Submitted 23 February, 2025; originally announced February 2025.

  13. arXiv:2502.04385  [pdf, other

    cs.CV cs.AI

    TexLiDAR: Automated Text Understanding for Panoramic LiDAR Data

    Authors: Naor Cohen, Roy Orfaig, Ben-Zion Bobrovsky

    Abstract: Efforts to connect LiDAR data with text, such as LidarCLIP, have primarily focused on embedding 3D point clouds into CLIP text-image space. However, these approaches rely on 3D point clouds, which present challenges in encoding efficiency and neural network processing. With the advent of advanced LiDAR sensors like Ouster OS1, which, in addition to 3D point clouds, produce fixed resolution depth,… ▽ More

    Submitted 21 February, 2025; v1 submitted 5 February, 2025; originally announced February 2025.

  14. arXiv:2501.14249  [pdf, other

    cs.LG cs.AI cs.CL

    Humanity's Last Exam

    Authors: Long Phan, Alice Gatti, Ziwen Han, Nathaniel Li, Josephina Hu, Hugh Zhang, Chen Bo Calvin Zhang, Mohamed Shaaban, John Ling, Sean Shi, Michael Choi, Anish Agrawal, Arnav Chopra, Adam Khoja, Ryan Kim, Richard Ren, Jason Hausenloy, Oliver Zhang, Mantas Mazeika, Dmitry Dodonov, Tung Nguyen, Jaeho Lee, Daron Anderson, Mikhail Doroshenko, Alun Cennyth Stokes , et al. (1084 additional authors not shown)

    Abstract: Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are not keeping pace in difficulty: LLMs now achieve over 90\% accuracy on popular benchmarks like MMLU, limiting informed measurement of state-of-the-art LLM capabilities. In response, we introduce Humanity's Last Exam (HLE), a multi-modal benchmark at the frontier of… ▽ More

    Submitted 19 April, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

    Comments: 29 pages, 6 figures

  15. arXiv:2412.19853  [pdf, other

    cs.CV cs.GR cs.LG

    Conditional Balance: Improving Multi-Conditioning Trade-Offs in Image Generation

    Authors: Nadav Z. Cohen, Oron Nir, Ariel Shamir

    Abstract: Balancing content fidelity and artistic style is a pivotal challenge in image generation. While traditional style transfer methods and modern Denoising Diffusion Probabilistic Models (DDPMs) strive to achieve this balance, they often struggle to do so without sacrificing either style, content, or sometimes both. This work addresses this challenge by analyzing the ability of DDPMs to maintain conte… ▽ More

    Submitted 25 December, 2024; originally announced December 2024.

  16. arXiv:2412.18377  [pdf, other

    cs.CL cs.AI cs.LG

    ChaI-TeA: A Benchmark for Evaluating Autocompletion of Interactions with LLM-based Chatbots

    Authors: Shani Goren, Oren Kalinsky, Tomer Stav, Yuri Rapoport, Yaron Fairstein, Ram Yazdi, Nachshon Cohen, Alexander Libov, Guy Kushilevitz

    Abstract: The rise of LLMs has deflected a growing portion of human-computer interactions towards LLM-based chatbots. The remarkable abilities of these models allow users to interact using long, diverse natural language text covering a wide range of topics and styles. Phrasing these messages is a time and effort consuming task, calling for an autocomplete solution to assist users. We introduce the task of c… ▽ More

    Submitted 5 March, 2025; v1 submitted 24 December, 2024; originally announced December 2024.

  17. arXiv:2412.04653  [pdf, other

    cs.CV cs.AI cs.LG

    Hidden in the Noise: Two-Stage Robust Watermarking for Images

    Authors: Kasra Arabi, Benjamin Feuer, R. Teal Witter, Chinmay Hegde, Niv Cohen

    Abstract: As the quality of image generators continues to improve, deepfakes become a topic of considerable societal debate. Image watermarking allows responsible model owners to detect and label their AI-generated content, which can mitigate the harm. Yet, current state-of-the-art methods in image watermarking remain vulnerable to forgery and removal attacks. This vulnerability occurs in part because water… ▽ More

    Submitted 27 April, 2025; v1 submitted 5 December, 2024; originally announced December 2024.

  18. arXiv:2411.17430  [pdf, other

    cs.RO eess.SP

    Snake-Inspired Mobile Robot Positioning with Hybrid Learning

    Authors: Aviad Etzion, Nadav Cohen, Orzion Levy, Zeev Yampolsky, Itzik Klein

    Abstract: Mobile robots are used in various fields, from deliveries to search and rescue applications. Different types of sensors are mounted on the robot to provide accurate navigation and, thus, allow successful completion of its task. In real-world scenarios, due to environmental constraints, the robot frequently relies only on its inertial sensors. Therefore, due to noises and other error terms associat… ▽ More

    Submitted 1 December, 2024; v1 submitted 26 November, 2024; originally announced November 2024.

  19. arXiv:2410.17128  [pdf, other

    stat.ML cs.LG math.FA

    Understanding Transfer Learning via Mean-field Analysis

    Authors: Gholamali Aminian, Łukasz Szpruch, Samuel N. Cohen

    Abstract: We propose a novel framework for exploring generalization errors of transfer learning through the lens of differential calculus on the space of probability measures. In particular, we consider two main transfer learning scenarios, $α$-ERM and fine-tuning with the KL-regularized empirical risk minimization and establish generic conditions under which the generalization error and the population risk… ▽ More

    Submitted 23 October, 2024; v1 submitted 22 October, 2024; originally announced October 2024.

    Comments: Under review

  20. arXiv:2410.17051  [pdf, other

    cs.CL

    Data-driven Coreference-based Ontology Building

    Authors: Shir Ashury-Tahan, Amir David Nissan Cohen, Nadav Cohen, Yoram Louzoun, Yoav Goldberg

    Abstract: While coreference resolution is traditionally used as a component in individual document understanding, in this work we take a more global view and explore what can we learn about a domain from the set of all document-level coreference relations that are present in a large corpus. We derive coreference chains from a corpus of 30 million biomedical abstracts and construct a graph based on the strin… ▽ More

    Submitted 22 October, 2024; originally announced October 2024.

    Journal ref: EMNLP 2024

  21. arXiv:2410.14067  [pdf, ps, other

    cs.LG cs.AI cs.NE

    Provable Benefits of Complex Parameterizations for Structured State Space Models

    Authors: Yuval Ran-Milo, Eden Lumbroso, Edo Cohen-Karlik, Raja Giryes, Amir Globerson, Nadav Cohen

    Abstract: Structured state space models (SSMs), the core engine behind prominent neural networks such as S4 and Mamba, are linear dynamical systems adhering to a specified structure, most notably diagonal. In contrast to typical neural network modules, whose parameterizations are real, SSMs often use complex parameterizations. Theoretically explaining the benefits of complex parameterizations for SSMs is an… ▽ More

    Submitted 31 October, 2024; v1 submitted 17 October, 2024; originally announced October 2024.

    Comments: 12 pages. Accepted to NeurIPS 2024

  22. arXiv:2410.10473  [pdf, other

    cs.LG stat.ML

    The Implicit Bias of Structured State Space Models Can Be Poisoned With Clean Labels

    Authors: Yonatan Slutzky, Yotam Alexander, Noam Razin, Nadav Cohen

    Abstract: Neural networks are powered by an implicit bias: a tendency of gradient descent to fit training data in a way that generalizes to unseen data. A recent class of neural network models gaining increasing popularity is structured state space models (SSMs), regarded as an efficient alternative to transformers. Prior work argued that the implicit bias of SSMs leads to generalization in a setting where… ▽ More

    Submitted 6 February, 2025; v1 submitted 14 October, 2024; originally announced October 2024.

  23. arXiv:2410.05057  [pdf, other

    cs.CV cs.LG

    SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Classification

    Authors: Benjamin Feuer, Jiawei Xu, Niv Cohen, Patrick Yubeaton, Govind Mittal, Chinmay Hegde

    Abstract: Data curation is the problem of how to collect and organize samples into a dataset that supports efficient learning. Despite the centrality of the task, little work has been devoted towards a large-scale, systematic comparison of various curation methods. In this work, we take steps towards a formal evaluation of data curation strategies and introduce SELECT, the first large-scale benchmark of cur… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

    Comments: NeurIPS 2024, Datasets and Benchmarks Track

  24. arXiv:2409.19431  [pdf, ps, other

    stat.ML cs.IT cs.LG

    Generalization and Robustness of the Tilted Empirical Risk

    Authors: Gholamali Aminian, Amir R. Asadi, Tian Li, Ahmad Beirami, Gesine Reinert, Samuel N. Cohen

    Abstract: The generalization error (risk) of a supervised statistical learning algorithm quantifies its prediction ability on previously unseen data. Inspired by exponential tilting, \citet{li2020tilted} proposed the {\it tilted empirical risk} (TER) as a non-linear risk metric for machine learning applications such as classification and regression problems. In this work, we examine the generalization error… ▽ More

    Submitted 7 June, 2025; v1 submitted 28 September, 2024; originally announced September 2024.

    Comments: Accepted in ICML 2025

  25. arXiv:2409.16341  [pdf, other

    cs.LG cs.CL cs.SE

    Quality Matters: Evaluating Synthetic Data for Tool-Using LLMs

    Authors: Shadi Iskander, Nachshon Cohen, Zohar Karnin, Ori Shapira, Sofia Tolmach

    Abstract: Training large language models (LLMs) for external tool usage is a rapidly expanding field, with recent research focusing on generating synthetic data to address the shortage of available data. However, the absence of systematic data quality checks poses complications for properly training and testing models. To that end, we propose two approaches for assessing the reliability of data for training… ▽ More

    Submitted 26 September, 2024; v1 submitted 24 September, 2024; originally announced September 2024.

  26. arXiv:2408.13767  [pdf, other

    cs.LG cs.AI stat.ML

    Lecture Notes on Linear Neural Networks: A Tale of Optimization and Generalization in Deep Learning

    Authors: Nadav Cohen, Noam Razin

    Abstract: These notes are based on a lecture delivered by NC on March 2021, as part of an advanced course in Princeton University on the mathematical understanding of deep learning. They present a theory (developed by NC, NR and collaborators) of linear neural networks -- a fundamental model in the study of optimization and generalization in deep learning. Practical applications born from the presented theo… ▽ More

    Submitted 6 November, 2024; v1 submitted 25 August, 2024; originally announced August 2024.

    Comments: Lecture notes

  27. Deep Learning Assisted Inertial Dead Reckoning and Fusion

    Authors: Dror Hurwitz, Nadav Cohen, Itzik Klein

    Abstract: The interest in mobile platforms across a variety of applications has increased significantly in recent years. One of the reasons is the ability to achieve accurate navigation by using low-cost sensors. To this end, inertial sensors are fused with global navigation satellite systems (GNSS) signals. GNSS outages during platform operation can result in pure inertial navigation, causing the navigatio… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  28. arXiv:2406.16048  [pdf, other

    cs.IR

    Evaluating D-MERIT of Partial-annotation on Information Retrieval

    Authors: Royi Rassin, Yaron Fairstein, Oren Kalinsky, Guy Kushilevitz, Nachshon Cohen, Alexander Libov, Yoav Goldberg

    Abstract: Retrieval models are often evaluated on partially-annotated datasets. Each query is mapped to a few relevant texts and the remaining corpus is assumed to be irrelevant. As a result, models that successfully retrieve false negatives are punished in evaluation. Unfortunately, completely annotating all texts for every query is not resource efficient. In this work, we show that using partially-annotat… ▽ More

    Submitted 13 October, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

    Comments: Accepted to EMNLP 2024 main track. Our dataset can be downloaded from https://D-MERIT.github.io

  29. arXiv:2406.14528  [pdf, other

    cs.LG cs.AI

    DeciMamba: Exploring the Length Extrapolation Potential of Mamba

    Authors: Assaf Ben-Kish, Itamar Zimerman, Shady Abu-Hussein, Nadav Cohen, Amir Globerson, Lior Wolf, Raja Giryes

    Abstract: Long-range sequence processing poses a significant challenge for Transformers due to their quadratic complexity in input length. A promising alternative is Mamba, which demonstrates high performance and achieves Transformer-level capabilities while requiring substantially fewer computational resources. In this paper we explore the length-generalization capabilities of Mamba, which we find to be re… ▽ More

    Submitted 9 April, 2025; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: Official Implementation: https://github.com/assafbk/DeciMamba

  30. arXiv:2406.14027  [pdf, other

    cs.AI

    How to design a dataset compliant with an ML-based system ODD?

    Authors: Cyril Cappi, Noémie Cohen, Mélanie Ducoffe, Christophe Gabreau, Laurent Gardes, Adrien Gauffriau, Jean-Brice Ginestet, Franck Mamalet, Vincent Mussot, Claire Pagetti, David Vigouroux

    Abstract: This paper focuses on a Vision-based Landing task and presents the design and the validation of a dataset that would comply with the Operational Design Domain (ODD) of a Machine-Learning (ML) system. Relying on emerging certification standards, we describe the process for establishing ODDs at both the system and image levels. In the process, we present the translation of high-level system constrai… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 12th European Congress on Embedded Real Time Software and Systems, Jun 2024, Toulouse, France. arXiv admin note: text overlap with arXiv:2304.09938

  31. arXiv:2406.07954  [pdf, other

    cs.CR cs.AI

    Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition

    Authors: Edoardo Debenedetti, Javier Rando, Daniel Paleka, Silaghi Fineas Florin, Dragos Albastroiu, Niv Cohen, Yuval Lemberg, Reshmi Ghosh, Rui Wen, Ahmed Salem, Giovanni Cherubin, Santiago Zanella-Beguelin, Robin Schmid, Victor Klemm, Takahiro Miki, Chenhao Li, Stefan Kraft, Mario Fritz, Florian Tramèr, Sahar Abdelnabi, Lea Schönherr

    Abstract: Large language model systems face important security risks from maliciously crafted messages that aim to overwrite the system's original instructions or leak private data. To study this problem, we organized a capture-the-flag competition at IEEE SaTML 2024, where the flag is a secret string in the LLM system prompt. The competition was organized in two phases. In the first phase, teams developed… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  32. arXiv:2406.05904  [pdf, ps, other

    cs.DC cs.CR

    Aegis: Tethering a Blockchain with Primary-Chain Stake

    Authors: Yogev Bar-On, Roi Bar-Zur, Omer Ben-Porat, Nimrod Cohen, Ittay Eyal, Matan Sitbon

    Abstract: Blockchains implement decentralized monetary systems and applications. Recent advancements enable what we call tethering a blockchain to a primary blockchain, securing the tethered chain by nodes that post primary-chain tokens as collateral. The collateral ensures nodes behave as intended, until they withdraw it. Unlike a Proof of Stake blockchain which uses its own token as collateral, using prim… ▽ More

    Submitted 24 April, 2025; v1 submitted 9 June, 2024; originally announced June 2024.

  33. arXiv:2405.12211  [pdf, other

    cs.CV

    Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices

    Authors: Nathaniel Cohen, Vladimir Kulikov, Matan Kleiner, Inbar Huberman-Spiegelglas, Tomer Michaeli

    Abstract: Text-to-image (T2I) diffusion models achieve state-of-the-art results in image synthesis and editing. However, leveraging such pretrained models for video editing is considered a major challenge. Many existing works attempt to enforce temporal consistency in the edited video through explicit correspondence mechanisms, either in pixel space or between deep features. These methods, however, struggle… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: ICML 2024. Code and examples are available at https://matankleiner.github.io/slicedit/

  34. arXiv:2404.13742  [pdf, other

    cs.RO cs.AI eess.SY

    Seamless Underwater Navigation with Limited Doppler Velocity Log Measurements

    Authors: Nadav Cohen, Itzik Klein

    Abstract: Autonomous Underwater Vehicles (AUVs) commonly utilize an inertial navigation system (INS) and a Doppler velocity log (DVL) for underwater navigation. To that end, their measurements are integrated through a nonlinear filter such as the extended Kalman filter (EKF). The DVL velocity vector estimate depends on retrieving reflections from the seabed, ensuring that at least three out of its four tran… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  35. arXiv:2404.06017  [pdf, other

    cs.CL

    Identifying Shopping Intent in Product QA for Proactive Recommendations

    Authors: Besnik Fetahu, Nachshon Cohen, Elad Haramaty, Liane Lewin-Eytan, Oleg Rokhlenko, Shervin Malmasi

    Abstract: Voice assistants have become ubiquitous in smart devices allowing users to instantly access information via voice questions. While extensive research has been conducted in question answering for voice search, little attention has been paid on how to enable proactive recommendations from a voice assistant to its users. This is a highly challenging problem that often leads to user friction, mainly d… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: Accepted at IronGraphs@ECIR'2024

  36. arXiv:2404.03631  [pdf, other

    cs.CV

    Robust Concept Erasure Using Task Vectors

    Authors: Minh Pham, Kelly O. Marshall, Chinmay Hegde, Niv Cohen

    Abstract: With the rapid growth of text-to-image models, a variety of techniques have been suggested to prevent undesirable image generations. Yet, these methods often only protect against specific user prompts and have been shown to allow unsafe generations with other inputs. Here we focus on unconditionally erasing a concept from a text-to-image model rather than conditioning the erasure on the user's pro… ▽ More

    Submitted 19 February, 2025; v1 submitted 4 April, 2024; originally announced April 2024.

  37. arXiv:2403.08788  [pdf, other

    cs.CV cs.AI cs.NE

    Verification for Object Detection -- IBP IoU

    Authors: Noémie Cohen, Mélanie Ducoffe, Ryma Boumazouza, Christophe Gabreau, Claire Pagetti, Xavier Pucel, Audrey Galametz

    Abstract: We introduce a novel Interval Bound Propagation (IBP) approach for the formal verification of object detection models, specifically targeting the Intersection over Union (IoU) metric. The approach has been implemented in an open source code, named IBP IoU, compatible with popular abstract interpretation based verification tools. The resulting verifier is evaluated on landing approach runway detect… ▽ More

    Submitted 30 January, 2024; originally announced March 2024.

  38. arXiv:2402.11137  [pdf, other

    cs.LG

    TuneTables: Context Optimization for Scalable Prior-Data Fitted Networks

    Authors: Benjamin Feuer, Robin Tibor Schirrmeister, Valeriia Cherepanova, Chinmay Hegde, Frank Hutter, Micah Goldblum, Niv Cohen, Colin White

    Abstract: While tabular classification has traditionally relied on from-scratch training, a recent breakthrough called prior-data fitted networks (PFNs) challenges this approach. Similar to large language models, PFNs make use of pretraining and in-context learning to achieve strong performance on new tasks in a single forward pass. However, current PFNs have limitations that prohibit their widespread adopt… ▽ More

    Submitted 21 October, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: NeurIPS 2024 Poster

  39. arXiv:2402.07875  [pdf, other

    cs.LG cs.AI eess.SY stat.ML

    Implicit Bias of Policy Gradient in Linear Quadratic Control: Extrapolation to Unseen Initial States

    Authors: Noam Razin, Yotam Alexander, Edo Cohen-Karlik, Raja Giryes, Amir Globerson, Nadav Cohen

    Abstract: In modern machine learning, models can often fit training data in numerous ways, some of which perform well on unseen (test) data, while others do not. Remarkably, in such cases gradient descent frequently exhibits an implicit bias that leads to excellent performance on unseen data. This implicit bias was extensively studied in supervised learning, but is far less understood in optimal control (re… ▽ More

    Submitted 1 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted to ICML 2024

  40. arXiv:2402.07025  [pdf, other

    stat.ML cs.IT cs.LG

    Generalization Error of Graph Neural Networks in the Mean-field Regime

    Authors: Gholamali Aminian, Yixuan He, Gesine Reinert, Łukasz Szpruch, Samuel N. Cohen

    Abstract: This work provides a theoretical framework for assessing the generalization error of graph neural networks in the over-parameterized regime, where the number of parameters surpasses the quantity of data points. We explore two widely utilized types of graph neural networks: graph convolutional neural networks and message passing graph neural networks. Prior to this study, existing bounds on the gen… ▽ More

    Submitted 1 July, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

    Comments: Accepted in ICML 2024

  41. arXiv:2402.05934  [pdf, other

    cs.LG cs.SI

    Classifying Nodes in Graphs without GNNs

    Authors: Daniel Winter, Niv Cohen, Yedid Hoshen

    Abstract: Graph neural networks (GNNs) are the dominant paradigm for classifying nodes in a graph, but they have several undesirable attributes stemming from their message passing architecture. Recently, distillation methods succeeded in eliminating the use of GNNs at test time but they still require them during training. We perform a careful analysis of the role that GNNs play in distillation methods. This… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  42. arXiv:2402.00035  [pdf, other

    cs.CV cs.LG cs.LO

    Robustness Assessment of a Runway Object Classifier for Safe Aircraft Taxiing

    Authors: Yizhak Elboher, Raya Elsaleh, Omri Isac, Mélanie Ducoffe, Audrey Galametz, Guillaume Povéda, Ryma Boumazouza, Noémie Cohen, Guy Katz

    Abstract: As deep neural networks (DNNs) are becoming the prominent solution for many computational problems, the aviation industry seeks to explore their potential in alleviating pilot workload and in improving operational safety. However, the use of DNNs in this type of safety-critical applications requires a thorough certification process. This need can be addressed through formal verification, which pro… ▽ More

    Submitted 6 August, 2024; v1 submitted 8 January, 2024; originally announced February 2024.

    Comments: This is a preprint version of the paper in the proceedings of 43rd Digital Avionics Systems Conference (DASC)

  43. arXiv:2401.15620  [pdf, other

    cs.RO cs.AI eess.SP eess.SY

    Data-Driven Strategies for Coping with Incomplete DVL Measurements

    Authors: Nadav Cohen, Itzik Klein

    Abstract: Autonomous underwater vehicles are specialized platforms engineered for deep underwater operations. Critical to their functionality is autonomous navigation, typically relying on an inertial navigation system and a Doppler velocity log. In real-world scenarios, incomplete Doppler velocity log measurements occur, resulting in positioning errors and mission aborts. To cope with such situations, a mo… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

  44. Adaptive Kalman-Informed Transformer

    Authors: Nadav Cohen, Itzik Klein

    Abstract: The extended Kalman filter (EKF) is a widely adopted method for sensor fusion in navigation applications. A crucial aspect of the EKF is the online determination of the process noise covariance matrix reflecting the model uncertainty. While common EKF implementation assumes a constant process noise, in real-world scenarios, the process noise varies, leading to inaccuracies in the estimated state a… ▽ More

    Submitted 7 March, 2025; v1 submitted 18 January, 2024; originally announced January 2024.

  45. arXiv:2311.14773  [pdf, other

    cs.CV cs.LG

    Set Features for Anomaly Detection

    Authors: Niv Cohen, Issar Tzachor, Yedid Hoshen

    Abstract: This paper proposes to use set features for detecting anomalies in samples that consist of unusual combinations of normal elements. Many leading methods discover anomalies by detecting an unusual part of a sample. For example, state-of-the-art segmentation-based approaches, first classify each element of the sample (e.g., image patch) as normal or anomalous and then classify the entire sample as a… ▽ More

    Submitted 18 March, 2025; v1 submitted 24 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2302.12245

  46. arXiv:2311.10609  [pdf, other

    cs.LG cs.DB

    Scaling TabPFN: Sketching and Feature Selection for Tabular Prior-Data Fitted Networks

    Authors: Benjamin Feuer, Chinmay Hegde, Niv Cohen

    Abstract: Tabular classification has traditionally relied on supervised algorithms, which estimate the parameters of a prediction model using its training data. Recently, Prior-Data Fitted Networks (PFNs) such as TabPFN have successfully learned to classify tabular data in-context: the model parameters are designed to classify new samples based on labelled training samples given after the model training. Wh… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: 2nd Table Representation Learning Workshop: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  47. arXiv:2310.16047  [pdf, other

    cs.CV cs.LG eess.IV

    From Posterior Sampling to Meaningful Diversity in Image Restoration

    Authors: Noa Cohen, Hila Manor, Yuval Bahat, Tomer Michaeli

    Abstract: Image restoration problems are typically ill-posed in the sense that each degraded image can be restored in infinitely many valid ways. To accommodate this, many works generate a diverse set of outputs by attempting to randomly sample from the posterior distribution of natural images given the degraded input. Here we argue that this strategy is commonly of limited practical value because of the he… ▽ More

    Submitted 11 March, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: Accepted for ICLR 2024. Code and examples are available at https://noa-cohen.github.io/MeaningfulDiversityInIR

  48. arXiv:2310.13112  [pdf, other

    astro-ph.EP

    Asteroid 2023 NT1: A Cautionary Tale

    Authors: Brin K. Bailey, Alexander N. Cohen, Sasha Egan, Philip Lubin, Ruitao Xu, Mark Boslough, Darrel Robertson, Elizabeth Silber, Irina Sagert, Oleg Korobkin, Glenn Sjoden

    Abstract: We investigate various short-warning mitigation scenarios via fragmentation for a hypothetical impact of asteroid 2023 NT1, a Near-Earth Object (NEO) that was discovered on July 15, 2023, two days after its closest approach to Earth on July 13. The asteroid passed by Earth within ~0.25 lunar distances, with a closest approach of ~1$\times10^5$ km and velocity of 11.27 km/s. Its size remains largel… ▽ More

    Submitted 16 January, 2025; v1 submitted 19 October, 2023; originally announced October 2023.

  49. arXiv:2309.14568  [pdf, other

    cs.CL

    Introducing DictaLM -- A Large Generative Language Model for Modern Hebrew

    Authors: Shaltiel Shmidman, Avi Shmidman, Amir David Nissan Cohen, Moshe Koppel

    Abstract: We present DictaLM, a large-scale language model tailored for Modern Hebrew. Boasting 7B parameters, this model is predominantly trained on Hebrew-centric data. As a commitment to promoting research and development in the Hebrew language, we release both the foundation model and the instruct-tuned model under a Creative Commons license. Concurrently, we introduce DictaLM-Rab, another foundation mo… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

  50. arXiv:2309.07091  [pdf, ps, other

    math.OC math.AP math.PR

    Optimal adaptive control with separable drift uncertainty

    Authors: Samuel N. Cohen, Christoph Knochenhauer, Alexander Merkel

    Abstract: We consider a problem of stochastic optimal control with separable drift uncertainty in strong formulation on a finite horizon. The drift coefficient of the state $Y^{u}$ is multiplicatively influenced by an unknown random variable $λ$, while admissible controls $u$ are required to be adapted to the observation filtration. Choosing a control actively influences the state and information acquisitio… ▽ More

    Submitted 10 November, 2023; v1 submitted 13 September, 2023; originally announced September 2023.