Skip to main content

Showing 1–50 of 123 results for author: Cohen, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.06261  [pdf, ps, other

    cs.CL cs.AI

    Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

    Authors: Gheorghe Comanici, Eric Bieber, Mike Schaekermann, Ice Pasupat, Noveen Sachdeva, Inderjit Dhillon, Marcel Blistein, Ori Ram, Dan Zhang, Evan Rosen, Luke Marris, Sam Petulla, Colin Gaffney, Asaf Aharoni, Nathan Lintz, Tiago Cardal Pais, Henrik Jacobsson, Idan Szpektor, Nan-Jiang Jiang, Krishna Haridasan, Ahmed Omran, Nikunj Saunshi, Dara Bahri, Gaurav Mishra, Eric Chu , et al. (3278 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 2.X model family: Gemini 2.5 Pro and Gemini 2.5 Flash, as well as our earlier Gemini 2.0 Flash and Flash-Lite models. Gemini 2.5 Pro is our most capable model yet, achieving SoTA performance on frontier coding and reasoning benchmarks. In addition to its incredible coding and reasoning skills, Gemini 2.5 Pro is a thinking model that excels at multimodal unde… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

    Comments: 72 pages, 17 figures

  2. arXiv:2507.03786  [pdf, ps, other

    cs.DS

    Bicriteria approximation for $k$-edge-connectivity

    Authors: Zeev Nutov, Reut Cohen

    Abstract: In the $k$-Edge Connected Spanning Subgraph ($k$-ECSS) problem we are given a (multi-)graph $G=(V,E)$ with edge costs and an integer $k$, and seek a min-cost $k$-edge-connected spanning subgraph of $G$. The problem admits a $2$-approximation algorithm and no better approximation ratio is known. Recently, Hershkowitz, Klein, and Zenklusen [STOC 24] gave a bicriteria $(1,k-10)$-approximation algorit… ▽ More

    Submitted 4 July, 2025; originally announced July 2025.

  3. arXiv:2505.21218  [pdf, ps, other

    cs.CL cs.AI

    Pretrained LLMs Learn Multiple Types of Uncertainty

    Authors: Roi Cohen, Omri Fahn, Gerard de Melo

    Abstract: Large Language Models are known to capture real-world knowledge, allowing them to excel in many downstream tasks. Despite recent advances, these models are still prone to what are commonly known as hallucinations, causing them to emit unwanted and factually incorrect text. In this work, we study how well LLMs capture uncertainty, without explicitly being trained for that. We show that, if consider… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  4. arXiv:2505.20487  [pdf, other

    cs.CL cs.AI

    InFact: Informativeness Alignment for Improved LLM Factuality

    Authors: Roi Cohen, Russa Biswas, Gerard de Melo

    Abstract: Factual completeness is a general term that captures how detailed and informative a factually correct text is. For instance, the factual sentence ``Barack Obama was born in the United States'' is factually correct, though less informative than the factual sentence ``Barack Obama was born in Honolulu, Hawaii, United States''. Despite the known fact that LLMs tend to hallucinate and generate factual… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  5. arXiv:2504.01481  [pdf, other

    cs.CR cs.LG stat.ML

    Identifying Obfuscated Code through Graph-Based Semantic Analysis of Binary Code

    Authors: Roxane Cohen, Robin David, Florian Yger, Fabrice Rossi

    Abstract: Protecting sensitive program content is a critical issue in various situations, ranging from legitimate use cases to unethical contexts. Obfuscation is one of the most used techniques to ensure such protection. Consequently, attackers must first detect and characterize obfuscation before launching any attack against it. This paper investigates the problem of function-level obfuscation detection us… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

    Comments: The 13th International Conference on Complex Networks and their Applications, Dec 2024, Istabul, Turkey

  6. arXiv:2503.20429  [pdf, other

    cs.CV

    Latent Beam Diffusion Models for Decoding Image Sequences

    Authors: Guilherme Fernandes, Vasco Ramos, Regev Cohen, Idan Szpektor, João Magalhães

    Abstract: While diffusion models excel at generating high-quality images from text prompts, they struggle with visual consistency in image sequences. Existing methods generate each image independently, leading to disjointed narratives - a challenge further exacerbated in non-linear storytelling, where scenes must connect beyond adjacent frames. We introduce a novel beam search strategy for latent space expl… ▽ More

    Submitted 28 May, 2025; v1 submitted 26 March, 2025; originally announced March 2025.

  7. arXiv:2502.00012  [pdf, ps, other

    cs.CY

    Lessons from complexity theory for AI governance

    Authors: Noam Kolt, Michal Shur-Ofry, Reuven Cohen

    Abstract: The study of complex adaptive systems, pioneered in physics, biology, and the social sciences, offers important lessons for AI governance. Contemporary AI systems and the environments in which they operate exhibit many of the properties characteristic of complex systems, including nonlinear growth patterns, emergent phenomena, and cascading effects that can lead to tail risks. Complexity theory ca… ▽ More

    Submitted 3 March, 2025; v1 submitted 7 January, 2025; originally announced February 2025.

  8. arXiv:2412.06676  [pdf, other

    cs.LG cs.CL

    I Don't Know: Explicit Modeling of Uncertainty with an [IDK] Token

    Authors: Roi Cohen, Konstantin Dobler, Eden Biran, Gerard de Melo

    Abstract: Large Language Models are known to capture real-world knowledge, allowing them to excel in many downstream tasks. Despite recent advances, these models are still prone to what are commonly known as hallucinations, causing them to emit unwanted and factually incorrect text. In this work, we propose a novel calibration method that can be used to combat hallucinations. We add a special [IDK] ("I don'… ▽ More

    Submitted 9 December, 2024; originally announced December 2024.

    Comments: Published at NeurIPS 2024

  9. arXiv:2411.11869  [pdf, other

    eess.SP cs.AI cs.LG

    A Multi-Modal Unsupervised Machine Learning Approach for Biomedical Signal Processing in CPR

    Authors: Saidul Islam, Jamal Bentahar, Robin Cohen, Gaith Rjoub

    Abstract: Cardiopulmonary resuscitation (CPR) is a critical, life-saving intervention aimed at restoring blood circulation and breathing in individuals experiencing cardiac arrest or respiratory failure. Accurate and real-time analysis of biomedical signals during CPR is essential for monitoring and decision-making, from the pre-hospital stage to the intensive care unit (ICU). However, CPR signals are often… ▽ More

    Submitted 3 November, 2024; originally announced November 2024.

  10. arXiv:2411.03131  [pdf, other

    cs.LG cs.AI

    Machine Learning Innovations in CPR: A Comprehensive Survey on Enhanced Resuscitation Techniques

    Authors: Saidul Islam, Gaith Rjoub, Hanae Elmekki, Jamal Bentahar, Witold Pedrycz, Robin Cohen

    Abstract: This survey paper explores the transformative role of Machine Learning (ML) and Artificial Intelligence (AI) in Cardiopulmonary Resuscitation (CPR). It examines the evolution from traditional CPR methods to innovative ML-driven approaches, highlighting the impact of predictive modeling, AI-enhanced devices, and real-time data analysis in improving resuscitation outcomes. The paper provides a compr… ▽ More

    Submitted 3 November, 2024; originally announced November 2024.

  11. arXiv:2410.19791  [pdf, other

    eess.SP cs.CV cs.LG cs.NI

    Data-Driven Cellular Network Selector for Vehicle Teleoperations

    Authors: Barak Gahtan, Reuven Cohen, Alex M. Bronstein, Eli Shapira

    Abstract: Remote control of robotic systems, also known as teleoperation, is crucial for the development of autonomous vehicle (AV) technology. It allows a remote operator to view live video from AVs and, in some cases, to make real-time decisions. The effectiveness of video-based teleoperation systems is heavily influenced by the quality of the cellular network and, in particular, its packet loss rate and… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: IEEE Network of Future 2024

  12. arXiv:2410.06140  [pdf, other

    cs.LG cs.CV cs.NI

    Estimating the Number of HTTP/3 Responses in QUIC Using Deep Learning

    Authors: Barak Gahtan, Robert J. Shahla, Reuven Cohen, Alex M. Bronstein

    Abstract: QUIC, a new and increasingly used transport protocol, enhances TCP by offering improved security, performance, and stream multiplexing. These features, however, also impose challenges for network middle-boxes that need to monitor and analyze web traffic. This paper proposes a novel method to estimate the number of HTTP/3 responses in a given QUIC connection by an observer. This estimation reveals… ▽ More

    Submitted 28 April, 2025; v1 submitted 8 October, 2024; originally announced October 2024.

  13. arXiv:2410.03728  [pdf, other

    cs.NI cs.AI cs.CV cs.LG

    Exploring QUIC Dynamics: A Large-Scale Dataset for Encrypted Traffic Analysis

    Authors: Barak Gahtan, Robert J. Shahla, Alex M. Bronstein, Reuven Cohen

    Abstract: The increasing adoption of the QUIC transport protocol has transformed encrypted web traffic, necessitating new methodologies for network analysis. However, existing datasets lack the scope, metadata, and decryption capabilities required for robust benchmarking in encrypted traffic research. We introduce VisQUIC, a large-scale dataset of 100,000 labeled QUIC traces from over 44,000 websites, colle… ▽ More

    Submitted 24 May, 2025; v1 submitted 30 September, 2024; originally announced October 2024.

    Comments: The dataset and the supplementary material can be provided upon request

  14. arXiv:2410.02914  [pdf, other

    cs.IR cs.AI cs.LG

    Streamlining Conformal Information Retrieval via Score Refinement

    Authors: Yotam Intrator, Ori Kelner, Regev Cohen, Roman Goldenberg, Ehud Rivlin, Daniel Freedman

    Abstract: Information retrieval (IR) methods, like retrieval augmented generation, are fundamental to modern applications but often lack statistical guarantees. Conformal prediction addresses this by retrieving sets guaranteed to include relevant information, yet existing approaches produce large-sized sets, incurring high computational costs and slow response times. In this work, we introduce a score refin… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

    Comments: 6 pages

  15. arXiv:2408.12570  [pdf, other

    cs.CL cs.LG

    Jamba-1.5: Hybrid Transformer-Mamba Models at Scale

    Authors: Jamba Team, Barak Lenz, Alan Arazi, Amir Bergman, Avshalom Manevich, Barak Peleg, Ben Aviram, Chen Almagor, Clara Fridman, Dan Padnos, Daniel Gissin, Daniel Jannai, Dor Muhlgay, Dor Zimberg, Edden M Gerber, Elad Dolev, Eran Krakovsky, Erez Safahi, Erez Schwartz, Gal Cohen, Gal Shachaf, Haim Rozenblum, Hofit Bata, Ido Blass, Inbal Magar , et al. (36 additional authors not shown)

    Abstract: We present Jamba-1.5, new instruction-tuned large language models based on our Jamba architecture. Jamba is a hybrid Transformer-Mamba mixture of experts architecture, providing high throughput and low memory usage across context lengths, while retaining the same or better quality as Transformer models. We release two model sizes: Jamba-1.5-Large, with 94B active parameters, and Jamba-1.5-Mini, wi… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: Webpage: https://www.ai21.com/jamba

  16. arXiv:2407.15153  [pdf, other

    cs.CV

    Anchored Diffusion for Video Face Reenactment

    Authors: Idan Kligvasser, Regev Cohen, George Leifman, Ehud Rivlin, Michael Elad

    Abstract: Video generation has drawn significant interest recently, pushing the development of large-scale models capable of producing realistic videos with coherent motion. Due to memory constraints, these models typically generate short video segments that are then combined into long videos. The merging process poses a significant challenge, as it requires ensuring smooth transitions and overall consisten… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

  17. arXiv:2406.07024  [pdf, other

    cs.GT

    Plant-and-Steal: Truthful Fair Allocations via Predictions

    Authors: Ilan Reuven Cohen, Alon Eden, Talya Eden, Arsen Vasilyan

    Abstract: We study truthful mechanisms for approximating the Maximin-Share (MMS) allocation of agents with additive valuations for indivisible goods. Algorithmically, constant factor approximations exist for the problem for any number of agents. When adding incentives to the mix, a jarring result by Amanatidis, Birmpas, Christodoulou, and Markakis [EC 2017] shows that the best possible approximation for two… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  18. arXiv:2405.16475  [pdf, other

    cs.LG cs.AI cs.CV eess.IV

    Looks Too Good To Be True: An Information-Theoretic Analysis of Hallucinations in Generative Restoration Models

    Authors: Regev Cohen, Idan Kligvasser, Ehud Rivlin, Daniel Freedman

    Abstract: The pursuit of high perceptual quality in image restoration has driven the development of revolutionary generative models, capable of producing results often visually indistinguishable from real data. However, as their perceptual quality continues to improve, these models also exhibit a growing tendency to generate hallucinations - realistic-looking details that do not exist in the ground truth im… ▽ More

    Submitted 25 October, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

  19. arXiv:2405.11566  [pdf, other

    cs.LG

    Uncertainty-Aware PPG-2-ECG for Enhanced Cardiovascular Diagnosis using Diffusion Models

    Authors: Omer Belhasin, Idan Kligvasser, George Leifman, Regev Cohen, Erin Rainaldi, Li-Fang Cheng, Nishant Verma, Paul Varghese, Ehud Rivlin, Michael Elad

    Abstract: Analyzing the cardiovascular system condition via Electrocardiography (ECG) is a common and highly effective approach, and it has been practiced and perfected over many decades. ECG sensing is non-invasive and relatively easy to acquire, and yet it is still cumbersome for holter monitoring tests that may span over hours and even days. A possible alternative in this context is Photoplethysmography… ▽ More

    Submitted 20 April, 2025; v1 submitted 19 May, 2024; originally announced May 2024.

  20. arXiv:2404.16859  [pdf, other

    cs.CL cs.SI

    Rumour Evaluation with Very Large Language Models

    Authors: Dahlia Shehata, Robin Cohen, Charles Clarke

    Abstract: Conversational prompt-engineering-based large language models (LLMs) have enabled targeted control over the output creation, enhancing versatility, adaptability and adhoc retrieval. From another perspective, digital misinformation has reached alarming levels. The anonymity, availability and reach of social media offer fertile ground for rumours to propagate. This work proposes to leverage the adva… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  21. arXiv:2402.12423  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    On the Semantic Latent Space of Diffusion-Based Text-to-Speech Models

    Authors: Miri Varshavsky-Hassid, Roy Hirsch, Regev Cohen, Tomer Golany, Daniel Freedman, Ehud Rivlin

    Abstract: The incorporation of Denoising Diffusion Models (DDMs) in the Text-to-Speech (TTS) domain is rising, providing great value in synthesizing high quality speech. Although they exhibit impressive audio quality, the extent of their semantic capabilities is unknown, and controlling their synthesized speech's vocal properties remains a challenge. Inspired by recent advances in image synthesis, we explor… ▽ More

    Submitted 4 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: Accepted to ACL 2024

  22. arXiv:2402.00857  [pdf, other

    cs.LG stat.ML

    Early Time Classification with Accumulated Accuracy Gap Control

    Authors: Liran Ringel, Regev Cohen, Daniel Freedman, Michael Elad, Yaniv Romano

    Abstract: Early time classification algorithms aim to label a stream of features without processing the full input stream, while maintaining accuracy comparable to that achieved by applying the classifier to the entire input. In this paper, we introduce a statistical framework that can be applied to any sequential classifier, formulating a calibrated stopping rule. This data-driven rule attains finite-sampl… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  23. arXiv:2401.02501  [pdf, other

    cs.CV cs.LG

    A Kolmogorov metric embedding for live cell microscopy signaling patterns

    Authors: Layton Aho, Mark Winter, Marc DeCarlo, Agne Frismantiene, Yannick Blum, Paolo Armando Gagliardi, Olivier Pertz, Andrew R. Cohen

    Abstract: We present a metric embedding that captures spatiotemporal patterns of cell signaling dynamics in 5-D $(x,y,z,channel,time)$ live cell microscopy movies. The embedding uses a metric distance called the normalized information distance (NID) based on Kolmogorov complexity theory, an absolute measure of information content between digital objects. The NID uses statistics of lossless compression to co… ▽ More

    Submitted 5 February, 2025; v1 submitted 4 January, 2024; originally announced January 2024.

  24. arXiv:2312.14506  [pdf, other

    cs.CR cs.DC

    Concurrent Asynchronous Byzantine Agreement in Expected-Constant Rounds, Revisited

    Authors: Ran Cohen, Pouyan Forghani, Juan Garay, Rutvik Patel, Vassilis Zikas

    Abstract: It is well known that without randomization, Byzantine agreement (BA) requires a linear number of rounds in the synchronous setting, while it is flat out impossible in the asynchronous setting. The primitive which allows to bypass the above limitation is known as oblivious common coin (OCC). It allows parties to agree with constant probability on a random coin, where agreement is oblivious, i.e.,… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: A preliminary version of this work appeared in TCC 2023

  25. arXiv:2310.17209  [pdf, other

    cs.CV cs.LG

    Weakly-Supervised Surgical Phase Recognition

    Authors: Roy Hirsch, Regev Cohen, Mathilde Caron, Tomer Golany, Daniel Freedman, Ehud Rivlin

    Abstract: A key element of computer-assisted surgery systems is phase recognition of surgical videos. Existing phase recognition algorithms require frame-wise annotation of a large number of videos, which is time and money consuming. In this work we join concepts of graph segmentation with self-supervised learning to derive a random-walk solution for per-frame phase prediction. Furthermore, we utilize withi… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

  26. arXiv:2309.01466  [pdf, ps, other

    cs.CR cs.DC

    Communication Lower Bounds for Cryptographic Broadcast Protocols

    Authors: Erica Blum, Elette Boyle, Ran Cohen, Chen-Da Liu-Zhang

    Abstract: Broadcast protocols enable a set of $n$ parties to agree on the input of a designated sender, even facing attacks by malicious parties. In the honest-majority setting, randomization and cryptography were harnessed to achieve low-communication broadcast with sub-quadratic total communication and balanced sub-linear cost per party. However, comparatively little is known in the dishonest-majority set… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Comments: A preliminary version of this work appeared in DISC 2023

  27. Development and validation of an interpretable machine learning-based calculator for predicting 5-year weight trajectories after bariatric surgery: a multinational retrospective cohort SOPHIA study

    Authors: Patrick Saux, Pierre Bauvin, Violeta Raverdy, Julien Teigny, Hélène Verkindt, Tomy Soumphonphakdy, Maxence Debert, Anne Jacobs, Daan Jacobs, Valerie Monpellier, Phong Ching Lee, Chin Hong Lim, Johanna C Andersson-Assarsson, Lena Carlsson, Per-Arne Svensson, Florence Galtier, Guelareh Dezfoulian, Mihaela Moldovanu, Severine Andrieux, Julien Couster, Marie Lepage, Erminia Lembo, Ornella Verrastro, Maud Robert, Paulina Salminen , et al. (9 additional authors not shown)

    Abstract: Background Weight loss trajectories after bariatric surgery vary widely between individuals, and predicting weight loss before the operation remains challenging. We aimed to develop a model using machine learning to provide individual preoperative prediction of 5-year weight loss trajectories after surgery. Methods In this multinational retrospective observational study we enrolled adult participa… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: The Lancet Digital Health, 2023

  28. Self-Supervised Learning for Endoscopic Video Analysis

    Authors: Roy Hirsch, Mathilde Caron, Regev Cohen, Amir Livne, Ron Shapiro, Tomer Golany, Roman Goldenberg, Daniel Freedman, Ehud Rivlin

    Abstract: Self-supervised learning (SSL) has led to important breakthroughs in computer vision by allowing learning from large amounts of unlabeled data. As such, it might have a pivotal role to play in biomedicine where annotating data requires a highly specialized expertise. Yet, there are many healthcare domains for which SSL has not been extensively explored. One such domain is endoscopy, minimally inva… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: Accepted to MICCAI 2023

    Journal ref: MICCAI 2023

  29. arXiv:2308.11033  [pdf, other

    cs.DM math.CO

    Constructing cost-effective infrastructure networks

    Authors: Rotem Brand, Reuven Cohen, Baruch Barzel, Simi Haber

    Abstract: The need for reliable and low-cost infrastructure is crucial in today's world. However, achieving both at the same time is often challenging. Traditionally, infrastructure networks are designed with a radial topology lacking redundancy, which makes them vulnerable to disruptions. As a result, network topologies have evolved towards a ring topology with only one redundant edge and, from there, to m… ▽ More

    Submitted 30 July, 2023; originally announced August 2023.

  30. arXiv:2307.12976  [pdf, other

    cs.CL

    Evaluating the Ripple Effects of Knowledge Editing in Language Models

    Authors: Roi Cohen, Eden Biran, Ori Yoran, Amir Globerson, Mor Geva

    Abstract: Modern language models capture a large body of factual knowledge. However, some facts can be incorrectly induced or become obsolete over time, resulting in factually incorrect generations. This has led to the development of various editing methods that allow updating facts encoded by the model. Evaluation of these methods has primarily focused on testing whether an individual fact has been success… ▽ More

    Submitted 20 December, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: Accepted for publication in Transactions of the Association for Computational Linguistics (TACL), 2024. Author's final version

  31. arXiv:2307.09312  [pdf, other

    cs.CL cs.LG cs.MM cs.SI

    Multi-Modal Discussion Transformer: Integrating Text, Images and Graph Transformers to Detect Hate Speech on Social Media

    Authors: Liam Hebert, Gaurav Sahu, Yuxuan Guo, Nanda Kishore Sreenivas, Lukasz Golab, Robin Cohen

    Abstract: We present the Multi-Modal Discussion Transformer (mDT), a novel methodfor detecting hate speech in online social networks such as Reddit discussions. In contrast to traditional comment-only methods, our approach to labelling a comment as hate speech involves a holistic analysis of text and images grounded in the discussion context. This is done by leveraging graph transformers to capture the cont… ▽ More

    Submitted 22 February, 2024; v1 submitted 18 July, 2023; originally announced July 2023.

    Comments: Accepted to AAAI 2024 (AI for Social Impact Track)

  32. arXiv:2305.18861  [pdf, ps, other

    cs.GT cs.DS

    A General Framework for Learning-Augmented Online Allocation

    Authors: Ilan Reuven Cohen, Debmalya Panigrahi

    Abstract: Online allocation is a broad class of problems where items arriving online have to be allocated to agents who have a fixed utility/cost for each assigned item so to maximize/minimize some objective. This framework captures a broad range of fundamental problems such as the Santa Claus problem (maximizing minimum utility), Nash welfare maximization (maximizing geometric mean of utilities), makespan… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

  33. arXiv:2305.13281  [pdf, other

    cs.CL

    LM vs LM: Detecting Factual Errors via Cross Examination

    Authors: Roi Cohen, May Hamri, Mor Geva, Amir Globerson

    Abstract: A prominent weakness of modern language models (LMs) is their tendency to generate factually incorrect text, which hinders their usability. A natural question is whether such factual errors can be detected automatically. Inspired by truth-seeking mechanisms in law, we propose a factuality evaluation framework for LMs that is based on cross-examination. Our key idea is that an incorrect claim is li… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  34. arXiv:2305.12737  [pdf, other

    cs.CL cs.AI

    The Best of Both Worlds: Combining Human and Machine Translations for Multilingual Semantic Parsing with Active Learning

    Authors: Zhuang Li, Lizhen Qu, Philip R. Cohen, Raj V. Tumuluri, Gholamreza Haffari

    Abstract: Multilingual semantic parsing aims to leverage the knowledge from the high-resource languages to improve low-resource semantic parsing, yet commonly suffers from the data imbalance problem. Prior works propose to utilize the translations by either humans or machines to alleviate such issues. However, human translations are expensive, while machine translations are cheap but prone to error and bias… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  35. Must the Communication Graph of MPC Protocols be an Expander?

    Authors: Elette Boyle, Ran Cohen, Deepesh Data, Pavel Hubáček

    Abstract: Secure multiparty computation (MPC) on incomplete communication networks has been studied within two primary models: (1) Where a partial network is fixed a priori, and thus corruptions can occur dependent on its structure, and (2) Where edges in the communication graph are determined dynamically as part of the protocol. Whereas a rich literature has succeeded in mapping out the feasibility and lim… ▽ More

    Submitted 21 June, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Journal ref: Journal of Cryptology 36, 20 (2023)

  36. A Survey on Explainable Artificial Intelligence for Cybersecurity

    Authors: Gaith Rjoub, Jamal Bentahar, Omar Abdel Wahab, Rabeb Mizouni, Alyssa Song, Robin Cohen, Hadi Otrok, Azzam Mourad

    Abstract: The black-box nature of artificial intelligence (AI) models has been the source of many concerns in their use for critical applications. Explainable Artificial Intelligence (XAI) is a rapidly growing research field that aims to create machine learning models that can provide clear and interpretable explanations for their decisions and actions. In the field of network cybersecurity, XAI has the pot… ▽ More

    Submitted 11 June, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

  37. arXiv:2302.09646  [pdf

    cs.AI

    An Explainable Collaborative Dialogue System using a Theory of Mind

    Authors: Philip R. Cohen, Lucian Galescu, Maayan Shvo

    Abstract: Eva is a neuro-symbolic domain-independent multimodal collaborative dialogue system that takes seriously that the purpose of task-oriented dialogue is to assist the user. To do this, the system collaborates by inferring their intentions and plans, detects obstacles to success, finds plans to overcome them or to achieve higher-level goals, and plans its actions, including speech acts, to help users… ▽ More

    Submitted 20 June, 2024; v1 submitted 19 February, 2023; originally announced February 2023.

    Comments: 46 pages, 7 figures, 2 appendices

    ACM Class: I.2.7; I.2.8; I.2.4; I.2.3; I.2.11

  38. arXiv:2301.12810  [pdf, other

    cs.CL cs.AI

    Crawling the Internal Knowledge-Base of Language Models

    Authors: Roi Cohen, Mor Geva, Jonathan Berant, Amir Globerson

    Abstract: Language models are trained on large volumes of text, and as a result their parameters might contain a significant body of factual knowledge. Any downstream task performed by these models implicitly builds on these facts, and thus it is highly desirable to have means for representing this body of knowledge in an interpretable way. However, there is currently no mechanism for such a representation.… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

    Comments: To be published in EACL 2023 (Findings)

  39. arXiv:2301.10871  [pdf, other

    cs.LG cs.CL cs.SI

    Qualitative Analysis of a Graph Transformer Approach to Addressing Hate Speech: Adapting to Dynamically Changing Content

    Authors: Liam Hebert, Hong Yi Chen, Robin Cohen, Lukasz Golab

    Abstract: Our work advances an approach for predicting hate speech in social media, drawing out the critical need to consider the discussions that follow a post to successfully detect when hateful discourse may arise. Using graph transformer networks, coupled with modelling attention and BERT-level natural language processing, our approach can capture context and anticipate upcoming anti-social behaviour. I… ▽ More

    Submitted 30 April, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

    Comments: Accepted at AAAI 2023 AI for Social Good

  40. arXiv:2301.04248  [pdf, other

    cs.CL cs.LG cs.SI

    Predicting Hateful Discussions on Reddit using Graph Transformer Networks and Communal Context

    Authors: Liam Hebert, Lukasz Golab, Robin Cohen

    Abstract: We propose a system to predict harmful discussions on social media platforms. Our solution uses contextual deep language models and proposes the novel idea of integrating state-of-the-art Graph Transformer Networks to analyze all conversations that follow an initial post. This framework also supports adapting to future comments as the conversation unfolds. In addition, we study whether a community… ▽ More

    Submitted 10 January, 2023; originally announced January 2023.

    Comments: Accepted and Presented at WI-IAT 22

  41. arXiv:2211.15211  [pdf, other

    cs.CV cs.LG

    What's Behind the Mask: Estimating Uncertainty in Image-to-Image Problems

    Authors: Gilad Kutiel, Regev Cohen, Michael Elad, Daniel Freedman

    Abstract: Estimating uncertainty in image-to-image networks is an important task, particularly as such networks are being increasingly deployed in the biological and medical imaging realms. In this paper, we introduce a new approach to this problem based on masking. Given an existing image-to-image network, our approach computes a mask such that the distance between the masked reconstructed image and the ma… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  42. arXiv:2210.01423  [pdf, other

    cs.NI cs.LG

    Using Deep Reinforcement Learning for mmWave Real-Time Scheduling

    Authors: Barak Gahtan, Reuven Cohen, Alex M. Bronstein, Gil Kedar

    Abstract: We study the problem of real-time scheduling in a multi-hop millimeter-wave (mmWave) mesh. We develop a model-free deep reinforcement learning algorithm called Adaptive Activator RL (AARL), which determines the subset of mmWave links that should be activated during each time slot and the power level for each link. The most important property of AARL is its ability to make scheduling decisions with… ▽ More

    Submitted 18 February, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

  43. arXiv:2206.02848  [pdf

    cs.CY

    Plagiarism deterrence for introductory programming

    Authors: Simon J. Cohen, Michael J. Martin, Chance A. Shipley, Abhishek Kumar, Andrew R. Cohen

    Abstract: Plagiarism in introductory programming courses is an enormous challenge for both students and institutions. For students, relying on the work of others too early in their academic development can make it impossible to acquire necessary skills for independent success in the future. For institutions, widespread student cheating can dilute the quality of the educational experience being offered. Curr… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

  44. arXiv:2205.13697  [pdf, other

    cs.LG cs.AI cs.MA

    FedFormer: Contextual Federation with Attention in Reinforcement Learning

    Authors: Liam Hebert, Lukasz Golab, Pascal Poupart, Robin Cohen

    Abstract: A core issue in multi-agent federated reinforcement learning is defining how to aggregate insights from multiple agents. This is commonly done by taking the average of each participating agent's model weights into one common model (FedAvg). We instead propose FedFormer, a novel federation strategy that utilizes Transformer Attention to contextually aggregate embeddings from models originating from… ▽ More

    Submitted 2 March, 2023; v1 submitted 26 May, 2022; originally announced May 2022.

    Comments: Our source code can be found at https://github.com/liamhebert/FedFormer. Accepted at AAMAS 2023

  45. Evolution is Driven by Natural Autoencoding: Reframing Species, Interaction Codes, Cooperation, and Sexual Reproduction

    Authors: Irun R. Cohen, Assaf Marron

    Abstract: The continuity of life and its evolution, we proposed, emerge from an interactive group process manifested in networks of interaction. We term this process \textit{survival-of-the-fitted}. Here, we reason that survival of the fitted results from a natural computational process we term \textit{natural autoencoding}. Natural autoencoding works by retaining repeating biological interactions while non… ▽ More

    Submitted 3 February, 2023; v1 submitted 22 March, 2022; originally announced March 2022.

    Comments: In version 7 we added various clarifications including to terminology, definitions, and to the reference to the second law of thermodynamics

    Journal ref: Proc. Roy. Soc. B v.290 no.1994 (2023)

  46. arXiv:2203.09016  [pdf, other

    cs.HC cs.CL

    Natural Language Communication with a Teachable Agent

    Authors: Rachel Love, Edith Law, Philip R. Cohen, Dana Kulić

    Abstract: Conversational teachable agents offer a promising platform to support learning, both in the classroom and in remote settings. In this context, the agent takes the role of the novice, while the student takes on the role of teacher. This framing is significant for its ability to elicit the Protégé effect in the student-teacher, a pedagogical phenomenon known to increase engagement in the teaching ta… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: This work has been submitted to the IEEE for possible publication

  47. arXiv:2201.04807  [pdf, other

    stat.ML cs.LG

    Active Learning-Based Multistage Sequential Decision-Making Model with Application on Common Bile Duct Stone Evaluation

    Authors: Hongzhen Tian, Reuven Zev Cohen, Chuck Zhang, Yajun Mei

    Abstract: Multistage sequential decision-making scenarios are commonly seen in the healthcare diagnosis process. In this paper, an active learning-based method is developed to actively collect only the necessary patient data in a sequential manner. There are two novelties in the proposed method. First, unlike the existing ordinal logistic regression model which only models a single stage, we estimate the pa… ▽ More

    Submitted 13 January, 2022; originally announced January 2022.

  48. arXiv:2201.01222  [pdf, other

    cs.LG cs.CV

    The cluster structure function

    Authors: Andrew R. Cohen, Paul M. B. Vitányi

    Abstract: For each partition of a data set into a given number of parts there is a partition such that every part is as much as possible a good model (an "algorithmic sufficient statistic") for the data in that part. Since this can be done for every number between one and the number of data, the result is a function, the cluster structure function. It maps the number of parts of a partition to values relate… ▽ More

    Submitted 14 October, 2022; v1 submitted 4 January, 2022; originally announced January 2022.

  49. arXiv:2112.06883  [pdf, other

    cs.SE cs.AI cs.DC

    A Methodology for a Scalable, Collaborative, and Resource-Efficient Platform to Facilitate Healthcare AI Research

    Authors: Raphael Y. Cohen, Vesela P. Kovacheva

    Abstract: Healthcare AI holds the potential to increase patient safety, augment efficiency and improve patient outcomes, yet research is often limited by data access, cohort curation, and tooling for analysis. Collection and translation of electronic health record data, live data, and real-time high resolution device data can be challenging and time-consuming. The development of real-world AI tools requires… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

  50. arXiv:2112.00794  [pdf, other

    eess.IV cs.CV

    DFTS2: Simulating Deep Feature Transmission Over Packet Loss Channels

    Authors: Ashiv Dhondea, Robert A. Cohen, Ivan V. Bajić

    Abstract: In edge-cloud collaborative intelligence (CI), an unreliable transmission channel exists in the information path of the AI model performing the inference. It is important to be able to simulate the performance of the CI system across an imperfect channel in order to understand system behavior and develop appropriate error control strategies. In this paper we present a simulation framework called D… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

    Comments: 6 pages, 4 figures, IEEE Conference on Visual Communications and Image Processing (VCIP) 2021