Skip to main content

Showing 1–50 of 122 results for author: Kazemi, M

.
  1. arXiv:2506.08249  [pdf, other

    cs.DB cs.CL

    RADAR: Benchmarking Language Models on Imperfect Tabular Data

    Authors: Ken Gu, Zhihan Zhang, Kate Lin, Yuwei Zhang, Akshay Paruchuri, Hong Yu, Mehran Kazemi, Kumar Ayush, A. Ali Heydari, Maxwell A. Xu, Girish Narayanswamy, Yun Liu, Ming-Zher Poh, Yuzhe Yang, Mark Malhotra, Shwetak Patel, Hamid Palangi, Xuhai Xu, Daniel McDuff, Tim Althoff, Xin Liu

    Abstract: Language models (LMs) are increasingly being deployed to perform autonomous data analyses. However, their data awareness -- the ability to recognize, reason over, and appropriately handle data artifacts such as missing values, outliers, and logical inconsistencies -- remains underexplored. These artifacts are especially common in real-world tabular data and, if mishandled, can significantly compro… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  2. arXiv:2506.02946  [pdf, ps, other

    cs.LG

    Abstract Counterfactuals for Language Model Agents

    Authors: Edoardo Pona, Milad Kazemi, Yali Du, David Watson, Nicola Paoletti

    Abstract: Counterfactual inference is a powerful tool for analysing and evaluating autonomous agents, but its application to language model (LM) agents remains challenging. Existing work on counterfactuals in LMs has primarily focused on token-level counterfactuals, which are often inadequate for LM agents due to their open-ended action spaces. Unlike traditional agents with fixed, clearly defined action sp… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  3. arXiv:2505.15693  [pdf, other

    cs.AI

    Average Reward Reinforcement Learning for Omega-Regular and Mean-Payoff Objectives

    Authors: Milad Kazemi, Mateo Perez, Fabio Somenzi, Sadegh Soudjani, Ashutosh Trivedi, Alvaro Velasquez

    Abstract: Recent advances in reinforcement learning (RL) have renewed focus on the design of reward functions that shape agent behavior. Manually designing reward functions is tedious and error-prone. A principled alternative is to specify behaviors in a formal language that can be automatically translated into rewards. Omega-regular languages are a natural choice for this purpose, given their established r… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: 29 pages, 6 figures and 2 tables

  4. arXiv:2505.01371  [pdf, other

    cs.CE

    SimICD: A Closed-Loop Simulation Framework For ICD Therapy

    Authors: Hannah Lydon, Milad Kazemi, Martin Bishop, Nicola Paoletti

    Abstract: Virtual studies of ICD behaviour are crucial for testing device functionality in a controlled environment prior to clinical application. Although previous works have shown the viability of using in silico testing for diagnosis, there is a notable gap in available models that can simulate therapy progression decisions during arrhythmic episodes. This work introduces SimICD, a simulation tool which… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

    Comments: Accepted for publication in the 47th annual Engineering in Medicine and Biology Conference (EMBC)

  5. arXiv:2504.11131  [pdf, other

    eess.SP

    A Fully Asynchronous Unsourced Random Access Scheme

    Authors: Mert Ozates, Mohammad Kazemi, Gianluigi Liva, Deniz Gündüz

    Abstract: We investigate fully asynchronous unsourced random access (URA), and propose a high-performing scheme that employs on-off division multiple access (ODMA). In this scheme, active users distribute their data over the transmit block based on a sparse transmission pattern without any limitations on the starting time. At the receiver side, we adopt a double sliding-window decoding approach, utilizing a… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

  6. arXiv:2503.19786  [pdf, other

    cs.CL cs.AI

    Gemma 3 Technical Report

    Authors: Gemma Team, Aishwarya Kamath, Johan Ferret, Shreya Pathak, Nino Vieillard, Ramona Merhej, Sarah Perrin, Tatiana Matejovicova, Alexandre Ramé, Morgane Rivière, Louis Rouillard, Thomas Mesnard, Geoffrey Cideron, Jean-bastien Grill, Sabela Ramos, Edouard Yvinec, Michelle Casbon, Etienne Pot, Ivo Penchev, Gaël Liu, Francesco Visin, Kathleen Kenealy, Lucas Beyer, Xiaohai Zhai, Anton Tsitsulin , et al. (191 additional authors not shown)

    Abstract: We introduce Gemma 3, a multimodal addition to the Gemma family of lightweight open models, ranging in scale from 1 to 27 billion parameters. This version introduces vision understanding abilities, a wider coverage of languages and longer context - at least 128K tokens. We also change the architecture of the model to reduce the KV-cache memory that tends to explode with long context. This is achie… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

  7. arXiv:2503.02545  [pdf, other

    cs.IT

    Characterization of Deletion/Substitution Channel Capacity for Small Deletion and Substitution Probabilities

    Authors: Mohammad Kazemi, Tolga M. Duman

    Abstract: We consider binary input deletion/substitution channels, which model certain channels with synchronization errors encountered in practice. Specifically, we focus on the regime of small deletion and substitution probabilities, and by extending an approach developed for the deletion-only channel, we obtain an asymptotic characterization of the channel capacity for independent and identically distrib… ▽ More

    Submitted 23 April, 2025; v1 submitted 4 March, 2025; originally announced March 2025.

  8. arXiv:2502.19187  [pdf, other

    cs.CL

    BIG-Bench Extra Hard

    Authors: Mehran Kazemi, Bahare Fatemi, Hritik Bansal, John Palowitch, Chrysovalantis Anastasiou, Sanket Vaibhav Mehta, Lalit K. Jain, Virginia Aglietti, Disha Jindal, Peter Chen, Nishanth Dikkala, Gladys Tyen, Xin Liu, Uri Shalit, Silvia Chiappa, Kate Olszewska, Yi Tay, Vinh Q. Tran, Quoc V. Le, Orhan Firat

    Abstract: Large language models (LLMs) are increasingly deployed in everyday applications, demanding robust general reasoning capabilities and diverse reasoning skillset. However, current LLM reasoning benchmarks predominantly focus on mathematical and coding abilities, leaving a gap in evaluating broader reasoning proficiencies. One particular exception is the BIG-Bench dataset, which has served as a cruci… ▽ More

    Submitted 6 May, 2025; v1 submitted 26 February, 2025; originally announced February 2025.

  9. arXiv:2502.13731  [pdf, ps, other

    cs.AI

    Robust Counterfactual Inference in Markov Decision Processes

    Authors: Jessica Lally, Milad Kazemi, Nicola Paoletti

    Abstract: This paper addresses a key limitation in existing counterfactual inference methods for Markov Decision Processes (MDPs). Current approaches assume a specific causal model to make counterfactuals identifiable. However, there are usually many causal models that align with the observational and interventional distributions of an MDP, each yielding different counterfactual distributions, so fixing a p… ▽ More

    Submitted 27 March, 2025; v1 submitted 19 February, 2025; originally announced February 2025.

    Comments: Fixed typo in Equation (5)

  10. arXiv:2502.10302  [pdf, other

    quant-ph

    Investigation of intrinsic nonlinear effects in driven-dissipative optomechanical systems using the generalized linear response theory

    Authors: B. Askari, A. Dalafi, M. J. Kazemi

    Abstract: In this article, we study the effects of intrinsic nonlinear optomechanical interaction on the linear response of a driven-dissipative optomechanical system to a weak time-dependent perturbation. By calculating the linear response of the cavity optical mode to a weak probe laser in the framework of the generalized linear response theory, it is shown how the Stokes and anti-Stokes sideband amplitud… ▽ More

    Submitted 14 February, 2025; originally announced February 2025.

  11. arXiv:2501.18888  [pdf, ps, other

    stat.ME

    Measuring Inaccuracies in the Proportional Hazard Rate Model based on Extropy using a Length-Biased Weighted Residual approach

    Authors: M. Hashempour, M. R. Kazemi

    Abstract: In this paper, we consider the concept of the residual inaccuracy measure and extend it to its weighted version based on extropy. Properties of this measure are studied and the discrimination principle is applied in the class of proportional hazard rate (PHR) models. A characterization problem for the proposed weighted extropy-inaccuracy measure is studied. We propose some alternative expressions… ▽ More

    Submitted 30 January, 2025; originally announced January 2025.

  12. arXiv:2501.18298  [pdf, other

    cs.LG cs.DC

    Update Estimation and Scheduling for Over-the-Air Federated Learning with Energy Harvesting Devices

    Authors: Furkan Bagci, Busra Tegin, Mohammad Kazemi, Tolga M. Duman

    Abstract: We study over-the-air (OTA) federated learning (FL) for energy harvesting devices with heterogeneous data distribution over wireless fading multiple access channel (MAC). To address the impact of low energy arrivals and data heterogeneity on global learning, we propose user scheduling strategies. Specifically, we develop two approaches: 1) entropy-based scheduling for known data distributions and… ▽ More

    Submitted 30 January, 2025; originally announced January 2025.

    Comments: 6 pages

  13. arXiv:2501.10201  [pdf, other

    cs.ET cs.IT eess.SY

    ODMA-Based Cell-Free Unsourced Random Access with Successive Interference Cancellation

    Authors: Mert Ozates, Mohammad Kazemi, Eduard Jorswieck, Deniz Gunduz

    Abstract: We consider the unsourced random access problem with multiple receivers and propose a cell-free type solution for that. In our proposed scheme, the active users transmit their signals to the access points (APs) distributed in a geographical area and connected to a central processing unit (CPU). The transmitted signals are composed of a pilot and polar codeword, where the polar codeword bits occupy… ▽ More

    Submitted 17 January, 2025; originally announced January 2025.

  14. arXiv:2412.04703  [pdf, other

    cs.CL cs.AI cs.LG

    Transformers Struggle to Learn to Search

    Authors: Abulhair Saparov, Srushti Pawar, Shreyas Pimpalgaonkar, Nitish Joshi, Richard Yuanzhe Pang, Vishakh Padmakumar, Seyed Mehran Kazemi, Najoung Kim, He He

    Abstract: Search is an ability foundational in many important tasks, and recent studies have shown that large language models (LLMs) struggle to perform search robustly. It is unknown whether this inability is due to a lack of data, insufficient model parameters, or fundamental limitations of the transformer architecture. In this work, we use the foundational graph connectivity problem as a testbed to gener… ▽ More

    Submitted 16 March, 2025; v1 submitted 5 December, 2024; originally announced December 2024.

    Comments: Published as a conference paper at ICLR 2025

  15. arXiv:2410.17002  [pdf, ps, other

    cs.GT cs.DS

    EFX Allocations and Orientations on Bipartite Multi-graphs: A Complete Picture

    Authors: Mahyar Afshinmehr, Alireza Danaei, Mehrafarin Kazemi, Kurt Mehlhorn, Nidhi Rathi

    Abstract: We consider the fundamental problem of fairly allocating a set of indivisible items among agents having valuations that are represented by a multi-graph -- here, agents appear as the vertices and items as the edges between them and each vertex (agent) only values the set of its incident edges (items). The goal is to find a fair, i.e., envy-free up to any item (EFX) allocation. This model has recen… ▽ More

    Submitted 26 October, 2024; v1 submitted 22 October, 2024; originally announced October 2024.

  16. arXiv:2410.13510  [pdf, other

    cs.CL cs.CV

    GeoCoder: Solving Geometry Problems by Generating Modular Code through Vision-Language Models

    Authors: Aditya Sharma, Aman Dalmia, Mehran Kazemi, Amal Zouaq, Christopher J. Pal

    Abstract: Geometry problem-solving demands advanced reasoning abilities to process multimodal inputs and employ mathematical knowledge effectively. Vision-language models (VLMs) have made significant progress in various multimodal tasks. Yet, they still struggle with geometry problems and are significantly limited by their inability to perform mathematical operations not seen during pre-training, such as ca… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  17. arXiv:2410.02342  [pdf, other

    cs.IT

    Capacity Bounds for the Poisson-Repeat Channel

    Authors: Mohammad Kazemi, Tolga M. Duman

    Abstract: We develop bounds on the capacity of Poisson-repeat channels (PRCs) for which each input bit is independently repeated according to a Poisson distribution. The upper bounds are obtained by considering an auxiliary channel where the output lengths corresponding to input blocks of a given length are provided as side information at the receiver. Numerical results show that the resulting upper bounds… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

  18. arXiv:2410.02274  [pdf, ps, other

    cs.GT

    MMS Approximations Under Additive Leveled Valuations

    Authors: Mahyar Afshinmehr, Mehrafarin Kazemi, Kurt Mehlhorn

    Abstract: We study the problem of fairly allocating indivisible goods to a set of agents with additive leveled valuations. A valuation function is called leveled if and only if bundles of larger size have larger value than bundles of smaller size. The economics literature has well studied such valuations. We use the maximin-share (MMS) and EFX as standard notions of fairness. We show that an algorithm int… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

  19. arXiv:2409.14911  [pdf, other

    cs.IT

    Unsourced Random Access: A Comprehensive Survey

    Authors: Mert Ozates, Mohammad Javad Ahmadi, Mohammad Kazemi, Deniz Gündüz, Tolga M. Duman

    Abstract: Multiple access communication systems enable numerous users to share common communication resources simultaneously, playing a crucial role in wireless networks. With the emergence of the sixth generation (6G) and beyond communication systems, supporting massive machine-type communications with sporadic activity patterns is expected to become a critical challenge. Unsourced random access (URA) has… ▽ More

    Submitted 24 February, 2025; v1 submitted 23 September, 2024; originally announced September 2024.

  20. arXiv:2409.12640  [pdf, other

    cs.CL cs.LG

    Michelangelo: Long Context Evaluations Beyond Haystacks via Latent Structure Queries

    Authors: Kiran Vodrahalli, Santiago Ontanon, Nilesh Tripuraneni, Kelvin Xu, Sanil Jain, Rakesh Shivanna, Jeffrey Hui, Nishanth Dikkala, Mehran Kazemi, Bahare Fatemi, Rohan Anil, Ethan Dyer, Siamak Shakeri, Roopali Vij, Harsh Mehta, Vinay Ramasesh, Quoc Le, Ed Chi, Yifeng Lu, Orhan Firat, Angeliki Lazaridou, Jean-Baptiste Lespiau, Nithya Attaluri, Kate Olszewska

    Abstract: We introduce Michelangelo: a minimal, synthetic, and unleaked long-context reasoning evaluation for large language models which is also easy to automatically score. This evaluation is derived via a novel, unifying framework for evaluations over arbitrarily long contexts which measure the model's ability to do more than retrieve a single piece of information from its context. The central idea of th… ▽ More

    Submitted 19 September, 2024; v1 submitted 19 September, 2024; originally announced September 2024.

  21. arXiv:2408.16737  [pdf, other

    cs.CL cs.AI

    Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling

    Authors: Hritik Bansal, Arian Hosseini, Rishabh Agarwal, Vinh Q. Tran, Mehran Kazemi

    Abstract: Training on high-quality synthetic data from strong language models (LMs) is a common strategy to improve the reasoning performance of LMs. In this work, we revisit whether this strategy is compute-optimal under a fixed inference budget (e.g., FLOPs). To do so, we investigate the trade-offs between generating synthetic data using a stronger but more expensive (SE) model versus a weaker but cheaper… ▽ More

    Submitted 7 October, 2024; v1 submitted 29 August, 2024; originally announced August 2024.

  22. arXiv:2408.15240  [pdf, other

    cs.LG

    Generative Verifiers: Reward Modeling as Next-Token Prediction

    Authors: Lunjun Zhang, Arian Hosseini, Hritik Bansal, Mehran Kazemi, Aviral Kumar, Rishabh Agarwal

    Abstract: Verifiers or reward models are often used to enhance the reasoning performance of large language models (LLMs). A common approach is the Best-of-N method, where N candidate solutions generated by the LLM are ranked by a verifier, and the best one is selected. While LLM-based verifiers are typically trained as discriminative classifiers to score solutions, they do not utilize the text generation ca… ▽ More

    Submitted 22 February, 2025; v1 submitted 27 August, 2024; originally announced August 2024.

    Comments: ICLR 2025

  23. arXiv:2408.13329  [pdf, other

    cs.IT

    RIS-Aided Unsourced Multiple Access (RISUMA): Coding Strategy and Performance Limits

    Authors: Mohammad Javad Ahmadi, Mohammad Kazemi, Tolga M. Duman

    Abstract: This paper considers an unsourced random access (URA) set-up equipped with a passive reconfigurable intelligent surface (RIS), where a massive number of unidentified users (only a small fraction of them being active at any given time) are connected to the base station (BS). We introduce a slotted coding scheme for which each active user chooses a slot at random for transmitting its signal, consist… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

  24. arXiv:2408.12416  [pdf, other

    cs.SE cs.LG

    Unlearning Trojans in Large Language Models: A Comparison Between Natural Language and Source Code

    Authors: Mahdi Kazemi, Aftab Hussain, Md Rafiqul Islam Rabin, Mohammad Amin Alipour, Sen Lin

    Abstract: This work investigates the application of Machine Unlearning (MU) for mitigating the impact of trojans embedded in conventional large language models of natural language (Text-LLMs) and large language models of code (Code-LLMs) We propose a novel unlearning approach, LYA, that leverages both gradient ascent and elastic weight consolidation, a Fisher Information Matrix (FIM) based regularization te… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

  25. arXiv:2408.00118  [pdf, other

    cs.CL cs.AI

    Gemma 2: Improving Open Language Models at a Practical Size

    Authors: Gemma Team, Morgane Riviere, Shreya Pathak, Pier Giuseppe Sessa, Cassidy Hardin, Surya Bhupatiraju, Léonard Hussenot, Thomas Mesnard, Bobak Shahriari, Alexandre Ramé, Johan Ferret, Peter Liu, Pouya Tafti, Abe Friesen, Michelle Casbon, Sabela Ramos, Ravin Kumar, Charline Le Lan, Sammy Jerome, Anton Tsitsulin, Nino Vieillard, Piotr Stanczyk, Sertan Girgin, Nikola Momchev, Matt Hoffman , et al. (173 additional authors not shown)

    Abstract: In this work, we introduce Gemma 2, a new addition to the Gemma family of lightweight, state-of-the-art open models, ranging in scale from 2 billion to 27 billion parameters. In this new version, we apply several known technical modifications to the Transformer architecture, such as interleaving local-global attentions (Beltagy et al., 2020a) and group-query attention (Ainslie et al., 2023). We al… ▽ More

    Submitted 2 October, 2024; v1 submitted 31 July, 2024; originally announced August 2024.

  26. arXiv:2407.16007  [pdf, other

    cs.CL

    SocialQuotes: Learning Contextual Roles of Social Media Quotes on the Web

    Authors: John Palowitch, Hamidreza Alvari, Mehran Kazemi, Tanvir Amin, Filip Radlinski

    Abstract: Web authors frequently embed social media to support and enrich their content, creating the potential to derive web-based, cross-platform social media representations that can enable more effective social media retrieval systems and richer scientific analyses. As step toward such capabilities, we introduce a novel language modeling framework that enables automatic annotation of roles that social m… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  27. arXiv:2406.09175  [pdf, other

    cs.CV cs.CL

    ReMI: A Dataset for Reasoning with Multiple Images

    Authors: Mehran Kazemi, Nishanth Dikkala, Ankit Anand, Petar Devic, Ishita Dasgupta, Fangyu Liu, Bahare Fatemi, Pranjal Awasthi, Dee Guo, Sreenivas Gollapudi, Ahmed Qureshi

    Abstract: With the continuous advancement of large language models (LLMs), it is essential to create new benchmarks to effectively evaluate their expanding capabilities and identify areas for improvement. This work focuses on multi-image reasoning, an emerging capability in state-of-the-art LLMs. We introduce ReMI, a dataset designed to assess LLMs' ability to Reason with Multiple Images. This dataset encom… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  28. arXiv:2406.09170  [pdf, other

    cs.CL

    Test of Time: A Benchmark for Evaluating LLMs on Temporal Reasoning

    Authors: Bahare Fatemi, Mehran Kazemi, Anton Tsitsulin, Karishma Malkan, Jinyeong Yim, John Palowitch, Sungyong Seo, Jonathan Halcrow, Bryan Perozzi

    Abstract: Large language models (LLMs) have showcased remarkable reasoning capabilities, yet they remain susceptible to errors, particularly in temporal reasoning tasks involving complex temporal logic. Existing research has explored LLM performance on temporal reasoning using diverse datasets and benchmarks. However, these studies often rely on real-world data that LLMs may have encountered during pre-trai… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  29. arXiv:2406.06284  [pdf, other

    cs.IT eess.SY

    An ODMA-Based Unsourced Random Access Scheme with a Multiple Antenna Receiver

    Authors: Mert Ozates, Mohammad Kazemi, Tolga M. Duman

    Abstract: We investigate the unsourced random access scheme assuming that the base station is equipped with multiple antennas, and propose a high-performing solution utilizing on-off-division multiple access. We assume that each user spreads its pilot sequence and polar codeword to the pilot and data parts of the transmission frame, respectively, based on a transmission pattern. The iterative receiver opera… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  30. arXiv:2405.18512  [pdf, ps, other

    cs.LG cs.AI

    Understanding Transformer Reasoning Capabilities via Graph Algorithms

    Authors: Clayton Sanford, Bahare Fatemi, Ethan Hall, Anton Tsitsulin, Mehran Kazemi, Jonathan Halcrow, Bryan Perozzi, Vahab Mirrokni

    Abstract: Which transformer scaling regimes are able to perfectly solve different classes of algorithmic problems? While tremendous empirical advances have been attained by transformer-based neural networks, a theoretical understanding of their algorithmic reasoning capabilities in realistic parameter regimes is lacking. We investigate this question in terms of the network's depth, width, and number of extr… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 43 pages, 8 figures

  31. arXiv:2405.07144  [pdf, other

    quant-ph

    Optical transition parameters of the silicon T centre

    Authors: Chloe Clear, Sara Hosseini, Amirhossein AlizadehKhaledi, Nicholas Brunelle, Austin Woolverton, Joshua Kanaganayagam, Moein Kazemi, Camille Chartrand, Mehdi Keshavarz, Yihuang Xiong, Louis Alaerts, Oney O. Soykal, Geoffroy Hautier, Valentin Karassiouk, Mike Thewalt, Daniel Higginbottom, Stephanie Simmons

    Abstract: The silicon T centre's narrow, telecommunications-band optical emission, long spin coherence, and direct photonic integration have spurred interest in this emitter as a spin-photon interface for distributed quantum computing and networking. However, key parameters of the T centre's spin-selective optical transitions remain undetermined or ambiguous in literature. In this paper we present a Hamilto… ▽ More

    Submitted 8 November, 2024; v1 submitted 11 May, 2024; originally announced May 2024.

    Comments: 18 pages and 13 figures in the main manuscript with appendix included

  32. arXiv:2403.17853  [pdf, other

    cs.CL cs.LG

    Using Domain Knowledge to Guide Dialog Structure Induction via Neural Probabilistic Soft Logic

    Authors: Connor Pryor, Quan Yuan, Jeremiah Liu, Mehran Kazemi, Deepak Ramachandran, Tania Bedrax-Weiss, Lise Getoor

    Abstract: Dialog Structure Induction (DSI) is the task of inferring the latent dialog structure (i.e., a set of dialog states and their temporal transitions) of a given goal-oriented dialog. It is a critical component for modern dialog system design and discourse analysis. Existing DSI approaches are often purely data-driven, deploy models that infer latent states without access to domain knowledge, underpe… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  33. Conformal Off-Policy Prediction for Multi-Agent Systems

    Authors: Tom Kuipers, Renukanandan Tumu, Shuo Yang, Milad Kazemi, Rahul Mangharam, Nicola Paoletti

    Abstract: Off-Policy Prediction (OPP), i.e., predicting the outcomes of a target policy using only data collected under a nominal (behavioural) policy, is a paramount problem in data-driven analysis of safety-critical systems where the deployment of a new policy may be unsafe. To achieve dependable off-policy predictions, recent work on Conformal Off-Policy Prediction (COPP) leverage the conformal predictio… ▽ More

    Submitted 15 September, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

    Comments: Accepted for publication in the 63rd IEEE Conference on Decision and Control (CDC) 2024

  34. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1112 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 16 December, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  35. arXiv:2403.02223  [pdf, other

    cond-mat.mes-hall

    All-optical control of skyrmion configuration in CrI$_3$ monolayer

    Authors: M. Kazemi, A. Kudlis, P. F. Bessarab, I. A. Shelykh

    Abstract: The potential for manipulating characteristics of skyrmions in a CrI$_3$ monolayer using circularly polarised light is explored. The effective skyrmion-light interaction is mediated by bright excitons whose magnetization is selectively influenced by the polarization of photons. The light-induced skyrmion dynamics is illustrated by the dependencies of the skyrmion size and the skyrmion lifetime on… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  36. arXiv:2402.11350  [pdf, ps, other

    quant-ph hep-th math-ph physics.atom-ph

    Non-Heisenbergian quantum mechanics

    Authors: MohammadJavad Kazemi, Ghadir Jafari

    Abstract: Relaxing the postulates of an axiomatic theory is a natural way to find more general theories, and historically, the discovery of non-Euclidean geometry is a famous example of this procedure. Here, we use this way to extend quantum mechanics by ignoring the heart of Heisenberg's quantum mechanics -- We do not assume the existence of a position operator that satisfies the Heisenberg commutation rel… ▽ More

    Submitted 14 October, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

  37. arXiv:2402.08514  [pdf, other

    cs.AI

    Counterfactual Influence in Markov Decision Processes

    Authors: Milad Kazemi, Jessica Lally, Ekaterina Tishchenko, Hana Chockler, Nicola Paoletti

    Abstract: Our work addresses a fundamental problem in the context of counterfactual inference for Markov Decision Processes (MDPs). Given an MDP path $τ$, this kind of inference allows us to derive counterfactual paths $τ'$ describing what-if versions of $τ$ obtained under different action sequences than those observed in $τ$. However, as the counterfactual states and actions deviate from the observed ones… ▽ More

    Submitted 27 March, 2025; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: 12 pages, 6 figures

  38. arXiv:2402.05862  [pdf, other

    cs.LG cs.AI cs.SI stat.ML

    Let Your Graph Do the Talking: Encoding Structured Data for LLMs

    Authors: Bryan Perozzi, Bahare Fatemi, Dustin Zelle, Anton Tsitsulin, Mehran Kazemi, Rami Al-Rfou, Jonathan Halcrow

    Abstract: How can we best encode structured data into sequential form for use in large language models (LLMs)? In this work, we introduce a parameter-efficient method to explicitly represent structured data for LLMs. Our method, GraphToken, learns an encoding function to extend prompts with explicit structured information. Unlike other work which focuses on limited domains (e.g. knowledge graph representati… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    ACM Class: I.5.1; I.2.6; I.2.7

  39. arXiv:2401.11624  [pdf, other

    cs.CL cs.AI cs.IR

    In-context Learning with Retrieved Demonstrations for Language Models: A Survey

    Authors: Man Luo, Xin Xu, Yue Liu, Panupong Pasupat, Mehran Kazemi

    Abstract: Language models, especially pre-trained large language models, have showcased remarkable abilities as few-shot in-context learners (ICL), adept at adapting to new tasks with just a few demonstrations in the input context. However, the model's ability to perform ICL is sensitive to the choice of the few-shot demonstrations. Instead of using a fixed set of demonstrations, one recent development is t… ▽ More

    Submitted 23 March, 2024; v1 submitted 21 January, 2024; originally announced January 2024.

  40. arXiv:2401.11284  [pdf, other

    cs.CV cs.AI cs.NE

    Evaluating Driver Readiness in Conditionally Automated Vehicles from Eye-Tracking Data and Head Pose

    Authors: Mostafa Kazemi, Mahdi Rezaei, Mohsen Azarmi

    Abstract: As automated driving technology advances, the role of the driver to resume control of the vehicle in conditionally automated vehicles becomes increasingly critical. In the SAE Level 3 or partly automated vehicles, the driver needs to be available and ready to intervene when necessary. This makes it essential to evaluate their readiness accurately. This article presents a comprehensive analysis of… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

  41. arXiv:2312.14075  [pdf, other

    cs.CC quant-ph

    Quantum Complexity vs Classical Complexity: A Survey

    Authors: Arash Vaezi, Ali Movaghar, Mohammad Ghodsi, Seyed Mohammad Hussein Kazemi, Negin Bagheri Noghrehy, Seyed Mohsen Kazemi

    Abstract: Scientists have demonstrated that quantum computing has presented novel approaches to address computational challenges, each varying in complexity. Adapting problem-solving strategies is crucial to harness the full potential of quantum computing. Nonetheless, there are defined boundaries to the capabilities of quantum computing. This paper concentrates on aggregating prior research efforts dedicat… ▽ More

    Submitted 11 September, 2024; v1 submitted 16 December, 2023; originally announced December 2023.

    Comments: arXiv admin note: text overlap with arXiv:1612.07366 by other authors

  42. arXiv:2312.12241  [pdf, other

    cs.CV cs.CL

    GeomVerse: A Systematic Evaluation of Large Models for Geometric Reasoning

    Authors: Mehran Kazemi, Hamidreza Alvari, Ankit Anand, Jialin Wu, Xi Chen, Radu Soricut

    Abstract: Large language models have shown impressive results for multi-hop mathematical reasoning when the input question is only textual. Many mathematical reasoning problems, however, contain both text and image. With the ever-increasing adoption of vision language models (VLMs), understanding their reasoning abilities for such problems is crucial. In this paper, we evaluate the reasoning capabilities of… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  43. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1326 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 9 May, 2025; v1 submitted 18 December, 2023; originally announced December 2023.

  44. arXiv:2312.09938  [pdf, other

    cs.LG cs.AI cs.MA

    Assume-Guarantee Reinforcement Learning

    Authors: Milad Kazemi, Mateo Perez, Fabio Somenzi, Sadegh Soudjani, Ashutosh Trivedi, Alvaro Velasquez

    Abstract: We present a modular approach to \emph{reinforcement learning} (RL) in environments consisting of simpler components evolving in parallel. A monolithic view of such modular environments may be prohibitively large to learn, or may require unrealizable communication between the components in the form of a centralized controller. Our proposed approach is based on the assume-guarantee paradigm where t… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: This is the extended version of the paper accepted in the SRRAI Special Track at the Conference on Artificial Intelligence (AAAI-24)

  45. arXiv:2308.15299  [pdf, other

    cs.CL

    TaskLAMA: Probing the Complex Task Understanding of Language Models

    Authors: Quan Yuan, Mehran Kazemi, Xin Xu, Isaac Noble, Vaiva Imbrasaite, Deepak Ramachandran

    Abstract: Structured Complex Task Decomposition (SCTD) is the problem of breaking down a complex real-world task (such as planning a wedding) into a directed acyclic graph over individual steps that contribute to achieving the task, with edges specifying temporal dependencies between them. SCTD is an important component of assistive planning tools, and a challenge for commonsense reasoning systems. We probe… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  46. arXiv:2308.10737  [pdf, other

    cs.LG

    UGSL: A Unified Framework for Benchmarking Graph Structure Learning

    Authors: Bahare Fatemi, Sami Abu-El-Haija, Anton Tsitsulin, Mehran Kazemi, Dustin Zelle, Neslihan Bulut, Jonathan Halcrow, Bryan Perozzi

    Abstract: Graph neural networks (GNNs) demonstrate outstanding performance in a broad range of applications. While the majority of GNN applications assume that a graph structure is given, some recent methods substantially expanded the applicability of GNNs by showing that they may be effective even when no graph structure is explicitly provided. The GNN parameters and a graph structure are jointly learned.… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

  47. arXiv:2308.06138  [pdf

    cs.LG cs.AI stat.ML

    Application of Artificial Neural Networks for Investigation of Pressure Filtration Performance, a Zinc Leaching Filter Cake Moisture Modeling

    Authors: Masoume Kazemi, Davood Moradkhani, Alireza A. Alipour

    Abstract: Machine Learning (ML) is a powerful tool for material science applications. Artificial Neural Network (ANN) is a machine learning technique that can provide high prediction accuracy. This study aimed to develop an ANN model to predict the cake moisture of the pressure filtration process of zinc production. The cake moisture was influenced by seven parameters: temperature (35 and 65 Celsius), solid… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

  48. arXiv:2307.14199  [pdf

    cs.LG

    Application of Random Forest and Support Vector Machine for Investigation of Pressure Filtration Performance, a Zinc Plant Filter Cake Modeling

    Authors: Masoume Kazemi, Davood Moradkhani, Alireza Abbas Alipour

    Abstract: The hydrometallurgical method of zinc production involves leaching zinc from ore and then separating the solid residue from the liquid solution by pressure filtration. This separation process is very important since the solid residue contains some moisture that can reduce the amount of zinc recovered. This study modeled the pressure filtration process through Random Forest (RF) and Support Vector… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

  49. arXiv:2307.13848  [pdf, other

    cs.CR

    TeleBTC: Trustless Wrapped Bitcoin

    Authors: Mahyar Daneshpajooh, Niusha Moshrefi, Mahdi Darabi, Sina Hashemi, Mehrafarin Kazemi

    Abstract: This paper introduces TeleBTC, a fully decentralized protocol designed to wrap Bitcoin (BTC) on programmable blockchains. The creation of a decentralized wrapped BTC presents challenges due to the non-programmable nature of Bitcoin, making it difficult to custody BTCs in a decentralized way. Existing solutions have addressed this challenge by introducing an external layer of validators who take cu… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

  50. arXiv:2307.07310  [pdf, ps, other

    cs.IT

    Unsourced Random Access Using Multiple Stages of Orthogonal Pilots: MIMO and Single-Antenna Structures

    Authors: Mohammad Javad Ahmadi, Mohammad Kazemi, Tolga M. Duman

    Abstract: We study the problem of unsourced random access (URA) over Rayleigh block-fading channels with a receiver equipped with multiple antennas. We propose a slotted structure with multiple stages of orthogonal pilots, each of which is randomly picked from a codebook. In the proposed signaling structure, each user encodes its message using a polar code and appends it to the selected pilot sequences to c… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.