Skip to main content

Showing 1–50 of 118 results for author: Haashemi, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.02921  [pdf, ps, other

    cs.LG cs.AI

    PlaceFM: A Training-free Geospatial Foundation Model of Places

    Authors: Mohammad Hashemi, Hossein Amiri, Andreas Zufle

    Abstract: Spatial structure is central to effective geospatial intelligence systems. While foundation models have shown promise, they often lack the flexibility to reason about places, which are context-rich regions spanning different spatial granularities. We propose PlaceFM, a spatial foundation model that captures place representations using a training-free graph condensation method. PlaceFM condenses a… ▽ More

    Submitted 25 June, 2025; originally announced July 2025.

  2. arXiv:2507.00920  [pdf, ps, other

    cs.LG eess.SP

    Privacy-Preserving Quantized Federated Learning with Diverse Precision

    Authors: Dang Qua Nguyen, Morteza Hashemi, Erik Perrins, Sergiy A. Vorobyov, David J. Love, Taejoon Kim

    Abstract: Federated learning (FL) has emerged as a promising paradigm for distributed machine learning, enabling collaborative training of a global model across multiple local devices without requiring them to share raw data. Despite its advancements, FL is limited by factors such as: (i) privacy risks arising from the unprotected transmission of local model updates to the fusion center (FC) and (ii) decrea… ▽ More

    Submitted 2 July, 2025; v1 submitted 1 July, 2025; originally announced July 2025.

  3. arXiv:2506.14570  [pdf, ps, other

    cs.AI

    From Points to Places: Towards Human Mobility-Driven Spatiotemporal Foundation Models via Understanding Places

    Authors: Mohammad Hashemi, Andreas Zufle

    Abstract: Capturing human mobility is essential for modeling how people interact with and move through physical spaces, reflecting social behavior, access to resources, and dynamic spatial patterns. To support scalable and transferable analysis across diverse geographies and contexts, there is a need for a generalizable foundation model for spatiotemporal data. While foundation models have transformed langu… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

  4. arXiv:2505.16293  [pdf, ps, other

    cs.CL

    Augmenting LLM Reasoning with Dynamic Notes Writing for Complex QA

    Authors: Rishabh Maheshwary, Masoud Hashemi, Khyati Mahajan, Shiva Krishna Reddy Malay, Sai Rajeswar, Sathwik Tejaswi Madhusudhan, Spandana Gella, Vikas Yadav

    Abstract: Iterative RAG for multi-hop question answering faces challenges with lengthy contexts and the buildup of irrelevant information. This hinders a model's capacity to process and reason over retrieved content and limits performance. While recent methods focus on compressing retrieved information, they are either restricted to single-round RAG, require finetuning or lack scalability in iterative RAG.… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  5. arXiv:2505.13337  [pdf, ps, other

    cs.IT cs.ET cs.MM eess.SY

    Neural-Enhanced Rate Adaptation and Computation Distribution for Emerging mmWave Multi-User 3D Video Streaming Systems

    Authors: Babak Badnava, Jacob Chakareski, Morteza Hashemi

    Abstract: We investigate multitask edge-user communication-computation resource allocation for $360^\circ$ video streaming in an edge-computing enabled millimeter wave (mmWave) multi-user virtual reality system. To balance the communication-computation trade-offs that arise herein, we formulate a video quality maximization problem that integrates interdependent multitask/multi-user action spaces and rebuffe… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: Accepted to be published in IEEE Transaction on Multimedia

  6. arXiv:2505.13331  [pdf, ps, other

    cs.NI cs.ET cs.MM eess.SY

    Learning Driven Elastic Task Multi-Connectivity Immersive Computing Systems

    Authors: Babak Badnava, Jacob Chakareski, Morteza Hashemi

    Abstract: In virtual reality (VR) environments, computational tasks exhibit an elastic nature, meaning they can dynamically adjust based on various user and system constraints. This elasticity is essential for maintaining immersive experiences; however, it also introduces challenges for communication and computing in VR systems. In this paper, we investigate elastic task offloading for multi-user edge-compu… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: Under review by IEEE Transaction on Mobile Computing

  7. arXiv:2504.14690  [pdf

    cs.CL cs.AI

    FarsEval-PKBETS: A new diverse benchmark for evaluating Persian large language models

    Authors: Mehrnoush Shamsfard, Zahra Saaberi, Mostafa Karimi manesh, Seyed Mohammad Hossein Hashemi, Zahra Vatankhah, Motahareh Ramezani, Niki Pourazin, Tara Zare, Maryam Azimi, Sarina Chitsaz, Sama Khoraminejad, Morteza Mahdavi Mortazavi, Mohammad Mahdi Chizari, Sahar Maleki, Seyed Soroush Majd, Mostafa Masumi, Sayed Ali Musavi Khoeini, Amir Mohseni, Sogol Alipour

    Abstract: Research on evaluating and analyzing large language models (LLMs) has been extensive for resource-rich languages such as English, yet their performance in languages such as Persian has received considerably less attention. This paper introduces FarsEval-PKBETS benchmark, a subset of FarsEval project for evaluating large language models in Persian. This benchmark consists of 4000 questions and answ… ▽ More

    Submitted 20 April, 2025; originally announced April 2025.

    Comments: 24 pages, 3 figures, 3 tables

    MSC Class: 68T50 ACM Class: I.2.7; E.0

  8. arXiv:2503.15793  [pdf, other

    cs.LG

    DNR Bench: Benchmarking Over-Reasoning in Reasoning LLMs

    Authors: Masoud Hashemi, Oluwanifemi Bamgbose, Sathwik Tejaswi Madhusudhan, Jishnu Sethumadhavan Nair, Aman Tiwari, Vikas Yadav

    Abstract: Test-time scaling has significantly improved large language model performance, enabling deeper reasoning to solve complex problems. However, this increased reasoning capability also leads to excessive token generation and unnecessary problem-solving attempts. We introduce Dont Reason Bench (DNR Bench), a new benchmark designed to evaluate LLMs ability to robustly understand the tricky reasoning tr… ▽ More

    Submitted 17 April, 2025; v1 submitted 19 March, 2025; originally announced March 2025.

  9. arXiv:2503.15669  [pdf, other

    cs.SE

    ECO: An LLM-Driven Efficient Code Optimizer for Warehouse Scale Computers

    Authors: Hannah Lin, Martin Maas, Maximilian Roquemore, Arman Hasanzadeh, Fred Lewis, Yusuf Simonson, Tzu-Wei Yang, Amir Yazdanbakhsh, Deniz Altinbüken, Florin Papa, Maggie Nolan Edmonds, Aditya Patil, Don Schwarz, Satish Chandra, Chris Kennelly, Milad Hashemi, Parthasarathy Ranganathan

    Abstract: With the end of Moore's Law, optimizing code for performance has become paramount for meeting ever-increasing compute demands, particularly in hyperscale data centers where even small efficiency gains translate to significant resource and energy savings. Traditionally, this process requires significant programmer effort to identify optimization opportunities, modify the code to implement the optim… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

  10. arXiv:2503.05551  [pdf, other

    cs.LG

    Revitalizing Saturated Benchmarks: A Weighted Metric Approach for Differentiating Large Language Model Performance

    Authors: Bryan Etzine, Masoud Hashemi, Nishanth Madhusudhan, Sagar Davasam, Roshnee Sharma, Sathwik Tejaswi Madhusudhan, Vikas Yadav

    Abstract: Existing benchmarks are becoming saturated and struggle to separate model performances due to factors like data contamination and advancing LLM capabilities. This paper introduces EMDM (Enhanced Model Differentiation Metric), a novel weighted metric that revitalizes benchmarks by enhancing model separation. EMDM integrates final answer and Chain-of-Thought (CoT) reasoning correctness, assigning we… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

    Comments: conference NAACL, TrustNLP Workshop

    Journal ref: TrustNLP workshop NAACL, 2025

  11. arXiv:2503.02967  [pdf

    cs.CV cs.LG

    Revolutionizing Traffic Management with AI-Powered Machine Vision: A Step Toward Smart Cities

    Authors: Seyed Hossein Hosseini DolatAbadi, Sayyed Mohammad Hossein Hashemi, Mohammad Hosseini, Moein-Aldin AliHosseini

    Abstract: The rapid urbanization of cities and increasing vehicular congestion have posed significant challenges to traffic management and safety. This study explores the transformative potential of artificial intelligence (AI) and machine vision technologies in revolutionizing traffic systems. By leveraging advanced surveillance cameras and deep learning algorithms, this research proposes a system for real… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

    Comments: 6 pages, 1 figure, 2 tables, accepted to 1th AITC conference in University Of Isfahan

  12. arXiv:2502.17614  [pdf, other

    cs.LG cs.SI

    Scalable Graph Condensation with Evolving Capabilities

    Authors: Shengbo Gong, Mohammad Hashemi, Juntong Ni, Carl Yang, Wei Jin

    Abstract: Graph data has become a pivotal modality due to its unique ability to model relational datasets. However, real-world graph data continues to grow exponentially, resulting in a quadratic increase in the complexity of most graph algorithms as graph sizes expand. Although graph condensation (GC) methods have been proposed to address these scalability issues, existing approaches often treat the traini… ▽ More

    Submitted 24 February, 2025; originally announced February 2025.

    Comments: 16 pages, 6 figures

  13. arXiv:2501.09822  [pdf, other

    cs.LG cs.NI

    pFedWN: A Personalized Federated Learning Framework for D2D Wireless Networks with Heterogeneous Data

    Authors: Zhou Ni, Masoud Ghazikor, Morteza Hashemi

    Abstract: Traditional Federated Learning (FL) approaches often struggle with data heterogeneity across clients, leading to suboptimal model performance for individual clients. To address this issue, Personalized Federated Learning (PFL) emerges as a solution to the challenges posed by non-independent and identically distributed (non-IID) and unbalanced data across clients. Furthermore, in most existing dece… ▽ More

    Submitted 16 January, 2025; originally announced January 2025.

    Comments: 16 pages, 9 figures, 3 tables, submitted to Transactions on Networking

  14. arXiv:2412.14008  [pdf

    cs.CL

    FarExStance: Explainable Stance Detection for Farsi

    Authors: Majid Zarharan, Maryam Hashemi, Malika Behroozrazegh, Sauleh Eetemadi, Mohammad Taher Pilehvar, Jennifer Foster

    Abstract: We introduce FarExStance, a new dataset for explainable stance detection in Farsi. Each instance in this dataset contains a claim, the stance of an article or social media post towards that claim, and an extractive explanation which provides evidence for the stance label. We compare the performance of a fine-tuned multilingual RoBERTa model to several large language models in zero-shot, few-shot,… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

    Comments: Accepted in COLING 2025

  15. arXiv:2412.12612  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Auto-Cypher: Improving LLMs on Cypher generation via LLM-supervised generation-verification framework

    Authors: Aman Tiwari, Shiva Krishna Reddy Malay, Vikas Yadav, Masoud Hashemi, Sathwik Tejaswi Madhusudhan

    Abstract: Graph databases like Neo4j are gaining popularity for handling complex, interconnected data, over traditional relational databases in modeling and querying relationships. While translating natural language into SQL queries is well-researched, generating Cypher queries for Neo4j remains relatively underexplored. In this work, we present an automated, LLM-Supervised, pipeline to generate high-qualit… ▽ More

    Submitted 24 January, 2025; v1 submitted 17 December, 2024; originally announced December 2024.

    Comments: Accepted at NAACL 2025 main conference

  16. arXiv:2412.08294  [pdf, other

    cs.DC

    EaCO: Resource Sharing Dynamics and Its Impact on Energy Efficiency for DNN Training

    Authors: Kawsar Haghshenas, Mona Hashemi

    Abstract: Deep Learning Training (DLT) is a growing workload in shared GPU/CPU clusters due to its high computational cost and increasing number of jobs. This contributes to significant energy consumption in GPU clusters, further exacerbated by GPU under-utilization, as shown in production cluster logs. Addressing this challenge requires workload scheduling and resource allocation policies for efficient GPU… ▽ More

    Submitted 11 December, 2024; originally announced December 2024.

  17. arXiv:2412.06032  [pdf

    cs.CY

    Optimizing Location Allocation in Urban Management: A Brief Review

    Authors: Aref Ayati, Mohammad Mahdi Hashemi, Mohsen Saffar, Hamid Reza Naji

    Abstract: Regarding the concepts of urban management, digital transformation, and smart cities, various issues are presented. Currently, we like to attend to location allocation problems that can be a new part of digital transformation in urban management (such as locating and placing facilities, locating and arranging centers such as aid and rescue centers, or even postal hubs, telecommunications, electron… ▽ More

    Submitted 8 December, 2024; originally announced December 2024.

    Comments: 7 pages, Keywords: Digital Transformation - Smart City - GIS - Location Allocation - Urban Management - Optimization

  18. Prompting with Phonemes: Enhancing LLMs' Multilinguality for Non-Latin Script Languages

    Authors: Hoang H Nguyen, Khyati Mahajan, Vikas Yadav, Julian Salazar, Philip S. Yu, Masoud Hashemi, Rishabh Maheshwary

    Abstract: Although multilingual LLMs have achieved remarkable performance across benchmarks, we find they continue to underperform on non-Latin script languages across contemporary LLM families. This discrepancy arises from the fact that LLMs are pretrained with orthographic scripts, which are dominated by Latin characters that obscure their shared phonology with non-Latin scripts. We propose leveraging pho… ▽ More

    Submitted 26 June, 2025; v1 submitted 4 November, 2024; originally announced November 2024.

    Comments: Accepted to NAACL 2025 (Main Conference). This version contains minor improvements to the camera-ready

    Journal ref: Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)

  19. arXiv:2408.03446  [pdf, other

    cs.NI eess.SP

    Optimizing NOMA Transmissions to Advance Federated Learning in Vehicular Networks

    Authors: Ziru Chen, Zhou Ni, Peiyuan Guan, Lu Wang, Lin X. Cai, Morteza Hashemi, Zongzhi Li

    Abstract: Diverse critical data, such as location information and driving patterns, can be collected by IoT devices in vehicular networks to improve driving experiences and road safety. However, drivers are often reluctant to share their data due to privacy concerns. The Federated Vehicular Network (FVN) is a promising technology that tackles these concerns by transmitting model parameters instead of raw da… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

    Comments: The paper is accepted by IEEE Globecom 2024

  20. arXiv:2408.01885  [pdf, other

    cs.IT

    Channel-Aware Distributed Transmission Control and Video Streaming in UAV Networks

    Authors: Masoud Ghazikor, Keenan Roach, Kenny Cheung, Morteza Hashemi

    Abstract: In this paper, we study the problem of distributed transmission control and video streaming optimization for unmanned aerial vehicles (UAVs) operating in unlicensed spectrum bands. We develop a rigorous cross-layer analysis framework that jointly considers three inter-dependent factors: (i) in-band interference introduced by ground/aerial nodes at the physical (PHY) layer, (ii) limited-size queues… ▽ More

    Submitted 3 August, 2024; originally announced August 2024.

    Comments: Submitted to IEEE Transactions on Communications

  21. arXiv:2407.21788  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Vision-Language Model Based Handwriting Verification

    Authors: Mihir Chauhan, Abhishek Satbhai, Mohammad Abuzar Hashemi, Mir Basheer Ali, Bina Ramamurthy, Mingchen Gao, Siwei Lyu, Sargur Srihari

    Abstract: Handwriting Verification is a critical in document forensics. Deep learning based approaches often face skepticism from forensic document examiners due to their lack of explainability and reliance on extensive training data and handcrafted features. This paper explores using Vision Language Models (VLMs), such as OpenAI's GPT-4o and Google's PaliGemma, to address these challenges. By leveraging th… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

    Comments: 4 Pages, 1 Figure, 1 Table, Accepted as Short paper at Irish Machine Vision and Image Processing (IMVIP) Conference

  22. arXiv:2407.16221  [pdf, other

    cs.CL

    Do LLMs Know When to NOT Answer? Investigating Abstention Abilities of Large Language Models

    Authors: Nishanth Madhusudhan, Sathwik Tejaswi Madhusudhan, Vikas Yadav, Masoud Hashemi

    Abstract: Abstention Ability (AA) is a critical aspect of Large Language Model (LLM) reliability, referring to an LLM's capability to withhold responses when uncertain or lacking a definitive answer, without compromising performance. Although previous studies have attempted to improve AA, they lack a standardised evaluation method and remain unsuitable for black-box models where token prediction probabiliti… ▽ More

    Submitted 24 September, 2024; v1 submitted 23 July, 2024; originally announced July 2024.

    Comments: 8 pages (excluding limitations, references and appendix) and 5 figures

  23. arXiv:2407.03426  [pdf, other

    cs.NI cs.LG cs.MM

    Multi-Task Decision-Making for Multi-User 360 Video Processing over Wireless Networks

    Authors: Babak Badnava, Jacob Chakareski, Morteza Hashemi

    Abstract: We study a multi-task decision-making problem for 360 video processing in a wireless multi-user virtual reality (VR) system that includes an edge computing unit (ECU) to deliver 360 videos to VR users and offer computing assistance for decoding/rendering of video frames. However, this comes at the expense of increased data volume and required bandwidth. To balance this trade-off, we formulate a co… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 2024 IEEE International Conference on Multimedia Information Processing and Retrieval (MIPR)

  24. arXiv:2406.01727  [pdf, other

    cs.LG cs.MA eess.SP

    Federated Learning-based Collaborative Wideband Spectrum Sensing and Scheduling for UAVs in UTM Systems

    Authors: Sravan Reddy Chintareddy, Keenan Roach, Kenny Cheung, Morteza Hashemi

    Abstract: In this paper, we propose a data-driven framework for collaborative wideband spectrum sensing and scheduling for networked unmanned aerial vehicles (UAVs), which act as the secondary users (SUs) to opportunistically utilize detected "spectrum holes". Our overall framework consists of three main stages. Firstly, in the model training stage, we explore dataset generation in a multi-cell environment… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: This is a preprint version submitted to IEEE Transactions on Machine learning in Communications and Networking. arXiv admin note: text overlap with arXiv:2308.05036

  25. arXiv:2405.18320  [pdf, other

    cs.CV cs.AI cs.CL

    Self-Supervised Learning Based Handwriting Verification

    Authors: Mihir Chauhan, Mohammad Abuzar Hashemi, Abhishek Satbhai, Mir Basheer Ali, Bina Ramamurthy, Mingchen Gao, Siwei Lyu, Sargur Srihari

    Abstract: We present SSL-HV: Self-Supervised Learning approaches applied to the task of Handwriting Verification. This task involves determining whether a given pair of handwritten images originate from the same or different writer distribution. We have compared the performance of multiple generative, contrastive SSL approaches against handcrafted feature extractors and supervised learning on CEDAR AND data… ▽ More

    Submitted 1 August, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: 8 pages, 2 figures, 2 tables, Accepted at Irish Machine Vision and Image Processing Conference 2024

  26. arXiv:2404.12520  [pdf, other

    cs.AI

    Centralized vs. Decentralized Multi-Agent Reinforcement Learning for Enhanced Control of Electric Vehicle Charging Networks

    Authors: Amin Shojaeighadikolaei, Zsolt Talata, Morteza Hashemi

    Abstract: The widespread adoption of electric vehicles (EVs) poses several challenges to power distribution networks and smart grid infrastructure due to the possibility of significantly increasing electricity demands, especially during peak hours. Furthermore, when EVs participate in demand-side management programs, charging expenses can be reduced by using optimal charging control policies that fully util… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 12 pages, 9 figures

    MSC Class: 68T07

  27. arXiv:2403.08948  [pdf, ps, other

    eess.SY cs.GT

    Model-free Resilient Controller Design based on Incentive Feedback Stackelberg Game and Q-learning

    Authors: Jiajun Shen, Fengjun Li, Morteza Hashemi, Huazhen Fang

    Abstract: In the swift evolution of Cyber-Physical Systems (CPSs) within intelligent environments, especially in the industrial domain shaped by Industry 4.0, the surge in development brings forth unprecedented security challenges. This paper explores the intricate security issues of Industrial CPSs (ICPSs), with a specific focus on the unique threats presented by intelligent attackers capable of directly c… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 8 pages

  28. arXiv:2402.03358  [pdf, other

    cs.SI cs.AI cs.DS cs.LG

    A Comprehensive Survey on Graph Reduction: Sparsification, Coarsening, and Condensation

    Authors: Mohammad Hashemi, Shengbo Gong, Juntong Ni, Wenqi Fan, B. Aditya Prakash, Wei Jin

    Abstract: Many real-world datasets can be naturally represented as graphs, spanning a wide range of domains. However, the increasing complexity and size of graph datasets present significant challenges for analysis and computation. In response, graph reduction, or graph summarization, has gained prominence for simplifying large graphs while preserving essential properties. In this survey, we aim to provide… ▽ More

    Submitted 29 June, 2024; v1 submitted 28 January, 2024; originally announced February 2024.

    Comments: Accepted by IJCAI 2024 (This ArXiv version is a long version of our IJCAI paper)

  29. arXiv:2402.01064  [pdf, other

    cs.IT eess.IV

    Semantic-Aware and Goal-Oriented Communications for Object Detection in Wireless End-to-End Image Transmission

    Authors: Fatemeh Zahra Safaeipour, Morteza Hashemi

    Abstract: Semantic communication is focused on optimizing the exchange of information by transmitting only the most relevant data required to convey the intended message to the receiver and achieve the desired communication goal. For example, if we consider images as the information and the goal of the communication is object detection at the receiver side, the semantic of information would be the objects i… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: International Conference on Computing, Networking and Communications (ICNC 2024)

  30. arXiv:2401.11084  [pdf, other

    cs.IT

    Interference-Aware Queuing Analysis for Distributed Transmission Control in UAV Networks

    Authors: Masoud Ghazikor, Keenan Roach, Kenny Cheung, Morteza Hashemi

    Abstract: In this paper, we investigate the problem of distributed transmission control for unmanned aerial vehicles (UAVs) operating in unlicensed spectrum bands. We develop a rigorous interference-aware queuing analysis framework that jointly considers two inter-dependent factors: (i) limited-size queues with delay-constrained packet arrival, and (ii) in-band interference introduced by other ground/aerial… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: IEEE International Conference on Communications (ICC)

  31. arXiv:2401.03302  [pdf, ps, other

    eess.IV cs.AI cs.CV cs.LG stat.ML

    Realism in Action: Anomaly-Aware Diagnosis of Brain Tumors from Medical Images Using YOLOv8 and DeiT

    Authors: Seyed Mohammad Hossein Hashemi, Leila Safari, Mohsen Hooshmand, Amirhossein Dadashzadeh Taromi

    Abstract: Reliable diagnosis of brain tumors remains challenging due to low clinical incidence rates of such cases. However, this low rate is neglected in most of proposed methods. We propose a clinically inspired framework for anomaly-resilient tumor detection and classification. Detection leverages YOLOv8n fine-tuned on a realistically imbalanced dataset (1:9 tumor-to-normal ratio; 30,000 MRI slices from… ▽ More

    Submitted 1 July, 2025; v1 submitted 6 January, 2024; originally announced January 2024.

  32. arXiv:2312.03133  [pdf, other

    eess.IV cs.CV physics.med-ph

    Predicting Bone Degradation Using Vision Transformer and Synthetic Cellular Microstructures Dataset

    Authors: Mohammad Saber Hashemi, Azadeh Sheidaei

    Abstract: Bone degradation, especially for astronauts in microgravity conditions, is crucial for space exploration missions since the lower applied external forces accelerate the diminution in bone stiffness and strength substantially. Although existing computational models help us understand this phenomenon and possibly restrict its effect in the future, they are time-consuming to simulate the changes in t… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: 8 pages, 5 figures

  33. arXiv:2310.19069  [pdf, other

    cs.LG cs.DC

    Efficient Cluster Selection for Personalized Federated Learning: A Multi-Armed Bandit Approach

    Authors: Zhou Ni, Morteza Hashemi

    Abstract: Federated learning (FL) offers a decentralized training approach for machine learning models, prioritizing data privacy. However, the inherent heterogeneity in FL networks, arising from variations in data distribution, size, and device capabilities, poses challenges in user federation. Recognizing this, Personalized Federated Learning (PFL) emphasizes tailoring learning processes to individual dat… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

  34. arXiv:2310.15325  [pdf, other

    cs.CV cs.CL cs.LG

    LXMERT Model Compression for Visual Question Answering

    Authors: Maryam Hashemi, Ghazaleh Mahmoudi, Sara Kodeiri, Hadi Sheikhi, Sauleh Eetemadi

    Abstract: Large-scale pretrained models such as LXMERT are becoming popular for learning cross-modal representations on text-image pairs for vision-language tasks. According to the lottery ticket hypothesis, NLP and computer vision models contain smaller subnetworks capable of being trained in isolation to full performance. In this paper, we combine these observations to evaluate whether such trainable subn… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: To appear in The Fourth Annual West Coast NLP (WeCNLP) Summit

  35. arXiv:2310.02405  [pdf, other

    cs.LG cs.AI

    PCGPT: Procedural Content Generation via Transformers

    Authors: Sajad Mohaghegh, Mohammad Amin Ramezan Dehnavi, Golnoosh Abdollahinejad, Matin Hashemi

    Abstract: The paper presents the PCGPT framework, an innovative approach to procedural content generation (PCG) using offline reinforcement learning and transformer networks. PCGPT utilizes an autoregressive model based on transformers to generate game levels iteratively, addressing the challenges of traditional PCG methods such as repetitive, predictable, or inconsistent content. The framework models traje… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

  36. arXiv:2308.12921  [pdf, other

    cs.MA cs.LG eess.SY math.OC

    An Efficient Distributed Multi-Agent Reinforcement Learning for EV Charging Network Control

    Authors: Amin Shojaeighadikolaei, Morteza Hashemi

    Abstract: The increasing trend in adopting electric vehicles (EVs) will significantly impact the residential electricity demand, which results in an increased risk of transformer overload in the distribution grid. To mitigate such risks, there are urgent needs to develop effective EV charging controllers. Currently, the majority of the EV charge controllers are based on a centralized approach for managing i… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: 8 pages, 4 figures, accepted at Allerton 2023

  37. arXiv:2308.06647  [pdf, other

    cs.NI cs.DC cs.IT

    Energy-Efficient Deadline-Aware Edge Computing: Bandit Learning with Partial Observations in Multi-Channel Systems

    Authors: Babak Badnava, Keenan Roach, Kenny Cheung, Morteza Hashemi, Ness B Shroff

    Abstract: In this paper, we consider a task offloading problem in a multi-access edge computing (MEC) network, in which edge users can either use their local processing unit to compute their tasks or offload their tasks to a nearby edge server through multiple communication channels each with different characteristics. The main objective is to maximize the energy efficiency of the edge users while meeting c… ▽ More

    Submitted 12 August, 2023; originally announced August 2023.

    Comments: 2023 IEEE Global Communications Conference

  38. arXiv:2308.05187  [pdf, other

    cs.IT eess.SP

    Exploring the Interplay of Interference and Queues in Unlicensed Spectrum Bands for UAV Networks

    Authors: Masoud Ghazikor, Keenan Roach, Kenny Cheung, Morteza Hashemi

    Abstract: In this paper, we present an analytical framework to explore the interplay of signal interference and transmission queue management, and their impacts on the performance of unmanned aerial vehicles (UAVs) when operating in the unlicensed spectrum bands. In particular, we develop a comprehensive framework to investigate the impact of other interference links on the UAV as it communicates with the g… ▽ More

    Submitted 24 November, 2023; v1 submitted 9 August, 2023; originally announced August 2023.

    Comments: Asilomar Conference on Signals, Systems, and Computers

  39. arXiv:2308.05036  [pdf, other

    eess.SP cs.LG cs.MA cs.NI

    Collaborative Wideband Spectrum Sensing and Scheduling for Networked UAVs in UTM Systems

    Authors: Sravan Reddy Chintareddy, Keenan Roach, Kenny Cheung, Morteza Hashemi

    Abstract: In this paper, we propose a data-driven framework for collaborative wideband spectrum sensing and scheduling for networked unmanned aerial vehicles (UAVs), which act as the secondary users to opportunistically utilize detected spectrum holes. To this end, we propose a multi-class classification problem for wideband spectrum sensing to detect vacant spectrum spots based on collected I/Q samples. To… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

  40. arXiv:2306.02557  [pdf, other

    stat.AP cs.AI

    Detecting individual-level infections using sparse group-testing through graph-coupled hidden Markov models

    Authors: Zahra Gholamalian, Zeinab Maleki, MasoudReza Hashemi, Pouria Ramazi

    Abstract: Identifying the infection status of each individual during infectious diseases informs public health management. However, performing frequent individual-level tests may not be feasible. Instead, sparse and sometimes group-level tests are performed. Determining the infection status of individuals using sparse group-level tests remains an open problem. We have tackled this problem by extending graph… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

  41. arXiv:2302.14094  [pdf, other

    eess.SY cs.AI cs.LG cs.MA

    Combating Uncertainties in Wind and Distributed PV Energy Sources Using Integrated Reinforcement Learning and Time-Series Forecasting

    Authors: Arman Ghasemi, Amin Shojaeighadikolaei, Morteza Hashemi

    Abstract: Renewable energy sources, such as wind and solar power, are increasingly being integrated into smart grid systems. However, when compared to traditional energy resources, the unpredictability of renewable energy generation poses significant challenges for both electricity providers and utility companies. Furthermore, the large-scale integration of distributed energy resources (such as PV systems)… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: 10 pages, 12 figures

  42. arXiv:2302.07867  [pdf, other

    cs.SE cs.AI cs.LG cs.PF

    Learning Performance-Improving Code Edits

    Authors: Alexander Shypula, Aman Madaan, Yimeng Zeng, Uri Alon, Jacob Gardner, Milad Hashemi, Graham Neubig, Parthasarathy Ranganathan, Osbert Bastani, Amir Yazdanbakhsh

    Abstract: With the decline of Moore's law, optimizing program performance has become a major focus of software research. However, high-level optimizations such as API and algorithm changes remain elusive due to the difficulty of understanding the semantics of code. Simultaneously, pretrained large language models (LLMs) have demonstrated strong capabilities at solving a wide range of programming tasks. To t… ▽ More

    Submitted 26 April, 2024; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: Published as a conference paper at ICLR 2024 (Spotlight). Project website: https://pie4perf.com/

  43. arXiv:2302.03180  [pdf, other

    cs.AI

    Understanding User Preferences in Explainable Artificial Intelligence: A Survey and a Mapping Function Proposal

    Authors: Maryam Hashemi, Ali Darejeh, Francisco Cruz

    Abstract: The increasing complexity of AI systems has led to the growth of the field of Explainable Artificial Intelligence (XAI), which aims to provide explanations and justifications for the outputs of AI algorithms. While there is considerable demand for XAI, there remains a scarcity of studies aimed at comprehensively understanding the practical distinctions among different methods and effectively align… ▽ More

    Submitted 19 June, 2024; v1 submitted 6 February, 2023; originally announced February 2023.

  44. arXiv:2211.15858  [pdf, other

    cs.MA cs.LG eess.SY

    Distributed Energy Management and Demand Response in Smart Grids: A Multi-Agent Deep Reinforcement Learning Framework

    Authors: Amin Shojaeighadikolaei, Arman Ghasemi, Kailani Jones, Yousif Dafalla, Alexandru G. Bardas, Reza Ahmadi, Morteza Haashemi

    Abstract: This paper presents a multi-agent Deep Reinforcement Learning (DRL) framework for autonomous control and integration of renewable energy resources into smart power grid systems. In particular, the proposed framework jointly considers demand response (DR) and distributed energy management (DEM) for residential end-users. DR has a widely recognized potential for improving power grid stability and re… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

    Comments: 16 pages, 10 figures

    MSC Class: 68T07; 68T42; 68T05 ACM Class: I.2; I.6

  45. arXiv:2211.00692  [pdf, other

    cs.LG

    Towards Better Out-of-Distribution Generalization of Neural Algorithmic Reasoning Tasks

    Authors: Sadegh Mahdavi, Kevin Swersky, Thomas Kipf, Milad Hashemi, Christos Thrampoulidis, Renjie Liao

    Abstract: In this paper, we study the OOD generalization of neural algorithmic reasoning tasks, where the goal is to learn an algorithm (e.g., sorting, breadth-first search, and depth-first search) from input-output pairs using deep neural networks. First, we argue that OOD generalization in this setting is significantly different than common OOD settings. For example, some phenomena in OOD generalization o… ▽ More

    Submitted 18 March, 2023; v1 submitted 1 November, 2022; originally announced November 2022.

    Comments: Transactions on Machine Learning Research (TMLR), 2023

  46. Connective Reconstruction-based Novelty Detection

    Authors: Seyyed Morteza Hashemi, Parvaneh Aliniya, Parvin Razzaghi

    Abstract: Detection of out-of-distribution samples is one of the critical tasks for real-world applications of computer vision. The advancement of deep learning has enabled us to analyze real-world data which contain unexplained samples, accentuating the need to detect out-of-distribution instances more than before. GAN-based approaches have been widely used to address this problem due to their ability to p… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

  47. arXiv:2210.06965  [pdf, other

    cs.LG cs.CV

    CUF: Continuous Upsampling Filters

    Authors: Cristina Vasconcelos, Cengiz Oztireli, Mark Matthews, Milad Hashemi, Kevin Swersky, Andrea Tagliasacchi

    Abstract: Neural fields have rapidly been adopted for representing 3D signals, but their application to more classical 2D image-processing has been relatively limited. In this paper, we consider one of the most important operations in image processing: upsampling. In deep learning, learnable upsampling layers have extensively been used for single image super-resolution. We propose to parameterize upsampling… ▽ More

    Submitted 20 October, 2022; v1 submitted 13 October, 2022; originally announced October 2022.

  48. arXiv:2208.05297  [pdf, other

    cs.SE cs.LG

    Learning to Improve Code Efficiency

    Authors: Binghong Chen, Daniel Tarlow, Kevin Swersky, Martin Maas, Pablo Heiber, Ashish Naik, Milad Hashemi, Parthasarathy Ranganathan

    Abstract: Improvements in the performance of computing systems, driven by Moore's Law, have transformed society. As such hardware-driven gains slow down, it becomes even more important for software developers to focus on performance and efficiency during development. While several studies have demonstrated the potential from such improved code efficiency (e.g., 2x better generational improvements compared t… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

  49. arXiv:2208.04146  [pdf

    cond-mat.mtrl-sci cs.LG

    Linking Properties to Microstructure in Liquid Metal Embedded Elastomers via Machine Learning

    Authors: Abhijith Thoopul Anantharanga, Mohammad Saber Hashemi, Azadeh Sheidaei

    Abstract: Liquid metals (LM) are embedded in an elastomer matrix to obtain soft composites with unique thermal, dielectric, and mechanical properties. They have applications in soft robotics, biomedical engineering, and wearable electronics. By linking the structure to the properties of these materials, it is possible to perform material design rationally. Liquid-metal embedded elastomers (LMEEs) have been… ▽ More

    Submitted 24 July, 2022; originally announced August 2022.

    Comments: 25 pages, 9 figures, submitted to the journal of Composites Science and Technology

  50. arXiv:2208.03822  [pdf, other

    cs.CR

    Garbled EDA: Privacy Preserving Electronic Design Automation

    Authors: Mohammad Hashemi, Steffi Roy, Fatemeh Ganji, Domenic Forte

    Abstract: The complexity of modern integrated circuits (ICs) necessitates collaboration between multiple distrusting parties, including thirdparty intellectual property (3PIP) vendors, design houses, CAD/EDA tool vendors, and foundries, which jeopardizes confidentiality and integrity of each party's IP. IP protection standards and the existing techniques proposed by researchers are ad hoc and vulnerable to… ▽ More

    Submitted 7 August, 2022; originally announced August 2022.