Skip to main content

Showing 1–50 of 88 results for author: Hashemi, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.01995  [pdf, other

    cs.AI cs.LG

    Brains vs. Bytes: Evaluating LLM Proficiency in Olympiad Mathematics

    Authors: Hamed Mahdavi, Alireza Hashemi, Majid Daliri, Pegah Mohammadipour, Alireza Farhadi, Samira Malek, Yekta Yazdanifard, Amir Khasahmadi, Vasant Honavar

    Abstract: Recent advances in large language models (LLMs) have shown impressive progress in mathematical reasoning tasks. However, current evaluation benchmarks predominantly focus on the accuracy of final answers, often overlooking the crucial logical rigor for mathematical problem solving. The claim that state-of-the-art LLMs can solve Math Olympiad-level problems requires closer examination. To explore t… ▽ More

    Submitted 10 April, 2025; v1 submitted 31 March, 2025; originally announced April 2025.

  2. arXiv:2503.07464  [pdf, other

    cs.LG cs.CR

    Learning to Localize Leakage of Cryptographic Sensitive Variables

    Authors: Jimmy Gammell, Anand Raghunathan, Abolfazl Hashemi, Kaushik Roy

    Abstract: While cryptographic algorithms such as the ubiquitous Advanced Encryption Standard (AES) are secure, *physical implementations* of these algorithms in hardware inevitably 'leak' sensitive data such as cryptographic keys. A particularly insidious form of leakage arises from the fact that hardware consumes power and emits radiation in a manner that is statistically associated with the data it proces… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

    Comments: 52 pages, 30 figures. Our code can be found at https://github.com/jimgammell/learning_to_localize_leakage

  3. arXiv:2503.02112  [pdf, other

    cs.LG astro-ph.IM

    Building Machine Learning Challenges for Anomaly Detection in Science

    Authors: Elizabeth G. Campolongo, Yuan-Tang Chou, Ekaterina Govorkova, Wahid Bhimji, Wei-Lun Chao, Chris Harris, Shih-Chieh Hsu, Hilmar Lapp, Mark S. Neubauer, Josephine Namayanja, Aneesh Subramanian, Philip Harris, Advaith Anand, David E. Carlyn, Subhankar Ghosh, Christopher Lawrence, Eric Moreno, Ryan Raikman, Jiaman Wu, Ziheng Zhang, Bayu Adhi, Mohammad Ahmadi Gharehtoragh, Saúl Alonso Monsalve, Marta Babicz, Furqan Baig , et al. (125 additional authors not shown)

    Abstract: Scientific discoveries are often made by finding a pattern or object that was not predicted by the known rules of science. Oftentimes, these anomalous events or objects that do not conform to the norms are an indication that the rules of science governing the data are incomplete, and something new needs to be present to explain these unexpected outliers. The challenge of finding anomalies can be c… ▽ More

    Submitted 29 March, 2025; v1 submitted 3 March, 2025; originally announced March 2025.

    Comments: 17 pages 6 figures to be submitted to Nature Communications

  4. arXiv:2411.00143  [pdf, other

    eess.IV cs.LG

    Enhancing Brain Source Reconstruction through Physics-Informed 3D Neural Networks

    Authors: Marco Morik, Ali Hashemi, Klaus-Robert Müller, Stefan Haufe, Shinichi Nakajima

    Abstract: Reconstructing brain sources is a fundamental challenge in neuroscience, crucial for understanding brain function and dysfunction. Electroencephalography (EEG) signals have a high temporal resolution. However, identifying the correct spatial location of brain sources from these signals remains difficult due to the ill-posed structure of the problem. Traditional methods predominantly rely on manual… ▽ More

    Submitted 31 October, 2024; originally announced November 2024.

    Comments: Under Review in IEEE Transactions on Medical Imaging

  5. arXiv:2410.19207  [pdf, other

    cs.LG cs.AI eess.SP

    Equitable Federated Learning with Activation Clustering

    Authors: Antesh Upadhyay, Abolfazl Hashemi

    Abstract: Federated learning is a prominent distributed learning paradigm that incorporates collaboration among diverse clients, promotes data locality, and thus ensures privacy. These clients have their own technological, cultural, and other biases in the process of data generation. However, the present standard often ignores this bias/heterogeneity, perpetuating bias against certain groups rather than mit… ▽ More

    Submitted 1 November, 2024; v1 submitted 24 October, 2024; originally announced October 2024.

    Comments: 28 pages

  6. arXiv:2409.08917  [pdf, other

    cs.LG cs.AI stat.ML

    Latent Space Score-based Diffusion Model for Probabilistic Multivariate Time Series Imputation

    Authors: Guojun Liang, Najmeh Abiri, Atiye Sadat Hashemi, Jens Lundström, Stefan Byttner, Prayag Tiwari

    Abstract: Accurate imputation is essential for the reliability and success of downstream tasks. Recently, diffusion models have attracted great attention in this field. However, these models neglect the latent distribution in a lower-dimensional space derived from the observed data, which limits the generative capacity of the diffusion model. Additionally, dealing with the original missing data without labe… ▽ More

    Submitted 13 September, 2024; originally announced September 2024.

    Comments: 5 pages, conference

  7. arXiv:2408.13683  [pdf, ps, other

    cs.LG cs.AI eess.SP eess.SY

    Submodular Maximization Approaches for Equitable Client Selection in Federated Learning

    Authors: Andrés Catalino Castillo Jiménez, Ege C. Kaya, Lintao Ye, Abolfazl Hashemi

    Abstract: In a conventional Federated Learning framework, client selection for training typically involves the random sampling of a subset of clients in each iteration. However, this random selection often leads to disparate performance among clients, raising concerns regarding fairness, particularly in applications where equitable outcomes are crucial, such as in medical or financial machine learning tasks… ▽ More

    Submitted 27 August, 2024; v1 submitted 24 August, 2024; originally announced August 2024.

    Comments: 13 pages

  8. arXiv:2407.21788  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Vision-Language Model Based Handwriting Verification

    Authors: Mihir Chauhan, Abhishek Satbhai, Mohammad Abuzar Hashemi, Mir Basheer Ali, Bina Ramamurthy, Mingchen Gao, Siwei Lyu, Sargur Srihari

    Abstract: Handwriting Verification is a critical in document forensics. Deep learning based approaches often face skepticism from forensic document examiners due to their lack of explainability and reliance on extensive training data and handcrafted features. This paper explores using Vision Language Models (VLMs), such as OpenAI's GPT-4o and Google's PaliGemma, to address these challenges. By leveraging th… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

    Comments: 4 Pages, 1 Figure, 1 Table, Accepted as Short paper at Irish Machine Vision and Image Processing (IMVIP) Conference

  9. arXiv:2405.18320  [pdf, other

    cs.CV cs.AI cs.CL

    Self-Supervised Learning Based Handwriting Verification

    Authors: Mihir Chauhan, Mohammad Abuzar Hashemi, Abhishek Satbhai, Mir Basheer Ali, Bina Ramamurthy, Mingchen Gao, Siwei Lyu, Sargur Srihari

    Abstract: We present SSL-HV: Self-Supervised Learning approaches applied to the task of Handwriting Verification. This task involves determining whether a given pair of handwritten images originate from the same or different writer distribution. We have compared the performance of multiple generative, contrastive SSL approaches against handcrafted feature extractors and supervised learning on CEDAR AND data… ▽ More

    Submitted 1 August, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: 8 pages, 2 figures, 2 tables, Accepted at Irish Machine Vision and Image Processing Conference 2024

  10. arXiv:2405.18237  [pdf, other

    cs.LG math.ST stat.ML

    Unveiling the Cycloid Trajectory of EM Iterations in Mixed Linear Regression

    Authors: Zhankun Luo, Abolfazl Hashemi

    Abstract: We study the trajectory of iterations and the convergence rates of the Expectation-Maximization (EM) algorithm for two-component Mixed Linear Regression (2MLR). The fundamental goal of MLR is to learn the regression models from unlabeled observations. The EM algorithm finds extensive applications in solving the mixture of linear regressions. Recent results have established the super-linear converg… ▽ More

    Submitted 3 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: This paper was accepted by the 41st International Conference on Machine Learning (ICML 2024). The code for numerical experiments is available at https://github.com/dassein/cycloid_em_mlr

  11. arXiv:2405.02188  [pdf, other

    stat.ML cs.AI cs.LG

    Optimistic Regret Bounds for Online Learning in Adversarial Markov Decision Processes

    Authors: Sang Bin Moon, Abolfazl Hashemi

    Abstract: The Adversarial Markov Decision Process (AMDP) is a learning framework that deals with unknown and varying tasks in decision-making applications like robotics and recommendation systems. A major limitation of the AMDP formalism, however, is pessimistic regret analysis results in the sense that although the cost function can change from one episode to the next, the evolution in many settings is not… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  12. arXiv:2404.08003  [pdf, other

    cs.LG cs.DC cs.NI

    Asynchronous Federated Reinforcement Learning with Policy Gradient Updates: Algorithm Design and Convergence Analysis

    Authors: Guangchen Lan, Dong-Jun Han, Abolfazl Hashemi, Vaneet Aggarwal, Christopher G. Brinton

    Abstract: To improve the efficiency of reinforcement learning (RL), we propose a novel asynchronous federated reinforcement learning (FedRL) framework termed AFedPG, which constructs a global model through collaboration among $N$ agents using policy gradient (PG) updates. To address the challenge of lagged policies in asynchronous settings, we design a delay-adaptive lookahead technique \textit{specifically… ▽ More

    Submitted 23 January, 2025; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: Published as a conference paper at ICLR 2025

    ACM Class: I.2.6; I.2.11

  13. arXiv:2404.05919  [pdf, other

    cs.LG

    AdaGossip: Adaptive Consensus Step-size for Decentralized Deep Learning with Communication Compression

    Authors: Sai Aparna Aketi, Abolfazl Hashemi, Kaushik Roy

    Abstract: Decentralized learning is crucial in supporting on-device learning over large distributed datasets, eliminating the need for a central server. However, the communication overhead remains a major bottleneck for the practical realization of such decentralized setups. To tackle this issue, several algorithms for decentralized training with compressed communication have been proposed in the literature… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: 11 pages, 3 figures, 8 tables. arXiv admin note: text overlap with arXiv:2305.04792, arXiv:2310.15890

  14. arXiv:2404.03759  [pdf, other

    cs.LG eess.SP math.OC

    Localized Distributional Robustness in Submodular Multi-Task Subset Selection

    Authors: Ege C. Kaya, Abolfazl Hashemi

    Abstract: In this work, we approach the problem of multi-task submodular optimization with the perspective of local distributional robustness, within the neighborhood of a reference distribution which assigns an importance score to each task. We initially propose to introduce a regularization term which makes use of the relative entropy to the standard multi-task objective. We then demonstrate through duali… ▽ More

    Submitted 3 November, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: 41 pages, 7 figures. A preliminary version of this article was presented at the 2023 Allerton Conference on Communication, Control, and Computing. This version is to be published in IEEE Transactions on Signal Processing

  15. arXiv:2403.13247  [pdf, other

    cs.LG cs.DC

    FedNMUT -- Federated Noisy Model Update Tracking Convergence Analysis

    Authors: Vishnu Pandi Chellapandi, Antesh Upadhyay, Abolfazl Hashemi, Stanislaw H. Żak

    Abstract: A novel Decentralized Noisy Model Update Tracking Federated Learning algorithm (FedNMUT) is proposed that is tailored to function efficiently in the presence of noisy communication channels that reflect imperfect information exchange. This algorithm uses gradient tracking to minimize the impact of data heterogeneity while minimizing communication overhead. The proposed algorithm incorporates noise… ▽ More

    Submitted 24 March, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: text overlap with arXiv:2303.10695

  16. arXiv:2402.18726  [pdf, other

    cs.LG cs.AI cs.CR

    Unveiling Privacy, Memorization, and Input Curvature Links

    Authors: Deepak Ravikumar, Efstathia Soufleri, Abolfazl Hashemi, Kaushik Roy

    Abstract: Deep Neural Nets (DNNs) have become a pervasive tool for solving many emerging problems. However, they tend to overfit to and memorize the training set. Memorization is of keen interest since it is closely related to several concepts such as generalization, noisy learning, and privacy. To study memorization, Feldman (2019) proposed a formal score, however its computational requirements limit its p… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  17. arXiv:2401.05379  [pdf, other

    cs.CV

    AutoVisual Fusion Suite: A Comprehensive Evaluation of Image Segmentation and Voice Conversion Tools on HuggingFace Platform

    Authors: Amirreza Hashemi

    Abstract: This study presents a comprehensive evaluation of tools available on the HuggingFace platform for two pivotal applications in artificial intelligence: image segmentation and voice conversion. The primary objective was to identify the top three tools within each category and subsequently install and configure these tools on Linux systems. We leveraged the power of pre-trained segmentation models su… ▽ More

    Submitted 12 January, 2024; v1 submitted 17 December, 2023; originally announced January 2024.

    Comments: 27 pages, 21 figures

  18. arXiv:2310.05286  [pdf, other

    cs.LG cs.AI cs.HC

    Generalizable Error Modeling for Human Data Annotation: Evidence From an Industry-Scale Search Data Annotation Program

    Authors: Heinrich Peters, Alireza Hashemi, James Rae

    Abstract: Machine learning (ML) and artificial intelligence (AI) systems rely heavily on human-annotated data for training and evaluation. A major challenge in this context is the occurrence of annotation errors, as their effects can degrade model performance. This paper presents a predictive error model trained to detect potential errors in search relevance annotation tasks for three industry-scale ML appl… ▽ More

    Submitted 25 September, 2024; v1 submitted 8 October, 2023; originally announced October 2023.

  19. arXiv:2307.07406  [pdf, other

    cs.LG cs.IT eess.SP

    Improved Convergence Analysis and SNR Control Strategies for Federated Learning in the Presence of Noise

    Authors: Antesh Upadhyay, Abolfazl Hashemi

    Abstract: We propose an improved convergence analysis technique that characterizes the distributed learning paradigm of federated learning (FL) with imperfect/noisy uplink and downlink communications. Such imperfect communication scenarios arise in the practical deployment of FL in emerging communication systems and protocols. The analysis developed in this paper demonstrates, for the first time, that there… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

    Journal ref: in IEEE Access, vol. 11, pp. 63398-63416, 2023

  20. arXiv:2307.03298  [pdf

    eess.IV cs.LG physics.med-ph

    Application of Spherical Convolutional Neural Networks to Image Reconstruction and Denoising in Nuclear Medicine

    Authors: Amirreza Hashemi, Yuemeng Feng, Arman Rahmim, Hamid Sabet

    Abstract: This work investigates use of equivariant neural networks as efficient and high-performance frameworks for image reconstruction and denoising in nuclear medicine. Our work aims to tackle limitations of conventional Convolutional Neural Networks (CNNs), which require significant training. We investigated equivariant networks, aiming to reduce CNN's dependency on specific training sets. Specifically… ▽ More

    Submitted 30 January, 2025; v1 submitted 6 July, 2023; originally announced July 2023.

  21. Communication-Efficient Zeroth-Order Distributed Online Optimization: Algorithm, Theory, and Applications

    Authors: Ege C. Kaya, M. Berk Sahin, Abolfazl Hashemi

    Abstract: This paper focuses on a multi-agent zeroth-order online optimization problem in a federated learning setting for target tracking. The agents only sense their current distances to their targets and aim to maintain a minimum safe distance from each other to prevent collisions. The coordination among the agents and dissemination of collision-prevention information is managed by a central server using… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: 21 pages, 5 figures, and this paper has been accepted by IEEE Access

  22. arXiv:2305.04792  [pdf, other

    cs.LG cs.MA

    Global Update Tracking: A Decentralized Learning Algorithm for Heterogeneous Data

    Authors: Sai Aparna Aketi, Abolfazl Hashemi, Kaushik Roy

    Abstract: Decentralized learning enables the training of deep learning models over large distributed datasets generated at different locations, without the need for a central server. However, in practical scenarios, the data distribution across these devices can be significantly different, leading to a degradation in model performance. In this paper, we focus on designing a decentralized learning algorithm… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: 22 pages, 10 tables, 3 figures

  23. arXiv:2303.10695  [pdf, other

    cs.LG eess.SY

    On the Convergence of Decentralized Federated Learning Under Imperfect Information Sharing

    Authors: Vishnu Pandi Chellapandi, Antesh Upadhyay, Abolfazl Hashemi, Stanislaw H /. Zak

    Abstract: Decentralized learning and optimization is a central problem in control that encompasses several existing and emerging applications, such as federated learning. While there exists a vast literature on this topic and most methods centered around the celebrated average-consensus paradigm, less attention has been devoted to scenarios where the communication between the agents may be imperfect. To thi… ▽ More

    Submitted 19 March, 2023; originally announced March 2023.

    Comments: 24 pages, 2 figures

  24. arXiv:2301.10960  [pdf, other

    cs.LG cs.SI

    Visiting Distant Neighbors in Graph Convolutional Networks

    Authors: Alireza Hashemi, Hernan Makse

    Abstract: We extend the graph convolutional network method for deep learning on graph data to higher order in terms of neighboring nodes. In order to construct representations for a node in a graph, in addition to the features of the node and its immediate neighboring nodes, we also include more distant nodes in the calculations. In experimenting with a number of publicly available citation graph datasets,… ▽ More

    Submitted 22 May, 2024; v1 submitted 26 January, 2023; originally announced January 2023.

  25. arXiv:2207.01105  [pdf, other

    cs.IT cs.AI

    Scalable Polar Code Construction for Successive Cancellation List Decoding: A Graph Neural Network-Based Approach

    Authors: Yun Liao, Seyyed Ali Hashemi, Hengjie Yang, John M. Cioffi

    Abstract: While constructing polar codes for successive-cancellation decoding can be implemented efficiently by sorting the bit-channels, finding optimal polar codes for cyclic-redundancy-check-aided successive-cancellation list (CA-SCL) decoding in an efficient and scalable manner still awaits investigation. This paper first maps a polar code to a unique heterogeneous graph called the polar-code-constructi… ▽ More

    Submitted 13 May, 2023; v1 submitted 3 July, 2022; originally announced July 2022.

    Comments: 33 pages, 11 figures, submitted to IEEE Transactions on Communications

  26. arXiv:2202.04786  [pdf, other

    cs.GT cs.LG

    No-Regret Learning in Dynamic Stackelberg Games

    Authors: Niklas Lauffer, Mahsa Ghasemi, Abolfazl Hashemi, Yagiz Savas, Ufuk Topcu

    Abstract: In a Stackelberg game, a leader commits to a randomized strategy, and a follower chooses their best strategy in response. We consider an extension of a standard Stackelberg game, called a discrete-time dynamic Stackelberg game, that has an underlying state space that affects the leader's rewards and available strategies and evolves in a Markovian manner depending on both the leader and follower's… ▽ More

    Submitted 9 February, 2022; originally announced February 2022.

    Comments: Preprint, under review

    ACM Class: I.2.6; I.2.11

  27. arXiv:2112.00057  [pdf, ps, other

    cs.IT

    Successive Syndrome-Check Decoding of Polar Codes

    Authors: Seyyed Ali Hashemi, Marco Mondelli, John Cioffi, Andrea Goldsmith

    Abstract: A two-part successive syndrome-check decoding of polar codes is proposed with the first part successively refining the received codeword and the second part checking its syndrome. A new formulation of the successive-cancellation (SC) decoding algorithm is presented that allows for successively refining the received codeword by comparing the log-likelihood ratio value of a frozen bit with its prede… ▽ More

    Submitted 30 November, 2021; originally announced December 2021.

    Comments: 2021 Asilomar Conference on Signals, Systems, and Computers

  28. arXiv:2111.01692  [pdf, other

    stat.ML cs.AI cs.LG eess.SP stat.AP

    Efficient Hierarchical Bayesian Inference for Spatio-temporal Regression Models in Neuroimaging

    Authors: Ali Hashemi, Yijing Gao, Chang Cai, Sanjay Ghosh, Klaus-Robert Müller, Srikantan S. Nagarajan, Stefan Haufe

    Abstract: Several problems in neuroimaging and beyond require inference on the parameters of multi-task sparse hierarchical regression models. Examples include M/EEG inverse problems, neural encoding models for task-based fMRI analyses, and climate science. In these domains, both the model parameters to be inferred and the measurement noise may exhibit a complex spatio-temporal structure. Existing work eith… ▽ More

    Submitted 23 November, 2021; v1 submitted 2 November, 2021; originally announced November 2021.

    Comments: Accepted to the 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  29. Fast Successive-Cancellation List Flip Decoding of Polar Codes

    Authors: Nghia Doan, Seyyed Ali Hashemi, Warren J. Gross

    Abstract: This work presents a fast successive-cancellation list flip (Fast-SCLF) decoding algorithm for polar codes that addresses the high latency issue associated with the successive-cancellation list flip (SCLF) decoding algorithm. We first propose a bit-flipping strategy tailored to the state-of-the-art fast successive-cancellation list (FSCL) decoding that avoids tree-traversal in the binary tree repr… ▽ More

    Submitted 23 January, 2022; v1 submitted 24 September, 2021; originally announced September 2021.

    Comments: Published in IEEE Access, Volume: 10, Page(s): 5568 - 5584, Date of Publication: 04 January 2022

  30. arXiv:2109.04993  [pdf, other

    cs.CV cs.AI cs.CL

    LAViTeR: Learning Aligned Visual and Textual Representations Assisted by Image and Caption Generation

    Authors: Mohammad Abuzar Hashemi, Zhanghexuan Li, Mihir Chauhan, Yan Shen, Abhishek Satbhai, Mir Basheer Ali, Mingchen Gao, Sargur Srihari

    Abstract: Pre-training visual and textual representations from large-scale image-text pairs is becoming a standard approach for many downstream vision-language tasks. The transformer-based models learn inter and intra-modal attention through a list of self-supervised learning tasks. This paper proposes LAViTeR, a novel architecture for visual and textual representation learning. The main module, Visual Text… ▽ More

    Submitted 1 October, 2024; v1 submitted 4 September, 2021; originally announced September 2021.

    Comments: 15 pages, 10 Figures, 5 Tables. Accepted for Oral Presentation at Irish Machine Vision and Image Processing Conference Proceedings (IMVIP), 2024

  31. arXiv:2109.02122  [pdf, other

    cs.IT

    Decoding Reed-Muller Codes with Successive Codeword Permutations

    Authors: Nghia Doan, Seyyed Ali Hashemi, Marco Mondelli, Warren J. Gross

    Abstract: A novel recursive list decoding (RLD) algorithm for Reed-Muller (RM) codes based on successive permutations (SP) of the codeword is presented. A low-complexity SP scheme applied to a subset of the symmetry group of RM codes is first proposed to carefully select a good codeword permutation on the fly. Then, the proposed SP technique is integrated into an improved RLD algorithm that initializes diff… ▽ More

    Submitted 20 September, 2022; v1 submitted 5 September, 2021; originally announced September 2021.

    Comments: Accepted for publication in IEEE Transactions on Communications

  32. arXiv:2108.12550  [pdf, ps, other

    cs.IT

    Successive-Cancellation Decoding of Reed-Muller Codes with Fast Hadamard Transform

    Authors: Nghia Doan, Seyyed Ali Hashemi, Warren J. Gross

    Abstract: A novel permuted fast successive-cancellation list decoding algorithm with fast Hadamard transform (FHT-FSCL) is presented. The proposed decoder initializes $L$ $(L\ge1)$ active decoding paths with $L$ random codeword permutations sampled from the full symmetry group of the codes. The path extension in the permutation domain is carried out until the first constituent RM code of order $1$ is visite… ▽ More

    Submitted 7 February, 2022; v1 submitted 27 August, 2021; originally announced August 2021.

    Comments: Submitted to an IEEE journal for possible publication

  33. arXiv:2107.08991  [pdf, ps, other

    cs.IT

    A Tree Search Approach for Maximum-Likelihood Decoding of Reed-Muller Codes

    Authors: Seyyed Ali Hashemi, Nghia Doan, Warren J. Gross, John Cioffi, Andrea Goldsmith

    Abstract: A low-complexity tree search approach is presented that achieves the maximum-likelihood (ML) decoding performance of Reed-Muller (RM) codes. The proposed approach generates a bit-flipping tree that is traversed to find the ML decoding result by performing successive-cancellation decoding after each node visit. A depth-first search (DFS) and a breadth-first search (BFS) scheme are developed and a l… ▽ More

    Submitted 19 July, 2021; originally announced July 2021.

  34. arXiv:2107.00116  [pdf, other

    cs.LG

    On the Benefits of Inducing Local Lipschitzness for Robust Generative Adversarial Imitation Learning

    Authors: Farzan Memarian, Abolfazl Hashemi, Scott Niekum, Ufuk Topcu

    Abstract: We explore methodologies to improve the robustness of generative adversarial imitation learning (GAIL) algorithms to observation noise. Towards this objective, we study the effect of local Lipschitzness of the discriminator and the generator on the robustness of policies learned by GAIL. In many robotics applications, the learned policies by GAIL typically suffer from a degraded performance at tes… ▽ More

    Submitted 15 January, 2024; v1 submitted 30 June, 2021; originally announced July 2021.

  35. arXiv:2106.08882  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    Robust Training in High Dimensions via Block Coordinate Geometric Median Descent

    Authors: Anish Acharya, Abolfazl Hashemi, Prateek Jain, Sujay Sanghavi, Inderjit S. Dhillon, Ufuk Topcu

    Abstract: Geometric median (\textsc{Gm}) is a classical method in statistics for achieving a robust estimation of the uncorrupted data; under gross corruption, it achieves the optimal breakdown point of 0.5. However, its computational complexity makes it infeasible for robustifying stochastic gradient descent (SGD) for high-dimensional optimization problems. In this paper, we show that by applying \textsc{G… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

  36. arXiv:2106.07094  [pdf, other

    cs.LG cs.DC eess.SP math.OC stat.ML

    On the Convergence of Differentially Private Federated Learning on Non-Lipschitz Objectives, and with Normalized Client Updates

    Authors: Rudrajit Das, Abolfazl Hashemi, Sujay Sanghavi, Inderjit S. Dhillon

    Abstract: There is a dearth of convergence results for differentially private federated learning (FL) with non-Lipschitz objective functions (i.e., when gradient norms are not bounded). The primary reason for this is that the clipping operation (i.e., projection onto an $\ell_2$ ball of a fixed radius called the clipping threshold) for bounding the sensitivity of the average update to each client's update i… ▽ More

    Submitted 15 April, 2022; v1 submitted 13 June, 2021; originally announced June 2021.

  37. arXiv:2103.03191  [pdf, other

    stat.ML cs.LG math.NA math.OC math.PR

    Generalization Bounds for Sparse Random Feature Expansions

    Authors: Abolfazl Hashemi, Hayden Schaeffer, Robert Shi, Ufuk Topcu, Giang Tran, Rachel Ward

    Abstract: Random feature methods have been successful in various machine learning tasks, are easy to compute, and come with theoretical accuracy bounds. They serve as an alternative approach to standard neural networks since they can represent similar function spaces without a costly training phase. However, for accuracy, random feature methods require more measurements than trainable parameters, limiting t… ▽ More

    Submitted 20 August, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

  38. arXiv:2012.13378  [pdf, ps, other

    cs.IT

    Parallelism versus Latency in Simplified Successive-Cancellation Decoding of Polar Codes

    Authors: Seyyed Ali Hashemi, Marco Mondelli, Arman Fazeli, Alexander Vardy, John Cioffi, Andrea Goldsmith

    Abstract: This paper characterizes the latency of the simplified successive-cancellation (SSC) decoding scheme for polar codes under hardware resource constraints. In particular, when the number of processing elements $P$ that can perform SSC decoding operations in parallel is limited, as is the case in practice, the latency of SSC decoding is $O\left(N^{1-1/μ}+\frac{N}{P}\log_2\log_2\frac{N}{P}\right)$, wh… ▽ More

    Submitted 24 December, 2020; originally announced December 2020.

  39. arXiv:2012.04061  [pdf, other

    stat.ML cs.DC cs.LG math.OC

    Faster Non-Convex Federated Learning via Global and Local Momentum

    Authors: Rudrajit Das, Anish Acharya, Abolfazl Hashemi, Sujay Sanghavi, Inderjit S. Dhillon, Ufuk Topcu

    Abstract: We propose \texttt{FedGLOMO}, a novel federated learning (FL) algorithm with an iteration complexity of $\mathcal{O}(ε^{-1.5})$ to converge to an $ε$-stationary point (i.e., $\mathbb{E}[\|\nabla f(\bm{x})\|^2] \leq ε$) for smooth non-convex functions -- under arbitrary client heterogeneity and compressed communication -- compared to the $\mathcal{O}(ε^{-2})$ complexity of most prior works. Our key… ▽ More

    Submitted 24 October, 2021; v1 submitted 7 December, 2020; originally announced December 2020.

  40. arXiv:2012.02232  [pdf, other

    cs.LG physics.flu-dyn

    Graph Convolutional Neural Networks for Body Force Prediction

    Authors: Francis Ogoke, Kazem Meidani, Amirreza Hashemi, Amir Barati Farimani

    Abstract: Many scientific and engineering processes produce spatially unstructured data. However, most data-driven models require a feature matrix that enforces both a set number and order of features for each sample. They thus cannot be easily constructed for an unstructured dataset. Therefore, a graph based data-driven model to perform inference on fields defined on an unstructured mesh, using a Graph Con… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

  41. arXiv:2011.12882  [pdf, other

    cs.IT

    Sparse Multi-Decoder Recursive Projection Aggregation for Reed-Muller Codes

    Authors: Dorsa Fathollahi, Nariman Farsad, Seyyed Ali Hashemi, Marco Mondelli

    Abstract: Reed-Muller (RM) codes are one of the oldest families of codes. Recently, a recursive projection aggregation (RPA) decoder has been proposed, which achieves a performance that is close to the maximum likelihood decoder for short-length RM codes. One of its main drawbacks, however, is the large amount of computations needed. In this paper, we devise a new algorithm to lower the computational budget… ▽ More

    Submitted 26 November, 2020; v1 submitted 25 November, 2020; originally announced November 2020.

    Comments: 6 pages, 12 figures

  42. arXiv:2011.10643  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    On the Benefits of Multiple Gossip Steps in Communication-Constrained Decentralized Optimization

    Authors: Abolfazl Hashemi, Anish Acharya, Rudrajit Das, Haris Vikalo, Sujay Sanghavi, Inderjit Dhillon

    Abstract: In decentralized optimization, it is common algorithmic practice to have nodes interleave (local) gradient descent iterations with gossip (i.e. averaging over the network) steps. Motivated by the training of large-scale machine learning models, it is also increasingly common to require that messages be {\em lossy compressed} versions of the local parameters. In this paper, we show that, in such co… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.

  43. arXiv:2010.14919  [pdf, other

    cs.CV

    Transferable Universal Adversarial Perturbations Using Generative Models

    Authors: Atiye Sadat Hashemi, Andreas Bär, Saeed Mozaffari, Tim Fingscheidt

    Abstract: Deep neural networks tend to be vulnerable to adversarial perturbations, which by adding to a natural image can fool a respective model with high confidence. Recently, the existence of image-agnostic perturbations, also known as universal adversarial perturbations (UAPs), were discovered. However, existing UAPs still lack a sufficiently high fooling rate, when being applied to an unknown target mo… ▽ More

    Submitted 29 October, 2020; v1 submitted 28 October, 2020; originally announced October 2020.

  44. arXiv:2009.09277  [pdf, ps, other

    cs.IT cs.LG

    Construction of Polar Codes with Reinforcement Learning

    Authors: Yun Liao, Seyyed Ali Hashemi, John Cioffi, Andrea Goldsmith

    Abstract: This paper formulates the polar-code construction problem for the successive-cancellation list (SCL) decoder as a maze-traversing game, which can be solved by reinforcement learning techniques. The proposed method provides a novel technique for polar-code construction that no longer depends on sorting and selecting bit-channels by reliability. Instead, this technique decides whether the input bits… ▽ More

    Submitted 19 September, 2020; originally announced September 2020.

    Comments: To be published in Proceedings of IEEE Globecom 2020

  45. arXiv:2009.06796  [pdf, other

    cs.IT cs.LG eess.SP

    Decoding Polar Codes with Reinforcement Learning

    Authors: Nghia Doan, Seyyed Ali Hashemi, Warren Gross

    Abstract: In this paper we address the problem of selecting factor-graph permutations of polar codes under belief propagation (BP) decoding to significantly improve the error-correction performance of the code. In particular, we formalize the factor-graph permutation selection as the multi-armed bandit problem in reinforcement learning and propose a decoder that acts like an online-learning agent that learn… ▽ More

    Submitted 14 September, 2020; originally announced September 2020.

    Comments: Accepted for presentation at IEEE GLOBECOM 2020

  46. arXiv:2008.12483  [pdf

    physics.comp-ph cs.LG

    A transfer learning metamodel using artificial neural networks applied to natural convection flows in enclosures

    Authors: Majid Ashouri, Alireza Hashemi

    Abstract: In this paper, we employed a transfer learning technique to predict the Nusselt number for natural convection flows in enclosures. Specifically, we considered the benchmark problem of a two-dimensional square enclosure with isolated horizontal walls and vertical walls at constant temperatures. The Rayleigh and Prandtl numbers are sufficient parameters to simulate this problem numerically. We adopt… ▽ More

    Submitted 14 October, 2020; v1 submitted 28 August, 2020; originally announced August 2020.

  47. arXiv:2006.01266  [pdf, other

    cs.CL

    Leveraging Affective Bidirectional Transformers for Offensive Language Detection

    Authors: AbdelRahim Elmadany, Chiyu Zhang, Muhammad Abdul-Mageed, Azadeh Hashemi

    Abstract: Social media are pervasive in our life, making it necessary to ensure safe online experiences by detecting and removing offensive and hate speech. In this work, we report our submission to the Offensive Language and hate-speech Detection shared task organized with the 4th Workshop on Open-Source Arabic Corpora and Processing Tools Arabic (OSACT4). We focus on developing purely deep learning system… ▽ More

    Submitted 16 May, 2020; originally announced June 2020.

  48. arXiv:2005.04394  [pdf, other

    cs.IT

    Threshold-Based Fast Successive-Cancellation Decoding of Polar Codes

    Authors: Haotian Zheng, Seyyed Ali Hashemi, Alexios Balatsoukas-Stimming, Zizheng Cao, Ton Koonen, John Cioffi, Andrea Goldsmith

    Abstract: Fast SC decoding overcomes the latency caused by the serial nature of the SC decoding by identifying new nodes in the upper levels of the SC decoding tree and implementing their fast parallel decoders. In this work, we first present a novel sequence repetition node corresponding to a particular class of bit sequences. Most existing special node types are special cases of the proposed sequence repe… ▽ More

    Submitted 27 November, 2020; v1 submitted 9 May, 2020; originally announced May 2020.

    Comments: 14 pages, 8 figures, 5 tables, submitted to IEEE Transactions on Communications

  49. arXiv:1912.13072  [pdf, other

    cs.CL cs.IR cs.LG

    AraNet: A Deep Learning Toolkit for Arabic Social Media

    Authors: Muhammad Abdul-Mageed, Chiyu Zhang, Azadeh Hashemi, El Moatez Billah Nagoudi

    Abstract: We describe AraNet, a collection of deep learning Arabic social media processing tools. Namely, we exploit an extensive host of publicly available and novel social media datasets to train bidirectional encoders from transformer models (BERT) to predict age, dialect, gender, emotion, irony, and sentiment. AraNet delivers state-of-the-art performance on a number of the cited tasks and competitively… ▽ More

    Submitted 11 April, 2020; v1 submitted 30 December, 2019; originally announced December 2019.

    Comments: Accepted by The 4th Workshop on Open-Source Arabic Corpora and Processing Tools (OSACT)

  50. arXiv:1912.01086  [pdf, other

    cs.IT

    Deep-Learning-Aided Successive-Cancellation Decoding of Polar Codes

    Authors: Seyyed Ali Hashemi, Nghia Doan, Thibaud Tonnellier, Warren J. Gross

    Abstract: A deep-learning-aided successive-cancellation list (DL-SCL) decoding algorithm for polar codes is introduced with deep-learning-aided successive-cancellation (DL-SC) decoding being a specific case of it. The DL-SCL decoder works by allowing additional rounds of SCL decoding when the first SCL decoding attempt fails, using a novel bit-flipping metric. The proposed bit-flipping metric exploits the i… ▽ More

    Submitted 2 December, 2019; originally announced December 2019.

    Comments: 2019 Asilomar Conference on Signals, Systems, and Computers