Skip to main content

Showing 1–50 of 90 results for author: Hashemi, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.06027  [pdf, other

    cs.CL cs.LG

    Unilogit: Robust Machine Unlearning for LLMs Using Uniform-Target Self-Distillation

    Authors: Stefan Vasilev, Christian Herold, Baohao Liao, Seyyed Hadi Hashemi, Shahram Khadivi, Christof Monz

    Abstract: This paper introduces Unilogit, a novel self-distillation method for machine unlearning in Large Language Models. Unilogit addresses the challenge of selectively forgetting specific information while maintaining overall model utility, a critical task in compliance with data privacy regulations like GDPR. Unlike prior methods that rely on static hyperparameters or starting model outputs, Unilogit d… ▽ More

    Submitted 9 May, 2025; originally announced May 2025.

    Comments: 16 pages, 6 figures, 5 tables, under review at ACL

    MSC Class: 68T50 ACM Class: I.2.7

  2. arXiv:2504.14690  [pdf

    cs.CL cs.AI

    FarsEval-PKBETS: A new diverse benchmark for evaluating Persian large language models

    Authors: Mehrnoush Shamsfard, Zahra Saaberi, Mostafa Karimi manesh, Seyed Mohammad Hossein Hashemi, Zahra Vatankhah, Motahareh Ramezani, Niki Pourazin, Tara Zare, Maryam Azimi, Sarina Chitsaz, Sama Khoraminejad, Morteza Mahdavi Mortazavi, Mohammad Mahdi Chizari, Sahar Maleki, Seyed Soroush Majd, Mostafa Masumi, Sayed Ali Musavi Khoeini, Amir Mohseni, Sogol Alipour

    Abstract: Research on evaluating and analyzing large language models (LLMs) has been extensive for resource-rich languages such as English, yet their performance in languages such as Persian has received considerably less attention. This paper introduces FarsEval-PKBETS benchmark, a subset of FarsEval project for evaluating large language models in Persian. This benchmark consists of 4000 questions and answ… ▽ More

    Submitted 20 April, 2025; originally announced April 2025.

    Comments: 24 pages, 3 figures, 3 tables

    MSC Class: 68T50 ACM Class: I.2.7; E.0

  3. arXiv:2503.13089  [pdf, other

    cs.CL cs.AI

    ClusComp: A Simple Paradigm for Model Compression and Efficient Finetuning

    Authors: Baohao Liao, Christian Herold, Seyyed Hadi Hashemi, Stefan Vasilev, Shahram Khadivi, Christof Monz

    Abstract: As large language models (LLMs) scale, model compression is crucial for edge deployment and accessibility. Weight-only quantization reduces model size but suffers from performance degradation at lower bit widths. Moreover, standard finetuning is incompatible with quantized models, and alternative methods often fall short of full finetuning. In this paper, we propose ClusComp, a simple yet effectiv… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

    Comments: 26 pages, 11 figures, 18 tables

  4. arXiv:2503.02967  [pdf

    cs.CV cs.LG

    Revolutionizing Traffic Management with AI-Powered Machine Vision: A Step Toward Smart Cities

    Authors: Seyed Hossein Hosseini DolatAbadi, Sayyed Mohammad Hossein Hashemi, Mohammad Hosseini, Moein-Aldin AliHosseini

    Abstract: The rapid urbanization of cities and increasing vehicular congestion have posed significant challenges to traffic management and safety. This study explores the transformative potential of artificial intelligence (AI) and machine vision technologies in revolutionizing traffic systems. By leveraging advanced surveillance cameras and deep learning algorithms, this research proposes a system for real… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

    Comments: 6 pages, 1 figure, 2 tables, accepted to 1th AITC conference in University Of Isfahan

  5. arXiv:2501.09706  [pdf, other

    cs.CL

    Domain Adaptation of Foundation LLMs for e-Commerce

    Authors: Christian Herold, Michael Kozielski, Tala Bazazo, Pavel Petrushkov, Seyyed Hadi Hashemi, Patrycja Cieplicka, Dominika Basaj, Shahram Khadivi

    Abstract: We present the e-Llama models: 8 billion and 70 billion parameter large language models that are adapted towards the e-commerce domain. These models are meant as foundation models with deep knowledge about e-commerce, that form a base for instruction- and fine-tuning. The e-Llama models are obtained by continuously pretraining the Llama 3.1 base models on 1 trillion tokens of domain-specific data.… ▽ More

    Submitted 19 January, 2025; v1 submitted 16 January, 2025; originally announced January 2025.

    Comments: include full author name

  6. arXiv:2411.17343  [pdf, other

    cs.CR cs.SE

    Assessing Vulnerability in Smart Contracts: The Role of Code Complexity Metrics in Security Analysis

    Authors: Masoud Jamshidiyan Tehrani, Sattar Hashemi

    Abstract: Codes with specific characteristics are more exposed to security vulnerabilities. Studies have revealed that codes that do not adhere to best practices are more challenging to verify and maintain, increasing the likelihood of unnoticed or unintentionally introduced vulnerabilities. Given the crucial role of smart contracts in blockchain systems, ensuring their security and conducting thorough vuln… ▽ More

    Submitted 13 March, 2025; v1 submitted 26 November, 2024; originally announced November 2024.

  7. arXiv:2411.15696  [pdf, other

    cs.IT

    RIS with Coupled Phase Shift and Amplitude: Capacity Maximization and Configuration Set Selection

    Authors: Seyedkhashayar Hashemi, Masoud Ardakani, Hai Jiang

    Abstract: A reconfigurable intelligent surface (RIS) is a planar surface that can enhance the quality of communication by providing control over the communication environment. Reflection optimization is one of the pivotal challenges in RIS setups. While there has been lots of research regarding the reflection optimization of RIS, most works consider the independence of the phase shift and the amplitude of R… ▽ More

    Submitted 23 November, 2024; originally announced November 2024.

  8. arXiv:2410.12380  [pdf, other

    cs.CL

    Evaluation of Attribution Bias in Retrieval-Augmented Large Language Models

    Authors: Amin Abolghasemi, Leif Azzopardi, Seyyed Hadi Hashemi, Maarten de Rijke, Suzan Verberne

    Abstract: Attributing answers to source documents is an approach used to enhance the verifiability of a model's output in retrieval augmented generation (RAG). Prior work has mainly focused on improving and evaluating the attribution quality of large language models (LLMs) in RAG, but this may come at the expense of inducing biases in the attribution of answers. We define and examine two aspects in the eval… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  9. arXiv:2409.08917  [pdf, other

    cs.LG cs.AI stat.ML

    Latent Space Score-based Diffusion Model for Probabilistic Multivariate Time Series Imputation

    Authors: Guojun Liang, Najmeh Abiri, Atiye Sadat Hashemi, Jens Lundström, Stefan Byttner, Prayag Tiwari

    Abstract: Accurate imputation is essential for the reliability and success of downstream tasks. Recently, diffusion models have attracted great attention in this field. However, these models neglect the latent distribution in a lower-dimensional space derived from the observed data, which limits the generative capacity of the diffusion model. Additionally, dealing with the original missing data without labe… ▽ More

    Submitted 13 September, 2024; originally announced September 2024.

    Comments: 5 pages, conference

  10. arXiv:2408.00645  [pdf, other

    cs.SE

    Token Interdependency Parsing (Tipping) -- Fast and Accurate Log Parsing

    Authors: Shayan Hashemi, Mika Mäntylä

    Abstract: In the last decade, an impressive increase in software adaptions has led to a surge in log data production, making manual log analysis impractical and establishing the necessity for automated methods. Conversely, most automated analysis tools include a component designed to separate log templates from their parameters, commonly referred to as a "log parser". This paper aims to introduce a new fast… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

  11. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1112 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 16 December, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  12. arXiv:2402.11103  [pdf, other

    cs.LG cond-mat.mtrl-sci

    Toward Learning Latent-Variable Representations of Microstructures by Optimizing in Spatial Statistics Space

    Authors: Sayed Sajad Hashemi, Michael Guerzhoy, Noah H. Paulson

    Abstract: In Materials Science, material development involves evaluating and optimizing the internal structures of the material, generically referred to as microstructures. Microstructures structure is stochastic, analogously to image textures. A particular microstructure can be well characterized by its spatial statistics, analogously to image texture being characterized by the response to a Fourier-like f… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Journal ref: ICLR Tiny Papers 2024

  13. arXiv:2401.03302  [pdf

    eess.IV cs.AI cs.CV cs.LG stat.ML

    Realism in Action: Anomaly-Aware Diagnosis of Brain Tumors from Medical Images Using YOLOv8 and DeiT

    Authors: Seyed Mohammad Hossein Hashemi, Leila Safari, Amirhossein Dadashzadeh Taromi

    Abstract: In the field of medical sciences, reliable detection and classification of brain tumors from images remains a formidable challenge due to the rarity of tumors within the population of patients. Therefore, the ability to detect tumors in anomaly scenarios is paramount for ensuring timely interventions and improved patient outcomes. This study addresses the issue by leveraging deep learning (DL) tec… ▽ More

    Submitted 25 September, 2024; v1 submitted 6 January, 2024; originally announced January 2024.

    Comments: This work has been submitted to the Elsevier for possible publication

  14. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1326 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 9 May, 2025; v1 submitted 18 December, 2023; originally announced December 2023.

  15. arXiv:2312.03133  [pdf, other

    eess.IV cs.CV physics.med-ph

    Predicting Bone Degradation Using Vision Transformer and Synthetic Cellular Microstructures Dataset

    Authors: Mohammad Saber Hashemi, Azadeh Sheidaei

    Abstract: Bone degradation, especially for astronauts in microgravity conditions, is crucial for space exploration missions since the lower applied external forces accelerate the diminution in bone stiffness and strength substantially. Although existing computational models help us understand this phenomenon and possibly restrict its effect in the future, they are time-consuming to simulate the changes in t… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: 8 pages, 5 figures

  16. arXiv:2311.07096  [pdf, other

    cs.IT eess.SP

    Optimal Configuration of Reconfigurable Intelligent Surfaces with Arbitrary Discrete Phase Shifts

    Authors: Seyedkhashayar Hashemi, Hai Jiang, Masoud Ardakani

    Abstract: We address the reflection optimization problem for a reconfigurable intelligent surface (RIS), where the RIS elements feature a set of non-uniformly spaced discrete phase shifts. This is motivated by the actual behavior of practical RIS elements, where it is shown that a uniform phase shift assumption is not realistic. A problem is formulated to find the optimal refection amplitudes and reflection… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  17. arXiv:2307.13848  [pdf, other

    cs.CR

    TeleBTC: Trustless Wrapped Bitcoin

    Authors: Mahyar Daneshpajooh, Niusha Moshrefi, Mahdi Darabi, Sina Hashemi, Mehrafarin Kazemi

    Abstract: This paper introduces TeleBTC, a fully decentralized protocol designed to wrap Bitcoin (BTC) on programmable blockchains. The creation of a decentralized wrapped BTC presents challenges due to the non-programmable nature of Bitcoin, making it difficult to custody BTCs in a decentralized way. Existing solutions have addressed this challenge by introducing an external layer of validators who take cu… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

  18. arXiv:2302.04163  [pdf, ps, other

    eess.SY cs.RO

    Task Space Control of Robot Manipulators based on Visual SLAM

    Authors: Seyed Hamed Hashemi, Jouni Mattila

    Abstract: This paper aims to address the open problem of designing a globally stable vision-based controller for robot manipulators. Accordingly, based on a hybrid mechanism, this paper proposes a novel task-space control law attained by taking the gradient of a potential function in SE(3). The key idea is to employ the Visual Simultaneous Localization and Mapping (VSLAM) algorithm to estimate a robot pose.… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  19. Connective Reconstruction-based Novelty Detection

    Authors: Seyyed Morteza Hashemi, Parvaneh Aliniya, Parvin Razzaghi

    Abstract: Detection of out-of-distribution samples is one of the critical tasks for real-world applications of computer vision. The advancement of deep learning has enabled us to analyze real-world data which contain unexplained samples, accentuating the need to detect out-of-distribution instances more than before. GAN-based approaches have been widely used to address this problem due to their ability to p… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

  20. Using Word Embedding and Convolution Neural Network for Bug Triaging by Considering Design Flaws

    Authors: Reza Sepahvand, Reza Akbari, Behnaz Jamasb, Sattar Hashemi, Omid Boushehrian

    Abstract: Resolving bugs in the maintenance phase of software is a complicated task. Bug assignment is one of the main tasks for resolving bugs. Some Bugs cannot be fixed properly without making design decisions and have to be assigned to designers, rather than programmers, to avoid emerging bad smells that may cause subsequent bug reports. Hence, it is important to refer some bugs to the designer to check… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

  21. arXiv:2208.04146  [pdf

    cond-mat.mtrl-sci cs.LG

    Linking Properties to Microstructure in Liquid Metal Embedded Elastomers via Machine Learning

    Authors: Abhijith Thoopul Anantharanga, Mohammad Saber Hashemi, Azadeh Sheidaei

    Abstract: Liquid metals (LM) are embedded in an elastomer matrix to obtain soft composites with unique thermal, dielectric, and mechanical properties. They have applications in soft robotics, biomedical engineering, and wearable electronics. By linking the structure to the properties of these materials, it is possible to perform material design rationally. Liquid-metal embedded elastomers (LMEEs) have been… ▽ More

    Submitted 24 July, 2022; originally announced August 2022.

    Comments: 25 pages, 9 figures, submitted to the journal of Composites Science and Technology

  22. arXiv:2207.01105  [pdf, other

    cs.IT cs.AI

    Scalable Polar Code Construction for Successive Cancellation List Decoding: A Graph Neural Network-Based Approach

    Authors: Yun Liao, Seyyed Ali Hashemi, Hengjie Yang, John M. Cioffi

    Abstract: While constructing polar codes for successive-cancellation decoding can be implemented efficiently by sorting the bit-channels, finding optimal polar codes for cyclic-redundancy-check-aided successive-cancellation list (CA-SCL) decoding in an efficient and scalable manner still awaits investigation. This paper first maps a polar code to a unique heterogeneous graph called the polar-code-constructi… ▽ More

    Submitted 13 May, 2023; v1 submitted 3 July, 2022; originally announced July 2022.

    Comments: 33 pages, 11 figures, submitted to IEEE Transactions on Communications

  23. arXiv:2202.09214  [pdf, other

    cs.SE

    Pinpointing Anomaly Events in Logs from Stability Testing -- N-Grams vs. Deep-Learning

    Authors: Mika Mäntylä, Martín Varela, Shayan Hashemi

    Abstract: As stability testing execution logs can be very long, software engineers need help in locating anomalous events. We develop and evaluate two models for scoring individual log-events for anomalousness, namely an N-Gram model and a Deep Learning model with LSTM (Long short-term memory). Both are trained on normal log sequences only. We evaluate the models with long log sequences of Android stability… ▽ More

    Submitted 23 February, 2022; v1 submitted 18 February, 2022; originally announced February 2022.

    Comments: Accepted to 5th Workshop on NEXt level of Test Automation (NEXTA), ICST Workshops 2022

  24. arXiv:2112.00057  [pdf, ps, other

    cs.IT

    Successive Syndrome-Check Decoding of Polar Codes

    Authors: Seyyed Ali Hashemi, Marco Mondelli, John Cioffi, Andrea Goldsmith

    Abstract: A two-part successive syndrome-check decoding of polar codes is proposed with the first part successively refining the received codeword and the second part checking its syndrome. A new formulation of the successive-cancellation (SC) decoding algorithm is presented that allows for successively refining the received codeword by comparing the log-likelihood ratio value of a frozen bit with its prede… ▽ More

    Submitted 30 November, 2021; originally announced December 2021.

    Comments: 2021 Asilomar Conference on Signals, Systems, and Computers

  25. arXiv:2111.07369  [pdf, other

    eess.IV cs.AI cs.CV cs.LG physics.med-ph

    Estimation of Acetabular Version from Anteroposterior Pelvic Radiograph Employing Deep Learning

    Authors: Ata Jodeiri, Hadi Seyedarabi, Fatemeh Shahbazi, Seyed Mohammad Mahdi Hashemi, Seyyedhossein Shafiei

    Abstract: Background and Objective: The Acetabular version, an essential factor in total hip arthroplasty, is measured by CT scan as the gold standard. The dose of radiation and expensiveness of CT make anterior-posterior pelvic radiograph an appropriate alternative procedure. In this study, we applied a deep learning approach on anteroposterior pelvic X-rays to measure anatomical version, eliminating the n… ▽ More

    Submitted 14 November, 2021; originally announced November 2021.

    Comments: 12 pages, 8 figures

  26. Fast Successive-Cancellation List Flip Decoding of Polar Codes

    Authors: Nghia Doan, Seyyed Ali Hashemi, Warren J. Gross

    Abstract: This work presents a fast successive-cancellation list flip (Fast-SCLF) decoding algorithm for polar codes that addresses the high latency issue associated with the successive-cancellation list flip (SCLF) decoding algorithm. We first propose a bit-flipping strategy tailored to the state-of-the-art fast successive-cancellation list (FSCL) decoding that avoids tree-traversal in the binary tree repr… ▽ More

    Submitted 23 January, 2022; v1 submitted 24 September, 2021; originally announced September 2021.

    Comments: Published in IEEE Access, Volume: 10, Page(s): 5568 - 5584, Date of Publication: 04 January 2022

  27. arXiv:2109.02122  [pdf, other

    cs.IT

    Decoding Reed-Muller Codes with Successive Codeword Permutations

    Authors: Nghia Doan, Seyyed Ali Hashemi, Marco Mondelli, Warren J. Gross

    Abstract: A novel recursive list decoding (RLD) algorithm for Reed-Muller (RM) codes based on successive permutations (SP) of the codeword is presented. A low-complexity SP scheme applied to a subset of the symmetry group of RM codes is first proposed to carefully select a good codeword permutation on the fly. Then, the proposed SP technique is integrated into an improved RLD algorithm that initializes diff… ▽ More

    Submitted 20 September, 2022; v1 submitted 5 September, 2021; originally announced September 2021.

    Comments: Accepted for publication in IEEE Transactions on Communications

  28. arXiv:2108.12550  [pdf, ps, other

    cs.IT

    Successive-Cancellation Decoding of Reed-Muller Codes with Fast Hadamard Transform

    Authors: Nghia Doan, Seyyed Ali Hashemi, Warren J. Gross

    Abstract: A novel permuted fast successive-cancellation list decoding algorithm with fast Hadamard transform (FHT-FSCL) is presented. The proposed decoder initializes $L$ $(L\ge1)$ active decoding paths with $L$ random codeword permutations sampled from the full symmetry group of the codes. The path extension in the permutation domain is carried out until the first constituent RM code of order $1$ is visite… ▽ More

    Submitted 7 February, 2022; v1 submitted 27 August, 2021; originally announced August 2021.

    Comments: Submitted to an IEEE journal for possible publication

  29. arXiv:2107.08991  [pdf, ps, other

    cs.IT

    A Tree Search Approach for Maximum-Likelihood Decoding of Reed-Muller Codes

    Authors: Seyyed Ali Hashemi, Nghia Doan, Warren J. Gross, John Cioffi, Andrea Goldsmith

    Abstract: A low-complexity tree search approach is presented that achieves the maximum-likelihood (ML) decoding performance of Reed-Muller (RM) codes. The proposed approach generates a bit-flipping tree that is traversed to find the ML decoding result by performing successive-cancellation decoding after each node visit. A depth-first search (DFS) and a breadth-first search (BFS) scheme are developed and a l… ▽ More

    Submitted 19 July, 2021; originally announced July 2021.

  30. OneLog: Towards End-to-End Training in Software Log Anomaly Detection

    Authors: Shayan Hashemi, Mika Mäntylä

    Abstract: With the growth of online services, IoT devices, and DevOps-oriented software development, software log anomaly detection is becoming increasingly important. Prior works mainly follow a traditional four-staged architecture (Preprocessor, Parser, Vectorizer, and Classifier). This paper proposes OneLog, which utilizes a single Deep Neural Network (DNN) instead of multiple separate components. OneLog… ▽ More

    Submitted 27 February, 2024; v1 submitted 15 April, 2021; originally announced April 2021.

  31. arXiv:2103.03221  [pdf, ps, other

    cs.LG q-bio.QM

    GenoML: Automated Machine Learning for Genomics

    Authors: Mary B. Makarious, Hampton L. Leonard, Dan Vitale, Hirotaka Iwaki, David Saffo, Lana Sargent, Anant Dadu, Eduardo Salmerón Castaño, John F. Carter, Melina Maleknia, Juan A. Botia, Cornelis Blauwendraat, Roy H. Campbell, Sayed Hadi Hashemi, Andrew B. Singleton, Mike A. Nalls, Faraz Faghri

    Abstract: GenoML is a Python package automating machine learning workflows for genomics (genetics and multi-omics) with an open science philosophy. Genomics data require significant domain expertise to clean, pre-process, harmonize and perform quality control of the data. Furthermore, tuning, validation, and interpretation involve taking into account the biology and possibly the limitations of the underlyin… ▽ More

    Submitted 4 March, 2021; originally announced March 2021.

  32. Detecting Anomalies in Software Execution Logs with Siamese Network

    Authors: Shayan Hashemi, Mika Mäntylä

    Abstract: Logs are semi-structured text files that represent software's execution paths and states during its run-time. Therefore, detecting anomalies in software logs reflect anomalies in the software's execution path or state. So, it has become a notable concern in software engineering. We use LSTM like many prior works, and on top of LSTM, we propose a novel anomaly detection approach based on the Siames… ▽ More

    Submitted 2 February, 2021; originally announced February 2021.

  33. arXiv:2012.13378  [pdf, ps, other

    cs.IT

    Parallelism versus Latency in Simplified Successive-Cancellation Decoding of Polar Codes

    Authors: Seyyed Ali Hashemi, Marco Mondelli, Arman Fazeli, Alexander Vardy, John Cioffi, Andrea Goldsmith

    Abstract: This paper characterizes the latency of the simplified successive-cancellation (SSC) decoding scheme for polar codes under hardware resource constraints. In particular, when the number of processing elements $P$ that can perform SSC decoding operations in parallel is limited, as is the case in practice, the latency of SSC decoding is $O\left(N^{1-1/μ}+\frac{N}{P}\log_2\log_2\frac{N}{P}\right)$, wh… ▽ More

    Submitted 24 December, 2020; originally announced December 2020.

  34. arXiv:2011.12882  [pdf, other

    cs.IT

    Sparse Multi-Decoder Recursive Projection Aggregation for Reed-Muller Codes

    Authors: Dorsa Fathollahi, Nariman Farsad, Seyyed Ali Hashemi, Marco Mondelli

    Abstract: Reed-Muller (RM) codes are one of the oldest families of codes. Recently, a recursive projection aggregation (RPA) decoder has been proposed, which achieves a performance that is close to the maximum likelihood decoder for short-length RM codes. One of its main drawbacks, however, is the large amount of computations needed. In this paper, we devise a new algorithm to lower the computational budget… ▽ More

    Submitted 26 November, 2020; v1 submitted 25 November, 2020; originally announced November 2020.

    Comments: 6 pages, 12 figures

  35. arXiv:2010.14919  [pdf, other

    cs.CV

    Transferable Universal Adversarial Perturbations Using Generative Models

    Authors: Atiye Sadat Hashemi, Andreas Bär, Saeed Mozaffari, Tim Fingscheidt

    Abstract: Deep neural networks tend to be vulnerable to adversarial perturbations, which by adding to a natural image can fool a respective model with high confidence. Recently, the existence of image-agnostic perturbations, also known as universal adversarial perturbations (UAPs), were discovered. However, existing UAPs still lack a sufficiently high fooling rate, when being applied to an unknown target mo… ▽ More

    Submitted 29 October, 2020; v1 submitted 28 October, 2020; originally announced October 2020.

  36. arXiv:2010.12877  [pdf, other

    eess.SP cs.LG

    EEGsig: an open-source machine learning-based toolbox for EEG signal processing

    Authors: Fardin Ghorbani, Javad Shabanpour, Sepideh Monjezi, Hossein Soleimani, Soheil Hashemi, Ali Abdolali

    Abstract: In the quest to realize a comprehensive EEG signal processing framework, in this paper, we demonstrate a toolbox and graphic user interface, EEGsig, for the full process of EEG signals. Our goal is to provide a comprehensive suite, free and open-source framework for EEG signal processing where the users especially physicians who do not have programming experience can focus on their practical requi… ▽ More

    Submitted 26 August, 2021; v1 submitted 24 October, 2020; originally announced October 2020.

  37. arXiv:2010.00041  [pdf

    physics.comp-ph cs.NE

    A Supervised Machine Learning Approach for Accelerating the Design of Particulate Composites: Application to Thermal Conductivity

    Authors: Mohammad Saber Hashemi, Masoud Safdari, Azadeh Sheidaei

    Abstract: A supervised machine learning (ML) based computational methodology for the design of particulate multifunctional composite materials with desired thermal conductivity (TC) is presented. The design variables are physical descriptors of the material microstructure that directly link microstructure to the material's properties. A sufficiently large and uniformly sampled database was generated based o… ▽ More

    Submitted 4 January, 2021; v1 submitted 30 September, 2020; originally announced October 2020.

    Comments: 24 pages, 6 figures, 3 tables

  38. arXiv:2009.09277  [pdf, ps, other

    cs.IT cs.LG

    Construction of Polar Codes with Reinforcement Learning

    Authors: Yun Liao, Seyyed Ali Hashemi, John Cioffi, Andrea Goldsmith

    Abstract: This paper formulates the polar-code construction problem for the successive-cancellation list (SCL) decoder as a maze-traversing game, which can be solved by reinforcement learning techniques. The proposed method provides a novel technique for polar-code construction that no longer depends on sorting and selecting bit-channels by reliability. Instead, this technique decides whether the input bits… ▽ More

    Submitted 19 September, 2020; originally announced September 2020.

    Comments: To be published in Proceedings of IEEE Globecom 2020

  39. arXiv:2009.06796  [pdf, other

    cs.IT cs.LG eess.SP

    Decoding Polar Codes with Reinforcement Learning

    Authors: Nghia Doan, Seyyed Ali Hashemi, Warren Gross

    Abstract: In this paper we address the problem of selecting factor-graph permutations of polar codes under belief propagation (BP) decoding to significantly improve the error-correction performance of the code. In particular, we formalize the factor-graph permutation selection as the multi-armed bandit problem in reinforcement learning and propose a decoder that acts like an online-learning agent that learn… ▽ More

    Submitted 14 September, 2020; originally announced September 2020.

    Comments: Accepted for presentation at IEEE GLOBECOM 2020

  40. arXiv:2005.04394  [pdf, other

    cs.IT

    Threshold-Based Fast Successive-Cancellation Decoding of Polar Codes

    Authors: Haotian Zheng, Seyyed Ali Hashemi, Alexios Balatsoukas-Stimming, Zizheng Cao, Ton Koonen, John Cioffi, Andrea Goldsmith

    Abstract: Fast SC decoding overcomes the latency caused by the serial nature of the SC decoding by identifying new nodes in the upper levels of the SC decoding tree and implementing their fast parallel decoders. In this work, we first present a novel sequence repetition node corresponding to a particular class of bit sequences. Most existing special node types are special cases of the proposed sequence repe… ▽ More

    Submitted 27 November, 2020; v1 submitted 9 May, 2020; originally announced May 2020.

    Comments: 14 pages, 8 figures, 5 tables, submitted to IEEE Transactions on Communications

  41. arXiv:2004.14020  [pdf, other

    cs.NI cs.DC cs.LG

    Caramel: Accelerating Decentralized Distributed Deep Learning with Computation Scheduling

    Authors: Sayed Hadi Hashemi, Sangeetha Abdu Jyothi, Brighten Godfrey, Roy Campbell

    Abstract: The method of choice for parameter aggregation in Deep Neural Network (DNN) training, a network-intensive task, is shifting from the Parameter Server model to decentralized aggregation schemes (AllReduce) inspired by theoretical guarantees of better performance. However, current implementations of AllReduce overlook the interdependence of communication and computation, resulting in significant per… ▽ More

    Submitted 29 April, 2020; originally announced April 2020.

  42. arXiv:2003.02820  [pdf

    cs.DC cs.NI

    Workload Scheduling on heterogeneous Mobile Edge Cloud in 5G networks to Minimize SLA Violation

    Authors: Mostafa Hadadian Nejad Yousefi, Amirmasoud Ghiassi, Boshra Sadat Hashemi, Maziar Goudarzi

    Abstract: Smart devices have become an indispensable part of our lives and gain increasing applicability in almost every area. Latency-aware applications such as Augmented Reality (AR), autonomous driving, and online gaming demand more resources such as network bandwidth and computational capabilities. Since the traditional mobile networks cannot fulfill the required bandwidth and latency, Mobile Edge Cloud… ▽ More

    Submitted 21 March, 2020; v1 submitted 5 March, 2020; originally announced March 2020.

    Comments: 12 pages, 8 figures, 4 tables contact: hadadian AT ce DOT sharif DOT edu

  43. arXiv:2002.06226  [pdf

    cs.LG stat.ML

    Wind speed prediction using a hybrid model of the multi-layer perceptron and whale optimization algorithm

    Authors: Saeed Samadianfard, Sajjad Hashemi, Katayoun Kargar, Mojtaba Izadyar, Ali Mostafaeipour, Amir Mosavi, Narjes Nabipour, Shahaboddin Shamshirband

    Abstract: Wind power as a renewable source of energy, has numerous economic, environmental and social benefits. In order to enhance and control renewable wind power, it is vital to utilize models that predict wind speed with high accuracy. Due to neglecting of requirement and significance of data preprocessing and disregarding the inadequacy of using a single predicting model, many traditional models have p… ▽ More

    Submitted 14 February, 2020; originally announced February 2020.

    Comments: 20 pages, 7 figures

    MSC Class: 68Q05

  44. arXiv:1912.01086  [pdf, other

    cs.IT

    Deep-Learning-Aided Successive-Cancellation Decoding of Polar Codes

    Authors: Seyyed Ali Hashemi, Nghia Doan, Thibaud Tonnellier, Warren J. Gross

    Abstract: A deep-learning-aided successive-cancellation list (DL-SCL) decoding algorithm for polar codes is introduced with deep-learning-aided successive-cancellation (DL-SC) decoding being a specific case of it. The DL-SCL decoder works by allowing additional rounds of SCL decoding when the first SCL decoding attempt fails, using a novel bit-flipping metric. The proposed bit-flipping metric exploits the i… ▽ More

    Submitted 2 December, 2019; originally announced December 2019.

    Comments: 2019 Asilomar Conference on Signals, Systems, and Computers

  45. arXiv:1911.04021  [pdf, other

    cs.AI cs.LG cs.NE

    DRiLLS: Deep Reinforcement Learning for Logic Synthesis

    Authors: Abdelrahman Hosny, Soheil Hashemi, Mohamed Shalan, Sherief Reda

    Abstract: Logic synthesis requires extensive tuning of the synthesis optimization flow where the quality of results (QoR) depends on the sequence of optimizations used. Efficient design space exploration is challenging due to the exponential number of possible optimization permutations. Therefore, automating the optimization process is necessary. In this work, we propose a novel reinforcement learning-based… ▽ More

    Submitted 12 November, 2019; v1 submitted 10 November, 2019; originally announced November 2019.

    Comments: ASPDAC'2020

  46. arXiv:1909.12746  [pdf

    cs.IR

    Cross-domain recommender system using Generalized Canonical Correlation Analysis

    Authors: Seyed Mohammad Hashemi, Mohammad Rahmati

    Abstract: Recommender systems provide personalized recommendations to the users from a large number of possible options in online stores. Matrix factorization is a well-known and accurate collaborative filtering approach for recommender system, which suffers from cold-start problem for new users and items. Whenever a new user participate with the system there is not enough interactions with the system, ther… ▽ More

    Submitted 15 September, 2019; originally announced September 2019.

  47. arXiv:1909.04892  [pdf, ps, other

    cs.IT

    Sublinear Latency for Simplified Successive Cancellation Decoding of Polar Codes

    Authors: Marco Mondelli, Seyyed Ali Hashemi, John Cioffi, Andrea Goldsmith

    Abstract: This work analyzes the latency of the simplified successive cancellation (SSC) decoding scheme for polar codes proposed by Alamdar-Yazdi and Kschischang. It is shown that, unlike conventional successive cancellation decoding, where latency is linear in the block length, the latency of SSC decoding is sublinear. More specifically, the latency of SSC decoding is $O(N^{1-1/μ})$, where $N$ is the bloc… ▽ More

    Submitted 5 September, 2020; v1 submitted 11 September, 2019; originally announced September 2019.

    Comments: 20 pages, 6 figures, presented in part at ISIT 2020 and accepted in IEEE Transactions on Wireless Communications

  48. arXiv:1908.05798  [pdf, other

    cs.IT eess.SP

    Efficient Flicker-Free FEC Codes using Knuth's Balancing Algorithm for VLC

    Authors: Elie Ngomseu Mambou, Thibaud Tonnellier, Seyyed Ali Hashemi, Warren J. Gross

    Abstract: Visible light communication (VLC) provides a short-range optical wireless communication through light-emitting diode (LED) lighting. Light beam flickering and dimming are among the challenges to be addressed in VLC. Conventional methods for generating flicker-free codes in VLC are based on run-length limited codes that have poor error correction performance, use lookup tables which are memory cons… ▽ More

    Submitted 15 August, 2019; originally announced August 2019.

    Comments: 6 pages, 8 figures, conference

  49. arXiv:1907.11563  [pdf, other

    cs.IT eess.SP

    Neural Dynamic Successive Cancellation Flip Decoding of Polar Codes

    Authors: Nghia Doan, Seyyed Ali Hashemi, Furkan Ercan, Thibaud Tonnellier, Warren Gross

    Abstract: Dynamic successive cancellation flip (DSCF) decoding of polar codes is a powerful algorithm that can achieve the error correction performance of successive cancellation list (SCL) decoding, with a complexity that is close to that of successive cancellation (SC) decoding at practical signal-to-noise ratio (SNR) regimes. However, DSCF decoding requires costly transcendental computations which advers… ▽ More

    Submitted 26 July, 2019; originally announced July 2019.

  50. arXiv:1904.06578  [pdf

    physics.comp-ph cs.LG

    Deep-learning PDEs with unlabeled data and hardwiring physics laws

    Authors: S. Mohammad H. Hashemi, Demetri Psaltis

    Abstract: Providing fast and accurate solutions to partial differential equations is a problem of continuous interest to the fields of applied mathematics and physics. With the recent advances in machine learning, the adoption learning techniques in this domain is being eagerly pursued. We build upon earlier works on linear and homogeneous PDEs, and develop convolutional deep neural networks that can accura… ▽ More

    Submitted 13 April, 2019; originally announced April 2019.