Skip to main content

Showing 1–50 of 80 results for author: Moradi, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.11957  [pdf, ps, other

    physics.med-ph cs.LG

    Automated Treatment Planning for Interstitial HDR Brachytherapy for Locally Advanced Cervical Cancer using Deep Reinforcement Learning

    Authors: Mohammadamin Moradi, Runyu Jiang, Yingzi Liu, Malvern Madondo, Tianming Wu, James J. Sohn, Xiaofeng Yang, Yasmin Hasan, Zhen Tian

    Abstract: High-dose-rate (HDR) brachytherapy plays a critical role in the treatment of locally advanced cervical cancer but remains highly dependent on manual treatment planning expertise. The objective of this study is to develop a fully automated HDR brachytherapy planning framework that integrates reinforcement learning (RL) and dose-based optimization to generate clinically acceptable treatment plans wi… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

    Comments: 12 pages, 2 figures, 3 tables

  2. arXiv:2506.11345  [pdf, ps, other

    cs.IT

    On the High-Rate FDPC Codes: Construction, Encoding, and a Generalization

    Authors: Mohsen Moradi, Sheida Rabeti, Hessam Mahdavifar

    Abstract: Recently introduced Fair-Density Parity-Check (FDPC) codes, targeting high-rate applications, offer superior error-correction performance (ECP) compared to 5G Low-Density Parity-Check (LDPC) codes, given the same number of message-passing decoding iterations. In this paper, we present a novel construction method for FDPC codes, introduce a generalization of these codes, and propose a low-complexit… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

  3. arXiv:2506.11268  [pdf, ps, other

    cs.IT

    Bounds and New Constructions for Girth-Constrained Regular Bipartite Graphs

    Authors: Sheida Rabeti, Mohsen Moradi, Hessam Mahdavifar

    Abstract: In this paper, we explore the design and analysis of regular bipartite graphs motivated by their application in low-density parity-check (LDPC) codes specifically with constrained girth and in the high-rate regime. We focus on the relation between the girth of the graph, and the size of the sets of variable and check nodes. We derive bounds on the size of the vertices in regular bipartite graphs,… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

  4. arXiv:2505.19475  [pdf, ps, other

    cs.CL

    Continuous Self-Improvement of Large Language Models by Test-time Training with Verifier-Driven Sample Selection

    Authors: Mohammad Mahdi Moradi, Hossam Amer, Sudhir Mudur, Weiwei Zhang, Yang Liu, Walid Ahmed

    Abstract: Learning to adapt pretrained language models to unlabeled, out-of-distribution data is a critical challenge, as models often falter on structurally novel reasoning tasks even while excelling within their training distribution. We introduce a new framework called VDS-TTT - Verifier-Driven Sample Selection for Test-Time Training to efficiently address this. We use a learned verifier to score a pool… ▽ More

    Submitted 28 May, 2025; v1 submitted 25 May, 2025; originally announced May 2025.

  5. arXiv:2505.19472  [pdf, ps, other

    cs.CL

    Balancing Computation Load and Representation Expressivity in Parallel Hybrid Neural Networks

    Authors: Mohammad Mahdi Moradi, Walid Ahmed, Shuangyue Wen, Sudhir Mudur, Weiwei Zhang, Yang Liu

    Abstract: Attention and State-Space Models (SSMs) when combined in a hybrid network in sequence or in parallel provide complementary strengths. In a hybrid sequential pipeline they alternate between applying a transformer to the input and then feeding its output into a SSM. This results in idle periods in the individual components increasing end-to-end latency and lowering throughput caps. In the parallel h… ▽ More

    Submitted 28 May, 2025; v1 submitted 25 May, 2025; originally announced May 2025.

  6. arXiv:2505.19354  [pdf, ps, other

    cs.CL cs.CV

    GC-KBVQA: A New Four-Stage Framework for Enhancing Knowledge Based Visual Question Answering Performance

    Authors: Mohammad Mahdi Moradi, Sudhir Mudur

    Abstract: Knowledge-Based Visual Question Answering (KB-VQA) methods focus on tasks that demand reasoning with information extending beyond the explicit content depicted in the image. Early methods relied on explicit knowledge bases to provide this auxiliary information. Recent approaches leverage Large Language Models (LLMs) as implicit knowledge sources. While KB-VQA methods have demonstrated promising re… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

  7. arXiv:2412.06072  [pdf, other

    cs.IT cs.CC

    PAC codes with Bounded-Complexity Sequential Decoding: Pareto Distribution and Code Design

    Authors: Mohsen Moradi, Hessam Mahdavifar

    Abstract: Recently, a novel variation of polar codes known as polarization-adjusted convolutional (PAC) codes has been introduced by Arıkan. These codes significantly outperform conventional polar and convolutional codes, particularly for short codeword lengths, and are shown to operate very close to the optimal bounds. It has also been shown that if the rate profile of PAC codes does not adhere to certain… ▽ More

    Submitted 8 December, 2024; originally announced December 2024.

    Comments: 11 pages. arXiv admin note: text overlap with arXiv:2012.05511

  8. arXiv:2411.05661  [pdf, other

    stat.ML cs.LG

    Multi-armed Bandits with Missing Outcome

    Authors: Ilia Mahrooghi, Mahshad Moradi, Sina Akbari, Negar Kiyavash

    Abstract: While significant progress has been made in designing algorithms that minimize regret in online decision-making, real-world scenarios often introduce additional complexities, perhaps the most challenging of which is missing outcomes. Overlooking this aspect or simply assuming random missingness invariably leads to biased estimates of the rewards and may result in linear regret. Despite the practic… ▽ More

    Submitted 8 November, 2024; originally announced November 2024.

    Comments: 38 pages, 5 figures, multi-armed bandits, missing data

  9. arXiv:2410.02077  [pdf, other

    cs.LG cs.AI cs.CV

    Kolmogorov-Arnold Network Autoencoders

    Authors: Mohammadamin Moradi, Shirin Panahi, Erik Bollt, Ying-Cheng Lai

    Abstract: Deep learning models have revolutionized various domains, with Multi-Layer Perceptrons (MLPs) being a cornerstone for tasks like data regression and image classification. However, a recent study has introduced Kolmogorov-Arnold Networks (KANs) as promising alternatives to MLPs, leveraging activation functions placed on edges rather than nodes. This structural shift aligns KANs closely with the Kol… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

    Comments: 12 pages, 5 figures, 1 table

  10. arXiv:2409.15167  [pdf, other

    cs.LG math.DS nlin.CD physics.data-an

    Data-driven model discovery with Kolmogorov-Arnold networks

    Authors: Mohammadamin Moradi, Shirin Panahi, Erik M. Bollt, Ying-Cheng Lai

    Abstract: Data-driven model discovery of complex dynamical systems is typically done using sparse optimization, but it has a fundamental limitation: sparsity in that the underlying governing equations of the system contain only a small number of elementary mathematical terms. Examples where sparse optimization fails abound, such as the classic Ikeda or optical-cavity map in nonlinear dynamics and a large va… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

    Comments: 6 pages, 4 figures

  11. arXiv:2409.14827  [pdf, other

    cs.CV cs.HC cs.MM

    AIM 2024 Challenge on Video Saliency Prediction: Methods and Results

    Authors: Andrey Moskalenko, Alexey Bryncev, Dmitry Vatolin, Radu Timofte, Gen Zhan, Li Yang, Yunlong Tang, Yiting Liao, Jiongzhi Lin, Baitao Huang, Morteza Moradi, Mohammad Moradi, Francesco Rundo, Concetto Spampinato, Ali Borji, Simone Palazzo, Yuxin Zhu, Yinan Sun, Huiyu Duan, Yuqin Cao, Ziheng Jia, Qiang Hu, Xiongkuo Min, Guangtao Zhai, Hao Fang , et al. (8 additional authors not shown)

    Abstract: This paper reviews the Challenge on Video Saliency Prediction at AIM 2024. The goal of the participants was to develop a method for predicting accurate saliency maps for the provided set of video sequences. Saliency maps are widely exploited in various applications, including video compression, quality assessment, visual perception studies, the advertising industry, etc. For this competition, a pr… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

    Comments: ECCVW 2024

    ACM Class: I.4.6; I.2.10

  12. arXiv:2408.03840  [pdf, other

    cs.IT

    On Fast SC-based Polar Decoders: Metric Polarization and a Pruning Technique

    Authors: Mohsen Moradi, Hessam Mahdavifar

    Abstract: Short- to medium-block-length polar-like and polarization-adjusted convolutional (PAC) codes have demonstrated exceptional error-correction performance through sequential decoding. Successive cancellation list (SCL) decoding of polar-like and PAC codes can potentially match the performance of sequential decoding though a relatively large list size is often required. By benefiting from an optimal m… ▽ More

    Submitted 19 March, 2025; v1 submitted 7 August, 2024; originally announced August 2024.

  13. arXiv:2405.00874  [pdf

    cs.SE cs.AI

    Artificial intelligence for context-aware visual change detection in software test automation

    Authors: Milad Moradi, Ke Yan, David Colwell, Rhona Asgari

    Abstract: Automated software testing is integral to the software development process, streamlining workflows and ensuring product reliability. Visual testing within this context, especially concerning user interface (UI) and user experience (UX) validation, stands as one of crucial determinants of overall software quality. Nevertheless, conventional methods like pixel-wise comparison and region-based visual… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  14. arXiv:2404.19669  [pdf, other

    cs.LG

    Enhancing Predictive Accuracy in Pharmaceutical Sales Through An Ensemble Kernel Gaussian Process Regression Approach

    Authors: Shahin Mirshekari, Mohammadreza Moradi, Hossein Jafari, Mehdi Jafari, Mohammad Ensaf

    Abstract: This research employs Gaussian Process Regression (GPR) with an ensemble kernel, integrating Exponential Squared, Revised Matérn, and Rational Quadratic kernels to analyze pharmaceutical sales data. Bayesian optimization was used to identify optimal kernel weights: 0.76 for Exponential Squared, 0.21 for Revised Matérn, and 0.13 for Rational Quadratic. The ensemble kernel demonstrated superior perf… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: 6 pages, 5 figures

  15. arXiv:2404.11973  [pdf

    cs.AI

    Exploring the landscape of large language models: Foundations, techniques, and challenges

    Authors: Milad Moradi, Ke Yan, David Colwell, Matthias Samwald, Rhona Asgari

    Abstract: In this review paper, we delve into the realm of Large Language Models (LLMs), covering their foundational principles, diverse applications, and nuanced training processes. The article sheds light on the mechanics of in-context learning and a spectrum of fine-tuning approaches, with a special focus on methods that optimize efficiency in parameter usage. Additionally, it explores how LLMs can be mo… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  16. arXiv:2404.07560  [pdf, other

    cs.RO cs.AI

    Socially Pertinent Robots in Gerontological Healthcare

    Authors: Xavier Alameda-Pineda, Angus Addlesee, Daniel Hernández García, Chris Reinke, Soraya Arias, Federica Arrigoni, Alex Auternaud, Lauriane Blavette, Cigdem Beyan, Luis Gomez Camara, Ohad Cohen, Alessandro Conti, Sébastien Dacunha, Christian Dondrup, Yoav Ellinson, Francesco Ferro, Sharon Gannot, Florian Gras, Nancie Gunson, Radu Horaud, Moreno D'Incà, Imad Kimouche, Séverin Lemaignan, Oliver Lemon, Cyril Liotard , et al. (19 additional authors not shown)

    Abstract: Despite the many recent achievements in developing and deploying social robotics, there are still many underexplored environments and applications for which systematic evaluation of such systems by end-users is necessary. While several robotic platforms have been used in gerontological healthcare, the question of whether or not a social interactive robot with multi-modal conversational capabilitie… ▽ More

    Submitted 11 February, 2025; v1 submitted 11 April, 2024; originally announced April 2024.

  17. arXiv:2404.03097  [pdf, other

    cs.CV

    SalFoM: Dynamic Saliency Prediction with Video Foundation Models

    Authors: Morteza Moradi, Mohammad Moradi, Francesco Rundo, Concetto Spampinato, Ali Borji, Simone Palazzo

    Abstract: Recent advancements in video saliency prediction (VSP) have shown promising performance compared to the human visual system, whose emulation is the primary goal of VSP. However, current state-of-the-art models employ spatio-temporal transformers trained on limited amounts of data, hindering generalizability adaptation to downstream tasks. The benefits of vision foundation models present a potentia… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 15 pages, 4 figures

  18. arXiv:2402.14877  [pdf, other

    physics.ao-ph cs.LG math.DS physics.data-an physics.pop-ph

    Machine-learning prediction of tipping with applications to the Atlantic Meridional Overturning Circulation

    Authors: Shirin Panahi, Ling-Wei Kong, Mohammadamin Moradi, Zheng-Meng Zhai, Bryan Glaz, Mulugeta Haile, Ying-Cheng Lai

    Abstract: Anticipating a tipping point, a transition from one stable steady state to another, is a problem of broad relevance due to the ubiquity of the phenomenon in diverse fields. The steady-state nature of the dynamics about a tipping point makes its prediction significantly more challenging than predicting other types of critical transitions from oscillatory or chaotic dynamics. Exploiting the benefits… ▽ More

    Submitted 17 October, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: 11 pages, 7 figures

  19. arXiv:2402.14131  [pdf, other

    eess.SP cs.LG physics.data-an

    Random forests for detecting weak signals and extracting physical information: a case study of magnetic navigation

    Authors: Mohammadamin Moradi, Zheng-Meng Zhai, Aaron Nielsen, Ying-Cheng Lai

    Abstract: It was recently demonstrated that two machine-learning architectures, reservoir computing and time-delayed feed-forward neural networks, can be exploited for detecting the Earth's anomaly magnetic field immersed in overwhelming complex signals for magnetic navigation in a GPS-denied environment. The accuracy of the detected anomaly field corresponds to a positioning accuracy in the range of 10 to… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 12 pages, 11 figures

    Journal ref: APL Machine Learning 2 (1), 016118 (2024)

  20. arXiv:2401.10376  [pdf, other

    cs.IT

    PAC Code Rate-Profile Design Using Search-Constrained Optimization Algorithms

    Authors: Mohsen Moradi, David G. M. Mitchell

    Abstract: In this paper, we introduce a novel rate-profile design based on search-constrained optimization techniques to assess the performance of polarization-adjusted convolutional (PAC) codes under Fano (sequential) decoding. The results demonstrate that the resulting PAC code offers much reduced computational complexity compared to a construction based on a conventional genetic algorithm without a perfo… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  21. arXiv:2401.07942  [pdf, other

    cs.CV cs.MM

    Transformer-based Video Saliency Prediction with High Temporal Dimension Decoding

    Authors: Morteza Moradi, Simone Palazzo, Concetto Spampinato

    Abstract: In recent years, finding an effective and efficient strategy for exploiting spatial and temporal information has been a hot research topic in video saliency prediction (VSP). With the emergence of spatio-temporal transformers, the weakness of the prior strategies, e.g., 3D convolutional networks and LSTM-based networks, for capturing long-range dependencies has been effectively compensated. While… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: 8 pages, 2 figures, 3 tables

  22. arXiv:2401.03448  [pdf, other

    eess.AS cs.SD

    Single-Microphone Speaker Separation and Voice Activity Detection in Noisy and Reverberant Environments

    Authors: Renana Opochinsky, Mordehay Moradi, Sharon Gannot

    Abstract: Speech separation involves extracting an individual speaker's voice from a multi-speaker audio signal. The increasing complexity of real-world environments, where multiple speakers might converse simultaneously, underscores the importance of effective speech separation techniques. This work presents a single-microphone speaker separation network with TF attention aiming at noisy and reverberant en… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

  23. arXiv:2312.15550  [pdf

    cs.CL cs.AI cs.LG

    Multi-level biomedical NER through multi-granularity embeddings and enhanced labeling

    Authors: Fahime Shahrokh, Nasser Ghadiri, Rasoul Samani, Milad Moradi

    Abstract: Biomedical Named Entity Recognition (NER) is a fundamental task of Biomedical Natural Language Processing for extracting relevant information from biomedical texts, such as clinical records, scientific publications, and electronic health records. The conventional approaches for biomedical NER mainly use traditional machine learning techniques, such as Conditional Random Fields and Support Vector M… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

    MSC Class: 68T50; 68T07 ACM Class: J.3; I.2.7; I.2.1

  24. arXiv:2311.09142  [pdf, other

    cs.LG math.DS nlin.CD physics.comp-ph

    Machine-learning parameter tracking with partial state observation

    Authors: Zheng-Meng Zhai, Mohammadamin Moradi, Bryan Glaz, Mulugeta Haile, Ying-Cheng Lai

    Abstract: Complex and nonlinear dynamical systems often involve parameters that change with time, accurate tracking of which is essential to tasks such as state estimation, prediction, and control. Existing machine-learning methods require full state observation of the underlying system and tacitly assume adiabatic changes in the parameter. Formulating an inverse problem and exploiting reservoir computing,… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: 5 pages, 4 figures

  25. arXiv:2309.13305  [pdf, other

    cs.SI

    Multilevel User Credibility Assessment in Social Networks

    Authors: Mohammad Moradi, Mostafa Haghir Chehreghani

    Abstract: Online social networks are major platforms for disseminating both real and fake news. Many users, intentionally or unintentionally, spread harmful content, fake news, and rumors in fields such as politics and business. Consequently, numerous studies have been conducted in recent years to assess user credibility. A significant shortcoming of most existing methods is that they categorize users as ei… ▽ More

    Submitted 11 January, 2025; v1 submitted 23 September, 2023; originally announced September 2023.

  26. arXiv:2309.11470  [pdf, other

    cs.RO cs.LG math.DS nlin.CD

    Model-free tracking control of complex dynamical trajectories with machine learning

    Authors: Zheng-Meng Zhai, Mohammadamin Moradi, Ling-Wei Kong, Bryan Glaz, Mulugeta Haile, Ying-Cheng Lai

    Abstract: Nonlinear tracking control enabling a dynamical system to track a desired trajectory is fundamental to robotics, serving a wide range of civil and defense applications. In control engineering, designing tracking control requires complete knowledge of the system model and equations. We develop a model-free, machine-learning framework to control a two-arm robotic manipulator using only partially obs… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

    Comments: 16 pages, 8 figures

    Journal ref: Nat Commun 14, 5698 (2023)

  27. arXiv:2304.01463  [pdf, other

    cs.IT

    Polarization-Adjusted Convolutional (PAC) Codes as a Concatenation of Inner Cyclic and Outer Polar- and Reed-Muller-like Codes

    Authors: Mohsen Moradi

    Abstract: Polarization-adjusted convolutional (PAC) codes are a new family of linear block codes that can perform close to the theoretical bounds in the short block-length regime. These codes combine polar coding and convolutional coding. In this study, we show that PAC codes are equivalent to a new class of codes consisting of inner cyclic codes and outer polar- and Reed-Muller-like codes. We leverage the… ▽ More

    Submitted 2 August, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

  28. Model-agnostic explainable artificial intelligence for object detection in image data

    Authors: Milad Moradi, Ke Yan, David Colwell, Matthias Samwald, Rhona Asgari

    Abstract: In recent years, deep neural networks have been widely used for building high-performance Artificial Intelligence (AI) systems for computer vision applications. Object detection is a fundamental task in computer vision, which has been greatly progressed through developing large and intricate AI models. However, the lack of transparency is a big challenge that may not allow the widespread adoption… ▽ More

    Submitted 4 September, 2024; v1 submitted 30 March, 2023; originally announced March 2023.

  29. ThoughtSource: A central hub for large language model reasoning data

    Authors: Simon Ott, Konstantin Hebenstreit, Valentin Liévin, Christoffer Egeberg Hother, Milad Moradi, Maximilian Mayrhauser, Robert Praas, Ole Winther, Matthias Samwald

    Abstract: Large language models (LLMs) such as GPT-4 have recently demonstrated impressive results across a wide range of tasks. LLMs are still limited, however, in that they frequently fail at complex reasoning, their reasoning processes are opaque, they are prone to 'hallucinate' facts, and there are concerns about their underlying biases. Letting models verbalize reasoning steps as natural language, a te… ▽ More

    Submitted 27 July, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: Revision: added datasets, formatting

    Journal ref: Scientific Data 10, 528 (2023)

  30. arXiv:2301.04619  [pdf, other

    cs.CV

    TinyHD: Efficient Video Saliency Prediction with Heterogeneous Decoders using Hierarchical Maps Distillation

    Authors: Feiyan Hu, Simone Palazzo, Federica Proietto Salanitri, Giovanni Bellitto, Morteza Moradi, Concetto Spampinato, Kevin McGuinness

    Abstract: Video saliency prediction has recently attracted attention of the research community, as it is an upstream task for several practical applications. However, current solutions are particularly computationally demanding, especially due to the wide usage of spatio-temporal 3D convolutions. We observe that, while different model architectures achieve similar performance on benchmarks, visual variation… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

    Comments: WACV2023

  31. arXiv:2211.11749  [pdf

    eess.IV cs.CV physics.med-ph

    Towards Automatic Prediction of Outcome in Treatment of Cerebral Aneurysms

    Authors: Ashutosh Jadhav, Satyananda Kashyap, Hakan Bulu, Ronak Dholakia, Amon Y. Liu, Tanveer Syeda-Mahmood, William R. Patterson, Hussain Rangwala, Mehdi Moradi

    Abstract: Intrasaccular flow disruptors treat cerebral aneurysms by diverting the blood flow from the aneurysm sac. Residual flow into the sac after the intervention is a failure that could be due to the use of an undersized device, or to vascular anatomy and clinical condition of the patient. We report a machine learning model based on over 100 clinical and imaging features that predict the outcome of wide… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

    Comments: 10 pages

    Report number: https://s4.goeshow.com/amia/annual/2022/schedule_at_a_glance.cfm?session_key=1965BCBD-A832-92DD-9D05-FB2CB132FADB&session_date=

    Journal ref: AMAI 2022 Annual Symposium

  32. arXiv:2208.04010  [pdf, other

    cs.IT

    Application of Guessing to Sequential Decoding of Polarization-Adjusted Convolutional (PAC) Codes

    Authors: Mohsen Moradi

    Abstract: Despite the extreme error-correction performance, the amount of computation of sequential decoding of the polarization-adjusted convolutional (PAC) codes is random. In sequential decoding of convolutional codes, the computational cutoff rate denotes the region between rates whose average computational complexity of decoding is finite and those which is infinite. In this paper, by benefiting from t… ▽ More

    Submitted 29 November, 2022; v1 submitted 8 August, 2022; originally announced August 2022.

  33. arXiv:2208.03873  [pdf, other

    cs.CV cs.LG

    CheXRelNet: An Anatomy-Aware Model for Tracking Longitudinal Relationships between Chest X-Rays

    Authors: Gaurang Karwande, Amarachi Mbakawe, Joy T. Wu, Leo A. Celi, Mehdi Moradi, Ismini Lourentzou

    Abstract: Despite the progress in utilizing deep learning to automate chest radiograph interpretation and disease diagnosis tasks, change between sequential Chest X-rays (CXRs) has received limited attention. Monitoring the progression of pathologies that are visualized through chest imaging poses several challenges in anatomical motion estimation and image registration, i.e., spatially aligning the two ima… ▽ More

    Submitted 15 September, 2022; v1 submitted 7 August, 2022; originally announced August 2022.

    Comments: Accepted at MICCAI 2022

  34. arXiv:2207.11946  [pdf, other

    cs.IT

    A Tree Pruning Technique for Decoding Complexity Reduction of Polar Codes and PAC Codes

    Authors: Mohsen Moradi, Amir Mozammel

    Abstract: Sorting operation is one of the main bottlenecks for the successive-cancellation list (SCL) decoding. This paper introduces an improvement to the SCL decoding for polar and pre-transformed polar codes that reduces the number of sorting operations without degrading the code's error-correction performance. In an SCL decoding with an optimum metric function we show that, on average, the correct branc… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

  35. arXiv:2204.11574  [pdf, other

    cs.CL cs.AI

    A global analysis of metrics used for measuring performance in natural language processing

    Authors: Kathrin Blagec, Georg Dorffner, Milad Moradi, Simon Ott, Matthias Samwald

    Abstract: Measuring the performance of natural language processing models is challenging. Traditionally used metrics, such as BLEU and ROUGE, originally devised for machine translation and summarization, have been shown to suffer from low correlation with human judgment and a lack of transferability to other tasks and languages. In the past 15 years, a wide range of alternative metrics have been proposed. H… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    Comments: "NLP Power" workshop at ACL 2022. This work is based on a previous arXiv submission: arXiv:2008.02577 [cs.AI]

  36. arXiv:2202.12678  [pdf

    cs.AI cs.CL cs.LG

    Deep Learning, Natural Language Processing, and Explainable Artificial Intelligence in the Biomedical Domain

    Authors: Milad Moradi, Matthias Samwald

    Abstract: In this article, we first give an introduction to artificial intelligence and its applications in biology and medicine in Section 1. Deep learning methods are then described in Section 2. We narrow down the focus of the study on textual data in Section 3, where natural language processing and its applications in the biomedical domain are described. In Section 4, we give an introduction to explaina… ▽ More

    Submitted 7 March, 2022; v1 submitted 25 February, 2022; originally announced February 2022.

  37. 3D Segmentation with Fully Trainable Gabor Kernels and Pearson's Correlation Coefficient

    Authors: Ken C. L. Wong, Mehdi Moradi

    Abstract: The convolutional layer and loss function are two fundamental components in deep learning. Because of the success of conventional deep learning kernels, the less versatile Gabor kernels become less popular despite the fact that they can provide abundant features at different frequencies, orientations, and scales with much fewer parameters. For existing loss functions for multi-class image segmenta… ▽ More

    Submitted 15 December, 2022; v1 submitted 10 January, 2022; originally announced January 2022.

    Comments: This paper was accepted by the International Workshop on Machine Learning in Medical Imaging (MLMI 2022)

  38. arXiv:2111.08529  [pdf

    cs.CL cs.AI

    Improving the robustness and accuracy of biomedical language models through adversarial training

    Authors: Milad Moradi, Matthias Samwald

    Abstract: Deep transformer neural network models have improved the predictive accuracy of intelligent text processing systems in the biomedical domain. They have obtained state-of-the-art performance scores on a wide variety of biomedical and clinical Natural Language Processing (NLP) benchmarks. However, the robustness and reliability of these models has been less explored so far. Neural NLP models can be… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

  39. arXiv:2109.02555  [pdf

    cs.CL cs.AI cs.LG

    GPT-3 Models are Poor Few-Shot Learners in the Biomedical Domain

    Authors: Milad Moradi, Kathrin Blagec, Florian Haberl, Matthias Samwald

    Abstract: Deep neural language models have set new breakthroughs in many tasks of Natural Language Processing (NLP). Recent work has shown that deep transformer language models (pretrained on large amounts of texts) can achieve high levels of task-specific few-shot performance comparable to state-of-the-art models. However, the ability of these large language models in few-shot transfer learning has not yet… ▽ More

    Submitted 1 June, 2022; v1 submitted 6 September, 2021; originally announced September 2021.

  40. arXiv:2108.12242  [pdf

    cs.CL cs.AI

    Deep learning models are not robust against noise in clinical text

    Authors: Milad Moradi, Kathrin Blagec, Matthias Samwald

    Abstract: Artificial Intelligence (AI) systems are attracting increasing interest in the medical domain due to their ability to learn complicated tasks that require human intelligence and expert knowledge. AI systems that utilize high-performance Natural Language Processing (NLP) models have achieved state-of-the-art results on a wide variety of clinical text processing benchmarks. They have even outperform… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

  41. arXiv:2108.12237  [pdf

    cs.CL cs.AI

    Evaluating the Robustness of Neural Language Models to Input Perturbations

    Authors: Milad Moradi, Matthias Samwald

    Abstract: High-performance neural language models have obtained state-of-the-art results on a wide range of Natural Language Processing (NLP) tasks. However, results for common benchmark datasets often do not reflect model reliability and robustness when applied to noisy, real-world data. In this study, we design and implement various types of character-level and word-level perturbation methods to simulate… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

    Comments: Accepted by EMNLP 2021

  42. Hybrid deep learning methods for phenotype prediction from clinical notes

    Authors: Sahar Khalafi, Nasser Ghadiri, Milad Moradi

    Abstract: Identifying patient cohorts from clinical notes in secondary electronic health records is a fundamental task in clinical information management. However, with the growing number of clinical notes, it becomes challenging to analyze the data manually for phenotype detection. Automatic extraction of clinical concepts would helps to identify the patient phenotypes correctly. This paper proposes a nove… ▽ More

    Submitted 3 May, 2022; v1 submitted 16 August, 2021; originally announced August 2021.

    MSC Class: 68T50; 68T07 ACM Class: J.3; I.2.7; I.2.1

  43. Basis Scaling and Double Pruning for Efficient Inference in Network-Based Transfer Learning

    Authors: Ken C. L. Wong, Satyananda Kashyap, Mehdi Moradi

    Abstract: Network-based transfer learning allows the reuse of deep learning features with limited data, but the resulting models can be unnecessarily large. Although network pruning can improve inference efficiency, existing algorithms usually require fine-tuning that may not be suitable for small datasets. In this paper, using the singular value decomposition, we decompose a convolutional layer into two la… ▽ More

    Submitted 20 December, 2023; v1 submitted 5 August, 2021; originally announced August 2021.

    Comments: This paper was accepted by Pattern Recognition Letters

  44. arXiv:2108.00316  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Chest ImaGenome Dataset for Clinical Reasoning

    Authors: Joy T. Wu, Nkechinyere N. Agu, Ismini Lourentzou, Arjun Sharma, Joseph A. Paguio, Jasper S. Yao, Edward C. Dee, William Mitchell, Satyananda Kashyap, Andrea Giovannini, Leo A. Celi, Mehdi Moradi

    Abstract: Despite the progress in automatic detection of radiologic findings from chest X-ray (CXR) images in recent years, a quantitative evaluation of the explainability of these models is hampered by the lack of locally labeled datasets for different findings. With the exception of a few expert-labeled small-scale datasets for specific findings, such as pneumonia and pneumothorax, most of the CXR deep le… ▽ More

    Submitted 31 July, 2021; originally announced August 2021.

    Comments: Dataset available on PhysioNet (https://doi.org/10.13026/wv01-y230)

  45. arXiv:2106.08822  [pdf, other

    cs.IT

    Concatenated Reed-Solomon and Polarization-Adjusted Convolutional (PAC) Codes

    Authors: Mohsen Moradi, Amir Mozammel

    Abstract: Two concatenated coding schemes incorporating algebraic Reed-Solomon (RS) codes and polarization-adjusted convolutional (PAC) codes are proposed. Simulation results show that at a bit error rate of $10^{-5}$, a concatenated scheme using RS and PAC codes has more than $0.25$ dB coding gain over the NASA standard concatenation scheme, which uses RS and convolutional codes.

    Submitted 16 June, 2021; originally announced June 2021.

  46. arXiv:2106.08118  [pdf, other

    cs.IT

    A Monte-Carlo Based Construction of Polarization-Adjusted Convolutional (PAC) Codes

    Authors: Mohsen Moradi, Amir Mozammel

    Abstract: This paper proposes a rate-profile construction method for polarization-adjusted convolutional (PAC) codes of any code length and rate, which is capable of maintaining trade-off between the error-correction performance and decoding complexity of PAC code. The proposed method can improve the error-correction performance of PAC codes while guaranteeing a low mean sequential decoding complexity for s… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

  47. arXiv:2105.09937  [pdf, other

    cs.CV cs.AI

    AnaXNet: Anatomy Aware Multi-label Finding Classification in Chest X-ray

    Authors: Nkechinyere N. Agu, Joy T. Wu, Hanqing Chao, Ismini Lourentzou, Arjun Sharma, Mehdi Moradi, Pingkun Yan, James Hendler

    Abstract: Radiologists usually observe anatomical regions of chest X-ray images as well as the overall image before making a decision. However, most existing deep learning models only look at the entire X-ray image for classification, failing to utilize important anatomical information. In this paper, we propose a novel multi-label chest X-ray classification model that accurately classifies the image findin… ▽ More

    Submitted 20 May, 2021; originally announced May 2021.

    Comments: Accepted to MICCAI 2021

  48. arXiv:2103.12228  [pdf, other

    cs.CV

    Channel Scaling: A Scale-and-Select Approach for Transfer Learning

    Authors: Ken C. L. Wong, Satyananda Kashyap, Mehdi Moradi

    Abstract: Transfer learning with pre-trained neural networks is a common strategy for training classifiers in medical image analysis. Without proper channel selections, this often results in unnecessarily large models that hinder deployment and explainability. In this paper, we propose a novel approach to efficiently build small and well performing networks by introducing the channel-scaling layers. A chann… ▽ More

    Submitted 22 March, 2021; originally announced March 2021.

    Comments: This paper was accepted by the IEEE International Symposium on Biomedical Imaging (ISBI) 2021

  49. Explaining Black-box Models for Biomedical Text Classification

    Authors: Milad Moradi, Matthias Samwald

    Abstract: In this paper, we propose a novel method named Biomedical Confident Itemsets Explanation (BioCIE), aiming at post-hoc explanation of black-box machine learning models for biomedical text classification. Using sources of domain knowledge and a confident itemset mining method, BioCIE discretizes the decision space of a black-box into smaller subspaces and extracts semantic relationships between the… ▽ More

    Submitted 20 December, 2020; originally announced December 2020.

  50. arXiv:2012.05511  [pdf, other

    cs.IT

    On the Metric and Computation of PAC Codes

    Authors: Mohsen Moradi

    Abstract: In this paper, we present an optimal metric function on average, which leads to a significantly low decoding computation while maintaining the superiority of the polarization-adjusted convolutional (PAC) codes' error-correction performance. With our proposed metric function, the PAC codes' decoding computation is comparable to the conventional convolutional codes (CC) sequential decoding. Moreover… ▽ More

    Submitted 10 December, 2020; originally announced December 2020.