-
AI-Driven MRI-based Brain Tumour Segmentation Benchmarking
Authors:
Connor Ludwig,
Khashayar Namdar,
Farzad Khalvati
Abstract:
Medical image segmentation has greatly aided medical diagnosis, with U-Net based architectures and nnU-Net providing state-of-the-art performance. There have been numerous general promptable models and medical variations introduced in recent years, but there is currently a lack of evaluation and comparison of these models across a variety of prompt qualities on a common medical dataset. This resea…
▽ More
Medical image segmentation has greatly aided medical diagnosis, with U-Net based architectures and nnU-Net providing state-of-the-art performance. There have been numerous general promptable models and medical variations introduced in recent years, but there is currently a lack of evaluation and comparison of these models across a variety of prompt qualities on a common medical dataset. This research uses Segment Anything Model (SAM), Segment Anything Model 2 (SAM 2), MedSAM, SAM-Med-3D, and nnU-Net to obtain zero-shot inference on the BraTS 2023 adult glioma and pediatrics dataset across multiple prompt qualities for both points and bounding boxes. Several of these models exhibit promising Dice scores, particularly SAM and SAM 2 achieving scores of up to 0.894 and 0.893, respectively when given extremely accurate bounding box prompts which exceeds nnU-Net's segmentation performance. However, nnU-Net remains the dominant medical image segmentation network due to the impracticality of providing highly accurate prompts to the models. The model and prompt evaluation, as well as the comparison, are extended through fine-tuning SAM, SAM 2, MedSAM, and SAM-Med-3D on the pediatrics dataset. The improvements in point prompt performance after fine-tuning are substantial and show promise for future investigation, but are unable to achieve better segmentation than bounding boxes or nnU-Net.
△ Less
Submitted 25 June, 2025;
originally announced June 2025.
-
Red Teaming Large Language Models for Healthcare
Authors:
Vahid Balazadeh,
Michael Cooper,
David Pellow,
Atousa Assadi,
Jennifer Bell,
Mark Coastworth,
Kaivalya Deshpande,
Jim Fackler,
Gabriel Funingana,
Spencer Gable-Cook,
Anirudh Gangadhar,
Abhishek Jaiswal,
Sumanth Kaja,
Christopher Khoury,
Amrit Krishnan,
Randy Lin,
Kaden McKeen,
Sara Naimimohasses,
Khashayar Namdar,
Aviraj Newatia,
Allan Pang,
Anshul Pattoo,
Sameer Peesapati,
Diana Prepelita,
Bogdana Rakova
, et al. (10 additional authors not shown)
Abstract:
We present the design process and findings of the pre-conference workshop at the Machine Learning for Healthcare Conference (2024) entitled Red Teaming Large Language Models for Healthcare, which took place on August 15, 2024. Conference participants, comprising a mix of computational and clinical expertise, attempted to discover vulnerabilities -- realistic clinical prompts for which a large lang…
▽ More
We present the design process and findings of the pre-conference workshop at the Machine Learning for Healthcare Conference (2024) entitled Red Teaming Large Language Models for Healthcare, which took place on August 15, 2024. Conference participants, comprising a mix of computational and clinical expertise, attempted to discover vulnerabilities -- realistic clinical prompts for which a large language model (LLM) outputs a response that could cause clinical harm. Red-teaming with clinicians enables the identification of LLM vulnerabilities that may not be recognised by LLM developers lacking clinical expertise. We report the vulnerabilities found, categorise them, and present the results of a replication study assessing the vulnerabilities across all LLMs provided.
△ Less
Submitted 1 May, 2025;
originally announced May 2025.
-
The effects of fibre spatial distribution and relative orientation on the percolation and mechanics of stochastic fibre networks: A model of peptide hydrogels
Authors:
Amir Hossein Namdar,
Nastaran Zoghi,
Aline Miller,
Alberto Saiani,
Tom Shearer
Abstract:
The structures of fibre networks can vary greatly due to fibre interactions during formation. We have modified the steps of generating Mikado networks to create two new model classes by altering the spatial distribution and relative orientation of their fibres to mimic the structures of self-assembling peptide hydrogels (SAPHs), whose physical properties depend strongly on their fibres' interactio…
▽ More
The structures of fibre networks can vary greatly due to fibre interactions during formation. We have modified the steps of generating Mikado networks to create two new model classes by altering the spatial distribution and relative orientation of their fibres to mimic the structures of self-assembling peptide hydrogels (SAPHs), whose physical properties depend strongly on their fibres' interactions. The results of our models and experiments on a set of beta-sheet forming SAPHs show that modifying a network's structure affects the percolation threshold and the mechanical behaviour of the material, both near percolation and at higher densities.
△ Less
Submitted 24 March, 2025; v1 submitted 6 November, 2024;
originally announced November 2024.
-
Opportunities for Persian Digital Humanities Research with Artificial Intelligence Language Models; Case Study: Forough Farrokhzad
Authors:
Arash Rasti Meymandi,
Zahra Hosseini,
Sina Davari,
Abolfazl Moshiri,
Shabnam Rahimi-Golkhandan,
Khashayar Namdar,
Nikta Feizi,
Mohamad Tavakoli-Targhi,
Farzad Khalvati
Abstract:
This study explores the integration of advanced Natural Language Processing (NLP) and Artificial Intelligence (AI) techniques to analyze and interpret Persian literature, focusing on the poetry of Forough Farrokhzad. Utilizing computational methods, we aim to unveil thematic, stylistic, and linguistic patterns in Persian poetry. Specifically, the study employs AI models including transformer-based…
▽ More
This study explores the integration of advanced Natural Language Processing (NLP) and Artificial Intelligence (AI) techniques to analyze and interpret Persian literature, focusing on the poetry of Forough Farrokhzad. Utilizing computational methods, we aim to unveil thematic, stylistic, and linguistic patterns in Persian poetry. Specifically, the study employs AI models including transformer-based language models for clustering of the poems in an unsupervised framework. This research underscores the potential of AI in enhancing our understanding of Persian literary heritage, with Forough Farrokhzad's work providing a comprehensive case study. This approach not only contributes to the field of Persian Digital Humanities but also sets a precedent for future research in Persian literary studies using computational techniques.
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
Improving Pediatric Low-Grade Neuroepithelial Tumors Molecular Subtype Identification Using a Novel AUROC Loss Function for Convolutional Neural Networks
Authors:
Khashayar Namdar,
Matthias W. Wagner,
Cynthia Hawkins,
Uri Tabori,
Birgit B. Ertl-Wagner,
Farzad Khalvati
Abstract:
Pediatric Low-Grade Neuroepithelial Tumors (PLGNT) are the most common pediatric cancer type, accounting for 40% of brain tumors in children, and identifying PLGNT molecular subtype is crucial for treatment planning. However, the gold standard to determine the PLGNT subtype is biopsy, which can be impractical or dangerous for patients. This research improves the performance of Convolutional Neural…
▽ More
Pediatric Low-Grade Neuroepithelial Tumors (PLGNT) are the most common pediatric cancer type, accounting for 40% of brain tumors in children, and identifying PLGNT molecular subtype is crucial for treatment planning. However, the gold standard to determine the PLGNT subtype is biopsy, which can be impractical or dangerous for patients. This research improves the performance of Convolutional Neural Networks (CNNs) in classifying PLGNT subtypes through MRI scans by introducing a loss function that specifically improves the model's Area Under the Receiver Operating Characteristic (ROC) Curve (AUROC), offering a non-invasive diagnostic alternative. In this study, a retrospective dataset of 339 children with PLGNT (143 BRAF fusion, 71 with BRAF V600E mutation, and 125 non-BRAF) was curated. We employed a CNN model with Monte Carlo random data splitting. The baseline model was trained using binary cross entropy (BCE), and achieved an AUROC of 86.11% for differentiating BRAF fusion and BRAF V600E mutations, which was improved to 87.71% using our proposed AUROC loss function (p-value 0.045). With multiclass classification, the AUROC improved from 74.42% to 76. 59% (p-value 0.0016).
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Towards a phase-field based model for combustion in particle beds: Reactive fluid flow
Authors:
Reza Namdar,
Mohammad Norouzi,
Fathollah Varnik
Abstract:
The present study provide a systematic derivation of a phase-field version of the momentum, mass and heat transport equations, while accounting for chemical reactions in the fluid phase. To achieve this goal, the volume averaging technique is used to reformulate the conservation equations in the presence of multiple phases and their respective diffuse interfaces. A careful analysis, and neglecting…
▽ More
The present study provide a systematic derivation of a phase-field version of the momentum, mass and heat transport equations, while accounting for chemical reactions in the fluid phase. To achieve this goal, the volume averaging technique is used to reformulate the conservation equations in the presence of multiple phases and their respective diffuse interfaces. A careful analysis, and neglecting terms of the second order in fluctuations, reveals that the structure of the multiphase/diffuse interface version of the conservation equations is very similar to the original single phase/sharp interface formulation. The multiphase character of the problem reflects itself in a coupling term, which acts at the interface between two adjacent phases. The model is then applied to the special case of a reactive fluid in contact with an inert solid. Two coupling parameters are then introduced, which control the exchange of momentum and heat at the interface. For a numerical study of the thus obtained set of coupled partial differential equations, a hybrid lattice Boltzmann-finite difference-phase field (LB-FD-PF) framework is proposed and implemented in the open source software OpenPhase. The model is then thoroughly validated via numerical simulations, including isothermal flow, non-reactive non-isothermal flow, and reactive flows. To perform the simulations involving chemical reactions, OpenPhase is coupled to the open-source chemical kinetics software CANTERA, which delivers details of the chemical reaction mechanisms and the necessary thermodynamic and transport properties of the reacting chemical species. These simulations show that, once the coupling parameters are adequately tuned, the present approach yields excellent agreement with the sought for sharp interface method.
△ Less
Submitted 26 October, 2023;
originally announced October 2023.
-
Parametric 3D Convolutional Autoencoder for the Prediction of Flow Fields in a Bed Configuration of Hot Particles
Authors:
Ali Mjalled,
Reza Namdar,
Lucas Reineking,
Mohammad Norouzi,
Fathollah Varnik,
Martin Mönnigmann
Abstract:
The use of deep learning methods for modeling fluid flow has drawn a lot of attention in the past few years. In situations where conventional numerical approaches can be computationally expensive, these techniques have shown promise in offering accurate, rapid, and practical solutions for modeling complex fluid flow problems. The success of deep learning is often due to its ability to extract hidd…
▽ More
The use of deep learning methods for modeling fluid flow has drawn a lot of attention in the past few years. In situations where conventional numerical approaches can be computationally expensive, these techniques have shown promise in offering accurate, rapid, and practical solutions for modeling complex fluid flow problems. The success of deep learning is often due to its ability to extract hidden patterns and features from the data, enabling the creation of data-driven reduced models that can capture the underlying physics of the domain. We present a data-driven reduced model for predicting flow fields in a bed configuration of hot particles. The reduced model consists of a parametric 3D convolutional autoencoder. The first part resolves the spatial and temporal dependencies present in the input sequence, while the second part of the architecture is responsible for predicting the solution at the subsequent timestep based on the information gathered from the preceding part. We also propose the utilization of a post-processing non-trainable output layer following the decoding path to incorporate the physical knowledge, e.g., no-slip condition, into the prediction. The evaluation of the reduced model for a bed configuration with variable particle temperature showed accurate results at a fraction of the computational cost required by traditional numerical simulation methods.
△ Less
Submitted 12 February, 2024; v1 submitted 6 July, 2023;
originally announced July 2023.
-
Modeling gas flows in packed beds with the lattice Boltzmann method: validation against experiments
Authors:
Tanya Neeraj,
Christin Velten,
Gabor Janiga,
Katharina Zähringer,
Reza Namdar,
Fathollah Varnik,
Dominique Thévenin,
Seyed Ali Hosseini
Abstract:
This study aims to validate the lattice Boltzmann method and assess its ability to accurately describe the behavior of gaseous flows in packed beds. To that end, simulations of a model packed bed reactor, corresponding to an experimental bench, are conducted, and the results are directly compared with experimental data obtained by Particle Image Velocimetry measurements. It is found that the latti…
▽ More
This study aims to validate the lattice Boltzmann method and assess its ability to accurately describe the behavior of gaseous flows in packed beds. To that end, simulations of a model packed bed reactor, corresponding to an experimental bench, are conducted, and the results are directly compared with experimental data obtained by Particle Image Velocimetry measurements. It is found that the lattice Boltzmann solver exhibits very good agreement with experimental measurements. Then, the numerical solver is further used to analyze the effect of the number of packing layers on the flow structure and to determine the minimum bed height above which the changes in flow structure become insignificant. Finally, flow fluctuations in time are discussed. The findings of this study provide valuable insights into the behavior of the gas flow in packed bed reactors, opening the door for further investigations involving additionally chemical reactions, as found in many practical applications.
△ Less
Submitted 20 June, 2023;
originally announced June 2023.
-
Non-invasive Liver Fibrosis Screening on CT Images using Radiomics
Authors:
Jay J. Yoo,
Khashayar Namdar,
Sean Carey,
Sandra E. Fischer,
Chris McIntosh,
Farzad Khalvati,
Patrik Rogalla
Abstract:
Objectives: To develop and evaluate a radiomics machine learning model for detecting liver fibrosis on CT of the liver.
Methods: For this retrospective, single-centre study, radiomic features were extracted from Regions of Interest (ROIs) on CT images of patients who underwent simultaneous liver biopsy and CT examinations. Combinations of contrast, normalization, machine learning model, and feat…
▽ More
Objectives: To develop and evaluate a radiomics machine learning model for detecting liver fibrosis on CT of the liver.
Methods: For this retrospective, single-centre study, radiomic features were extracted from Regions of Interest (ROIs) on CT images of patients who underwent simultaneous liver biopsy and CT examinations. Combinations of contrast, normalization, machine learning model, and feature selection method were determined based on their mean test Area Under the Receiver Operating Characteristic curve (AUC) on randomly placed ROIs. The combination and selected features with the highest AUC were used to develop a final liver fibrosis screening model.
Results: The study included 101 male and 68 female patients (mean age = 51.2 years $\pm$ 14.7 [SD]). When averaging the AUC across all combinations, non-contrast enhanced (NC) CT (AUC, 0.6100; 95% CI: 0.5897, 0.6303) outperformed contrast-enhanced CT (AUC, 0.5680; 95% CI: 0.5471, 0.5890). The combination of hyperparameters and features that yielded the highest AUC was a logistic regression model with inputs features of maximum, energy, kurtosis, skewness, and small area high gray level emphasis extracted from non-contrast enhanced NC CT normalized using Gamma correction with $γ$ = 1.5 (AUC, 0.7833; 95% CI: 0.7821, 0.7845), (sensitivity, 0.9091; 95% CI: 0.9091, 0.9091).
Conclusions: Radiomics-based machine learning models allow for the detection of liver fibrosis with reasonable accuracy and high sensitivity on NC CT. Thus, these models can be used to non-invasively screen for liver fibrosis, contributing to earlier detection of the disease at a potentially curable stage.
△ Less
Submitted 26 February, 2024; v1 submitted 25 November, 2022;
originally announced November 2022.
-
Automating Cobb Angle Measurement for Adolescent Idiopathic Scoliosis using Instance Segmentation
Authors:
Chaojun Chen,
Khashayar Namdar,
Yujie Wu,
Shahob Hosseinpour,
Manohar Shroff,
Andrea S. Doria,
Farzad Khalvati
Abstract:
Scoliosis is a three-dimensional deformity of the spine, most often diagnosed in childhood. It affects 2-3% of the population, which is approximately seven million people in North America. Currently, the reference standard for assessing scoliosis is based on the manual assignment of Cobb angles at the site of the curvature center. This manual process is time consuming and unreliable as it is affec…
▽ More
Scoliosis is a three-dimensional deformity of the spine, most often diagnosed in childhood. It affects 2-3% of the population, which is approximately seven million people in North America. Currently, the reference standard for assessing scoliosis is based on the manual assignment of Cobb angles at the site of the curvature center. This manual process is time consuming and unreliable as it is affected by inter- and intra-observer variance. To overcome these inaccuracies, machine learning (ML) methods can be used to automate the Cobb angle measurement process. This paper proposes to address the Cobb angle measurement task using YOLACT, an instance segmentation model. The proposed method first segments the vertebrae in an X-Ray image using YOLACT, then it tracks the important landmarks using the minimum bounding box approach. Lastly, the extracted landmarks are used to calculate the corresponding Cobb angles. The model achieved a Symmetric Mean Absolute Percentage Error (SMAPE) score of 10.76%, demonstrating the reliability of this process in both vertebra localization and Cobb angle measurement.
△ Less
Submitted 25 November, 2022;
originally announced November 2022.
-
Unbounded composition operators on Orlicz spaces
Authors:
M. Namdar Baboli,
Y. Estaremi
Abstract:
In this paper we deal with unbounded composition operators defined in Orlicz spaces. Indeed, we provide some necessary and sufficient condition for densely definedness of composition operators on Orlicz spaces. Also, we will investigate the adjoint of densely defined composition operators and we give some equivalent conditions for it to be densely defined. In addition, we show that densely defined…
▽ More
In this paper we deal with unbounded composition operators defined in Orlicz spaces. Indeed, we provide some necessary and sufficient condition for densely definedness of composition operators on Orlicz spaces. Also, we will investigate the adjoint of densely defined composition operators and we give some equivalent conditions for it to be densely defined. In addition, we show that densely defined composition operator is continuous if and only if it is everywhere defined. Finally, we characterize densely defined continuous composition operators.
△ Less
Submitted 11 November, 2022;
originally announced November 2022.
-
Generative Adversarial Networks for Weakly Supervised Generation and Evaluation of Brain Tumor Segmentations on MR Images
Authors:
Jay J. Yoo,
Khashayar Namdar,
Matthias W. Wagner,
Liana Nobre,
Uri Tabori,
Cynthia Hawkins,
Birgit B. Ertl-Wagner,
Farzad Khalvati
Abstract:
Segmentation of regions of interest (ROIs) for identifying abnormalities is a leading problem in medical imaging. Using machine learning for this problem generally requires manually annotated ground-truth segmentations, demanding extensive time and resources from radiologists. This work presents a weakly supervised approach that utilizes binary image-level labels, which are much simpler to acquire…
▽ More
Segmentation of regions of interest (ROIs) for identifying abnormalities is a leading problem in medical imaging. Using machine learning for this problem generally requires manually annotated ground-truth segmentations, demanding extensive time and resources from radiologists. This work presents a weakly supervised approach that utilizes binary image-level labels, which are much simpler to acquire, to effectively segment anomalies in 2D magnetic resonance images without ground truth annotations. We train a generative adversarial network (GAN) that converts cancerous images to healthy variants, which are used along with localization seeds as priors to generate improved weakly supervised segmentations. The non-cancerous variants can also be used to evaluate the segmentations in a weakly supervised fashion, which allows for the most effective segmentations to be identified and then applied to downstream clinical classification tasks. On the Multimodal Brain Tumor Segmentation (BraTS) 2020 dataset, our proposed method generates and identifies segmentations that achieve test Dice coefficients of 83.91%. Using these segmentations for pathology classification results with a test AUC of 93.32% which is comparable to the test AUC of 95.80% achieved when using true segmentations.
△ Less
Submitted 15 August, 2024; v1 submitted 9 November, 2022;
originally announced November 2022.
-
Improving Deep Learning Models for Pediatric Low-Grade Glioma Tumors Molecular Subtype Identification Using 3D Probability Distributions of Tumor Location
Authors:
Khashayar Namdar,
Matthias W. Wagner,
Kareem Kudus,
Cynthia Hawkins,
Uri Tabori,
Brigit Ertl-Wagner,
Farzad Khalvati
Abstract:
Background and Purpose: Pediatric low-grade glioma (pLGG) is the most common type of brain tumor in children, and identification of molecular markers for pLGG is crucial for successful treatment planning. Convolutional Neural Network (CNN) models for pLGG subtype identification rely on tumor segmentation. We hypothesize tumor segmentations are suboptimal and thus, we propose to augment the CNN mod…
▽ More
Background and Purpose: Pediatric low-grade glioma (pLGG) is the most common type of brain tumor in children, and identification of molecular markers for pLGG is crucial for successful treatment planning. Convolutional Neural Network (CNN) models for pLGG subtype identification rely on tumor segmentation. We hypothesize tumor segmentations are suboptimal and thus, we propose to augment the CNN models using tumor location probability in MRI data.
Materials and Methods: Our REB-approved retrospective study included MRI Fluid-Attenuated Inversion Recovery (FLAIR) sequences of 143 BRAF fused and 71 BRAF V600E mutated tumors. Tumor segmentations (regions of interest (ROIs)) were provided by a pediatric neuroradiology fellow and verified by a senior pediatric neuroradiologist. In each experiment, we randomly split the data into development and test with an 80/20 ratio. We combined the 3D binary ROI masks for each class in the development dataset to derive the probability density functions (PDF) of tumor location, and developed three pipelines: location-based, CNN-based, and hybrid.
Results: We repeated the experiment with different model initializations and data splits 100 times and calculated the Area Under Receiver Operating Characteristic Curve (AUC). The location-based classifier achieved an AUC of 77.90, 95% confidence interval (CI) (76.76, 79.03). CNN-based classifiers achieved AUC of 86.11, CI (84.96, 87.25), while the tumor-location-guided CNNs outperformed the formers with an average AUC of 88.64 CI (87.57, 89.72), which was statistically significant (Student's t-test p-value 0.0018).
Conclusion: We achieved statistically significant improvements by incorporating tumor location into the CNN models. Our results suggest that manually segmented ROIs may not be optimal.
△ Less
Submitted 24 October, 2023; v1 submitted 13 October, 2022;
originally announced October 2022.
-
Deep Superpixel Generation and Clustering for Weakly Supervised Segmentation of Brain Tumors in MR Images
Authors:
Jay J. Yoo,
Khashayar Namdar,
Farzad Khalvati
Abstract:
Training machine learning models to segment tumors and other anomalies in medical images is an important step for developing diagnostic tools but generally requires manually annotated ground truth segmentations, which necessitates significant time and resources. This work proposes the use of a superpixel generation model and a superpixel clustering model to enable weakly supervised brain tumor seg…
▽ More
Training machine learning models to segment tumors and other anomalies in medical images is an important step for developing diagnostic tools but generally requires manually annotated ground truth segmentations, which necessitates significant time and resources. This work proposes the use of a superpixel generation model and a superpixel clustering model to enable weakly supervised brain tumor segmentations. The proposed method utilizes binary image-level classification labels, which are readily accessible, to significantly improve the initial region of interest segmentations generated by standard weakly supervised methods without requiring ground truth annotations. We used 2D slices of magnetic resonance brain scans from the Multimodal Brain Tumor Segmentation Challenge 2020 dataset and labels indicating the presence of tumors to train the pipeline. On the test cohort, our method achieved a mean Dice coefficient of 0.691 and a mean 95% Hausdorff distance of 18.1, outperforming existing superpixel-based weakly supervised segmentation methods.
△ Less
Submitted 22 January, 2024; v1 submitted 20 September, 2022;
originally announced September 2022.
-
Minimizing the Effect of Noise and Limited Dataset Size in Image Classification Using Depth Estimation as an Auxiliary Task with Deep Multitask Learning
Authors:
Khashayar Namdar,
Partoo Vafaeikia,
Farzad Khalvati
Abstract:
Generalizability is the ultimate goal of Machine Learning (ML) image classifiers, for which noise and limited dataset size are among the major concerns. We tackle these challenges through utilizing the framework of deep Multitask Learning (dMTL) and incorporating image depth estimation as an auxiliary task. On a customized and depth-augmented derivation of the MNIST dataset, we show a) multitask l…
▽ More
Generalizability is the ultimate goal of Machine Learning (ML) image classifiers, for which noise and limited dataset size are among the major concerns. We tackle these challenges through utilizing the framework of deep Multitask Learning (dMTL) and incorporating image depth estimation as an auxiliary task. On a customized and depth-augmented derivation of the MNIST dataset, we show a) multitask loss functions are the most effective approach of implementing dMTL, b) limited dataset size primarily contributes to classification inaccuracy, and c) depth estimation is mostly impacted by noise. In order to further validate the results, we manually labeled the NYU Depth V2 dataset for scene classification tasks. As a contribution to the field, we have made the data in python native format publicly available as an open-source dataset and provided the scene labels. Our experiments on MNIST and NYU-Depth-V2 show dMTL improves generalizability of the classifiers when the dataset is noisy and the number of examples is limited.
△ Less
Submitted 22 August, 2022;
originally announced August 2022.
-
Using Multi-modal Data for Improving Generalizability and Explainability of Disease Classification in Radiology
Authors:
Pranav Agnihotri,
Sara Ketabi,
Khashayar,
Namdar,
Farzad Khalvati
Abstract:
Traditional datasets for the radiological diagnosis tend to only provide the radiology image alongside the radiology report. However, radiology reading as performed by radiologists is a complex process, and information such as the radiologist's eye-fixations over the course of the reading has the potential to be an invaluable data source to learn from. Nonetheless, the collection of such data is e…
▽ More
Traditional datasets for the radiological diagnosis tend to only provide the radiology image alongside the radiology report. However, radiology reading as performed by radiologists is a complex process, and information such as the radiologist's eye-fixations over the course of the reading has the potential to be an invaluable data source to learn from. Nonetheless, the collection of such data is expensive and time-consuming. This leads to the question of whether such data is worth the investment to collect. This paper utilizes the recently published Eye-Gaze dataset to perform an exhaustive study on the impact on performance and explainability of deep learning (DL) classification in the face of varying levels of input features, namely: radiology images, radiology report text, and radiologist eye-gaze data. We find that the best classification performance of X-ray images is achieved with a combination of radiology report free-text and radiology image, with the eye-gaze data providing no performance boost. Nonetheless, eye-gaze data serving as secondary ground truth alongside the class label results in highly explainable models that generate better attention maps compared to models trained to do classification and attention map generation without eye-gaze data.
△ Less
Submitted 29 July, 2022;
originally announced July 2022.
-
Open-radiomics: A Collection of Standardized Datasets and a Technical Protocol for Reproducible Radiomics Machine Learning Pipelines
Authors:
Khashayar Namdar,
Matthias W. Wagner,
Birgit B. Ertl-Wagner,
Farzad Khalvati
Abstract:
Background: As an important branch of machine learning pipelines in medical imaging, radiomics faces two major challenges namely reproducibility and accessibility. In this work, we introduce open-radiomics, a set of radiomics datasets along with a comprehensive radiomics pipeline based on our proposed technical protocol to improve the reproducibility of the results. Methods: We curated large-scale…
▽ More
Background: As an important branch of machine learning pipelines in medical imaging, radiomics faces two major challenges namely reproducibility and accessibility. In this work, we introduce open-radiomics, a set of radiomics datasets along with a comprehensive radiomics pipeline based on our proposed technical protocol to improve the reproducibility of the results. Methods: We curated large-scale radiomics datasets based on three open-source datasets; BraTS 2020 for high-grade glioma (HGG) versus low-grade glioma (LGG) classification and survival analysis, BraTS 2023 for O6-methylguanine-DNA methyltransferase classification, and non-small cell lung cancer survival analysis from the Cancer Imaging Archive. Using BraTS 2020 Magnetic Resonance Imaging (MRI) dataset, we applied our protocol to 369 brain tumor patients (76 LGG, 293 HGG). Leveraging PyRadiomics for LGG vs. HGG classification, we generated 288 datasets from 4 MRI sequences, 3 binWidths, 6 normalization methods, and 4 tumor subregions. Random Forest classifiers were trained and validated (60%,20%,20%) across 100 different data splits (28,800 test results), evaluating Area Under the Receiver Operating Characteristic Curve (AUROC). Results: Unlike binWidth and image normalization, tumor subregion and imaging sequence significantly affected performance of the models. T1 contrast-enhanced sequence and the union of Necrotic and the non-enhancing tumor core subregions resulted in the highest AUROCs (average test AUROC 0.951, 95% confidence interval of (0.949, 0.952)). Although several settings and data splits (28 out of 28800) yielded test AUROC of 1, they were irreproducible. Conclusion: Our experiments demonstrate the sources of variability in radiomics pipelines (e.g., tumor subregion) can have a significant impact on the results, which may lead to superficial perfect performances that are irreproducible.
△ Less
Submitted 28 February, 2025; v1 submitted 29 July, 2022;
originally announced July 2022.
-
Improving Disease Classification Performance and Explainability of Deep Learning Models in Radiology with Heatmap Generators
Authors:
Akino Watanabe,
Sara Ketabi,
Khashayar,
Namdar,
Farzad Khalvati
Abstract:
As deep learning is widely used in the radiology field, the explainability of such models is increasingly becoming essential to gain clinicians' trust when using the models for diagnosis. In this research, three experiment sets were conducted with a U-Net architecture to improve the classification performance while enhancing the heatmaps corresponding to the model's focus through incorporating hea…
▽ More
As deep learning is widely used in the radiology field, the explainability of such models is increasingly becoming essential to gain clinicians' trust when using the models for diagnosis. In this research, three experiment sets were conducted with a U-Net architecture to improve the classification performance while enhancing the heatmaps corresponding to the model's focus through incorporating heatmap generators during training. All of the experiments used the dataset that contained chest radiographs, associated labels from one of the three conditions ("normal", "congestive heart failure (CHF)", and "pneumonia"), and numerical information regarding a radiologist's eye-gaze coordinates on the images. The paper (A. Karargyris and Moradi, 2021) that introduced this dataset developed a U-Net model, which was treated as the baseline model for this research, to show how the eye-gaze data can be used in multi-modal training for explainability improvement. To compare the classification performances, the 95% confidence intervals (CI) of the area under the receiver operating characteristic curve (AUC) were measured. The best method achieved an AUC of 0.913 (CI: 0.860-0.966). The greatest improvements were for the "pneumonia" and "CHF" classes, which the baseline model struggled most to classify, resulting in AUCs of 0.859 (CI: 0.732-0.957) and 0.962 (CI: 0.933-0.989), respectively. The proposed method's decoder was also able to produce probability masks that highlight the determining image parts in model classifications, similarly as the radiologist's eye-gaze data. Hence, this work showed that incorporating heatmap generators and eye-gaze information into training can simultaneously improve disease classification and provide explainable visuals that align well with how the radiologist viewed the chest radiographs when making diagnosis.
△ Less
Submitted 28 June, 2022;
originally announced July 2022.
-
Experimental Higher-Order Interference in a Nonlinear Triple Slit
Authors:
Peter Namdar,
Philipp K. Jenke,
Irati Alonso Calafell,
Alessandro Trenti,
Milan Radonjić,
Borivoje Dakić,
Philip Walther,
Lee A. Rozema
Abstract:
Interference between two waves is a well-known concept in physics, and its generalization to more than two waves is straight-forward. The order of interference is defined as the number of paths that interfere in a manner that cannot be reduced to patterns of a lower order. In practice, second-order interference means that in, say, a triple-slit experiment, the interference pattern when all three s…
▽ More
Interference between two waves is a well-known concept in physics, and its generalization to more than two waves is straight-forward. The order of interference is defined as the number of paths that interfere in a manner that cannot be reduced to patterns of a lower order. In practice, second-order interference means that in, say, a triple-slit experiment, the interference pattern when all three slits are open can be predicted from the interference patterns between all possible pairs of slits. Quantum mechanics is often said to only exhibit second-order interference. However, this is only true under specific assumptions, typically single-particles undergoing linear evolution. Here we experimentally show that nonlinear evolution can in fact lead to higher-order interference. The higher-order interference in our experiment has a simple quantum mechanical description; namely, optical coherent states interacting in a nonlinear medium. Our work shows that nonlinear evolution could open a loophole for experiments attempting to verify Born's rule by ruling out higher-order interference.
△ Less
Submitted 13 December, 2021;
originally announced December 2021.
-
VideoClick: Video Object Segmentation with a Single Click
Authors:
Namdar Homayounfar,
Justin Liang,
Wei-Chiu Ma,
Raquel Urtasun
Abstract:
Annotating videos with object segmentation masks typically involves a two stage procedure of drawing polygons per object instance for all the frames and then linking them through time. While simple, this is a very tedious, time consuming and expensive process, making the creation of accurate annotations at scale only possible for well-funded labs. What if we were able to segment an object in the f…
▽ More
Annotating videos with object segmentation masks typically involves a two stage procedure of drawing polygons per object instance for all the frames and then linking them through time. While simple, this is a very tedious, time consuming and expensive process, making the creation of accurate annotations at scale only possible for well-funded labs. What if we were able to segment an object in the full video with only a single click? This will enable video segmentation at scale with a very low budget opening the door to many applications. Towards this goal, in this paper we propose a bottom up approach where given a single click for each object in a video, we obtain the segmentation masks of these objects in the full video. In particular, we construct a correlation volume that assigns each pixel in a target frame to either one of the objects in the reference frame or the background. We then refine this correlation volume via a recurrent attention module and decode the final segmentation. To evaluate the performance, we label the popular and challenging Cityscapes dataset with video object segmentations. Results on this new CityscapesVideo dataset show that our approach outperforms all the baselines in this challenging setting.
△ Less
Submitted 16 January, 2021;
originally announced January 2021.
-
DAGMapper: Learning to Map by Discovering Lane Topology
Authors:
Namdar Homayounfar,
Wei-Chiu Ma,
Justin Liang,
Xinyu Wu,
Jack Fan,
Raquel Urtasun
Abstract:
One of the fundamental challenges to scale self-driving is being able to create accurate high definition maps (HD maps) with low cost. Current attempts to automate this process typically focus on simple scenarios, estimate independent maps per frame or do not have the level of precision required by modern self driving vehicles. In contrast, in this paper we focus on drawing the lane boundaries of…
▽ More
One of the fundamental challenges to scale self-driving is being able to create accurate high definition maps (HD maps) with low cost. Current attempts to automate this process typically focus on simple scenarios, estimate independent maps per frame or do not have the level of precision required by modern self driving vehicles. In contrast, in this paper we focus on drawing the lane boundaries of complex highways with many lanes that contain topology changes due to forks and merges. Towards this goal, we formulate the problem as inference in a directed acyclic graphical model (DAG), where the nodes of the graph encode geometric and topological properties of the local regions of the lane boundaries. Since we do not know a priori the topology of the lanes, we also infer the DAG topology (i.e., nodes and edges) for each region. We demonstrate the effectiveness of our approach on two major North American Highways in two different states and show high precision and recall as well as 89% correct topology.
△ Less
Submitted 22 December, 2020;
originally announced December 2020.
-
Hierarchical Recurrent Attention Networks for Structured Online Maps
Authors:
Namdar Homayounfar,
Wei-Chiu Ma,
Shrinidhi Kowshika Lakshmikanth,
Raquel Urtasun
Abstract:
In this paper, we tackle the problem of online road network extraction from sparse 3D point clouds. Our method is inspired by how an annotator builds a lane graph, by first identifying how many lanes there are and then drawing each one in turn. We develop a hierarchical recurrent network that attends to initial regions of a lane boundary and traces them out completely by outputting a structured po…
▽ More
In this paper, we tackle the problem of online road network extraction from sparse 3D point clouds. Our method is inspired by how an annotator builds a lane graph, by first identifying how many lanes there are and then drawing each one in turn. We develop a hierarchical recurrent network that attends to initial regions of a lane boundary and traces them out completely by outputting a structured polyline. We also propose a novel differentiable loss function that measures the deviation of the edges of the ground truth polylines and their predictions. This is more suitable than distances on vertices, as there exists many ways to draw equivalent polylines. We demonstrate the effectiveness of our method on a 90 km stretch of highway, and show that we can recover the right topology 92\% of the time.
△ Less
Submitted 22 December, 2020;
originally announced December 2020.
-
Convolutional Recurrent Network for Road Boundary Extraction
Authors:
Justin Liang,
Namdar Homayounfar,
Wei-Chiu Ma,
Shenlong Wang,
Raquel Urtasun
Abstract:
Creating high definition maps that contain precise information of static elements of the scene is of utmost importance for enabling self driving cars to drive safely. In this paper, we tackle the problem of drivable road boundary extraction from LiDAR and camera imagery. Towards this goal, we design a structured model where a fully convolutional network obtains deep features encoding the location…
▽ More
Creating high definition maps that contain precise information of static elements of the scene is of utmost importance for enabling self driving cars to drive safely. In this paper, we tackle the problem of drivable road boundary extraction from LiDAR and camera imagery. Towards this goal, we design a structured model where a fully convolutional network obtains deep features encoding the location and direction of road boundaries and then, a convolutional recurrent network outputs a polyline representation for each one of them. Importantly, our method is fully automatic and does not require a user in the loop. We showcase the effectiveness of our method on a large North American city where we obtain perfect topology of road boundaries 99.3% of the time at a high precision and recall.
△ Less
Submitted 21 December, 2020;
originally announced December 2020.
-
A Transfer Learning Based Active Learning Framework for Brain Tumor Classification
Authors:
Ruqian Hao,
Khashayar Namdar,
Lin Liu,
Farzad Khalvati
Abstract:
Brain tumor is one of the leading causes of cancer-related death globally among children and adults. Precise classification of brain tumor grade (low-grade and high-grade glioma) at early stage plays a key role in successful prognosis and treatment planning. With recent advances in deep learning, Artificial Intelligence-enabled brain tumor grading systems can assist radiologists in the interpretat…
▽ More
Brain tumor is one of the leading causes of cancer-related death globally among children and adults. Precise classification of brain tumor grade (low-grade and high-grade glioma) at early stage plays a key role in successful prognosis and treatment planning. With recent advances in deep learning, Artificial Intelligence-enabled brain tumor grading systems can assist radiologists in the interpretation of medical images within seconds. The performance of deep learning techniques is, however, highly depended on the size of the annotated dataset. It is extremely challenging to label a large quantity of medical images given the complexity and volume of medical data. In this work, we propose a novel transfer learning based active learning framework to reduce the annotation cost while maintaining stability and robustness of the model performance for brain tumor classification. We employed a 2D slice-based approach to train and finetune our model on the Magnetic Resonance Imaging (MRI) training dataset of 203 patients and a validation dataset of 66 patients which was used as the baseline. With our proposed method, the model achieved Area Under Receiver Operating Characteristic (ROC) Curve (AUC) of 82.89% on a separate test dataset of 66 patients, which was 2.92% higher than the baseline AUC while saving at least 40% of labeling cost. In order to further examine the robustness of our method, we created a balanced dataset, which underwent the same procedure. The model achieved AUC of 82% compared with AUC of 78.48% for the baseline, which reassures the robustness and stability of our proposed transfer learning augmented with active learning framework while significantly reducing the size of training data.
△ Less
Submitted 16 November, 2020;
originally announced November 2020.
-
LevelSet R-CNN: A Deep Variational Method for Instance Segmentation
Authors:
Namdar Homayounfar,
Yuwen Xiong,
Justin Liang,
Wei-Chiu Ma,
Raquel Urtasun
Abstract:
Obtaining precise instance segmentation masks is of high importance in many modern applications such as robotic manipulation and autonomous driving. Currently, many state of the art models are based on the Mask R-CNN framework which, while very powerful, outputs masks at low resolutions which could result in imprecise boundaries. On the other hand, classic variational methods for segmentation impo…
▽ More
Obtaining precise instance segmentation masks is of high importance in many modern applications such as robotic manipulation and autonomous driving. Currently, many state of the art models are based on the Mask R-CNN framework which, while very powerful, outputs masks at low resolutions which could result in imprecise boundaries. On the other hand, classic variational methods for segmentation impose desirable global and local data and geometry constraints on the masks by optimizing an energy functional. While mathematically elegant, their direct dependence on good initialization, non-robust image cues and manual setting of hyperparameters renders them unsuitable for modern applications. We propose LevelSet R-CNN, which combines the best of both worlds by obtaining powerful feature representations that are combined in an end-to-end manner with a variational segmentation framework. We demonstrate the effectiveness of our approach on COCO and Cityscapes datasets.
△ Less
Submitted 30 July, 2020;
originally announced July 2020.
-
A Brief Review of Deep Multi-task Learning and Auxiliary Task Learning
Authors:
Partoo Vafaeikia,
Khashayar Namdar,
Farzad Khalvati
Abstract:
Multi-task learning (MTL) optimizes several learning tasks simultaneously and leverages their shared information to improve generalization and the prediction of the model for each task. Auxiliary tasks can be added to the main task to ultimately boost the performance. In this paper, we provide a brief review on the recent deep multi-task learning (dMTL) approaches followed by methods on selecting…
▽ More
Multi-task learning (MTL) optimizes several learning tasks simultaneously and leverages their shared information to improve generalization and the prediction of the model for each task. Auxiliary tasks can be added to the main task to ultimately boost the performance. In this paper, we provide a brief review on the recent deep multi-task learning (dMTL) approaches followed by methods on selecting useful auxiliary tasks that can be used in dMTL to improve the performance of the model for the main task.
△ Less
Submitted 2 July, 2020;
originally announced July 2020.
-
A Modified AUC for Training Convolutional Neural Networks: Taking Confidence into Account
Authors:
Khashayar Namdar,
Masoom A. Haider,
Farzad Khalvati
Abstract:
Receiver operating characteristic (ROC) curve is an informative tool in binary classification and Area Under ROC Curve (AUC) is a popular metric for reporting performance of binary classifiers. In this paper, first we present a comprehensive review of ROC curve and AUC metric. Next, we propose a modified version of AUC that takes confidence of the model into account and at the same time, incorpora…
▽ More
Receiver operating characteristic (ROC) curve is an informative tool in binary classification and Area Under ROC Curve (AUC) is a popular metric for reporting performance of binary classifiers. In this paper, first we present a comprehensive review of ROC curve and AUC metric. Next, we propose a modified version of AUC that takes confidence of the model into account and at the same time, incorporates AUC into Binary Cross Entropy (BCE) loss used for training a Convolutional neural Network for classification tasks. We demonstrate this on three datasets: MNIST, prostate MRI, and brain MRI. Furthermore, we have published GenuineAI, a new python library, which provides the functions for conventional AUC and the proposed modified AUC along with metrics including sensitivity, specificity, recall, precision, and F1 for each point of the ROC curve.
△ Less
Submitted 12 September, 2021; v1 submitted 8 June, 2020;
originally announced June 2020.
-
A Comprehensive Study of Data Augmentation Strategies for Prostate Cancer Detection in Diffusion-weighted MRI using Convolutional Neural Networks
Authors:
Ruqian Hao,
Khashayar Namdar,
Lin Liu,
Masoom A. Haider,
Farzad Khalvati
Abstract:
Data augmentation refers to a group of techniques whose goal is to battle limited amount of available data to improve model generalization and push sample distribution toward the true distribution. While different augmentation strategies and their combinations have been investigated for various computer vision tasks in the context of deep learning, a specific work in the domain of medical imaging…
▽ More
Data augmentation refers to a group of techniques whose goal is to battle limited amount of available data to improve model generalization and push sample distribution toward the true distribution. While different augmentation strategies and their combinations have been investigated for various computer vision tasks in the context of deep learning, a specific work in the domain of medical imaging is rare and to the best of our knowledge, there has been no dedicated work on exploring the effects of various augmentation methods on the performance of deep learning models in prostate cancer detection. In this work, we have statically applied five most frequently used augmentation techniques (random rotation, horizontal flip, vertical flip, random crop, and translation) to prostate Diffusion-weighted Magnetic Resonance Imaging training dataset of 217 patients separately and evaluated the effect of each method on the accuracy of prostate cancer detection. The augmentation algorithms were applied independently to each data channel and a shallow as well as a deep Convolutional Neural Network (CNN) were trained on the five augmented sets separately. We used Area Under Receiver Operating Characteristic (ROC) curve (AUC) to evaluate the performance of the trained CNNs on a separate test set of 95 patients, using a validation set of 102 patients for finetuning. The shallow network outperformed the deep network with the best 2D slice-based AUC of 0.85 obtained by the rotation method.
△ Less
Submitted 1 June, 2020;
originally announced June 2020.
-
PolyTransform: Deep Polygon Transformer for Instance Segmentation
Authors:
Justin Liang,
Namdar Homayounfar,
Wei-Chiu Ma,
Yuwen Xiong,
Rui Hu,
Raquel Urtasun
Abstract:
In this paper, we propose PolyTransform, a novel instance segmentation algorithm that produces precise, geometry-preserving masks by combining the strengths of prevailing segmentation approaches and modern polygon-based methods. In particular, we first exploit a segmentation network to generate instance masks. We then convert the masks into a set of polygons that are then fed to a deforming networ…
▽ More
In this paper, we propose PolyTransform, a novel instance segmentation algorithm that produces precise, geometry-preserving masks by combining the strengths of prevailing segmentation approaches and modern polygon-based methods. In particular, we first exploit a segmentation network to generate instance masks. We then convert the masks into a set of polygons that are then fed to a deforming network that transforms the polygons such that they better fit the object boundaries. Our experiments on the challenging Cityscapes dataset show that our PolyTransform significantly improves the performance of the backbone instance segmentation network and ranks 1st on the Cityscapes test-set leaderboard. We also show impressive gains in the interactive annotation setting. We release the code at https://github.com/uber-research/PolyTransform.
△ Less
Submitted 16 January, 2021; v1 submitted 5 December, 2019;
originally announced December 2019.
-
Evolution-based Fine-tuning of CNNs for Prostate Cancer Detection
Authors:
Khashayar Namdar,
Isha Gujrathi,
Masoom A. Haider,
Farzad Khalvati
Abstract:
Convolutional Neural Networks (CNNs) have been used for automated detection of prostate cancer where Area Under Receiver Operating Characteristic (ROC) curve (AUC) is usually used as the performance metric. Given that AUC is not differentiable, common practice is to train the CNN using a loss functions based on other performance metrics such as cross entropy and monitoring AUC to select the best m…
▽ More
Convolutional Neural Networks (CNNs) have been used for automated detection of prostate cancer where Area Under Receiver Operating Characteristic (ROC) curve (AUC) is usually used as the performance metric. Given that AUC is not differentiable, common practice is to train the CNN using a loss functions based on other performance metrics such as cross entropy and monitoring AUC to select the best model. In this work, we propose to fine-tune a trained CNN for prostate cancer detection using a Genetic Algorithm to achieve a higher AUC. Our dataset contained 6-channel Diffusion-Weighted MRI slices of prostate. On a cohort of 2,955 training, 1,417 validation, and 1,334 test slices, we reached test AUC of 0.773; a 9.3% improvement compared to the base CNN model.
△ Less
Submitted 4 November, 2019;
originally announced November 2019.
-
Exploiting Sparse Semantic HD Maps for Self-Driving Vehicle Localization
Authors:
Wei-Chiu Ma,
Ignacio Tartavull,
Ioan Andrei Bârsan,
Shenlong Wang,
Min Bai,
Gellert Mattyus,
Namdar Homayounfar,
Shrinidhi Kowshika Lakshmikanth,
Andrei Pokrovsky,
Raquel Urtasun
Abstract:
In this paper we propose a novel semantic localization algorithm that exploits multiple sensors and has precision on the order of a few centimeters. Our approach does not require detailed knowledge about the appearance of the world, and our maps require orders of magnitude less storage than maps utilized by traditional geometry- and LiDAR intensity-based localizers. This is important as self-drivi…
▽ More
In this paper we propose a novel semantic localization algorithm that exploits multiple sensors and has precision on the order of a few centimeters. Our approach does not require detailed knowledge about the appearance of the world, and our maps require orders of magnitude less storage than maps utilized by traditional geometry- and LiDAR intensity-based localizers. This is important as self-driving cars need to operate in large environments. Towards this goal, we formulate the problem in a Bayesian filtering framework, and exploit lanes, traffic signs, as well as vehicle dynamics to localize robustly with respect to a sparse semantic map. We validate the effectiveness of our method on a new highway dataset consisting of 312km of roads. Our experiments show that the proposed approach is able to achieve 0.05m lateral accuracy and 1.12m longitudinal accuracy on average while taking up only 0.3% of the storage required by previous LiDAR intensity-based approaches.
△ Less
Submitted 8 August, 2019;
originally announced August 2019.
-
Deep Multi-Sensor Lane Detection
Authors:
Min Bai,
Gellert Mattyus,
Namdar Homayounfar,
Shenlong Wang,
Shrinidhi Kowshika Lakshmikanth,
Raquel Urtasun
Abstract:
Reliable and accurate lane detection has been a long-standing problem in the field of autonomous driving. In recent years, many approaches have been developed that use images (or videos) as input and reason in image space. In this paper we argue that accurate image estimates do not translate to precise 3D lane boundaries, which are the input required by modern motion planning algorithms. To addres…
▽ More
Reliable and accurate lane detection has been a long-standing problem in the field of autonomous driving. In recent years, many approaches have been developed that use images (or videos) as input and reason in image space. In this paper we argue that accurate image estimates do not translate to precise 3D lane boundaries, which are the input required by modern motion planning algorithms. To address this issue, we propose a novel deep neural network that takes advantage of both LiDAR and camera sensors and produces very accurate estimates directly in 3D space. We demonstrate the performance of our approach on both highways and in cities, and show very accurate estimates in complex scenarios such as heavy traffic (which produces occlusion), fork, merges and intersections.
△ Less
Submitted 4 May, 2019;
originally announced May 2019.
-
Soccer Field Localization from a Single Image
Authors:
Namdar Homayounfar,
Sanja Fidler,
Raquel Urtasun
Abstract:
In this work, we propose a novel way of efficiently localizing a soccer field from a single broadcast image of the game. Related work in this area relies on manually annotating a few key frames and extending the localization to similar images, or installing fixed specialized cameras in the stadium from which the layout of the field can be obtained. In contrast, we formulate this problem as a branc…
▽ More
In this work, we propose a novel way of efficiently localizing a soccer field from a single broadcast image of the game. Related work in this area relies on manually annotating a few key frames and extending the localization to similar images, or installing fixed specialized cameras in the stadium from which the layout of the field can be obtained. In contrast, we formulate this problem as a branch and bound inference in a Markov random field where an energy function is defined in terms of field cues such as grass, lines and circles. Moreover, our approach is fully automatic and depends only on single images from the broadcast video of the game. We demonstrate the effectiveness of our method by applying it to various games and obtain promising results. Finally, we posit that our approach can be applied easily to other sports such as hockey and basketball.
△ Less
Submitted 10 April, 2016;
originally announced April 2016.
-
Localized modes in defective multilayer structures
Authors:
S. Roshan Entezar,
A. Namdar
Abstract:
In this paper, the localized surface modes in a defective multilayer structure has been investigated. It is shown that the defective multilayer structures can support two different kind of localized modes depending on the position and the thickness of the defect layer. One of these modes is localized at the interface between the multilayer structure and a homogeneous medium (the so-called surfac…
▽ More
In this paper, the localized surface modes in a defective multilayer structure has been investigated. It is shown that the defective multilayer structures can support two different kind of localized modes depending on the position and the thickness of the defect layer. One of these modes is localized at the interface between the multilayer structure and a homogeneous medium (the so-called surface mode) and the other one is localized at the defect layer (defect localized mode). We reveal that the presence of defect layer pushes the dispersion curve of surface modes to the lower or the upper edge of the photonic bandgap depending on the homogeneous medium is a left-handed or right-handed medium (e.g. vacuum), respectively. So, the existence region of the surface modes restricted. Moreover, the effect of defect on the energy flow velocity of the surface modes is discussed.
△ Less
Submitted 21 January, 2009;
originally announced January 2009.
-
The Dirichlet Casimir effect for $φ^4$ theory in (3+1) dimensions: A new renormalization approach
Authors:
Reza Moazzemi,
Maryam Namdar,
Siamak S. Gousheh
Abstract:
We calculate the next to the leading order Casimir effect for a real scalar field, within $φ^4$ theory, confined between two parallel plates in three spatial dimensions with the Dirichlet boundary condition. In this paper we introduce a systematic perturbation expansion in which the counterterms automatically turn out to be consistent with the boundary conditions. This will inevitably lead to no…
▽ More
We calculate the next to the leading order Casimir effect for a real scalar field, within $φ^4$ theory, confined between two parallel plates in three spatial dimensions with the Dirichlet boundary condition. In this paper we introduce a systematic perturbation expansion in which the counterterms automatically turn out to be consistent with the boundary conditions. This will inevitably lead to nontrivial position dependence for physical quantities, as a manifestation of the breaking of the translational invariance. This is in contrast to the usual usage of the counterterms in problems with nontrivial boundary conditions, which are either completely derived from the free cases or at most supplemented with the addition of counterterms only at the boundaries. Our results for the massive and massless cases are different from those reported elsewhere. Secondly, and probably less importantly, we use a supplementary renormalization procedure, which makes the usage of any analytic continuation techniques unnecessary.
△ Less
Submitted 30 August, 2007;
originally announced August 2007.
-
Tamm states in one dimensional photonic crystals containing left-handed materials
Authors:
Abdolrahman Namdar
Abstract:
We present a theoretical study of electromagnetic surface waves localized at an interface separating a conventional uniform medium and a semi-infinite 1-D photonic crystal made of alternate left-handed metamaterial and right-handed material which we refer to as left-handed photonic crystal. We find novel type of surface mode's structure, the so-called surface Tamm states and demonstrate that the…
▽ More
We present a theoretical study of electromagnetic surface waves localized at an interface separating a conventional uniform medium and a semi-infinite 1-D photonic crystal made of alternate left-handed metamaterial and right-handed material which we refer to as left-handed photonic crystal. We find novel type of surface mode's structure, the so-called surface Tamm states and demonstrate that the presence of metamaterial in the photonic crystal structure allows for a flexible control of the dispersion properties of surface states, and can support the Tamm states with a backward energy flow and a vortex-like structure.
△ Less
Submitted 18 July, 2006;
originally announced July 2006.
-
Backward Tamm states in left-handed metamaterials
Authors:
Abdolrahman Namdar,
Ilya V. Shadrivov,
Yuri S. Kivshar
Abstract:
We study the electromagnetic surface waves localized at an interface separating a one-dimensional photonic crystal and left-handed metamaterial, the so-called surface Tamm states. We demonstrate that the metamaterial allows for a flexible control of the dispersion properties of surface states, and can support the Tamm states with a backward energy flow and a vortex-like structure.
We study the electromagnetic surface waves localized at an interface separating a one-dimensional photonic crystal and left-handed metamaterial, the so-called surface Tamm states. We demonstrate that the metamaterial allows for a flexible control of the dispersion properties of surface states, and can support the Tamm states with a backward energy flow and a vortex-like structure.
△ Less
Submitted 22 May, 2006;
originally announced May 2006.