Search | arXiv e-print repository

arXiv:2506.20786 [pdf]

AI-Driven MRI-based Brain Tumour Segmentation Benchmarking

Authors: Connor Ludwig, Khashayar Namdar, Farzad Khalvati

Abstract: Medical image segmentation has greatly aided medical diagnosis, with U-Net based architectures and nnU-Net providing state-of-the-art performance. There have been numerous general promptable models and medical variations introduced in recent years, but there is currently a lack of evaluation and comparison of these models across a variety of prompt qualities on a common medical dataset. This resea… ▽ More Medical image segmentation has greatly aided medical diagnosis, with U-Net based architectures and nnU-Net providing state-of-the-art performance. There have been numerous general promptable models and medical variations introduced in recent years, but there is currently a lack of evaluation and comparison of these models across a variety of prompt qualities on a common medical dataset. This research uses Segment Anything Model (SAM), Segment Anything Model 2 (SAM 2), MedSAM, SAM-Med-3D, and nnU-Net to obtain zero-shot inference on the BraTS 2023 adult glioma and pediatrics dataset across multiple prompt qualities for both points and bounding boxes. Several of these models exhibit promising Dice scores, particularly SAM and SAM 2 achieving scores of up to 0.894 and 0.893, respectively when given extremely accurate bounding box prompts which exceeds nnU-Net's segmentation performance. However, nnU-Net remains the dominant medical image segmentation network due to the impracticality of providing highly accurate prompts to the models. The model and prompt evaluation, as well as the comparison, are extended through fine-tuning SAM, SAM 2, MedSAM, and SAM-Med-3D on the pediatrics dataset. The improvements in point prompt performance after fine-tuning are substantial and show promise for future investigation, but are unable to achieve better segmentation than bounding boxes or nnU-Net. △ Less

Submitted 25 June, 2025; originally announced June 2025.

arXiv:2505.00467 [pdf, ps, other]

Red Teaming Large Language Models for Healthcare

Authors: Vahid Balazadeh, Michael Cooper, David Pellow, Atousa Assadi, Jennifer Bell, Mark Coastworth, Kaivalya Deshpande, Jim Fackler, Gabriel Funingana, Spencer Gable-Cook, Anirudh Gangadhar, Abhishek Jaiswal, Sumanth Kaja, Christopher Khoury, Amrit Krishnan, Randy Lin, Kaden McKeen, Sara Naimimohasses, Khashayar Namdar, Aviraj Newatia, Allan Pang, Anshul Pattoo, Sameer Peesapati, Diana Prepelita, Bogdana Rakova , et al. (10 additional authors not shown)

Abstract: We present the design process and findings of the pre-conference workshop at the Machine Learning for Healthcare Conference (2024) entitled Red Teaming Large Language Models for Healthcare, which took place on August 15, 2024. Conference participants, comprising a mix of computational and clinical expertise, attempted to discover vulnerabilities -- realistic clinical prompts for which a large lang… ▽ More We present the design process and findings of the pre-conference workshop at the Machine Learning for Healthcare Conference (2024) entitled Red Teaming Large Language Models for Healthcare, which took place on August 15, 2024. Conference participants, comprising a mix of computational and clinical expertise, attempted to discover vulnerabilities -- realistic clinical prompts for which a large language model (LLM) outputs a response that could cause clinical harm. Red-teaming with clinicians enables the identification of LLM vulnerabilities that may not be recognised by LLM developers lacking clinical expertise. We report the vulnerabilities found, categorise them, and present the results of a replication study assessing the vulnerabilities across all LLMs provided. △ Less

Submitted 1 May, 2025; originally announced May 2025.

arXiv:2411.03894 [pdf, other]

The effects of fibre spatial distribution and relative orientation on the percolation and mechanics of stochastic fibre networks: A model of peptide hydrogels

Authors: Amir Hossein Namdar, Nastaran Zoghi, Aline Miller, Alberto Saiani, Tom Shearer

Abstract: The structures of fibre networks can vary greatly due to fibre interactions during formation. We have modified the steps of generating Mikado networks to create two new model classes by altering the spatial distribution and relative orientation of their fibres to mimic the structures of self-assembling peptide hydrogels (SAPHs), whose physical properties depend strongly on their fibres' interactio… ▽ More The structures of fibre networks can vary greatly due to fibre interactions during formation. We have modified the steps of generating Mikado networks to create two new model classes by altering the spatial distribution and relative orientation of their fibres to mimic the structures of self-assembling peptide hydrogels (SAPHs), whose physical properties depend strongly on their fibres' interactions. The results of our models and experiments on a set of beta-sheet forming SAPHs show that modifying a network's structure affects the percolation threshold and the mechanical behaviour of the material, both near percolation and at higher densities. △ Less

Submitted 24 March, 2025; v1 submitted 6 November, 2024; originally announced November 2024.

arXiv:2405.06760 [pdf]

Opportunities for Persian Digital Humanities Research with Artificial Intelligence Language Models; Case Study: Forough Farrokhzad

Authors: Arash Rasti Meymandi, Zahra Hosseini, Sina Davari, Abolfazl Moshiri, Shabnam Rahimi-Golkhandan, Khashayar Namdar, Nikta Feizi, Mohamad Tavakoli-Targhi, Farzad Khalvati

Abstract: This study explores the integration of advanced Natural Language Processing (NLP) and Artificial Intelligence (AI) techniques to analyze and interpret Persian literature, focusing on the poetry of Forough Farrokhzad. Utilizing computational methods, we aim to unveil thematic, stylistic, and linguistic patterns in Persian poetry. Specifically, the study employs AI models including transformer-based… ▽ More This study explores the integration of advanced Natural Language Processing (NLP) and Artificial Intelligence (AI) techniques to analyze and interpret Persian literature, focusing on the poetry of Forough Farrokhzad. Utilizing computational methods, we aim to unveil thematic, stylistic, and linguistic patterns in Persian poetry. Specifically, the study employs AI models including transformer-based language models for clustering of the poems in an unsupervised framework. This research underscores the potential of AI in enhancing our understanding of Persian literary heritage, with Forough Farrokhzad's work providing a comprehensive case study. This approach not only contributes to the field of Persian Digital Humanities but also sets a precedent for future research in Persian literary studies using computational techniques. △ Less

Submitted 10 May, 2024; originally announced May 2024.

arXiv:2402.03547 [pdf]

Improving Pediatric Low-Grade Neuroepithelial Tumors Molecular Subtype Identification Using a Novel AUROC Loss Function for Convolutional Neural Networks

Authors: Khashayar Namdar, Matthias W. Wagner, Cynthia Hawkins, Uri Tabori, Birgit B. Ertl-Wagner, Farzad Khalvati

Abstract: Pediatric Low-Grade Neuroepithelial Tumors (PLGNT) are the most common pediatric cancer type, accounting for 40% of brain tumors in children, and identifying PLGNT molecular subtype is crucial for treatment planning. However, the gold standard to determine the PLGNT subtype is biopsy, which can be impractical or dangerous for patients. This research improves the performance of Convolutional Neural… ▽ More Pediatric Low-Grade Neuroepithelial Tumors (PLGNT) are the most common pediatric cancer type, accounting for 40% of brain tumors in children, and identifying PLGNT molecular subtype is crucial for treatment planning. However, the gold standard to determine the PLGNT subtype is biopsy, which can be impractical or dangerous for patients. This research improves the performance of Convolutional Neural Networks (CNNs) in classifying PLGNT subtypes through MRI scans by introducing a loss function that specifically improves the model's Area Under the Receiver Operating Characteristic (ROC) Curve (AUROC), offering a non-invasive diagnostic alternative. In this study, a retrospective dataset of 339 children with PLGNT (143 BRAF fusion, 71 with BRAF V600E mutation, and 125 non-BRAF) was curated. We employed a CNN model with Monte Carlo random data splitting. The baseline model was trained using binary cross entropy (BCE), and achieved an AUROC of 86.11% for differentiating BRAF fusion and BRAF V600E mutations, which was improved to 87.71% using our proposed AUROC loss function (p-value 0.045). With multiclass classification, the AUROC improved from 74.42% to 76. 59% (p-value 0.0016). △ Less

Submitted 5 February, 2024; originally announced February 2024.

arXiv:2310.17214 [pdf, other]

Towards a phase-field based model for combustion in particle beds: Reactive fluid flow

Authors: Reza Namdar, Mohammad Norouzi, Fathollah Varnik

Abstract: The present study provide a systematic derivation of a phase-field version of the momentum, mass and heat transport equations, while accounting for chemical reactions in the fluid phase. To achieve this goal, the volume averaging technique is used to reformulate the conservation equations in the presence of multiple phases and their respective diffuse interfaces. A careful analysis, and neglecting… ▽ More The present study provide a systematic derivation of a phase-field version of the momentum, mass and heat transport equations, while accounting for chemical reactions in the fluid phase. To achieve this goal, the volume averaging technique is used to reformulate the conservation equations in the presence of multiple phases and their respective diffuse interfaces. A careful analysis, and neglecting terms of the second order in fluctuations, reveals that the structure of the multiphase/diffuse interface version of the conservation equations is very similar to the original single phase/sharp interface formulation. The multiphase character of the problem reflects itself in a coupling term, which acts at the interface between two adjacent phases. The model is then applied to the special case of a reactive fluid in contact with an inert solid. Two coupling parameters are then introduced, which control the exchange of momentum and heat at the interface. For a numerical study of the thus obtained set of coupled partial differential equations, a hybrid lattice Boltzmann-finite difference-phase field (LB-FD-PF) framework is proposed and implemented in the open source software OpenPhase. The model is then thoroughly validated via numerical simulations, including isothermal flow, non-reactive non-isothermal flow, and reactive flows. To perform the simulations involving chemical reactions, OpenPhase is coupled to the open-source chemical kinetics software CANTERA, which delivers details of the chemical reaction mechanisms and the necessary thermodynamic and transport properties of the reacting chemical species. These simulations show that, once the coupling parameters are adequately tuned, the present approach yields excellent agreement with the sought for sharp interface method. △ Less

Submitted 26 October, 2023; originally announced October 2023.

Comments: 39 pages, 19 figures

MSC Class: 0000; 1111

arXiv:2307.02960 [pdf, other]

doi 10.1080/10407790.2024.2379006

Parametric 3D Convolutional Autoencoder for the Prediction of Flow Fields in a Bed Configuration of Hot Particles

Authors: Ali Mjalled, Reza Namdar, Lucas Reineking, Mohammad Norouzi, Fathollah Varnik, Martin Mönnigmann

Abstract: The use of deep learning methods for modeling fluid flow has drawn a lot of attention in the past few years. In situations where conventional numerical approaches can be computationally expensive, these techniques have shown promise in offering accurate, rapid, and practical solutions for modeling complex fluid flow problems. The success of deep learning is often due to its ability to extract hidd… ▽ More The use of deep learning methods for modeling fluid flow has drawn a lot of attention in the past few years. In situations where conventional numerical approaches can be computationally expensive, these techniques have shown promise in offering accurate, rapid, and practical solutions for modeling complex fluid flow problems. The success of deep learning is often due to its ability to extract hidden patterns and features from the data, enabling the creation of data-driven reduced models that can capture the underlying physics of the domain. We present a data-driven reduced model for predicting flow fields in a bed configuration of hot particles. The reduced model consists of a parametric 3D convolutional autoencoder. The first part resolves the spatial and temporal dependencies present in the input sequence, while the second part of the architecture is responsible for predicting the solution at the subsequent timestep based on the information gathered from the preceding part. We also propose the utilization of a post-processing non-trainable output layer following the decoding path to incorporate the physical knowledge, e.g., no-slip condition, into the prediction. The evaluation of the reduced model for a bed configuration with variable particle temperature showed accurate results at a fraction of the computational cost required by traditional numerical simulation methods. △ Less

Submitted 12 February, 2024; v1 submitted 6 July, 2023; originally announced July 2023.

arXiv:2306.11405 [pdf, other]

Modeling gas flows in packed beds with the lattice Boltzmann method: validation against experiments

Authors: Tanya Neeraj, Christin Velten, Gabor Janiga, Katharina Zähringer, Reza Namdar, Fathollah Varnik, Dominique Thévenin, Seyed Ali Hosseini

Abstract: This study aims to validate the lattice Boltzmann method and assess its ability to accurately describe the behavior of gaseous flows in packed beds. To that end, simulations of a model packed bed reactor, corresponding to an experimental bench, are conducted, and the results are directly compared with experimental data obtained by Particle Image Velocimetry measurements. It is found that the latti… ▽ More This study aims to validate the lattice Boltzmann method and assess its ability to accurately describe the behavior of gaseous flows in packed beds. To that end, simulations of a model packed bed reactor, corresponding to an experimental bench, are conducted, and the results are directly compared with experimental data obtained by Particle Image Velocimetry measurements. It is found that the lattice Boltzmann solver exhibits very good agreement with experimental measurements. Then, the numerical solver is further used to analyze the effect of the number of packing layers on the flow structure and to determine the minimum bed height above which the changes in flow structure become insignificant. Finally, flow fluctuations in time are discussed. The findings of this study provide valuable insights into the behavior of the gas flow in packed bed reactors, opening the door for further investigations involving additionally chemical reactions, as found in many practical applications. △ Less

Submitted 20 June, 2023; originally announced June 2023.

arXiv:2211.14396 [pdf]

Non-invasive Liver Fibrosis Screening on CT Images using Radiomics

Authors: Jay J. Yoo, Khashayar Namdar, Sean Carey, Sandra E. Fischer, Chris McIntosh, Farzad Khalvati, Patrik Rogalla

Abstract: Objectives: To develop and evaluate a radiomics machine learning model for detecting liver fibrosis on CT of the liver. Methods: For this retrospective, single-centre study, radiomic features were extracted from Regions of Interest (ROIs) on CT images of patients who underwent simultaneous liver biopsy and CT examinations. Combinations of contrast, normalization, machine learning model, and feat… ▽ More Objectives: To develop and evaluate a radiomics machine learning model for detecting liver fibrosis on CT of the liver. Methods: For this retrospective, single-centre study, radiomic features were extracted from Regions of Interest (ROIs) on CT images of patients who underwent simultaneous liver biopsy and CT examinations. Combinations of contrast, normalization, machine learning model, and feature selection method were determined based on their mean test Area Under the Receiver Operating Characteristic curve (AUC) on randomly placed ROIs. The combination and selected features with the highest AUC were used to develop a final liver fibrosis screening model. Results: The study included 101 male and 68 female patients (mean age = 51.2 years $\pm$ 14.7 [SD]). When averaging the AUC across all combinations, non-contrast enhanced (NC) CT (AUC, 0.6100; 95% CI: 0.5897, 0.6303) outperformed contrast-enhanced CT (AUC, 0.5680; 95% CI: 0.5471, 0.5890). The combination of hyperparameters and features that yielded the highest AUC was a logistic regression model with inputs features of maximum, energy, kurtosis, skewness, and small area high gray level emphasis extracted from non-contrast enhanced NC CT normalized using Gamma correction with $γ$ = 1.5 (AUC, 0.7833; 95% CI: 0.7821, 0.7845), (sensitivity, 0.9091; 95% CI: 0.9091, 0.9091). Conclusions: Radiomics-based machine learning models allow for the detection of liver fibrosis with reasonable accuracy and high sensitivity on NC CT. Thus, these models can be used to non-invasively screen for liver fibrosis, contributing to earlier detection of the disease at a potentially curable stage. △ Less

Submitted 26 February, 2024; v1 submitted 25 November, 2022; originally announced November 2022.

arXiv:2211.14122 [pdf]

Automating Cobb Angle Measurement for Adolescent Idiopathic Scoliosis using Instance Segmentation

Authors: Chaojun Chen, Khashayar Namdar, Yujie Wu, Shahob Hosseinpour, Manohar Shroff, Andrea S. Doria, Farzad Khalvati

Abstract: Scoliosis is a three-dimensional deformity of the spine, most often diagnosed in childhood. It affects 2-3% of the population, which is approximately seven million people in North America. Currently, the reference standard for assessing scoliosis is based on the manual assignment of Cobb angles at the site of the curvature center. This manual process is time consuming and unreliable as it is affec… ▽ More Scoliosis is a three-dimensional deformity of the spine, most often diagnosed in childhood. It affects 2-3% of the population, which is approximately seven million people in North America. Currently, the reference standard for assessing scoliosis is based on the manual assignment of Cobb angles at the site of the curvature center. This manual process is time consuming and unreliable as it is affected by inter- and intra-observer variance. To overcome these inaccuracies, machine learning (ML) methods can be used to automate the Cobb angle measurement process. This paper proposes to address the Cobb angle measurement task using YOLACT, an instance segmentation model. The proposed method first segments the vertebrae in an X-Ray image using YOLACT, then it tracks the important landmarks using the minimum bounding box approach. Lastly, the extracted landmarks are used to calculate the corresponding Cobb angles. The model achieved a Symmetric Mean Absolute Percentage Error (SMAPE) score of 10.76%, demonstrating the reliability of this process in both vertebra localization and Cobb angle measurement. △ Less

Submitted 25 November, 2022; originally announced November 2022.

arXiv:2211.07640 [pdf, ps, other]

Unbounded composition operators on Orlicz spaces

Authors: M. Namdar Baboli, Y. Estaremi

Abstract: In this paper we deal with unbounded composition operators defined in Orlicz spaces. Indeed, we provide some necessary and sufficient condition for densely definedness of composition operators on Orlicz spaces. Also, we will investigate the adjoint of densely defined composition operators and we give some equivalent conditions for it to be densely defined. In addition, we show that densely defined… ▽ More In this paper we deal with unbounded composition operators defined in Orlicz spaces. Indeed, we provide some necessary and sufficient condition for densely definedness of composition operators on Orlicz spaces. Also, we will investigate the adjoint of densely defined composition operators and we give some equivalent conditions for it to be densely defined. In addition, we show that densely defined composition operator is continuous if and only if it is everywhere defined. Finally, we characterize densely defined continuous composition operators. △ Less

Submitted 11 November, 2022; originally announced November 2022.

Comments: 11 pages, comments are welcome

arXiv:2211.05269 [pdf, other]

Generative Adversarial Networks for Weakly Supervised Generation and Evaluation of Brain Tumor Segmentations on MR Images

Authors: Jay J. Yoo, Khashayar Namdar, Matthias W. Wagner, Liana Nobre, Uri Tabori, Cynthia Hawkins, Birgit B. Ertl-Wagner, Farzad Khalvati

Abstract: Segmentation of regions of interest (ROIs) for identifying abnormalities is a leading problem in medical imaging. Using machine learning for this problem generally requires manually annotated ground-truth segmentations, demanding extensive time and resources from radiologists. This work presents a weakly supervised approach that utilizes binary image-level labels, which are much simpler to acquire… ▽ More Segmentation of regions of interest (ROIs) for identifying abnormalities is a leading problem in medical imaging. Using machine learning for this problem generally requires manually annotated ground-truth segmentations, demanding extensive time and resources from radiologists. This work presents a weakly supervised approach that utilizes binary image-level labels, which are much simpler to acquire, to effectively segment anomalies in 2D magnetic resonance images without ground truth annotations. We train a generative adversarial network (GAN) that converts cancerous images to healthy variants, which are used along with localization seeds as priors to generate improved weakly supervised segmentations. The non-cancerous variants can also be used to evaluate the segmentations in a weakly supervised fashion, which allows for the most effective segmentations to be identified and then applied to downstream clinical classification tasks. On the Multimodal Brain Tumor Segmentation (BraTS) 2020 dataset, our proposed method generates and identifies segmentations that achieve test Dice coefficients of 83.91%. Using these segmentations for pathology classification results with a test AUC of 93.32% which is comparable to the test AUC of 95.80% achieved when using true segmentations. △ Less

Submitted 15 August, 2024; v1 submitted 9 November, 2022; originally announced November 2022.

arXiv:2210.07287 [pdf]

Improving Deep Learning Models for Pediatric Low-Grade Glioma Tumors Molecular Subtype Identification Using 3D Probability Distributions of Tumor Location

Authors: Khashayar Namdar, Matthias W. Wagner, Kareem Kudus, Cynthia Hawkins, Uri Tabori, Brigit Ertl-Wagner, Farzad Khalvati

Abstract: Background and Purpose: Pediatric low-grade glioma (pLGG) is the most common type of brain tumor in children, and identification of molecular markers for pLGG is crucial for successful treatment planning. Convolutional Neural Network (CNN) models for pLGG subtype identification rely on tumor segmentation. We hypothesize tumor segmentations are suboptimal and thus, we propose to augment the CNN mod… ▽ More Background and Purpose: Pediatric low-grade glioma (pLGG) is the most common type of brain tumor in children, and identification of molecular markers for pLGG is crucial for successful treatment planning. Convolutional Neural Network (CNN) models for pLGG subtype identification rely on tumor segmentation. We hypothesize tumor segmentations are suboptimal and thus, we propose to augment the CNN models using tumor location probability in MRI data. Materials and Methods: Our REB-approved retrospective study included MRI Fluid-Attenuated Inversion Recovery (FLAIR) sequences of 143 BRAF fused and 71 BRAF V600E mutated tumors. Tumor segmentations (regions of interest (ROIs)) were provided by a pediatric neuroradiology fellow and verified by a senior pediatric neuroradiologist. In each experiment, we randomly split the data into development and test with an 80/20 ratio. We combined the 3D binary ROI masks for each class in the development dataset to derive the probability density functions (PDF) of tumor location, and developed three pipelines: location-based, CNN-based, and hybrid. Results: We repeated the experiment with different model initializations and data splits 100 times and calculated the Area Under Receiver Operating Characteristic Curve (AUC). The location-based classifier achieved an AUC of 77.90, 95% confidence interval (CI) (76.76, 79.03). CNN-based classifiers achieved AUC of 86.11, CI (84.96, 87.25), while the tumor-location-guided CNNs outperformed the formers with an average AUC of 88.64 CI (87.57, 89.72), which was statistically significant (Student's t-test p-value 0.0018). Conclusion: We achieved statistically significant improvements by incorporating tumor location into the CNN models. Our results suggest that manually segmented ROIs may not be optimal. △ Less

Submitted 24 October, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

Comments: arXiv admin note: text overlap with arXiv:2207.14776

arXiv:2209.09930 [pdf, other]

Deep Superpixel Generation and Clustering for Weakly Supervised Segmentation of Brain Tumors in MR Images

Authors: Jay J. Yoo, Khashayar Namdar, Farzad Khalvati

Abstract: Training machine learning models to segment tumors and other anomalies in medical images is an important step for developing diagnostic tools but generally requires manually annotated ground truth segmentations, which necessitates significant time and resources. This work proposes the use of a superpixel generation model and a superpixel clustering model to enable weakly supervised brain tumor seg… ▽ More Training machine learning models to segment tumors and other anomalies in medical images is an important step for developing diagnostic tools but generally requires manually annotated ground truth segmentations, which necessitates significant time and resources. This work proposes the use of a superpixel generation model and a superpixel clustering model to enable weakly supervised brain tumor segmentations. The proposed method utilizes binary image-level classification labels, which are readily accessible, to significantly improve the initial region of interest segmentations generated by standard weakly supervised methods without requiring ground truth annotations. We used 2D slices of magnetic resonance brain scans from the Multimodal Brain Tumor Segmentation Challenge 2020 dataset and labels indicating the presence of tumors to train the pipeline. On the test cohort, our method achieved a mean Dice coefficient of 0.691 and a mean 95% Hausdorff distance of 18.1, outperforming existing superpixel-based weakly supervised segmentation methods. △ Less

Submitted 22 January, 2024; v1 submitted 20 September, 2022; originally announced September 2022.

Comments: 12 pages, LaTeX; updated methodology, added additional results, revised discussion

arXiv:2208.10390 [pdf, other]

Minimizing the Effect of Noise and Limited Dataset Size in Image Classification Using Depth Estimation as an Auxiliary Task with Deep Multitask Learning

Authors: Khashayar Namdar, Partoo Vafaeikia, Farzad Khalvati

Abstract: Generalizability is the ultimate goal of Machine Learning (ML) image classifiers, for which noise and limited dataset size are among the major concerns. We tackle these challenges through utilizing the framework of deep Multitask Learning (dMTL) and incorporating image depth estimation as an auxiliary task. On a customized and depth-augmented derivation of the MNIST dataset, we show a) multitask l… ▽ More Generalizability is the ultimate goal of Machine Learning (ML) image classifiers, for which noise and limited dataset size are among the major concerns. We tackle these challenges through utilizing the framework of deep Multitask Learning (dMTL) and incorporating image depth estimation as an auxiliary task. On a customized and depth-augmented derivation of the MNIST dataset, we show a) multitask loss functions are the most effective approach of implementing dMTL, b) limited dataset size primarily contributes to classification inaccuracy, and c) depth estimation is mostly impacted by noise. In order to further validate the results, we manually labeled the NYU Depth V2 dataset for scene classification tasks. As a contribution to the field, we have made the data in python native format publicly available as an open-source dataset and provided the scene labels. Our experiments on MNIST and NYU-Depth-V2 show dMTL improves generalizability of the classifiers when the dataset is noisy and the number of examples is limited. △ Less

Submitted 22 August, 2022; originally announced August 2022.

arXiv:2207.14781 [pdf]

Using Multi-modal Data for Improving Generalizability and Explainability of Disease Classification in Radiology

Authors: Pranav Agnihotri, Sara Ketabi, Khashayar, Namdar, Farzad Khalvati

Abstract: Traditional datasets for the radiological diagnosis tend to only provide the radiology image alongside the radiology report. However, radiology reading as performed by radiologists is a complex process, and information such as the radiologist's eye-fixations over the course of the reading has the potential to be an invaluable data source to learn from. Nonetheless, the collection of such data is e… ▽ More Traditional datasets for the radiological diagnosis tend to only provide the radiology image alongside the radiology report. However, radiology reading as performed by radiologists is a complex process, and information such as the radiologist's eye-fixations over the course of the reading has the potential to be an invaluable data source to learn from. Nonetheless, the collection of such data is expensive and time-consuming. This leads to the question of whether such data is worth the investment to collect. This paper utilizes the recently published Eye-Gaze dataset to perform an exhaustive study on the impact on performance and explainability of deep learning (DL) classification in the face of varying levels of input features, namely: radiology images, radiology report text, and radiologist eye-gaze data. We find that the best classification performance of X-ray images is achieved with a combination of radiology report free-text and radiology image, with the eye-gaze data providing no performance boost. Nonetheless, eye-gaze data serving as secondary ground truth alongside the class label results in highly explainable models that generate better attention maps compared to models trained to do classification and attention map generation without eye-gaze data. △ Less

Submitted 29 July, 2022; originally announced July 2022.

arXiv:2207.14776 [pdf]

Open-radiomics: A Collection of Standardized Datasets and a Technical Protocol for Reproducible Radiomics Machine Learning Pipelines

Authors: Khashayar Namdar, Matthias W. Wagner, Birgit B. Ertl-Wagner, Farzad Khalvati

Abstract: Background: As an important branch of machine learning pipelines in medical imaging, radiomics faces two major challenges namely reproducibility and accessibility. In this work, we introduce open-radiomics, a set of radiomics datasets along with a comprehensive radiomics pipeline based on our proposed technical protocol to improve the reproducibility of the results. Methods: We curated large-scale… ▽ More Background: As an important branch of machine learning pipelines in medical imaging, radiomics faces two major challenges namely reproducibility and accessibility. In this work, we introduce open-radiomics, a set of radiomics datasets along with a comprehensive radiomics pipeline based on our proposed technical protocol to improve the reproducibility of the results. Methods: We curated large-scale radiomics datasets based on three open-source datasets; BraTS 2020 for high-grade glioma (HGG) versus low-grade glioma (LGG) classification and survival analysis, BraTS 2023 for O6-methylguanine-DNA methyltransferase classification, and non-small cell lung cancer survival analysis from the Cancer Imaging Archive. Using BraTS 2020 Magnetic Resonance Imaging (MRI) dataset, we applied our protocol to 369 brain tumor patients (76 LGG, 293 HGG). Leveraging PyRadiomics for LGG vs. HGG classification, we generated 288 datasets from 4 MRI sequences, 3 binWidths, 6 normalization methods, and 4 tumor subregions. Random Forest classifiers were trained and validated (60%,20%,20%) across 100 different data splits (28,800 test results), evaluating Area Under the Receiver Operating Characteristic Curve (AUROC). Results: Unlike binWidth and image normalization, tumor subregion and imaging sequence significantly affected performance of the models. T1 contrast-enhanced sequence and the union of Necrotic and the non-enhancing tumor core subregions resulted in the highest AUROCs (average test AUROC 0.951, 95% confidence interval of (0.949, 0.952)). Although several settings and data splits (28 out of 28800) yielded test AUROC of 1, they were irreproducible. Conclusion: Our experiments demonstrate the sources of variability in radiomics pipelines (e.g., tumor subregion) can have a significant impact on the results, which may lead to superficial perfect performances that are irreproducible. △ Less

Submitted 28 February, 2025; v1 submitted 29 July, 2022; originally announced July 2022.

arXiv:2207.00157 [pdf]

Improving Disease Classification Performance and Explainability of Deep Learning Models in Radiology with Heatmap Generators

Authors: Akino Watanabe, Sara Ketabi, Khashayar, Namdar, Farzad Khalvati

Abstract: As deep learning is widely used in the radiology field, the explainability of such models is increasingly becoming essential to gain clinicians' trust when using the models for diagnosis. In this research, three experiment sets were conducted with a U-Net architecture to improve the classification performance while enhancing the heatmaps corresponding to the model's focus through incorporating hea… ▽ More As deep learning is widely used in the radiology field, the explainability of such models is increasingly becoming essential to gain clinicians' trust when using the models for diagnosis. In this research, three experiment sets were conducted with a U-Net architecture to improve the classification performance while enhancing the heatmaps corresponding to the model's focus through incorporating heatmap generators during training. All of the experiments used the dataset that contained chest radiographs, associated labels from one of the three conditions ("normal", "congestive heart failure (CHF)", and "pneumonia"), and numerical information regarding a radiologist's eye-gaze coordinates on the images. The paper (A. Karargyris and Moradi, 2021) that introduced this dataset developed a U-Net model, which was treated as the baseline model for this research, to show how the eye-gaze data can be used in multi-modal training for explainability improvement. To compare the classification performances, the 95% confidence intervals (CI) of the area under the receiver operating characteristic curve (AUC) were measured. The best method achieved an AUC of 0.913 (CI: 0.860-0.966). The greatest improvements were for the "pneumonia" and "CHF" classes, which the baseline model struggled most to classify, resulting in AUCs of 0.859 (CI: 0.732-0.957) and 0.962 (CI: 0.933-0.989), respectively. The proposed method's decoder was also able to produce probability masks that highlight the determining image parts in model classifications, similarly as the radiologist's eye-gaze data. Hence, this work showed that incorporating heatmap generators and eye-gaze information into training can simultaneously improve disease classification and provide explainable visuals that align well with how the radiologist viewed the chest radiographs when making diagnosis. △ Less

Submitted 28 June, 2022; originally announced July 2022.

arXiv:2112.06965 [pdf, other]

Experimental Higher-Order Interference in a Nonlinear Triple Slit

Authors: Peter Namdar, Philipp K. Jenke, Irati Alonso Calafell, Alessandro Trenti, Milan Radonjić, Borivoje Dakić, Philip Walther, Lee A. Rozema

Abstract: Interference between two waves is a well-known concept in physics, and its generalization to more than two waves is straight-forward. The order of interference is defined as the number of paths that interfere in a manner that cannot be reduced to patterns of a lower order. In practice, second-order interference means that in, say, a triple-slit experiment, the interference pattern when all three s… ▽ More Interference between two waves is a well-known concept in physics, and its generalization to more than two waves is straight-forward. The order of interference is defined as the number of paths that interfere in a manner that cannot be reduced to patterns of a lower order. In practice, second-order interference means that in, say, a triple-slit experiment, the interference pattern when all three slits are open can be predicted from the interference patterns between all possible pairs of slits. Quantum mechanics is often said to only exhibit second-order interference. However, this is only true under specific assumptions, typically single-particles undergoing linear evolution. Here we experimentally show that nonlinear evolution can in fact lead to higher-order interference. The higher-order interference in our experiment has a simple quantum mechanical description; namely, optical coherent states interacting in a nonlinear medium. Our work shows that nonlinear evolution could open a loophole for experiments attempting to verify Born's rule by ruling out higher-order interference. △ Less

Submitted 13 December, 2021; originally announced December 2021.

Comments: 5 pages, 3 figures, plus an appendix

arXiv:2101.06545 [pdf, other]

VideoClick: Video Object Segmentation with a Single Click

Authors: Namdar Homayounfar, Justin Liang, Wei-Chiu Ma, Raquel Urtasun

Abstract: Annotating videos with object segmentation masks typically involves a two stage procedure of drawing polygons per object instance for all the frames and then linking them through time. While simple, this is a very tedious, time consuming and expensive process, making the creation of accurate annotations at scale only possible for well-funded labs. What if we were able to segment an object in the f… ▽ More Annotating videos with object segmentation masks typically involves a two stage procedure of drawing polygons per object instance for all the frames and then linking them through time. While simple, this is a very tedious, time consuming and expensive process, making the creation of accurate annotations at scale only possible for well-funded labs. What if we were able to segment an object in the full video with only a single click? This will enable video segmentation at scale with a very low budget opening the door to many applications. Towards this goal, in this paper we propose a bottom up approach where given a single click for each object in a video, we obtain the segmentation masks of these objects in the full video. In particular, we construct a correlation volume that assigns each pixel in a target frame to either one of the objects in the reference frame or the background. We then refine this correlation volume via a recurrent attention module and decode the final segmentation. To evaluate the performance, we label the popular and challenging Cityscapes dataset with video object segmentations. Results on this new CityscapesVideo dataset show that our approach outperforms all the baselines in this challenging setting. △ Less

Submitted 16 January, 2021; originally announced January 2021.

arXiv:2012.12377 [pdf, other]

DAGMapper: Learning to Map by Discovering Lane Topology

Authors: Namdar Homayounfar, Wei-Chiu Ma, Justin Liang, Xinyu Wu, Jack Fan, Raquel Urtasun

Abstract: One of the fundamental challenges to scale self-driving is being able to create accurate high definition maps (HD maps) with low cost. Current attempts to automate this process typically focus on simple scenarios, estimate independent maps per frame or do not have the level of precision required by modern self driving vehicles. In contrast, in this paper we focus on drawing the lane boundaries of… ▽ More One of the fundamental challenges to scale self-driving is being able to create accurate high definition maps (HD maps) with low cost. Current attempts to automate this process typically focus on simple scenarios, estimate independent maps per frame or do not have the level of precision required by modern self driving vehicles. In contrast, in this paper we focus on drawing the lane boundaries of complex highways with many lanes that contain topology changes due to forks and merges. Towards this goal, we formulate the problem as inference in a directed acyclic graphical model (DAG), where the nodes of the graph encode geometric and topological properties of the local regions of the lane boundaries. Since we do not know a priori the topology of the lanes, we also infer the DAG topology (i.e., nodes and edges) for each region. We demonstrate the effectiveness of our approach on two major North American Highways in two different states and show high precision and recall as well as 89% correct topology. △ Less

Submitted 22 December, 2020; originally announced December 2020.

Comments: Published at ICCV 2019

arXiv:2012.12314 [pdf, other]

Hierarchical Recurrent Attention Networks for Structured Online Maps

Authors: Namdar Homayounfar, Wei-Chiu Ma, Shrinidhi Kowshika Lakshmikanth, Raquel Urtasun

Abstract: In this paper, we tackle the problem of online road network extraction from sparse 3D point clouds. Our method is inspired by how an annotator builds a lane graph, by first identifying how many lanes there are and then drawing each one in turn. We develop a hierarchical recurrent network that attends to initial regions of a lane boundary and traces them out completely by outputting a structured po… ▽ More In this paper, we tackle the problem of online road network extraction from sparse 3D point clouds. Our method is inspired by how an annotator builds a lane graph, by first identifying how many lanes there are and then drawing each one in turn. We develop a hierarchical recurrent network that attends to initial regions of a lane boundary and traces them out completely by outputting a structured polyline. We also propose a novel differentiable loss function that measures the deviation of the edges of the ground truth polylines and their predictions. This is more suitable than distances on vertices, as there exists many ways to draw equivalent polylines. We demonstrate the effectiveness of our method on a 90 km stretch of highway, and show that we can recover the right topology 92\% of the time. △ Less

Submitted 22 December, 2020; originally announced December 2020.

Comments: Published at CVPR 2018

arXiv:2012.12160 [pdf, other]

Convolutional Recurrent Network for Road Boundary Extraction

Authors: Justin Liang, Namdar Homayounfar, Wei-Chiu Ma, Shenlong Wang, Raquel Urtasun

Abstract: Creating high definition maps that contain precise information of static elements of the scene is of utmost importance for enabling self driving cars to drive safely. In this paper, we tackle the problem of drivable road boundary extraction from LiDAR and camera imagery. Towards this goal, we design a structured model where a fully convolutional network obtains deep features encoding the location… ▽ More Creating high definition maps that contain precise information of static elements of the scene is of utmost importance for enabling self driving cars to drive safely. In this paper, we tackle the problem of drivable road boundary extraction from LiDAR and camera imagery. Towards this goal, we design a structured model where a fully convolutional network obtains deep features encoding the location and direction of road boundaries and then, a convolutional recurrent network outputs a polyline representation for each one of them. Importantly, our method is fully automatic and does not require a user in the loop. We showcase the effectiveness of our method on a large North American city where we obtain perfect topology of road boundaries 99.3% of the time at a high precision and recall. △ Less

Submitted 21 December, 2020; originally announced December 2020.

Journal ref: CVPR 2019

arXiv:2011.09265 [pdf]

A Transfer Learning Based Active Learning Framework for Brain Tumor Classification

Authors: Ruqian Hao, Khashayar Namdar, Lin Liu, Farzad Khalvati

Abstract: Brain tumor is one of the leading causes of cancer-related death globally among children and adults. Precise classification of brain tumor grade (low-grade and high-grade glioma) at early stage plays a key role in successful prognosis and treatment planning. With recent advances in deep learning, Artificial Intelligence-enabled brain tumor grading systems can assist radiologists in the interpretat… ▽ More Brain tumor is one of the leading causes of cancer-related death globally among children and adults. Precise classification of brain tumor grade (low-grade and high-grade glioma) at early stage plays a key role in successful prognosis and treatment planning. With recent advances in deep learning, Artificial Intelligence-enabled brain tumor grading systems can assist radiologists in the interpretation of medical images within seconds. The performance of deep learning techniques is, however, highly depended on the size of the annotated dataset. It is extremely challenging to label a large quantity of medical images given the complexity and volume of medical data. In this work, we propose a novel transfer learning based active learning framework to reduce the annotation cost while maintaining stability and robustness of the model performance for brain tumor classification. We employed a 2D slice-based approach to train and finetune our model on the Magnetic Resonance Imaging (MRI) training dataset of 203 patients and a validation dataset of 66 patients which was used as the baseline. With our proposed method, the model achieved Area Under Receiver Operating Characteristic (ROC) Curve (AUC) of 82.89% on a separate test dataset of 66 patients, which was 2.92% higher than the baseline AUC while saving at least 40% of labeling cost. In order to further examine the robustness of our method, we created a balanced dataset, which underwent the same procedure. The model achieved AUC of 82% compared with AUC of 78.48% for the baseline, which reassures the robustness and stability of our proposed transfer learning augmented with active learning framework while significantly reducing the size of training data. △ Less

Submitted 16 November, 2020; originally announced November 2020.

arXiv:2007.15629 [pdf, other]

LevelSet R-CNN: A Deep Variational Method for Instance Segmentation

Authors: Namdar Homayounfar, Yuwen Xiong, Justin Liang, Wei-Chiu Ma, Raquel Urtasun

Abstract: Obtaining precise instance segmentation masks is of high importance in many modern applications such as robotic manipulation and autonomous driving. Currently, many state of the art models are based on the Mask R-CNN framework which, while very powerful, outputs masks at low resolutions which could result in imprecise boundaries. On the other hand, classic variational methods for segmentation impo… ▽ More Obtaining precise instance segmentation masks is of high importance in many modern applications such as robotic manipulation and autonomous driving. Currently, many state of the art models are based on the Mask R-CNN framework which, while very powerful, outputs masks at low resolutions which could result in imprecise boundaries. On the other hand, classic variational methods for segmentation impose desirable global and local data and geometry constraints on the masks by optimizing an energy functional. While mathematically elegant, their direct dependence on good initialization, non-robust image cues and manual setting of hyperparameters renders them unsuitable for modern applications. We propose LevelSet R-CNN, which combines the best of both worlds by obtaining powerful feature representations that are combined in an end-to-end manner with a variational segmentation framework. We demonstrate the effectiveness of our approach on COCO and Cityscapes datasets. △ Less

Submitted 30 July, 2020; originally announced July 2020.

Comments: ECCV 2020

arXiv:2007.01126 [pdf, other]

A Brief Review of Deep Multi-task Learning and Auxiliary Task Learning

Authors: Partoo Vafaeikia, Khashayar Namdar, Farzad Khalvati

Abstract: Multi-task learning (MTL) optimizes several learning tasks simultaneously and leverages their shared information to improve generalization and the prediction of the model for each task. Auxiliary tasks can be added to the main task to ultimately boost the performance. In this paper, we provide a brief review on the recent deep multi-task learning (dMTL) approaches followed by methods on selecting… ▽ More Multi-task learning (MTL) optimizes several learning tasks simultaneously and leverages their shared information to improve generalization and the prediction of the model for each task. Auxiliary tasks can be added to the main task to ultimately boost the performance. In this paper, we provide a brief review on the recent deep multi-task learning (dMTL) approaches followed by methods on selecting useful auxiliary tasks that can be used in dMTL to improve the performance of the model for the main task. △ Less

Submitted 2 July, 2020; originally announced July 2020.

arXiv:2006.04836 [pdf]

A Modified AUC for Training Convolutional Neural Networks: Taking Confidence into Account

Authors: Khashayar Namdar, Masoom A. Haider, Farzad Khalvati

Abstract: Receiver operating characteristic (ROC) curve is an informative tool in binary classification and Area Under ROC Curve (AUC) is a popular metric for reporting performance of binary classifiers. In this paper, first we present a comprehensive review of ROC curve and AUC metric. Next, we propose a modified version of AUC that takes confidence of the model into account and at the same time, incorpora… ▽ More Receiver operating characteristic (ROC) curve is an informative tool in binary classification and Area Under ROC Curve (AUC) is a popular metric for reporting performance of binary classifiers. In this paper, first we present a comprehensive review of ROC curve and AUC metric. Next, we propose a modified version of AUC that takes confidence of the model into account and at the same time, incorporates AUC into Binary Cross Entropy (BCE) loss used for training a Convolutional neural Network for classification tasks. We demonstrate this on three datasets: MNIST, prostate MRI, and brain MRI. Furthermore, we have published GenuineAI, a new python library, which provides the functions for conventional AUC and the proposed modified AUC along with metrics including sensitivity, specificity, recall, precision, and F1 for each point of the ROC curve. △ Less

Submitted 12 September, 2021; v1 submitted 8 June, 2020; originally announced June 2020.

arXiv:2006.01693 [pdf]

A Comprehensive Study of Data Augmentation Strategies for Prostate Cancer Detection in Diffusion-weighted MRI using Convolutional Neural Networks

Authors: Ruqian Hao, Khashayar Namdar, Lin Liu, Masoom A. Haider, Farzad Khalvati

Abstract: Data augmentation refers to a group of techniques whose goal is to battle limited amount of available data to improve model generalization and push sample distribution toward the true distribution. While different augmentation strategies and their combinations have been investigated for various computer vision tasks in the context of deep learning, a specific work in the domain of medical imaging… ▽ More Data augmentation refers to a group of techniques whose goal is to battle limited amount of available data to improve model generalization and push sample distribution toward the true distribution. While different augmentation strategies and their combinations have been investigated for various computer vision tasks in the context of deep learning, a specific work in the domain of medical imaging is rare and to the best of our knowledge, there has been no dedicated work on exploring the effects of various augmentation methods on the performance of deep learning models in prostate cancer detection. In this work, we have statically applied five most frequently used augmentation techniques (random rotation, horizontal flip, vertical flip, random crop, and translation) to prostate Diffusion-weighted Magnetic Resonance Imaging training dataset of 217 patients separately and evaluated the effect of each method on the accuracy of prostate cancer detection. The augmentation algorithms were applied independently to each data channel and a shallow as well as a deep Convolutional Neural Network (CNN) were trained on the five augmented sets separately. We used Area Under Receiver Operating Characteristic (ROC) curve (AUC) to evaluate the performance of the trained CNNs on a separate test set of 95 patients, using a validation set of 102 patients for finetuning. The shallow network outperformed the deep network with the best 2D slice-based AUC of 0.85 obtained by the rotation method. △ Less

Submitted 1 June, 2020; originally announced June 2020.

arXiv:1912.02801 [pdf, other]

PolyTransform: Deep Polygon Transformer for Instance Segmentation

Authors: Justin Liang, Namdar Homayounfar, Wei-Chiu Ma, Yuwen Xiong, Rui Hu, Raquel Urtasun

Abstract: In this paper, we propose PolyTransform, a novel instance segmentation algorithm that produces precise, geometry-preserving masks by combining the strengths of prevailing segmentation approaches and modern polygon-based methods. In particular, we first exploit a segmentation network to generate instance masks. We then convert the masks into a set of polygons that are then fed to a deforming networ… ▽ More In this paper, we propose PolyTransform, a novel instance segmentation algorithm that produces precise, geometry-preserving masks by combining the strengths of prevailing segmentation approaches and modern polygon-based methods. In particular, we first exploit a segmentation network to generate instance masks. We then convert the masks into a set of polygons that are then fed to a deforming network that transforms the polygons such that they better fit the object boundaries. Our experiments on the challenging Cityscapes dataset show that our PolyTransform significantly improves the performance of the backbone instance segmentation network and ranks 1st on the Cityscapes test-set leaderboard. We also show impressive gains in the interactive annotation setting. We release the code at https://github.com/uber-research/PolyTransform. △ Less

Submitted 16 January, 2021; v1 submitted 5 December, 2019; originally announced December 2019.

arXiv:1911.01477 [pdf, other]

Evolution-based Fine-tuning of CNNs for Prostate Cancer Detection

Authors: Khashayar Namdar, Isha Gujrathi, Masoom A. Haider, Farzad Khalvati

Abstract: Convolutional Neural Networks (CNNs) have been used for automated detection of prostate cancer where Area Under Receiver Operating Characteristic (ROC) curve (AUC) is usually used as the performance metric. Given that AUC is not differentiable, common practice is to train the CNN using a loss functions based on other performance metrics such as cross entropy and monitoring AUC to select the best m… ▽ More Convolutional Neural Networks (CNNs) have been used for automated detection of prostate cancer where Area Under Receiver Operating Characteristic (ROC) curve (AUC) is usually used as the performance metric. Given that AUC is not differentiable, common practice is to train the CNN using a loss functions based on other performance metrics such as cross entropy and monitoring AUC to select the best model. In this work, we propose to fine-tune a trained CNN for prostate cancer detection using a Genetic Algorithm to achieve a higher AUC. Our dataset contained 6-channel Diffusion-Weighted MRI slices of prostate. On a cohort of 2,955 training, 1,417 validation, and 1,334 test slices, we reached test AUC of 0.773; a 9.3% improvement compared to the base CNN model. △ Less

Submitted 4 November, 2019; originally announced November 2019.

Comments: Accepted for the 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Medical Imaging Meets NEURIPS Workshop

arXiv:1908.03274 [pdf, other]

Exploiting Sparse Semantic HD Maps for Self-Driving Vehicle Localization

Authors: Wei-Chiu Ma, Ignacio Tartavull, Ioan Andrei Bârsan, Shenlong Wang, Min Bai, Gellert Mattyus, Namdar Homayounfar, Shrinidhi Kowshika Lakshmikanth, Andrei Pokrovsky, Raquel Urtasun

Abstract: In this paper we propose a novel semantic localization algorithm that exploits multiple sensors and has precision on the order of a few centimeters. Our approach does not require detailed knowledge about the appearance of the world, and our maps require orders of magnitude less storage than maps utilized by traditional geometry- and LiDAR intensity-based localizers. This is important as self-drivi… ▽ More In this paper we propose a novel semantic localization algorithm that exploits multiple sensors and has precision on the order of a few centimeters. Our approach does not require detailed knowledge about the appearance of the world, and our maps require orders of magnitude less storage than maps utilized by traditional geometry- and LiDAR intensity-based localizers. This is important as self-driving cars need to operate in large environments. Towards this goal, we formulate the problem in a Bayesian filtering framework, and exploit lanes, traffic signs, as well as vehicle dynamics to localize robustly with respect to a sparse semantic map. We validate the effectiveness of our method on a new highway dataset consisting of 312km of roads. Our experiments show that the proposed approach is able to achieve 0.05m lateral accuracy and 1.12m longitudinal accuracy on average while taking up only 0.3% of the storage required by previous LiDAR intensity-based approaches. △ Less

Submitted 8 August, 2019; originally announced August 2019.

Comments: 8 pages, 4 figures, 4 tables, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2019)

arXiv:1905.01555 [pdf, other]

Deep Multi-Sensor Lane Detection

Authors: Min Bai, Gellert Mattyus, Namdar Homayounfar, Shenlong Wang, Shrinidhi Kowshika Lakshmikanth, Raquel Urtasun

Abstract: Reliable and accurate lane detection has been a long-standing problem in the field of autonomous driving. In recent years, many approaches have been developed that use images (or videos) as input and reason in image space. In this paper we argue that accurate image estimates do not translate to precise 3D lane boundaries, which are the input required by modern motion planning algorithms. To addres… ▽ More Reliable and accurate lane detection has been a long-standing problem in the field of autonomous driving. In recent years, many approaches have been developed that use images (or videos) as input and reason in image space. In this paper we argue that accurate image estimates do not translate to precise 3D lane boundaries, which are the input required by modern motion planning algorithms. To address this issue, we propose a novel deep neural network that takes advantage of both LiDAR and camera sensors and produces very accurate estimates directly in 3D space. We demonstrate the performance of our approach on both highways and in cities, and show very accurate estimates in complex scenarios such as heavy traffic (which produces occlusion), fork, merges and intersections. △ Less

Submitted 4 May, 2019; originally announced May 2019.

Comments: IEEE International Conference on Intelligent Robots and Systems (IROS) 2018

arXiv:1604.02715 [pdf, other]

Soccer Field Localization from a Single Image

Authors: Namdar Homayounfar, Sanja Fidler, Raquel Urtasun

Abstract: In this work, we propose a novel way of efficiently localizing a soccer field from a single broadcast image of the game. Related work in this area relies on manually annotating a few key frames and extending the localization to similar images, or installing fixed specialized cameras in the stadium from which the layout of the field can be obtained. In contrast, we formulate this problem as a branc… ▽ More In this work, we propose a novel way of efficiently localizing a soccer field from a single broadcast image of the game. Related work in this area relies on manually annotating a few key frames and extending the localization to similar images, or installing fixed specialized cameras in the stadium from which the layout of the field can be obtained. In contrast, we formulate this problem as a branch and bound inference in a Markov random field where an energy function is defined in terms of field cues such as grass, lines and circles. Moreover, our approach is fully automatic and depends only on single images from the broadcast video of the game. We demonstrate the effectiveness of our method by applying it to various games and obtain promising results. Finally, we posit that our approach can be applied easily to other sports such as hockey and basketball. △ Less

Submitted 10 April, 2016; originally announced April 2016.

arXiv:0901.3208 [pdf, ps, other]

doi 10.1103/PhysRevA.80.013814

Localized modes in defective multilayer structures

Authors: S. Roshan Entezar, A. Namdar

Abstract: In this paper, the localized surface modes in a defective multilayer structure has been investigated. It is shown that the defective multilayer structures can support two different kind of localized modes depending on the position and the thickness of the defect layer. One of these modes is localized at the interface between the multilayer structure and a homogeneous medium (the so-called surfac… ▽ More In this paper, the localized surface modes in a defective multilayer structure has been investigated. It is shown that the defective multilayer structures can support two different kind of localized modes depending on the position and the thickness of the defect layer. One of these modes is localized at the interface between the multilayer structure and a homogeneous medium (the so-called surface mode) and the other one is localized at the defect layer (defect localized mode). We reveal that the presence of defect layer pushes the dispersion curve of surface modes to the lower or the upper edge of the photonic bandgap depending on the homogeneous medium is a left-handed or right-handed medium (e.g. vacuum), respectively. So, the existence region of the surface modes restricted. Moreover, the effect of defect on the energy flow velocity of the surface modes is discussed. △ Less

Submitted 21 January, 2009; originally announced January 2009.

Comments: 5 pages, 7 figures

arXiv:0708.4127 [pdf, ps, other]

doi 10.1088/1126-6708/2007/09/029

The Dirichlet Casimir effect for $φ^4$ theory in (3+1) dimensions: A new renormalization approach

Authors: Reza Moazzemi, Maryam Namdar, Siamak S. Gousheh

Abstract: We calculate the next to the leading order Casimir effect for a real scalar field, within $φ^4$ theory, confined between two parallel plates in three spatial dimensions with the Dirichlet boundary condition. In this paper we introduce a systematic perturbation expansion in which the counterterms automatically turn out to be consistent with the boundary conditions. This will inevitably lead to no… ▽ More We calculate the next to the leading order Casimir effect for a real scalar field, within $φ^4$ theory, confined between two parallel plates in three spatial dimensions with the Dirichlet boundary condition. In this paper we introduce a systematic perturbation expansion in which the counterterms automatically turn out to be consistent with the boundary conditions. This will inevitably lead to nontrivial position dependence for physical quantities, as a manifestation of the breaking of the translational invariance. This is in contrast to the usual usage of the counterterms in problems with nontrivial boundary conditions, which are either completely derived from the free cases or at most supplemented with the addition of counterterms only at the boundaries. Our results for the massive and massless cases are different from those reported elsewhere. Secondly, and probably less importantly, we use a supplementary renormalization procedure, which makes the usage of any analytic continuation techniques unnecessary. △ Less

Submitted 30 August, 2007; originally announced August 2007.

Comments: JHEP3 format,20 pages, 2 figures, to appear in JHEP

Journal ref: JHEP 0709:029,2007

arXiv:physics/0607171 [pdf, ps, other]

doi 10.1016/j.optcom.2007.05.063

Tamm states in one dimensional photonic crystals containing left-handed materials

Authors: Abdolrahman Namdar

Abstract: We present a theoretical study of electromagnetic surface waves localized at an interface separating a conventional uniform medium and a semi-infinite 1-D photonic crystal made of alternate left-handed metamaterial and right-handed material which we refer to as left-handed photonic crystal. We find novel type of surface mode's structure, the so-called surface Tamm states and demonstrate that the… ▽ More We present a theoretical study of electromagnetic surface waves localized at an interface separating a conventional uniform medium and a semi-infinite 1-D photonic crystal made of alternate left-handed metamaterial and right-handed material which we refer to as left-handed photonic crystal. We find novel type of surface mode's structure, the so-called surface Tamm states and demonstrate that the presence of metamaterial in the photonic crystal structure allows for a flexible control of the dispersion properties of surface states, and can support the Tamm states with a backward energy flow and a vortex-like structure. △ Less

Submitted 18 July, 2006; originally announced July 2006.

Comments: 8pages, 5 figures

arXiv:physics/0605188 [pdf, ps, other]

doi 10.1063/1.2352794

Backward Tamm states in left-handed metamaterials

Authors: Abdolrahman Namdar, Ilya V. Shadrivov, Yuri S. Kivshar

Abstract: We study the electromagnetic surface waves localized at an interface separating a one-dimensional photonic crystal and left-handed metamaterial, the so-called surface Tamm states. We demonstrate that the metamaterial allows for a flexible control of the dispersion properties of surface states, and can support the Tamm states with a backward energy flow and a vortex-like structure. We study the electromagnetic surface waves localized at an interface separating a one-dimensional photonic crystal and left-handed metamaterial, the so-called surface Tamm states. We demonstrate that the metamaterial allows for a flexible control of the dispersion properties of surface states, and can support the Tamm states with a backward energy flow and a vortex-like structure. △ Less

Submitted 22 May, 2006; originally announced May 2006.

Comments: 3 pages, 5 figures

Journal ref: Appl. Phys. Lett 89, 114104 (2006).

Showing 1–37 of 37 results for author: Namdar