Skip to main content

Showing 1–50 of 51 results for author: Minaee, S

.
  1. arXiv:2408.06687  [pdf, other

    cs.CV cs.AI cs.LG

    Masked Image Modeling: A Survey

    Authors: Vlad Hondru, Florinel Alin Croitoru, Shervin Minaee, Radu Tudor Ionescu, Nicu Sebe

    Abstract: In this work, we survey recent studies on masked image modeling (MIM), an approach that emerged as a powerful self-supervised learning technique in computer vision. The MIM task involves masking some information, e.g.~pixels, patches, or even latent representations, and training a model, usually an autoencoder, to predicting the missing information by using the context available in the visible par… ▽ More

    Submitted 9 January, 2025; v1 submitted 13 August, 2024; originally announced August 2024.

    Comments: Revised version

  2. arXiv:2402.06196  [pdf, other

    cs.CL cs.AI

    Large Language Models: A Survey

    Authors: Shervin Minaee, Tomas Mikolov, Narjes Nikzad, Meysam Chenaghlu, Richard Socher, Xavier Amatriain, Jianfeng Gao

    Abstract: Large Language Models (LLMs) have drawn a lot of attention due to their strong performance on a wide range of natural language tasks, since the release of ChatGPT in November 2022. LLMs' ability of general-purpose language understanding and generation is acquired by training billions of model's parameters on massive amounts of text data, as predicted by scaling laws \cite{kaplan2020scaling,hoffman… ▽ More

    Submitted 23 March, 2025; v1 submitted 9 February, 2024; originally announced February 2024.

  3. arXiv:2203.02573  [pdf, other

    cs.CV cs.AI cs.LG

    Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning

    Authors: Ligong Han, Jian Ren, Hsin-Ying Lee, Francesco Barbieri, Kyle Olszewski, Shervin Minaee, Dimitris Metaxas, Sergey Tulyakov

    Abstract: Most methods for conditional video synthesis use a single modality as the condition. This comes with major limitations. For example, it is problematic for a model conditioned on an image to generate a specific motion trajectory desired by the user since there is no means to provide motion information. Conversely, language information can describe the desired motion, while not precisely defining th… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR 2022

  4. arXiv:2202.09450  [pdf, other

    cs.CV

    Modern Augmented Reality: Applications, Trends, and Future Directions

    Authors: Shervin Minaee, Xiaodan Liang, Shuicheng Yan

    Abstract: Augmented reality (AR) is one of the relatively old, yet trending areas in the intersection of computer vision and computer graphics with numerous applications in several areas, from gaming and entertainment, to education and healthcare. Although it has been around for nearly fifty years, it has seen a lot of interest by the research community in the recent years, mainly because of the huge succes… ▽ More

    Submitted 24 February, 2022; v1 submitted 18 February, 2022; originally announced February 2022.

  5. arXiv:2103.14983  [pdf, other

    cs.CV

    Going Deeper Into Face Detection: A Survey

    Authors: Shervin Minaee, Ping Luo, Zhe Lin, Kevin Bowyer

    Abstract: Face detection is a crucial first step in many facial recognition and face analysis systems. Early approaches for face detection were mainly based on classifiers built on top of hand-crafted features extracted from local image regions, such as Haar Cascades and Histogram of Oriented Gradients. However, these approaches were not powerful enough to achieve a high accuracy on images of from uncontrol… ▽ More

    Submitted 13 April, 2021; v1 submitted 27 March, 2021; originally announced March 2021.

  6. arXiv:2010.03791  [pdf, other

    cs.CV cs.LG eess.IV

    Age and Gender Prediction From Face Images Using Attentional Convolutional Network

    Authors: Amirali Abdolrashidi, Mehdi Minaei, Elham Azimi, Shervin Minaee

    Abstract: Automatic prediction of age and gender from face images has drawn a lot of attention recently, due it is wide applications in various facial analysis problems. However, due to the large intra-class variation of face images (such as variation in lighting, pose, scale, occlusion), the existing models are still behind the desired accuracy level, which is necessary for the use of these models in real-… ▽ More

    Submitted 7 December, 2020; v1 submitted 8 October, 2020; originally announced October 2020.

  7. arXiv:2009.05096  [pdf, other

    eess.IV cs.CV cs.LG

    COVID CT-Net: Predicting Covid-19 From Chest CT Images Using Attentional Convolutional Network

    Authors: Shakib Yazdani, Shervin Minaee, Rahele Kafieh, Narges Saeedizadeh, Milan Sonka

    Abstract: The novel corona-virus disease (COVID-19) pandemic has caused a major outbreak in more than 200 countries around the world, leading to a severe impact on the health and life of many people globally. As of Aug 25th of 2020, more than 20 million people are infected, and more than 800,000 death are reported. Computed Tomography (CT) images can be used as a as an alternative to the time-consuming "rev… ▽ More

    Submitted 10 September, 2020; originally announced September 2020.

  8. arXiv:2009.03947  [pdf, other

    cs.CL cs.LG

    Covid-Transformer: Detecting COVID-19 Trending Topics on Twitter Using Universal Sentence Encoder

    Authors: Meysam Asgari-Chenaghlu, Narjes Nikzad-Khasmakhi, Shervin Minaee

    Abstract: The novel corona-virus disease (also known as COVID-19) has led to a pandemic, impacting more than 200 countries across the globe. With its global impact, COVID-19 has become a major concern of people almost everywhere, and therefore there are a large number of tweets coming out from every corner of the world, about COVID-19 related topics. In this work, we try to analyze the tweets and detect the… ▽ More

    Submitted 19 September, 2020; v1 submitted 8 September, 2020; originally announced September 2020.

  9. arXiv:2007.12303  [pdf, other

    eess.IV cs.CV cs.LG

    COVID TV-UNet: Segmenting COVID-19 Chest CT Images Using Connectivity Imposed U-Net

    Authors: Narges Saeedizadeh, Shervin Minaee, Rahele Kafieh, Shakib Yazdani, Milan Sonka

    Abstract: The novel corona-virus disease (COVID-19) pandemic has caused a major outbreak in more than 200 countries around the world, leading to a severe impact on the health and life of many people globally. As of mid-July 2020, more than 12 million people were infected, and more than 570,000 death were reported. Computed Tomography (CT) images can be used as an alternative to the time-consuming RT-PCR tes… ▽ More

    Submitted 6 August, 2020; v1 submitted 23 July, 2020; originally announced July 2020.

  10. arXiv:2004.09363  [pdf, other

    cs.CV

    Deep-COVID: Predicting COVID-19 From Chest X-Ray Images Using Deep Transfer Learning

    Authors: Shervin Minaee, Rahele Kafieh, Milan Sonka, Shakib Yazdani, Ghazaleh Jamalipour Soufi

    Abstract: The COVID-19 pandemic is causing a major outbreak in more than 150 countries around the world, having a severe impact on the health and life of many people globally. One of the crucial step in fighting COVID-19 is the ability to detect the infected patients early enough, and put them under special care. Detecting this disease from radiography and radiology images is perhaps one of the fastest way… ▽ More

    Submitted 21 July, 2020; v1 submitted 20 April, 2020; originally announced April 2020.

    Comments: Accepted by Medical Image Analysis

  11. arXiv:2004.03705  [pdf, other

    cs.CL cs.LG stat.ML

    Deep Learning Based Text Classification: A Comprehensive Review

    Authors: Shervin Minaee, Nal Kalchbrenner, Erik Cambria, Narjes Nikzad, Meysam Chenaghlu, Jianfeng Gao

    Abstract: Deep learning based models have surpassed classical machine learning based approaches in various text classification tasks, including sentiment analysis, news categorization, question answering, and natural language inference. In this paper, we provide a comprehensive review of more than 150 deep learning based models for text classification developed in recent years, and discuss their technical c… ▽ More

    Submitted 4 January, 2021; v1 submitted 5 April, 2020; originally announced April 2020.

  12. arXiv:2003.10834  [pdf, other

    cs.CV cs.LG eess.IV

    Palm-GAN: Generating Realistic Palmprint Images Using Total-Variation Regularized GAN

    Authors: Shervin Minaee, Mehdi Minaei, Amirali Abdolrashidi

    Abstract: Generating realistic palmprint (more generally biometric) images has always been an interesting and, at the same time, challenging problem. Classical statistical models fail to generate realistic-looking palmprint images, as they are not powerful enough to capture the complicated texture representation of palmprint images. In this work, we present a deep learning framework based on generative adve… ▽ More

    Submitted 20 March, 2020; originally announced March 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1812.10482, arXiv:1812.04822

  13. arXiv:2002.03503  [pdf, other

    cs.LG stat.ML

    Regularized Submodular Maximization at Scale

    Authors: Ehsan Kazemi, Shervin Minaee, Moran Feldman, Amin Karbasi

    Abstract: In this paper, we propose scalable methods for maximizing a regularized submodular function $f = g - \ell$ expressed as the difference between a monotone submodular function $g$ and a modular function $\ell$. Indeed, submodularity is inherently related to the notions of diversity, coverage, and representativeness. In particular, finding the mode of many popular probabilistic models of diversity, s… ▽ More

    Submitted 9 February, 2020; originally announced February 2020.

  14. arXiv:2001.05566  [pdf, other

    cs.CV cs.LG

    Image Segmentation Using Deep Learning: A Survey

    Authors: Shervin Minaee, Yuri Boykov, Fatih Porikli, Antonio Plaza, Nasser Kehtarnavaz, Demetri Terzopoulos

    Abstract: Image segmentation is a key topic in image processing and computer vision with applications such as scene understanding, medical image analysis, robotic perception, video surveillance, augmented reality, and image compression, among many others. Various algorithms for image segmentation have been developed in the literature. Recently, due to the success of deep learning models in a wide range of v… ▽ More

    Submitted 14 November, 2020; v1 submitted 15 January, 2020; originally announced January 2020.

  15. arXiv:1912.00271  [pdf, other

    cs.CV cs.LG

    Biometrics Recognition Using Deep Learning: A Survey

    Authors: Shervin Minaee, Amirali Abdolrashidi, Hang Su, Mohammed Bennamoun, David Zhang

    Abstract: Deep learning-based models have been very successful in achieving state-of-the-art results in many of the computer vision, speech recognition, and natural language processing tasks in the last few years. These models seem a natural fit for handling the ever-increasing scale of biometric recognition problems, from cellphone authentication to airport security systems. Deep learning-based models have… ▽ More

    Submitted 8 February, 2021; v1 submitted 30 November, 2019; originally announced December 2019.

    Comments: Under Review

  16. arXiv:1910.03943  [pdf, other

    cs.IR cs.CL cs.LG stat.ML

    Hotel2vec: Learning Attribute-Aware Hotel Embeddings with Self-Supervision

    Authors: Ali Sadeghian, Shervin Minaee, Ioannis Partalas, Xinxin Li, Daisy Zhe Wang, Brooke Cowan

    Abstract: We propose a neural network architecture for learning vector representations of hotels. Unlike previous works, which typically only use user click information for learning item embeddings, we propose a framework that combines several sources of data, including user clicks, hotel attributes (e.g., property type, star rating, average user rating), amenity information (e.g., the hotel has free Wi-Fi… ▽ More

    Submitted 30 September, 2019; originally announced October 2019.

  17. arXiv:1909.08049  [pdf, other

    cs.CV cs.LG eess.IV

    Masked-RPCA: Sparse and Low-rank Decomposition Under Overlaying Model and Application to Moving Object Detection

    Authors: Amirhossein Khalilian-Gourtani, Shervin Minaee, Yao Wang

    Abstract: Foreground detection in a given video sequence is a pivotal step in many computer vision applications such as video surveillance system. Robust Principal Component Analysis (RPCA) performs low-rank and sparse decomposition and accomplishes such a task when the background is stationary and the foreground is dynamic and relatively small. A fundamental issue with RPCA is the assumption that the low-r… ▽ More

    Submitted 17 September, 2019; originally announced September 2019.

  18. arXiv:1907.12956  [pdf, other

    cs.CV cs.LG

    FingerNet: Pushing The Limits of Fingerprint Recognition Using Convolutional Neural Network

    Authors: Shervin Minaee, Elham Azimi, Amirali Abdolrashidi

    Abstract: Fingerprint recognition has been utilized for cellphone authentication, airport security and beyond. Many different features and algorithms have been proposed to improve fingerprint recognition. In this paper, we propose an end-to-end deep learning framework for fingerprint recognition using convolutional neural networks (CNNs) which can jointly learn the feature representation and perform recogni… ▽ More

    Submitted 28 July, 2019; originally announced July 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1907.09380

  19. arXiv:1907.09380  [pdf, other

    cs.CV cs.LG

    DeepIris: Iris Recognition Using A Deep Learning Approach

    Authors: Shervin Minaee, Amirali Abdolrashidi

    Abstract: Iris recognition has been an active research area during last few decades, because of its wide applications in security, from airports to homeland security border control. Different features and algorithms have been proposed for iris recognition in the past. In this paper, we propose an end-to-end deep learning framework for iris recognition based on residual convolutional neural network (CNN), wh… ▽ More

    Submitted 22 July, 2019; originally announced July 2019.

  20. arXiv:1904.04206  [pdf, other

    cs.CL cs.IR cs.LG stat.ML

    Deep-Sentiment: Sentiment Analysis Using Ensemble of CNN and Bi-LSTM Models

    Authors: Shervin Minaee, Elham Azimi, AmirAli Abdolrashidi

    Abstract: With the popularity of social networks, and e-commerce websites, sentiment analysis has become a more active area of research in the past few years. On a high level, sentiment analysis tries to understand the public opinion about a specific product or topic, or trends from reviews or tweets. Sentiment analysis plays an important role in better understanding customer/user opinion, and also extracti… ▽ More

    Submitted 8 April, 2019; originally announced April 2019.

  21. arXiv:1902.01019  [pdf, other

    cs.CV

    Deep-Emotion: Facial Expression Recognition Using Attentional Convolutional Network

    Authors: Shervin Minaee, Amirali Abdolrashidi

    Abstract: Facial expression recognition has been an active research area over the past few decades, and it is still challenging due to the high intra-class variation. Traditional approaches for this problem rely on hand-crafted features such as SIFT, HOG and LBP, followed by a classifier trained on a database of images or videos. Most of these works perform reasonably well on datasets of images captured… ▽ More

    Submitted 3 February, 2019; originally announced February 2019.

  22. arXiv:1812.10482  [pdf, other

    cs.CV cs.LG

    Finger-GAN: Generating Realistic Fingerprint Images Using Connectivity Imposed GAN

    Authors: Shervin Minaee, Amirali Abdolrashidi

    Abstract: Generating realistic biometric images has been an interesting and, at the same time, challenging problem. Classical statistical models fail to generate realistic-looking fingerprint images, as they are not powerful enough to capture the complicated texture representation in fingerprint images. In this work, we present a machine learning framework based on generative adversarial networks (GAN), whi… ▽ More

    Submitted 25 December, 2018; originally announced December 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1812.04822

  23. arXiv:1812.04822  [pdf, other

    cs.CV

    Iris-GAN: Learning to Generate Realistic Iris Images Using Convolutional GAN

    Authors: Shervin Minaee, Amirali Abdolrashidi

    Abstract: Generating iris images which look realistic is both an interesting and challenging problem. Most of the classical statistical models are not powerful enough to capture the complicated texture representation in iris images, and therefore fail to generate iris images which look realistic. In this work, we present a machine learning framework based on generative adversarial network (GAN), which is ab… ▽ More

    Submitted 25 December, 2018; v1 submitted 12 December, 2018; originally announced December 2018.

  24. arXiv:1812.04821  [pdf, other

    cs.CV

    Efficient Super Resolution For Large-Scale Images Using Attentional GAN

    Authors: Harsh Nilesh Pathak, Xinxin Li, Shervin Minaee, Brooke Cowan

    Abstract: Single Image Super Resolution (SISR) is a well-researched problem with broad commercial relevance. However, most of the SISR literature focuses on small-size images under 500px, whereas business needs can mandate the generation of very high resolution images. At Expedia Group, we were tasked with generating images of at least 2000px for display on the website, four times greater than the sizes typ… ▽ More

    Submitted 13 January, 2019; v1 submitted 12 December, 2018; originally announced December 2018.

    Comments: Accepted by IEEE International Conference on Big Data, 2018

  25. arXiv:1806.10419  [pdf, other

    cs.CV

    MTBI Identification From Diffusion MR Images Using Bag of Adversarial Visual Features

    Authors: Shervin Minaee, Yao Wang, Alp Aygar, Sohae Chung, Xiuyuan Wang, Yvonne W. Lui, Els Fieremans, Steven Flanagan, Joseph Rath

    Abstract: In this work, we propose bag of adversarial features (BAF) for identifying mild traumatic brain injury (MTBI) patients from their diffusion magnetic resonance images (MRI) (obtained within one month of injury) by incorporating unsupervised feature learning techniques. MTBI is a growing public health problem with an estimated incidence of over 1.7 million people annually in US. Diagnosis is based o… ▽ More

    Submitted 27 June, 2018; originally announced June 2018.

    Comments: IEEE Transactions on Medical Imaging

  26. arXiv:1806.08612  [pdf, other

    cs.CV

    Ad-Net: Audio-Visual Convolutional Neural Network for Advertisement Detection In Videos

    Authors: Shervin Minaee, Imed Bouazizi, Prakash Kolan, Hossein Najafzadeh

    Abstract: Personalized advertisement is a crucial task for many of the online businesses and video broadcasters. Many of today's broadcasters use the same commercial for all customers, but as one can imagine different viewers have different interests and it seems reasonable to have customized commercial for different group of people, chosen based on their demographic features, and history. In this project,… ▽ More

    Submitted 22 June, 2018; originally announced June 2018.

  27. arXiv:1804.02419  [pdf, other

    cs.CV

    Image Segmentation Using Subspace Representation and Sparse Decomposition

    Authors: Shervin Minaee

    Abstract: Image foreground extraction is a classical problem in image processing and vision, with a large range of applications. In this dissertation, we focus on the extraction of text and graphics in mixed-content images, and design novel approaches for various aspects of this problem. We first propose a sparse decomposition framework, which models the background by a subspace containing smooth basis ve… ▽ More

    Submitted 6 April, 2018; originally announced April 2018.

    Comments: PhD Dissertation, NYU, 2018

  28. arXiv:1802.02925  [pdf, other

    cs.CV

    A Deep Unsupervised Learning Approach Toward MTBI Identification Using Diffusion MRI

    Authors: Shervin Minaee, Yao Wang, Anna Choromanska, Sohae Chung, Xiuyuan Wang, Els Fieremans, Steven Flanagan, Joseph Rath, Yvonne W Lui

    Abstract: Mild traumatic brain injury is a growing public health problem with an estimated incidence of over 1.7 million people annually in US. Diagnosis is based on clinical history and symptoms, and accurate, concrete measures of injury are lacking. This work aims to directly use diffusion MR images obtained within one month of trauma to detect injury, by incorporating deep learning techniques. To overcom… ▽ More

    Submitted 11 April, 2018; v1 submitted 8 February, 2018; originally announced February 2018.

    Comments: arXiv admin note: text overlap with arXiv:1710.06824

  29. arXiv:1710.06824  [pdf, other

    cs.CV

    Identifying Mild Traumatic Brain Injury Patients From MR Images Using Bag of Visual Words

    Authors: Shervin Minaee, Siyun Wang, Yao Wang, Sohae Chung, Xiuyuan Wang, Els Fieremans, Steven Flanagan, Joseph Rath, Yvonne W. Lui

    Abstract: Mild traumatic brain injury (mTBI) is a growing public health problem with an estimated incidence of one million people annually in US. Neurocognitive tests are used to both assess the patient condition and to monitor the patient progress. This work aims to directly use MR images taken shortly after injury to detect whether a patient suffers from mTBI, by incorporating machine learning and compute… ▽ More

    Submitted 14 February, 2018; v1 submitted 18 October, 2017; originally announced October 2017.

  30. arXiv:1708.09000  [pdf, other

    cs.CV

    A Machine Learning Approach For Identifying Patients with Mild Traumatic Brain Injury Using Diffusion MRI Modeling

    Authors: Shervin Minaee, Yao Wang, Sohae Chung, Xiuyuan Wang, Els Fieremans, Steven Flanagan, Joseph Rath, Yvonne W. Lui

    Abstract: While diffusion MRI has been extremely promising in the study of MTBI, identifying patients with recent MTBI remains a challenge. The literature is mixed with regard to localizing injury in these patients, however, gray matter such as the thalamus and white matter including the corpus callosum and frontal deep white matter have been repeatedly implicated as areas at high risk for injury. The purpo… ▽ More

    Submitted 27 August, 2017; originally announced August 2017.

  31. arXiv:1708.01713  [pdf, other

    cs.CL

    Automatic Question-Answering Using A Deep Similarity Neural Network

    Authors: Shervin Minaee, Zhu Liu

    Abstract: Automatic question-answering is a classical problem in natural language processing, which aims at designing systems that can automatically answer a question, in the same way as human does. In this work, we propose a deep learning based model for automatic question-answering. First the questions and answers are embedded using neural probabilistic modeling. Then a deep similarity neural network is t… ▽ More

    Submitted 5 August, 2017; originally announced August 2017.

  32. arXiv:1706.04041  [pdf, other

    cs.CV

    Text Extraction From Texture Images Using Masked Signal Decomposition

    Authors: Shervin Minaee, Yao Wang

    Abstract: Text extraction is an important problem in image processing with applications from optical character recognition to autonomous driving. Most of the traditional text segmentation algorithms consider separating text from a simple background (which usually has a different color from texts). In this work we consider separating texts from a textured background, that has similar color to texts. We look… ▽ More

    Submitted 10 July, 2017; v1 submitted 11 June, 2017; originally announced June 2017.

    Comments: arXiv admin note: text overlap with arXiv:1704.07711

  33. arXiv:1704.07711  [pdf, other

    cs.CV

    An ADMM Approach to Masked Signal Decomposition Using Subspace Representation

    Authors: Shervin Minaee, Yao Wang

    Abstract: Signal decomposition is a classical problem in signal processing, which aims to separate an observed signal into two or more components each with its own property. Usually each component is described by its own subspace or dictionary. Extensive research has been done for the case where the components are additive, but in real world applications, the components are often non-additive. For example,… ▽ More

    Submitted 25 December, 2018; v1 submitted 25 April, 2017; originally announced April 2017.

  34. arXiv:1703.04611  [pdf, other

    cs.CV

    Subspace Learning in The Presence of Sparse Structured Outliers and Noise

    Authors: Shervin Minaee, Yao Wang

    Abstract: Subspace learning is an important problem, which has many applications in image and video processing. It can be used to find a low-dimensional representation of signals and images. But in many applications, the desired signal is heavily distorted by outliers and noise, which negatively affect the learned subspace. In this work, we present a novel algorithm for learning a subspace for signal repres… ▽ More

    Submitted 12 July, 2017; v1 submitted 14 March, 2017; originally announced March 2017.

    Comments: IEEE International Symposium on Circuits and Systems, 2017

  35. arXiv:1702.01334  [pdf, other

    cs.CV cs.LG

    An Experimental Study of Deep Convolutional Features For Iris Recognition

    Authors: Shervin Minaee, Amirali Abdolrashidi, Yao Wang

    Abstract: Iris is one of the popular biometrics that is widely used for identity authentication. Different features have been used to perform iris recognition in the past. Most of them are based on hand-crafted features designed by biometrics experts. Due to tremendous success of deep learning in computer vision problems, there has been a lot of interest in applying features learned by convolutional neural… ▽ More

    Submitted 4 February, 2017; originally announced February 2017.

    Comments: IEEE Signal Processing in Medicine and Biology Symposium, 2016

  36. arXiv:1611.07909  [pdf, other

    cs.CV

    Image Segmentation Using Overlapping Group Sparsity

    Authors: Shervin Minaee, Yao Wang

    Abstract: Sparse decomposition has been widely used for different applications, such as source separation, image classification and image denoising. This paper presents a new algorithm for segmentation of an image into background and foreground text and graphics using sparse decomposition. First, the background is represented using a suitable smooth model, which is a linear combination of a few smoothly var… ▽ More

    Submitted 21 December, 2016; v1 submitted 23 November, 2016; originally announced November 2016.

    Comments: arXiv admin note: substantial text overlap with arXiv:1602.02434. appears in IEEE Signal Processing in Medicine and Biology Symposium, 2016

  37. arXiv:1609.03874  [pdf, other

    cs.CV

    Image Decomposition Using a Robust Regression Approach

    Authors: Shervin Minaee, Yao Wang

    Abstract: This paper considers how to separate text and/or graphics from smooth background in screen content and mixed content images and proposes an algorithm to perform this segmentation task. The proposed methods make use of the fact that the background in each block is usually smoothly varying and can be modeled well by a linear combination of a few smoothly varying basis functions, while the foreground… ▽ More

    Submitted 4 December, 2017; v1 submitted 13 September, 2016; originally announced September 2016.

    Comments: arXiv admin note: substantial text overlap with arXiv:1607.02547

  38. arXiv:1608.00059  [pdf, other

    cs.CV

    Face Recognition Using Scattering Convolutional Network

    Authors: Shervin Minaee, Amirali Abdolrashidi, Yao Wang

    Abstract: Face recognition has been an active research area in the past few decades. In general, face recognition can be very challenging due to variations in viewpoint, illumination, facial expression, etc. Therefore it is essential to extract features which are invariant to some or all of these variations. Here a new image representation, called scattering transform/network, has been used to extract featu… ▽ More

    Submitted 30 November, 2017; v1 submitted 29 July, 2016; originally announced August 2016.

  39. arXiv:1607.02547  [pdf, other

    cs.CV

    Screen Content Image Segmentation Using Robust Regression and Sparse Decomposition

    Authors: Shervin Minaee, Yao Wang

    Abstract: This paper considers how to separate text and/or graphics from smooth background in screen content and mixed document images and proposes two approaches to perform this segmentation task. The proposed methods make use of the fact that the background in each block is usually smoothly varying and can be modeled well by a linear combination of a few smoothly varying basis functions, while the foregro… ▽ More

    Submitted 8 July, 2016; originally announced July 2016.

  40. arXiv:1603.09027  [pdf, other

    cs.CV

    Palmprint Recognition Using Deep Scattering Convolutional Network

    Authors: Shervin Minaee, Yao Wang

    Abstract: Palmprint recognition has drawn a lot of attention during the recent years. Many algorithms have been proposed for palmprint recognition in the past, majority of them being based on features extracted from the transform domain. Many of these transform domain features are not translation or rotation invariant, and therefore a great deal of preprocessing is needed to align the images. In this paper,… ▽ More

    Submitted 29 March, 2016; originally announced March 2016.

  41. arXiv:1602.02434  [pdf, other

    cs.CV

    Screen Content Image Segmentation Using Sparse Decomposition and Total Variation Minimization

    Authors: Shervin Minaee, Yao Wang

    Abstract: Sparse decomposition has been widely used for different applications, such as source separation, image classification, image denoising and more. This paper presents a new algorithm for segmentation of an image into background and foreground text and graphics using sparse decomposition and total variation minimization. The proposed method is designed based on the assumption that the background part… ▽ More

    Submitted 26 July, 2016; v1 submitted 7 February, 2016; originally announced February 2016.

    Comments: 5 pages in IEEE, International Conference on Image Processing, 2016

  42. Screen Content Image Segmentation Using Sparse-Smooth Decomposition

    Authors: Shervin Minaee, Amirali Abdolrashidi, Yao Wang

    Abstract: Sparse decomposition has been extensively used for different applications including signal compression and denoising and document analysis. In this paper, sparse decomposition is used for image segmentation. The proposed algorithm separates the background and foreground using a sparse-smooth decomposition technique such that the smooth and sparse components correspond to the background and foregro… ▽ More

    Submitted 21 November, 2015; originally announced November 2015.

    Comments: Asilomar Conference on Signals, Systems and Computers, IEEE, 2015, (to Appear)

  43. arXiv:1509.03542  [pdf, other

    cs.CV

    Fingerprint Recognition Using Translation Invariant Scattering Network

    Authors: Shervin Minaee, Yao Wang

    Abstract: Fingerprint recognition has drawn a lot of attention during last decades. Different features and algorithms have been used for fingerprint recognition in the past. In this paper, a powerful image representation called scattering transform/network, is used for recognition. Scattering network is a convolutional network where its architecture and filters are predefined wavelet transforms. The first l… ▽ More

    Submitted 25 November, 2015; v1 submitted 11 September, 2015; originally announced September 2015.

    Comments: IEEE Signal Processing in Medicine and Biology Symposium, 2015

  44. arXiv:1507.02177  [pdf, other

    cs.CV

    Iris Recognition Using Scattering Transform and Textural Features

    Authors: Shervin Minaee, AmirAli Abdolrashidi, Yao Wang

    Abstract: Iris recognition has drawn a lot of attention since the mid-twentieth century. Among all biometric features, iris is known to possess a rich set of features. Different features have been used to perform iris recognition in the past. In this paper, two powerful sets of features are introduced to be used for iris recognition: scattering transform-based features and textural features. PCA is also app… ▽ More

    Submitted 8 July, 2015; originally announced July 2015.

  45. arXiv:1501.03755  [pdf, other

    cs.CV

    Screen Content Image Segmentation Using Least Absolute Deviation Fitting

    Authors: Shervin Minaee, Yao Wang

    Abstract: We propose an algorithm for separating the foreground (mainly text and line graphics) from the smoothly varying background in screen content images. The proposed method is designed based on the assumption that the background part of the image is smoothly varying and can be represented by a linear combination of a few smoothly varying basis functions, while the foreground text and graphics create s… ▽ More

    Submitted 19 February, 2015; v1 submitted 15 January, 2015; originally announced January 2015.

    Comments: 5 pages

  46. arXiv:1412.5126  [pdf, other

    cs.CV

    A Robust Regression Approach for Background/Foreground Segmentation

    Authors: Shervin Minaee, Haoping Yu, Yao Wang

    Abstract: Background/foreground segmentation has a lot of applications in image and video processing. In this paper, a segmentation algorithm is proposed which is mainly designed for text and line extraction in screen content. The proposed method makes use of the fact that the background in each block is usually smoothly varying and can be modeled well by a linear combination of a few smoothly varying basis… ▽ More

    Submitted 1 September, 2015; v1 submitted 16 December, 2014; originally announced December 2014.

  47. arXiv:1409.7818  [pdf, ps, other

    cs.CV

    On The Power of Joint Wavelet-DCT Features for Multispectral Palmprint Recognition

    Authors: Shervin Minaee, AmirAli Abdolrashidi

    Abstract: Biometric-based identification has drawn a lot of attention in the recent years. Among all biometrics, palmprint is known to possess a rich set of features. In this paper we have proposed to use DCT-based features in parallel with wavelet-based ones for palmprint identification. PCA is applied to the features to reduce their dimensionality and the majority voting algorithm is used to perform class… ▽ More

    Submitted 25 November, 2015; v1 submitted 27 September, 2014; originally announced September 2014.

    Comments: Asilomar Conference on Signals, Systems and Computers, IEEE, 2015, (to Appear)

  48. arXiv:1408.6615  [pdf, ps, other

    cs.CV

    Multispectral Palmprint Recognition Using Textural Features

    Authors: Shervin Minaee, AmirAli Abdolrashidi

    Abstract: In order to utilize identification to the best extent, we need robust and fast algorithms and systems to process the data. Having palmprint as a reliable and unique characteristic of every person, we extract and use its features based on its geometry, lines and angles. There are countless ways to define measures for the recognition task. To analyze a new point of view, we extracted textural featur… ▽ More

    Submitted 11 February, 2015; v1 submitted 27 August, 2014; originally announced August 2014.

    Comments: 5 pages, Published in IEEE Signal Processing in Medicine and Biology Symposium 2014

  49. arXiv:1408.3772  [pdf, ps, other

    cs.CV

    Highly Accurate Multispectral Palmprint Recognition Using Statistical and Wavelet Features

    Authors: Shervin Minaee, AmirAli Abdolrashidi

    Abstract: Palmprint is one of the most useful physiological biometrics that can be used as a powerful means in personal recognition systems. The major features of the palmprints are palm lines, wrinkles and ridges, and many approaches use them in different ways towards solving the palmprint recognition problem. Here we have proposed to use a set of statistical and wavelet-based features; statistical to capt… ▽ More

    Submitted 24 June, 2015; v1 submitted 16 August, 2014; originally announced August 2014.

    Comments: 6 pages

  50. arXiv:1112.5997  [pdf, ps, other

    cs.CV

    Multispectral Palmprint Recognition Using a Hybrid Feature

    Authors: Sina Akbari Mistani, Shervin Minaee, Emad Fatemizadeh

    Abstract: Personal identification problem has been a major field of research in recent years. Biometrics-based technologies that exploit fingerprints, iris, face, voice and palmprints, have been in the center of attention to solve this problem. Palmprints can be used instead of fingerprints that have been of the earliest of these biometrics technologies. A palm is covered with the same skin as the fingertip… ▽ More

    Submitted 11 December, 2015; v1 submitted 27 December, 2011; originally announced December 2011.

    Comments: 6 pages