Skip to main content

Showing 1–11 of 11 results for author: Lew, M S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2104.04991  [pdf, other

    cs.CV

    Integrating Information Theory and Adversarial Learning for Cross-modal Retrieval

    Authors: Wei Chen, Yu Liu, Erwin M. Bakker, Michael S. Lew

    Abstract: Accurately matching visual and textual data in cross-modal retrieval has been widely studied in the multimedia community. To address these challenges posited by the heterogeneity gap and the semantic gap, we propose integrating Shannon information theory and adversarial learning. In terms of the heterogeneity gap, we integrate modality classification and information entropy maximization adversaria… ▽ More

    Submitted 11 April, 2021; originally announced April 2021.

    Comments: Accepted by Pattern Recognition

  2. arXiv:2103.12462  [pdf, other

    cs.CV

    Lifelong Person Re-Identification via Adaptive Knowledge Accumulation

    Authors: Nan Pu, Wei Chen, Yu Liu, Erwin M. Bakker, Michael S. Lew

    Abstract: Person ReID methods always learn through a stationary domain that is fixed by the choice of a given dataset. In many contexts (e.g., lifelong learning), those methods are ineffective because the domain is continually changing in which case incremental learning over multiple domains is required potentially. In this work we explore a new and challenging ReID task, namely lifelong person re-identific… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

    Comments: 10 pages, 5 figures, Accepted by CVPR2021

  3. arXiv:2101.11282  [pdf, other

    cs.CV

    Deep Learning for Instance Retrieval: A Survey

    Authors: Wei Chen, Yu Liu, Weiping Wang, Erwin Bakker, Theodoros Georgiou, Paul Fieguth, Li Liu, Michael S. Lew

    Abstract: In recent years a vast amount of visual content has been generated and shared from many fields, such as social media platforms, medical imaging, and robotics. This abundance of content creation and sharing has introduced new challenges, particularly that of searching databases for similar content-Content Based Image Retrieval (CBIR)-a long-established research area in which improved efficiency and… ▽ More

    Submitted 30 October, 2022; v1 submitted 27 January, 2021; originally announced January 2021.

    Comments: IEEE Transactions on Pattern Analysis and Machine Intelligence

  4. arXiv:2010.08189  [pdf, other

    cs.CV

    New Ideas and Trends in Deep Multimodal Content Understanding: A Review

    Authors: Wei Chen, Weiping Wang, Li Liu, Michael S. Lew

    Abstract: The focus of this survey is on the analysis of two modalities of multimodal deep learning: image and text. Unlike classic reviews of deep learning where monomodal image classifiers such as VGG, ResNet and Inception module are central topics, this paper will examine recent multimodal deep models and structures, including auto-encoders, generative adversarial nets and their variants. These models go… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

    Comments: Accepted by Neurocomputing

  5. arXiv:2008.02520  [pdf, other

    cs.CV

    Dual Gaussian-based Variational Subspace Disentanglement for Visible-Infrared Person Re-Identification

    Authors: Nan Pu, Wei Chen, Yu Liu, Erwin M. Bakker, Michael S. Lew

    Abstract: Visible-infrared person re-identification (VI-ReID) is a challenging and essential task in night-time intelligent surveillance systems. Except for the intra-modality variance that RGB-RGB person re-identification mainly overcomes, VI-ReID suffers from additional inter-modality variance caused by the inherent heterogeneous gap. To solve the problem, we present a carefully designed dual Gaussian-bas… ▽ More

    Submitted 6 August, 2020; originally announced August 2020.

    Comments: Accepted by ACM MM 2020 poster. 12 pages, 10 appendixes

  6. arXiv:1611.05503  [pdf, other

    cs.CV

    On the Exploration of Convolutional Fusion Networks for Visual Recognition

    Authors: Yu Liu, Yanming Guo, Michael S. Lew

    Abstract: Despite recent advances in multi-scale deep representations, their limitations are attributed to expensive parameters and weak fusion modules. Hence, we propose an efficient approach to fuse multi-scale deep representations, called convolutional fusion networks (CFN). Owing to using 1$\times$1 convolution and global average pooling, CFN can efficiently generate the side branches while adding few p… ▽ More

    Submitted 16 November, 2016; originally announced November 2016.

    Comments: 23rd International Conference on MultiMedia Modeling (MMM 2017)

  7. arXiv:1101.0243  [pdf

    cs.GR

    Across Browsers SVG Implementation

    Authors: Liang Wang, Nies Huijsmans, Michael S. Lew, Dan Tsymbala

    Abstract: In this work SVG will be translated into VML or HTML by using Javascript based on Backbase Client Framework. The target of this project is to implement SVG to be viewed in Internet Explorer without any plug-in and work together with other Backbase Client Framework languages. The result of this project will be added as an extension to the current Backbase Client Framework.

    Submitted 31 December, 2010; originally announced January 2011.

    Report number: LML20090402

  8. arXiv:1101.0242  [pdf

    cs.CV

    Binary and nonbinary description of hypointensity in human brain MR images

    Authors: Xiaojing Chen, Michael S. Lew

    Abstract: Accumulating evidence has shown that iron is involved in the mechanism underlying many neurodegenerative diseases, such as Alzheimer's disease, Parkinson's disease and Huntington's disease. Abnormal (higher) iron accumulation has been detected in the brains of most neurodegenerative patients, especially in the basal ganglia region. Presence of iron leads to changes in MR signal in both magnitude a… ▽ More

    Submitted 31 December, 2010; originally announced January 2011.

    Report number: LML20080101

  9. arXiv:1101.0237  [pdf

    cs.CV

    A Framework for Real-Time Face and Facial Feature Tracking using Optical Flow Pre-estimation and Template Tracking

    Authors: E. R. Gast, Michael S. Lew

    Abstract: This work presents a framework for tracking head movements and capturing the movements of the mouth and both the eyebrows in real-time. We present a head tracker which is a combination of a optical flow and a template based tracker. The estimation of the optical flow head tracker is used as starting point for the template tracker which fine-tunes the head estimation. This approach together with re… ▽ More

    Submitted 31 December, 2010; originally announced January 2011.

    Report number: LML20100401

  10. arXiv:1101.0235  [pdf

    cs.HC

    Analysis of Using Browser-native Technology to Build Rich Internet Applications for Image Manipulation

    Authors: Thomas Steenbergen, Michael S. Lew

    Abstract: In this work we investigate whether browser-native technologies can be used to perform photo manipulation tasks e.g cropping, resizing or rotating an image within the current mainstream browser. By the use of a case study we will analyze problems that have occurred during the implementation of a prototype web application that utilizes browser-native web technology in order to create an online vers… ▽ More

    Submitted 31 December, 2010; originally announced January 2011.

    Report number: LML20090901

  11. arXiv:1101.0234  [pdf

    cs.HC

    Dynamic Feature Description in Human Action Recognition

    Authors: Ruoyun Gao, Michael S. Lew, Ling Shao

    Abstract: This work aims to present novel description methods for human action recognition. Generally, a video sequence can be represented as a collection of spatial temporal words by detecting space-time interest points and describing the unique features around the detected points (Bag of Words representation). Interest points as well as the cuboids around them are considered informative for feature descri… ▽ More

    Submitted 31 December, 2010; originally announced January 2011.

    Report number: LML20090701