-
Sparse Approximation of the Subdivision-Rips Bifiltration for Doubling Metrics
Authors:
Michael Lesnick,
Kenneth McCabe
Abstract:
The Vietoris-Rips filtration, the standard filtration on metric data in topological data analysis, is notoriously sensitive to outliers. Sheehy's subdivision-Rips bifiltration $\mathcal{SR}(-)$ is a density-sensitive refinement that is robust to outliers in a strong sense, but whose 0-skeleton has exponential size. For $X$ a finite metric space of constant doubling dimension and fixed $ε>0$, we co…
▽ More
The Vietoris-Rips filtration, the standard filtration on metric data in topological data analysis, is notoriously sensitive to outliers. Sheehy's subdivision-Rips bifiltration $\mathcal{SR}(-)$ is a density-sensitive refinement that is robust to outliers in a strong sense, but whose 0-skeleton has exponential size. For $X$ a finite metric space of constant doubling dimension and fixed $ε>0$, we construct a $(1+ε)$-homotopy interleaving approximation of $\mathcal{SR}(X)$ whose $k$-skeleton has size $O(|X|^{k+2})$. For $k\geq 1$ constant, the $k$-skeleton can be computed in time $O(|X|^{k+3})$.
△ Less
Submitted 29 August, 2024;
originally announced August 2024.
-
Nerve Models of Subdivision Bifiltrations
Authors:
Michael Lesnick,
Kenneth McCabe
Abstract:
We study the size of Sheehy's subdivision bifiltrations, up to homotopy. We focus in particular on the subdivision-Rips bifiltration $\mathcal{SR}(X)$ of a metric space $X$, the only density-sensitive bifiltration on metric spaces known to satisfy a strong robustness property. Given a simplicial filtration $\mathcal{F}$ with a total of $m$ maximal simplices across all indices, we introduce a nerve…
▽ More
We study the size of Sheehy's subdivision bifiltrations, up to homotopy. We focus in particular on the subdivision-Rips bifiltration $\mathcal{SR}(X)$ of a metric space $X$, the only density-sensitive bifiltration on metric spaces known to satisfy a strong robustness property. Given a simplicial filtration $\mathcal{F}$ with a total of $m$ maximal simplices across all indices, we introduce a nerve-based simplicial model for its subdivision bifiltration $\mathcal{SF}$ whose $k$-skeleton has size $O(m^{k+1})$. We also show that the $0$-skeleton of any simplicial model of $\mathcal{SF}$ has size at least $m$. We give several applications: For an arbitrary metric space $X$, we introduce a $\sqrt{2}$-approximation to $\mathcal{SR}(X)$, denoted $\mathcal{J}(X)$, whose $k$-skeleton has size $O(|X|^{k+2})$. This improves on the previous best approximation bound of $\sqrt{3}$, achieved by the degree-Rips bifiltration, which implies that $\mathcal{J}(X)$ is more robust than degree-Rips. Moreover, we show that the approximation factor of $\sqrt{2}$ is tight; in particular, there exists no exact model of $\mathcal{SR}(X)$ with poly-size skeleta. On the other hand, we show that for $X$ in a fixed-dimensional Euclidean space with the $\ell_p$-metric, there exists an exact model of $\mathcal{SR}(X)$ with poly-size skeleta for $p\in \{1, \infty\}$, as well as a $(1+ε)$-approximation to $\mathcal{SR}(X)$ with poly-size skeleta for any $p \in (1, \infty)$ and fixed ${ε> 0}$.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Identifying Candidate Spaces for Advert Implantation
Authors:
Soumyabrata Dev,
Hossein Javidnia,
Murhaf Hossari,
Matthew Nicholson,
Killian McCabe,
Atul Nautiyal,
Clare Conran,
Jian Tang,
Wei Xu,
François Pitié
Abstract:
Virtual advertising is an important and promising feature in the area of online advertising. It involves integrating adverts onto live or recorded videos for product placements and targeted advertisements. Such integration of adverts is primarily done by video editors in the post-production stage, which is cumbersome and time-consuming. Therefore, it is important to automatically identify candidat…
▽ More
Virtual advertising is an important and promising feature in the area of online advertising. It involves integrating adverts onto live or recorded videos for product placements and targeted advertisements. Such integration of adverts is primarily done by video editors in the post-production stage, which is cumbersome and time-consuming. Therefore, it is important to automatically identify candidate spaces in a video frame, wherein new adverts can be implanted. The candidate space should match the scene perspective, and also have a high quality of experience according to human subjective judgment. In this paper, we propose the use of a bespoke neural net that can assist the video editors in identifying candidate spaces. We benchmark our approach against several deep-learning architectures on a large-scale image dataset of candidate spaces of outdoor scenes. Our work is the first of its kind in this area of multimedia and augmented reality applications, and achieves the best results.
△ Less
Submitted 8 October, 2019;
originally announced October 2019.
-
Localizing Adverts in Outdoor Scenes
Authors:
Soumyabrata Dev,
Murhaf Hossari,
Matthew Nicholson,
Killian McCabe,
Atul Nautiyal,
Clare Conran,
Jian Tang,
Wei Xu,
François Pitié
Abstract:
Online videos have witnessed an unprecedented growth over the last decade, owing to wide range of content creation. This provides the advertisement and marketing agencies plethora of opportunities for targeted advertisements. Such techniques involve replacing an existing advertisement in a video frame, with a new advertisement. However, such post-processing of online videos is mostly done manually…
▽ More
Online videos have witnessed an unprecedented growth over the last decade, owing to wide range of content creation. This provides the advertisement and marketing agencies plethora of opportunities for targeted advertisements. Such techniques involve replacing an existing advertisement in a video frame, with a new advertisement. However, such post-processing of online videos is mostly done manually by video editors. This is cumbersome and time-consuming. In this paper, we propose DeepAds -- a deep neural network, based on the simple encoder-decoder architecture, that can accurately localize the position of an advert in a video frame. Our approach of localizing billboards in outdoor scenes using neural nets, is the first of its kind, and achieves the best performance. We benchmark our proposed method with other semantic segmentation algorithms, on a public dataset of outdoor scenes with manually annotated billboard binary maps.
△ Less
Submitted 6 May, 2019;
originally announced May 2019.
-
A Comparison of Online Automatic Speech Recognition Systems and the Nonverbal Responses to Unintelligible Speech
Authors:
Joshua Y. Kim,
Chunfeng Liu,
Rafael A. Calvo,
Kathryn McCabe,
Silas C. R. Taylor,
Björn W. Schuller,
Kaihang Wu
Abstract:
Automatic Speech Recognition (ASR) systems have proliferated over the recent years to the point that free platforms such as YouTube now provide speech recognition services. Given the wide selection of ASR systems, we contribute to the field of automatic speech recognition by comparing the relative performance of two sets of manual transcriptions and five sets of automatic transcriptions (Google Cl…
▽ More
Automatic Speech Recognition (ASR) systems have proliferated over the recent years to the point that free platforms such as YouTube now provide speech recognition services. Given the wide selection of ASR systems, we contribute to the field of automatic speech recognition by comparing the relative performance of two sets of manual transcriptions and five sets of automatic transcriptions (Google Cloud, IBM Watson, Microsoft Azure, Trint, and YouTube) to help researchers to select accurate transcription services. In addition, we identify nonverbal behaviors that are associated with unintelligible speech, as indicated by high word error rates. We show that manual transcriptions remain superior to current automatic transcriptions. Amongst the automatic transcription services, YouTube offers the most accurate transcription service. For non-verbal behavioral involvement, we provide evidence that the variability of smile intensities from the listener is high (low) when the speaker is clear (unintelligible). These findings are derived from videoconferencing interactions between student doctors and simulated patients; therefore, we contribute towards both the ASR literature and the healthcare communication skills teaching community.
△ Less
Submitted 28 April, 2019;
originally announced April 2019.
-
The ALOS Dataset for Advert Localization in Outdoor Scenes
Authors:
Soumyabrata Dev,
Murhaf Hossari,
Matthew Nicholson,
Killian McCabe,
Atul Nautiyal,
Clare Conran,
Jian Tang,
Wei Xu,
François Pitié
Abstract:
The rapid increase in the number of online videos provides the marketing and advertising agents ample opportunities to reach out to their audience. One of the most widely used strategies is product placement, or embedded marketing, wherein new advertisements are integrated seamlessly into existing advertisements in videos. Such strategies involve accurately localizing the position of the advert in…
▽ More
The rapid increase in the number of online videos provides the marketing and advertising agents ample opportunities to reach out to their audience. One of the most widely used strategies is product placement, or embedded marketing, wherein new advertisements are integrated seamlessly into existing advertisements in videos. Such strategies involve accurately localizing the position of the advert in the image frame, either manually in the video editing phase, or by using machine learning frameworks. However, these machine learning techniques and deep neural networks need a massive amount of data for training. In this paper, we propose and release the first large-scale dataset of advertisement billboards, captured in outdoor scenes. We also benchmark several state-of-the-art semantic segmentation algorithms on our proposed dataset.
△ Less
Submitted 16 April, 2019;
originally announced April 2019.
-
The CASE Dataset of Candidate Spaces for Advert Implantation
Authors:
Soumyabrata Dev,
Murhaf Hossari,
Matthew Nicholson,
Killian McCabe,
Atul Nautiyal,
Clare Conran,
Jian Tang,
Wei Xu,
François Pitié
Abstract:
With the advent of faster internet services and growth of multimedia content, we observe a massive growth in the number of online videos. The users generate these video contents at an unprecedented rate, owing to the use of smart-phones and other hand-held video capturing devices. This creates immense potential for the advertising and marketing agencies to create personalized content for the users…
▽ More
With the advent of faster internet services and growth of multimedia content, we observe a massive growth in the number of online videos. The users generate these video contents at an unprecedented rate, owing to the use of smart-phones and other hand-held video capturing devices. This creates immense potential for the advertising and marketing agencies to create personalized content for the users. In this paper, we attempt to assist the video editors to generate augmented video content, by proposing candidate spaces in video frames. We propose and release a large-scale dataset of outdoor scenes, along with manually annotated maps for candidate spaces. We also benchmark several deep-learning based semantic segmentation algorithms on this proposed dataset.
△ Less
Submitted 29 April, 2019; v1 submitted 21 March, 2019;
originally announced March 2019.
-
ADNet: A Deep Network for Detecting Adverts
Authors:
Murhaf Hossari,
Soumyabrata Dev,
Matthew Nicholson,
Killian McCabe,
Atul Nautiyal,
Clare Conran,
Jian Tang,
Wei Xu,
François Pitié
Abstract:
Online video advertising gives content providers the ability to deliver compelling content, reach a growing audience, and generate additional revenue from online media. Recently, advertising strategies are designed to look for original advert(s) in a video frame, and replacing them with new adverts. These strategies, popularly known as product placement or embedded marketing, greatly help the mark…
▽ More
Online video advertising gives content providers the ability to deliver compelling content, reach a growing audience, and generate additional revenue from online media. Recently, advertising strategies are designed to look for original advert(s) in a video frame, and replacing them with new adverts. These strategies, popularly known as product placement or embedded marketing, greatly help the marketing agencies to reach out to a wider audience. However, in the existing literature, such detection of candidate frames in a video sequence for the purpose of advert integration, is done manually. In this paper, we propose a deep-learning architecture called ADNet, that automatically detects the presence of advertisements in video frames. Our approach is the first of its kind that automatically detects the presence of adverts in a video frame, and achieves state-of-the-art results on a public dataset.
△ Less
Submitted 9 November, 2018;
originally announced November 2018.
-
An Advert Creation System for Next-Gen Publicity
Authors:
Atul Nautiyal,
Killian McCabe,
Murhaf Hossari,
Soumyabrata Dev,
Matthew Nicholson,
Clare Conran,
Declan McKibben,
Jian Tang,
Xu Wei,
Francois Pitie
Abstract:
With the rapid proliferation of multimedia data in the internet, there has been a fast rise in the creation of videos for the viewers. This enables the viewers to skip the advertisement breaks in the videos, using ad blockers and 'skip ad' buttons -- bringing online marketing and publicity to a stall. In this paper, we demonstrate a system that can effectively integrate a new advertisement into a…
▽ More
With the rapid proliferation of multimedia data in the internet, there has been a fast rise in the creation of videos for the viewers. This enables the viewers to skip the advertisement breaks in the videos, using ad blockers and 'skip ad' buttons -- bringing online marketing and publicity to a stall. In this paper, we demonstrate a system that can effectively integrate a new advertisement into a video sequence. We use state-of-the-art techniques from deep learning and computational photogrammetry, for effective detection of existing adverts, and seamless integration of new adverts into video sequences. This is helpful for targeted advertisement, paving the path for next-gen publicity.
△ Less
Submitted 1 August, 2018;
originally announced August 2018.
-
Network-Centric Quantum Communications with Application to Critical Infrastructure Protection
Authors:
Richard J. Hughes,
Jane E. Nordholt,
Kevin P. McCabe,
Raymond T. Newell,
Charles G. Peterson,
Rolando D. Somma
Abstract:
Network-centric quantum communications (NQC) - a new, scalable instantiation of quantum cryptography providing key management with forward security for lightweight encryption, authentication and digital signatures in optical networks - is briefly described. Results from a multi-node experimental test-bed utilizing integrated photonics quantum communications components, known as QKarDs, include: qu…
▽ More
Network-centric quantum communications (NQC) - a new, scalable instantiation of quantum cryptography providing key management with forward security for lightweight encryption, authentication and digital signatures in optical networks - is briefly described. Results from a multi-node experimental test-bed utilizing integrated photonics quantum communications components, known as QKarDs, include: quantum identification; verifiable quantum secret sharing; multi-party authenticated key establishment, including group keying; and single-fiber quantum-secured communications that can be applied as a security retrofit/upgrade to existing optical fiber installations. A demonstration that NQC meets the challenging simultaneous latency and security requirements of electric grid control communications, which cannot be met without compromises using conventional cryptography, is described.
△ Less
Submitted 1 May, 2013;
originally announced May 2013.