Skip to main content

Showing 51–74 of 74 results for author: Bajic, I V

.
  1. arXiv:2009.05666  [pdf, other

    eess.IV

    Affine Transformation-Based Deep Frame Prediction

    Authors: Hyomin Choi, Ivan V. Bajić

    Abstract: We propose a neural network model to estimate the current frame from two reference frames, using affine transformation and adaptive spatially-varying filters. The estimated affine transformation allows for using shorter filters compared to existing approaches for deep frame prediction. The predicted frame is used as a reference for coding the current frame. Since the proposed model is available at… ▽ More

    Submitted 16 February, 2021; v1 submitted 11 September, 2020; originally announced September 2020.

    Comments: This paper is accepted for publication in IEEE Trans. Image Processing, Feb. 2021

  2. arXiv:2007.13645  [pdf, other

    eess.SP cs.LG

    PowerGAN: Synthesizing Appliance Power Signatures Using Generative Adversarial Networks

    Authors: Alon Harell, Richard Jones, Stephen Makonin, Ivan V. Bajic

    Abstract: Non-intrusive load monitoring (NILM) allows users and energy providers to gain insight into home appliance electricity consumption using only the building's smart meter. Most current techniques for NILM are trained using significant amounts of labeled appliances power data. The collection of such data is challenging, making data a major bottleneck in creating well generalizing NILM solutions. To h… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

  3. Soft Video Multicasting Using Adaptive Compressed Sensing

    Authors: Hadi Hadizadeh, Ivan V. bajic

    Abstract: Recently, soft video multicasting has gained a lot of attention, especially in broadcast and mobile scenarios where the bit rate supported by the channel may differ across receivers, and may vary quickly over time. Unlike the conventional designs that force the source to use a single bit rate according to the receiver with the worst channel quality, soft video delivery schemes transmit the video s… ▽ More

    Submitted 6 March, 2020; originally announced March 2020.

  4. arXiv:2002.07048  [pdf, other

    cs.LG cs.MM eess.IV

    Bit Allocation for Multi-Task Collaborative Intelligence

    Authors: Saeed Ranjbar Alvar, Ivan V. Bajić

    Abstract: Recent studies have shown that collaborative intelligence (CI) is a promising framework for deployment of Artificial Intelligence (AI)-based services on mobile devices. In CI, a deep neural network is split between the mobile device and the cloud. Deep features obtained at the mobile are compressed and transferred to the cloud to complete the inference. So far, the methods in the literature focuse… ▽ More

    Submitted 13 February, 2020; originally announced February 2020.

    Comments: Accepted for publication ICASSP'20

  5. arXiv:2002.07036  [pdf, other

    cs.LG eess.IV eess.SP

    Back-and-Forth prediction for deep tensor compression

    Authors: Hyomin Choi, Robert A. Cohen, Ivan V. Bajic

    Abstract: Recent AI applications such as Collaborative Intelligence with neural networks involve transferring deep feature tensors between various computing devices. This necessitates tensor compression in order to optimize the usage of bandwidth-constrained channels between devices. In this paper we present a prediction scheme called Back-and-Forth (BaF) prediction, developed for deep feature tensors, whic… ▽ More

    Submitted 13 February, 2020; originally announced February 2020.

    Comments: Accepted for publication in IEEE ICASSP'20

  6. arXiv:2002.00157  [pdf, other

    cs.AI eess.IV

    Shared Mobile-Cloud Inference for Collaborative Intelligence

    Authors: Mateen Ulhaq, Ivan V. Bajić

    Abstract: As AI applications for mobile devices become more prevalent, there is an increasing need for faster execution and lower energy consumption for neural model inference. Historically, the models run on mobile devices have been smaller and simpler in comparison to large state-of-the-art research models, which can only run on the cloud. However, cloud-only inference has drawbacks such as increased netw… ▽ More

    Submitted 1 February, 2020; originally announced February 2020.

    Comments: 5 pages, 3 figures

  7. arXiv:2001.04433  [pdf, other

    cs.CV

    Towards Automated Swimming Analytics Using Deep Neural Networks

    Authors: Timothy Woinoski, Alon Harell, Ivan V. Bajic

    Abstract: Methods for creating a system to automate the collection of swimming analytics on a pool-wide scale are considered in this paper. There has not been much work on swimmer tracking or the creation of a swimmer database for machine learning purposes. Consequently, methods for collecting swimmer data from videos of swim competitions are explored and analyzed. The result is a guide to the creation of a… ▽ More

    Submitted 13 January, 2020; originally announced January 2020.

  8. arXiv:1908.06261  [pdf, other

    eess.SP

    3D Point Cloud Super-Resolution via Graph Total Variation on Surface Normals

    Authors: Chinthaka Dinesh, Gene Cheung, Ivan V. Bajic

    Abstract: Point cloud is a collection of 3D coordinates that are discrete geometric samples of an object's 2D surfaces. Using a low-cost 3D scanner to acquire data means that point clouds are often in lower resolution than desired for rendering on high-resolution displays. Building on recent advances in graph signal processing, we design a local algorithm for 3D point cloud super-resolution (SR). First, we… ▽ More

    Submitted 17 August, 2019; originally announced August 2019.

  9. arXiv:1906.11942  [pdf

    cs.CV

    Datasets for Face and Object Detection in Fisheye Images

    Authors: Jianglin Fu, Ivan V. Bajic, Rodney G. Vaughan

    Abstract: We present two new fisheye image datasets for training face and object detection models: VOC-360 and Wider-360. The fisheye images are created by post-processing regular images collected from two well-known datasets, VOC2012 and Wider Face, using a model for mapping regular to fisheye images implemented in Matlab. VOC-360 contains 39,575 fisheye images for object detection, segmentation, and class… ▽ More

    Submitted 27 June, 2019; originally announced June 2019.

  10. arXiv:1902.08736  [pdf, other

    eess.SP

    Wavenilm: A causal neural network for power disaggregation from the complex power signal

    Authors: Alon Harell, Stephen Makonin, Ivan V. Bajić

    Abstract: Non-intrusive load monitoring (NILM) helps meet energy conservation goals by estimating individual appliance power usage from a single aggregate measurement. Deep neural networks have become increasingly popular in attempting to solve NILM problems; however, many of them are not causal which is important for real-time application. We present a causal 1-D convolutional neural network inspired by Wa… ▽ More

    Submitted 18 June, 2019; v1 submitted 23 February, 2019; originally announced February 2019.

    Comments: 44th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2019)

  11. arXiv:1902.05179  [pdf, other

    cs.MM

    Multi-task learning with compressible features for Collaborative Intelligence

    Authors: Saeed Ranjbar Alvar, Ivan V. Bajić

    Abstract: A promising way to deploy Artificial Intelligence (AI)-based services on mobile devices is to run a part of the AI model (a deep neural network) on the mobile itself, and the rest in the cloud. This is sometimes referred to as collaborative intelligence. In this framework, intermediate features from the deep network need to be transmitted to the cloud for further processing. We study the case wher… ▽ More

    Submitted 15 May, 2019; v1 submitted 13 February, 2019; originally announced February 2019.

  12. arXiv:1902.02777  [pdf, other

    cs.CV

    FDDB-360: Face Detection in 360-degree Fisheye Images

    Authors: Jianglin Fu, Saeed Ranjbar Alvar, Ivan V. Bajic, Rodney G. Vaughan

    Abstract: 360-degree cameras offer the possibility to cover a large area, for example an entire room, without using multiple distributed vision sensors. However, geometric distortions introduced by their lenses make computer vision problems more challenging. In this paper we address face detection in 360-degree fisheye images. We show how a face detector trained on regular images can be re-trained for this… ▽ More

    Submitted 7 February, 2019; originally announced February 2019.

  13. arXiv:1901.00062  [pdf, other

    eess.IV cs.CV

    Deep Frame Prediction for Video Coding

    Authors: Hyomin Choi, Ivan V. Bajic

    Abstract: We propose a novel frame prediction method using a deep neural network (DNN), with the goal of improving video coding efficiency. The proposed DNN makes use of decoded frames, at both encoder and decoder, to predict textures of the current coding block. Unlike conventional inter-prediction, the proposed method does not require any motion information to be transferred between the encoder and the de… ▽ More

    Submitted 20 June, 2019; v1 submitted 31 December, 2018; originally announced January 2019.

    Comments: This paper is accepted by IEEE Transactions on Circuits and Systems for Video Technology in 2019

  14. arXiv:1812.07711  [pdf, other

    eess.SP

    3D Point Cloud Denoising via Bipartite Graph Approximation and Reweighted Graph Laplacian

    Authors: Chinthaka Dinesh, Gene Cheung, Ivan V. Bajic

    Abstract: Point cloud is a collection of 3D coordinates that are discrete geometric samples of an object's 2D surfaces. Imperfection in the acquisition process means that point clouds are often corrupted with noise. Building on recent advances in graph signal processing, we design local algorithms for 3D point cloud denoising. Specifically, we design a reweighted graph Laplacian regularizer (RGLR) for surfa… ▽ More

    Submitted 18 December, 2018; originally announced December 2018.

    Comments: 14 pages, 7 figures, Journal

  15. arXiv:1805.00107  [pdf, other

    cs.CV

    MV-YOLO: Motion Vector-aided Tracking by Semantic Object Detection

    Authors: Saeed Ranjbar Alvar, Ivan V. Bajić

    Abstract: Object tracking is the cornerstone of many visual analytics systems. While considerable progress has been made in this area in recent years, robust, efficient, and accurate tracking in real-world video remains a challenge. In this paper, we present a hybrid tracker that leverages motion information from the compressed video stream and a general-purpose semantic object detector acting on decoded fr… ▽ More

    Submitted 15 June, 2018; v1 submitted 30 April, 2018; originally announced May 2018.

  16. arXiv:1804.10831  [pdf, other

    eess.SP

    Fast 3D Point Cloud Denoising via Bipartite Graph Approximation & Total Variation

    Authors: Chinthaka Dinesh, Gene Cheung, Ivan V. Bajic, Cheng Yang

    Abstract: Acquired 3D point cloud data, whether from active sensors directly or from stereo-matching algorithms indirectly, typically contain non-negligible noise. To address the point cloud denoising problem, we propose a fast graph-based local algorithm. Specifically, given a k-nearest-neighbor graph of the 3D points, we first approximate it with a bipartite graph(independent sets of red and blue nodes) u… ▽ More

    Submitted 28 April, 2018; originally announced April 2018.

    Comments: 6 pages, 5 figures, conference

  17. Adaptive Non-Rigid Inpainting of 3D Point Cloud Geometry

    Authors: Chinthaka Dinesh, Ivan V. Bajic, Gene Cheung

    Abstract: In this letter, we introduce several algorithms for geometry inpainting of 3D point clouds with large holes. The algorithms are examplar-based: hole filling is performed iteratively using templates near the hole boundary to find the best matching regions elsewhere in the cloud, from where existing points are transferred to the hole. We propose two improvements over the previous work on exemplar-ba… ▽ More

    Submitted 27 April, 2018; originally announced April 2018.

    Comments: 5 pages, 2 figures, a short journal paper (letter)

  18. arXiv:1804.09963  [pdf, other

    eess.IV cs.CV

    Near-Lossless Deep Feature Compression for Collaborative Intelligence

    Authors: Hyomin Choi, Ivan V. Bajic

    Abstract: Collaborative intelligence is a new paradigm for efficient deployment of deep neural networks across the mobile-cloud infrastructure. By dividing the network between the mobile and the cloud, it is possible to distribute the computational workload such that the overall energy and/or latency of the system is minimized. However, this necessitates sending deep feature data from the mobile to the clou… ▽ More

    Submitted 15 June, 2018; v1 submitted 26 April, 2018; originally announced April 2018.

  19. arXiv:1802.03931  [pdf, other

    cs.CV

    Deep feature compression for collaborative object detection

    Authors: Hyomin Choi, Ivan V. Bajic

    Abstract: Recent studies have shown that the efficiency of deep neural networks in mobile applications can be significantly improved by distributing the computational workload between the mobile device and the cloud. This paradigm, termed collaborative intelligence, involves communicating feature data between the mobile and the cloud. The efficiency of such approach can be further improved by lossy compress… ▽ More

    Submitted 12 February, 2018; originally announced February 2018.

  20. arXiv:1710.11151  [pdf, other

    eess.IV cs.CV

    High efficiency compression for object detection

    Authors: Hyomin Choi, Ivan V. Bajic

    Abstract: Image and video compression has traditionally been tailored to human vision. However, modern applications such as visual analytics and surveillance rely on computers seeing and analyzing the images before (or instead of) humans. For these applications, it is important to adjust compression to computer vision. In this paper we present a bit allocation and rate control strategy that is tailored to o… ▽ More

    Submitted 15 February, 2018; v1 submitted 30 October, 2017; originally announced October 2017.

    Comments: The paper is published in IEEE ICASSP 18'

  21. arXiv:1710.10736  [pdf, other

    cs.CV

    Can you find a face in a HEVC bitstream?

    Authors: Saeed Ranjbar Alvar, Hyomin Choi, Ivan V. Bajic

    Abstract: Finding faces in images is one of the most important tasks in computer vision, with applications in biometrics, surveillance, human-computer interaction, and other areas. In our earlier work, we demonstrated that it is possible to tell whether or not an image contains a face by only examining the HEVC syntax, without fully reconstructing the image. In the present work we move further in this direc… ▽ More

    Submitted 23 February, 2018; v1 submitted 29 October, 2017; originally announced October 2017.

  22. arXiv:1709.02993  [pdf, other

    cs.CV

    Can you tell a face from a HEVC bitstream?

    Authors: Saeed Ranjbar Alvar, Hyomin Choi, Ivan V. Bajic

    Abstract: Image and video analytics are being increasingly used on a massive scale. Not only is the amount of data growing, but the complexity of the data processing pipelines is also increasing, thereby exacerbating the problem. It is becoming increasingly important to save computational resources wherever possible. We focus on one of the poster problems of visual analytics -- face detection -- and approac… ▽ More

    Submitted 9 September, 2017; originally announced September 2017.

  23. arXiv:1604.07339  [pdf, ps, other

    cs.MM

    Compressed-domain visual saliency models: A comparative study

    Authors: Sayed Hossein Khatoonabadi, Ivan V. Bajic, Yufeng Shan

    Abstract: Computational modeling of visual saliency has become an important research problem in recent years, with applications in video quality estimation, video compression, object tracking, retargeting, summarization, and so on. While most visual saliency models for dynamic scenes operate on raw video, several models have been developed for use with compressed-domain information such as motion vectors an… ▽ More

    Submitted 25 April, 2016; originally announced April 2016.

    ACM Class: I.2.10; I.4.8

  24. Load Disaggregation Based on Aided Linear Integer Programming

    Authors: Md. Zulfiquar Ali Bhotto, Stephen Makonin, Ivan V. Bajic

    Abstract: Load disaggregation based on aided linear integer programming (ALIP) is proposed. We start with a conventional linear integer programming (IP) based disaggregation and enhance it in several ways. The enhancements include additional constraints, correction based on a state diagram, median filtering, and linear programming-based refinement. With the aid of these enhancements, the performance of IP-b… ▽ More

    Submitted 30 August, 2016; v1 submitted 23 March, 2016; originally announced March 2016.