Complexity Evaluation of Parallel Execution of the RAPiD Deep-Learning Algorithm on Intel CPU
Authors:
Dominic Konrad,
Zhihao Duan,
Mertcan Cokbas,
Prakash Ishwar
Abstract:
Knowing how many and where are people in various indoor spaces is critical for reducing HVAC energy waste, space management, spatial analytics and in emergency scenarios. While a range of technologies have been proposed to detect and track people in large indoor spaces, ceiling-mounted fisheye cameras have recently emerged as strong contenders. Currently, RAPiD is the SOTA algorithm for people det…
▽ More
Knowing how many and where are people in various indoor spaces is critical for reducing HVAC energy waste, space management, spatial analytics and in emergency scenarios. While a range of technologies have been proposed to detect and track people in large indoor spaces, ceiling-mounted fisheye cameras have recently emerged as strong contenders. Currently, RAPiD is the SOTA algorithm for people detection in images captured by fisheye cameras. However, in large spaces several overhead fisheye cameras are needed to assure high accuracy of counting and thus multiple instances of RAPiD must be executed simultaneously. This report evaluates inference time when multiple instances of RAPiD run in parallel on an Ubuntu NUC PC with Intel I7 8559U CPU. We consider three mechanisms of CPU-resource allocation to handle multiple instances of RAPiD: 1) managed by Ubuntu, 2) managed by user via operating-system calls to assign logical cores, and 3) managed by user via PyTorch-library calls to limit the number of threads used by PyTorch. Each scenario was evaluated on 300 images. The experimental results show, that when one or two instances of RAPiD are executed in parallel all three approaches result in similar inference times of 1.8sec and 3.2sec, respectively. However, when three or more instances of RAPiD run in parallel, limiting the number of threads used by PyTorch results in the shortest inference times. On average, RAPiD completes inference of 2 images simultaneously in about 3sec, 4 images in 6sec and 8 images in less than 14sec. This is important for real-time system design. In HVAC-application scenarios, with a typical reaction time of 10-15min, a latency of 14sec is negligible so a single 8559U CPU can support 8 camera streams thus reducing the system cost. However, in emergency scenarios, when time is of essence, a single CPU may be needed for each camera to reduce the latency to 1.8sec.
△ Less
Submitted 11 December, 2023;
originally announced December 2023.
Cyclic competition of four species: domains and interfaces
Authors:
Ahmed Roman,
David Konrad,
Michel Pleimling
Abstract:
We study numerically domain growth and interface fluctuations in one- and two-dimensional lattice systems composed of four species that interact in a cyclic way. Particle mobility is implemented through exchanges of particles located on neighboring lattice sites. For the chain we find that domain growth strongly depends on the mobility, with a higher mobility yielding a larger domain growth expone…
▽ More
We study numerically domain growth and interface fluctuations in one- and two-dimensional lattice systems composed of four species that interact in a cyclic way. Particle mobility is implemented through exchanges of particles located on neighboring lattice sites. For the chain we find that domain growth strongly depends on the mobility, with a higher mobility yielding a larger domain growth exponent. In two space dimensions, when also exchanges between mutually neutral particles are possible, both domain growth and interface fluctuations display universal regimes that are independent of the predation and exchange rates.
△ Less
Submitted 25 June, 2012; v1 submitted 22 May, 2012;
originally announced May 2012.