Skip to main content

Showing 1–28 of 28 results for author: Peleg, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.10759  [pdf, other

    cs.CV

    Clothes-Changing Person Re-identification Based On Skeleton Dynamics

    Authors: Asaf Joseph, Shmuel Peleg

    Abstract: Clothes-Changing Person Re-Identification (ReID) aims to recognize the same individual across different videos captured at various times and locations. This task is particularly challenging due to changes in appearance, such as clothing, hairstyle, and accessories. We propose a Clothes-Changing ReID method that uses only skeleton data and does not use appearance features. Traditional ReID methods… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

  2. arXiv:2408.17434  [pdf, other

    cs.SD eess.AS

    Audio Enhancement from Multiple Crowdsourced Recordings: A Simple and Effective Baseline

    Authors: Shiran Aziz, Yossi Adi, Shmuel Peleg

    Abstract: With the popularity of cellular phones, events are often recorded by multiple devices from different locations and shared on social media. Several different recordings could be found for many events. Such recordings are usually noisy, where noise for each device is local and unrelated to others. This case of multiple microphones at unknown locations, capturing local, uncorrelated noise, was rarely… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

  3. arXiv:2211.13807  [pdf, other

    cs.CV

    GEFF: Improving Any Clothes-Changing Person ReID Model using Gallery Enrichment with Face Features

    Authors: Daniel Arkushin, Bar Cohen, Shmuel Peleg, Ohad Fried

    Abstract: In the Clothes-Changing Re-Identification (CC-ReID) problem, given a query sample of a person, the goal is to determine the correct identity based on a labeled gallery in which the person appears in different clothes. Several models tackle this challenge by extracting clothes-independent features. However, the performance of these models is still lower for the clothes-changing setting compared to… ▽ More

    Submitted 21 November, 2023; v1 submitted 24 November, 2022; originally announced November 2022.

  4. arXiv:2209.04177  [pdf, ps, other

    cs.CC cs.DS

    Tensor Reconstruction Beyond Constant Rank

    Authors: Shir Peleg, Amir Shpilka, Ben Lee Volk

    Abstract: We give reconstruction algorithms for subclasses of depth-3 arithmetic circuits. In particular, we obtain the first efficient algorithm for finding tensor rank, and an optimal tensor decomposition as a sum of rank-one tensors, when given black-box access to a tensor of super-constant rank. We obtain the following results: 1. A deterministic algorithm that reconstructs polynomials computed by… ▽ More

    Submitted 9 September, 2022; originally announced September 2022.

    Comments: Abstract shortened to meet arXiv requirements; 59 pages

  5. Deep Audio Waveform Prior

    Authors: Arnon Turetzky, Tzvi Michelson, Yossi Adi, Shmuel Peleg

    Abstract: Convolutional neural networks contain strong priors for generating natural looking images [1]. These priors enable image denoising, super resolution, and inpainting in an unsupervised manner. Previous attempts to demonstrate similar ideas in audio, namely deep audio priors, (i) use hand picked architectures such as harmonic convolutions, (ii) only work with spectrogram input, and (iii) have been u… ▽ More

    Submitted 21 July, 2022; originally announced July 2022.

    Comments: Interspeech 2022

  6. arXiv:2205.09791  [pdf, other

    cs.CV cs.HC cs.MM

    A Peek at Peak Emotion Recognition

    Authors: Tzvi Michelson, Hillel Aviezer, Shmuel Peleg

    Abstract: Despite much progress in the field of facial expression recognition, little attention has been paid to the recognition of peak emotion. Aviezer et al. [1] showed that humans have trouble discerning between positive and negative peak emotions. In this work we analyze how deep learning fares on this challenge. We find that (i) despite using very small datasets, features extracted from deep learning… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: Submitted to HBU Workshop at ICPR, 6 pages, 5 figures

  7. arXiv:2202.04932  [pdf, other

    cs.CG cs.CC

    Robust Sylvester-Gallai type theorem for quadratic polynomials

    Authors: Shir Peleg, Amir Shpilka

    Abstract: In this work, we extend the robust version of the Sylvester-Gallai theorem, obtained by Barak, Dvir, Wigderson and Yehudayoff, and by Dvir, Saraf and Wigderson, to the case of quadratic polynomials. Specifically, we prove that if $\mathcal{Q}\subset \mathbb{C}[x_1.\ldots,x_n]$ is a finite set, $|\mathcal{Q}|=m$, of irreducible quadratic polynomials that satisfy the following condition: There is… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: text overlap with arXiv:2006.08263

  8. arXiv:2110.01367  [pdf, other

    cs.SD cs.MM eess.AS

    Audio-Visual Evaluation of Oratory Skills

    Authors: Tzvi Michelson, Shmuel Peleg

    Abstract: What makes a talk successful? Is it the content or the presentation? We try to estimate the contribution of the speaker's oratory skills to the talk's success, while ignoring the content of the talk. By oratory skills we refer to facial expressions, motions and gestures, as well as the vocal features. We use TED Talks as our dataset, and measure the success of each talk by its view count. Using th… ▽ More

    Submitted 30 September, 2021; originally announced October 2021.

    Comments: TransAI 2021

  9. Lower Bounds on Stabilizer Rank

    Authors: Shir Peleg, Amir Shpilka, Ben Lee Volk

    Abstract: The stabilizer rank of a quantum state $ψ$ is the minimal $r$ such that $\left| ψ\right \rangle = \sum_{j=1}^r c_j \left|\varphi_j \right\rangle$ for $c_j \in \mathbb{C}$ and stabilizer states $\varphi_j$. The running time of several classical simulation methods for quantum circuits is determined by the stabilizer rank of the $n$-th tensor power of single-qubit magic states. We prove a lower bou… ▽ More

    Submitted 10 February, 2022; v1 submitted 6 June, 2021; originally announced June 2021.

    Journal ref: Quantum 6, 652 (2022)

  10. arXiv:2102.07762  [pdf, other

    cs.LG cs.CR

    Membership Inference Attacks are Easier on Difficult Problems

    Authors: Avital Shafran, Shmuel Peleg, Yedid Hoshen

    Abstract: Membership inference attacks (MIA) try to detect if data samples were used to train a neural network model, e.g. to detect copyright abuses. We show that models with higher dimensional input and output are more vulnerable to MIA, and address in more detail models for image translation and semantic segmentation, including medical image segmentation. We show that reconstruction-errors can lead to ve… ▽ More

    Submitted 18 August, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

  11. arXiv:2006.08263  [pdf, other

    cs.CC

    Polynomial time deterministic identity testingalgorithm for $Σ^{[3]}ΠΣΠ^{[2]}$ circuits via Edelstein-Kelly type theorem for quadratic polynomials

    Authors: Shir Peleg, Amir Shpilka

    Abstract: In this work we resolve conjectures of Beecken, Mitmann and Saxena [BMS13] and Gupta [Gup14], by proving an analog of a theorem of Edelstein and Kelly for quadratic polynomials. As immediate corollary we obtain the first deterministic polynomial time black-box algorithm for testing zeroness of $Σ^{[3]}ΠΣΠ^{[2]}$ circuits.

    Submitted 15 June, 2020; originally announced June 2020.

  12. arXiv:2003.05152  [pdf, ps, other

    cs.CC cs.CG

    A generalized Sylvester-Gallai type theorem for quadratic polynomials

    Authors: Shir Peleg, Amir Shpilka

    Abstract: In this work we prove a version of the Sylvester-Gallai theorem for quadratic polynomials that takes us one step closer to obtaining a deterministic polynomial time algorithm for testing zeroness of $Σ^{[3]}ΠΣΠ^{[2]}$ circuits. Specifically, we prove that if a finite set of irreducible quadratic polynomials $\mathcal{Q}$ satisfy that for every two polynomials $Q_1,Q_2\in \mathcal{Q}$ there is a su… ▽ More

    Submitted 11 March, 2020; originally announced March 2020.

  13. arXiv:1911.12322  [pdf, other

    cs.LG cs.CR stat.ML

    Crypto-Oriented Neural Architecture Design

    Authors: Avital Shafran, Gil Segev, Shmuel Peleg, Yedid Hoshen

    Abstract: As neural networks revolutionize many applications, significant privacy conflicts between model users and providers emerge. The cryptography community developed a variety of techniques for secure computation to address such privacy issues. As generic techniques for secure computation are typically prohibitively ineffective, many efforts focus on optimizing their underlying cryptographic tools. Dif… ▽ More

    Submitted 16 February, 2021; v1 submitted 27 November, 2019; originally announced November 2019.

    Comments: Full version (shorter version published in ICASSP'21)

  14. arXiv:1808.06250  [pdf, other

    cs.CV

    Dynamic Temporal Alignment of Speech to Lips

    Authors: Tavi Halperin, Ariel Ephrat, Shmuel Peleg

    Abstract: Many speech segments in movies are re-recorded in a studio during postproduction, to compensate for poor sound quality as recorded on location. Manual alignment of the newly-recorded speech with the original lip movements is a tedious task. We present an audio-to-video alignment method for automating speech to lips alignment, stretching and compressing the audio signal to match the lip movements.… ▽ More

    Submitted 19 August, 2018; originally announced August 2018.

  15. arXiv:1711.08789  [pdf, other

    cs.CV cs.SD eess.AS

    Visual Speech Enhancement

    Authors: Aviv Gabbay, Asaph Shamir, Shmuel Peleg

    Abstract: When video is shot in noisy environment, the voice of a speaker seen in the video can be enhanced using the visible mouth movements, reducing background noise. While most existing methods use audio-only inputs, improved performance is obtained with our visual speech enhancement, based on an audio-visual neural network. We include in the training data videos to which we added the voice of the targe… ▽ More

    Submitted 13 June, 2018; v1 submitted 23 November, 2017; originally announced November 2017.

    Comments: Accepted to Interspeech 2018. Supplementary video: https://www.youtube.com/watch?v=nyYarDGpcYA

  16. arXiv:1708.06767  [pdf, other

    cs.CV cs.SD

    Seeing Through Noise: Visually Driven Speaker Separation and Enhancement

    Authors: Aviv Gabbay, Ariel Ephrat, Tavi Halperin, Shmuel Peleg

    Abstract: Isolating the voice of a specific person while filtering out other voices or background noises is challenging when video is shot in noisy environments. We propose audio-visual methods to isolate the voice of a single speaker and eliminate unrelated sounds. First, face motions captured in the video are used to estimate the speaker's voice, by passing the silent video frames through a video-to-speec… ▽ More

    Submitted 9 February, 2018; v1 submitted 22 August, 2017; originally announced August 2017.

    Comments: Supplementary video: https://www.youtube.com/watch?v=qmsyj7vAzoI

  17. arXiv:1708.01204  [pdf, other

    cs.CV cs.SD

    Improved Speech Reconstruction from Silent Video

    Authors: Ariel Ephrat, Tavi Halperin, Shmuel Peleg

    Abstract: Speechreading is the task of inferring phonetic information from visually observed articulatory facial movements, and is a notoriously difficult task for humans to perform. In this paper we present an end-to-end model based on a convolutional neural network (CNN) for generating an intelligible and natural-sounding acoustic speech signal from silent video frames of a speaking person. We train our m… ▽ More

    Submitted 29 August, 2017; v1 submitted 1 August, 2017; originally announced August 2017.

    Comments: Accepted to ICCV 2017 Workshop on Computer Vision for Audio-Visual Media. Supplementary video: https://www.youtube.com/watch?v=Xjbn7h7tpg0. arXiv admin note: text overlap with arXiv:1701.00495

  18. arXiv:1701.00495  [pdf, other

    cs.CV cs.SD

    Vid2speech: Speech Reconstruction from Silent Video

    Authors: Ariel Ephrat, Shmuel Peleg

    Abstract: Speechreading is a notoriously difficult task for humans to perform. In this paper we present an end-to-end model based on a convolutional neural network (CNN) for generating an intelligible acoustic speech signal from silent video frames of a speaking person. The proposed CNN generates sound features for each frame based on its neighboring frames. Waveforms are then synthesized from the learned s… ▽ More

    Submitted 9 January, 2017; v1 submitted 2 January, 2017; originally announced January 2017.

    Comments: Accepted for publication at ICASSP 2017

  19. arXiv:1607.07660  [pdf, other

    cs.CV

    Fundamental Matrices from Moving Objects Using Line Motion Barcodes

    Authors: Yoni Kasten, Gil Ben-Artzi, Shmuel Peleg, Michael Werman

    Abstract: Computing the epipolar geometry between cameras with very different viewpoints is often very difficult. The appearance of objects can vary greatly, and it is difficult to find corresponding feature points. Prior methods searched for corresponding epipolar lines using points on the convex hull of the silhouette of a single moving object. These methods fail when the scene includes multiple moving ob… ▽ More

    Submitted 26 July, 2016; originally announced July 2016.

    Journal ref: ECCV'16, Amsterdam, Oct. 2016, Vol II, pp. 220-118

  20. EgoSampling: Wide View Hyperlapse from Egocentric Videos

    Authors: Tavi Halperin, Yair Poleg, Chetan Arora, Shmuel Peleg

    Abstract: The possibility of sharing one's point of view makes use of wearable cameras compelling. These videos are often long, boring and coupled with extreme shake, as the camera is worn on a moving person. Fast forwarding (i.e. frame sampling) is a natural choice for quick video browsing. However, this accentuates the shake caused by natural head motion in an egocentric video, making the fast forwarded v… ▽ More

    Submitted 12 January, 2017; v1 submitted 26 April, 2016; originally announced April 2016.

    Comments: Accepted for publication in IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)

  21. arXiv:1604.04848  [pdf, other

    cs.CV

    Epipolar Geometry Based On Line Similarity

    Authors: Gil Ben-Artzi, Tavi Halperin, Michael Werman, Shmuel Peleg

    Abstract: It is known that epipolar geometry can be computed from three epipolar line correspondences but this computation is rarely used in practice since there are no simple methods to find corresponding lines. Instead, methods for finding corresponding points are widely used. This paper proposes a similarity measure between lines that indicates whether two lines are corresponding epipolar lines and enabl… ▽ More

    Submitted 7 January, 2017; v1 submitted 17 April, 2016; originally announced April 2016.

    Comments: ICPR 2016, Cancun, Dec 2016

    Journal ref: ICPR'16, Cancun, Dec. 2016, pp. 1865-1870

  22. arXiv:1506.07866  [pdf, other

    cs.CV

    Camera Calibration from Dynamic Silhouettes Using Motion Barcodes

    Authors: Gil Ben-Artzi, Yoni Kasten, Shmuel Peleg, Michael Werman

    Abstract: Computing the epipolar geometry between cameras with very different viewpoints is often problematic as matching points are hard to find. In these cases, it has been proposed to use information from dynamic objects in the scene for suggesting point and line correspondences. We propose a speed up of about two orders of magnitude, as well as an increase in robustness and accuracy, to methods comput… ▽ More

    Submitted 7 January, 2017; v1 submitted 25 June, 2015; originally announced June 2015.

    Comments: Update metadata

    Journal ref: Proc. CVPR'16, Las Vegas, June 2016, pp. 4095-4103

  23. arXiv:1506.02264  [pdf, other

    cs.LG cs.AI cs.CV

    Visual Learning of Arithmetic Operations

    Authors: Yedid Hoshen, Shmuel Peleg

    Abstract: A simple Neural Network model is presented for end-to-end visual learning of arithmetic operations from pictures of numbers. The input consists of two pictures, each showing a 7-digit number. The output, also a picture, displays the number showing the result of an arithmetic operation (e.g., addition or subtraction) on the two input numbers. The concepts of a number, or of an operator, are not exp… ▽ More

    Submitted 27 November, 2015; v1 submitted 7 June, 2015; originally announced June 2015.

    Comments: To appear in AAAI 2016

    Journal ref: Proc. AAAI'16, Phoenix, Feb. 2016, pp. 3733-3739

  24. arXiv:1505.05254  [pdf, other

    cs.CV

    Live Video Synopsis for Multiple Cameras

    Authors: Yedid Hoshen, Shmuel Peleg

    Abstract: Video surveillance cameras generate most of recorded video, and there is far more recorded video than operators can watch. Much progress has recently been made using summarization of recorded video, but such techniques do not have much impact on live video surveillance. We assume a camera hierarchy where a Master camera observes the decision-critical region, and one or more Slave cameras observe… ▽ More

    Submitted 20 May, 2015; originally announced May 2015.

    Comments: To be presented in ICIP 2015

    Journal ref: Proc. ICIP'15, Quebec City, Sept. 2015, pp. 212-216

  25. Compact CNN for Indexing Egocentric Videos

    Authors: Yair Poleg, Ariel Ephrat, Shmuel Peleg, Chetan Arora

    Abstract: While egocentric video is becoming increasingly popular, browsing it is very difficult. In this paper we present a compact 3D Convolutional Neural Network (CNN) architecture for long-term activity recognition in egocentric videos. Recognizing long-term activities enables us to temporally segment (index) long and unstructured egocentric videos. Existing methods for this task are based on hand tuned… ▽ More

    Submitted 24 November, 2015; v1 submitted 28 April, 2015; originally announced April 2015.

    Journal ref: IEEE WACV'16, March 2016, pp. 1-9

  26. EgoSampling: Fast-Forward and Stereo for Egocentric Videos

    Authors: Yair Poleg, Tavi Halperin, Chetan Arora, Shmuel Peleg

    Abstract: While egocentric cameras like GoPro are gaining popularity, the videos they capture are long, boring, and difficult to watch from start to end. Fast forwarding (i.e. frame sampling) is a natural choice for faster video browsing. However, this accentuates the shake caused by natural head motion, making the fast forwarded video useless. We propose EgoSampling, an adaptive frame sampling that gives… ▽ More

    Submitted 27 April, 2015; v1 submitted 11 December, 2014; originally announced December 2014.

    Comments: in IEEE CVPR 2015, Boston, MA, June 2015

    Journal ref: CVPR'15, Boston, June 2015

  27. arXiv:1412.1455  [pdf, other

    cs.CV

    Event Retrieval Using Motion Barcodes

    Authors: Gil Ben-Artzi, Michael Werman, Shmuel Peleg

    Abstract: We introduce a simple and effective method for retrieval of videos showing a specific event, even when the videos of that event were captured from significantly different viewpoints. Appearance-based methods fail in such cases, as appearances change with large changes of viewpoints. Our method is based on a pixel-based feature, "motion barcode", which records the existence/non-existence of motio… ▽ More

    Submitted 12 May, 2015; v1 submitted 3 December, 2014; originally announced December 2014.

    Journal ref: Proc. ICIP'15, Quebec City, Sept. 2015, pp 2621-2625

  28. arXiv:1411.7591  [pdf, other

    cs.CV

    An Egocentric Look at Video Photographer Identity

    Authors: Yedid Hoshen, Shmuel Peleg

    Abstract: Egocentric cameras are being worn by an increasing number of users, among them many security forces worldwide. GoPro cameras already penetrated the mass market, reporting substantial increase in sales every year. As head-worn cameras do not capture the photographer, it may seem that the anonymity of the photographer is preserved even when the video is publicly distributed. We show that camera mo… ▽ More

    Submitted 8 November, 2015; v1 submitted 27 November, 2014; originally announced November 2014.

    Journal ref: Proc. CVPR'16, Las Vegas, June 2016, pp. 4284-4292