Skip to main content

Showing 1–20 of 20 results for author: Strauss, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.10915  [pdf, other

    cs.HC

    Usable Privacy in Virtual Worlds: Design Implications for Data Collection Awareness and Control Interfaces in Virtual Reality

    Authors: Viktorija Paneva, Verena Winterhalter, Naga Sai Surya Vamsy Malladi, Marvin Strauss, Stefan Schneegass, Florian Alt

    Abstract: Extended reality (XR) devices have become ubiquitous. They are equipped with arrays of sensors, collecting extensive user and environmental data, allowing inferences about sensitive user information users may not realize they are sharing. Current VR privacy notices largely replicate mechanisms from 2D interfaces, failing to leverage the unique affordances of virtual 3D environments. To address thi… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

  2. FlowMAC: Conditional Flow Matching for Audio Coding at Low Bit Rates

    Authors: Nicola Pia, Martin Strauss, Markus Multrus, Bernd Edler

    Abstract: This paper introduces FlowMAC, a novel neural audio codec for high-quality general audio compression at low bit rates based on conditional flow matching (CFM). FlowMAC jointly learns a mel spectrogram encoder, quantizer and decoder. At inference time the decoder integrates a continuous normalizing flow via an ODE solver to generate a high-quality mel spectrogram. This is the first time that a CFM-… ▽ More

    Submitted 6 April, 2025; v1 submitted 26 September, 2024; originally announced September 2024.

    Comments: Published in: ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  3. arXiv:2409.00739  [pdf, ps, other

    cs.HC

    Designing and Evaluating Scalable Privacy Awareness and Control User Interfaces for Mixed Reality

    Authors: Marvin Strauss, Viktorija Paneva, Florian Alt, Stefan Schneegass

    Abstract: As Mixed Reality (MR) devices become increasingly popular across industries, they raise significant privacy and ethical concerns due to their capacity to collect extensive data on users and their environments. This paper highlights the urgent need for privacy-aware user interfaces that educate and empower both users and bystanders, enabling them to understand, control, and manage data collection a… ▽ More

    Submitted 1 September, 2024; originally announced September 2024.

    Comments: Workshop Paper for CHI'24 Shaping The Future: Developing Principles for Policy Recommendations for Responsible Innovation in Virtual Worlds

  4. arXiv:2408.09810  [pdf, other

    eess.AS cs.SD

    Efficient Area-based and Speaker-Agnostic Source Separation

    Authors: Martin Strauss, Okan Köpüklü

    Abstract: This paper introduces an area-based source separation method designed for virtual meeting scenarios. The aim is to preserve speech signals from an unspecified number of sources within a defined spatial area in front of a linear microphone array, while suppressing all other sounds. Therefore, we employ an efficient neural network architecture adapted for multi-channel input to encompass the predefi… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: Preprint. Accepted to the International Workshop on Acoustic Signal Enhancement (IWAENC 2024)

  5. arXiv:2305.19100  [pdf, other

    eess.AS cs.SD

    Predicting Preferred Dialogue-to-Background Loudness Difference in Dialogue-Separated Audio

    Authors: Luca Resti, Martin Strauss, Matteo Torcoli, Emanuël Habets, Bernd Edler

    Abstract: Dialogue Enhancement (DE) enables the rebalancing of dialogue and background sounds to fit personal preferences and needs in the context of broadcast audio. When individual audio stems are unavailable from production, Dialogue Separation (DS) can be applied to the final audio mixture to obtain estimates of these stems. This work focuses on Preferred Loudness Differences (PLDs) between dialogue and… ▽ More

    Submitted 31 May, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: Paper accepted at the 15th International Conference on Quality of Multimedia Experience (QoMEX), 4 pages, 2 figures

  6. arXiv:2305.08812  [pdf, other

    cs.LO cs.SE eess.SY

    Slow Down, Move Over: A Case Study in Formal Verification, Refinement, and Testing of the Responsibility-Sensitive Safety Model for Self-Driving Cars

    Authors: Megan Strauss, Stefan Mitsch

    Abstract: Technology advances give us the hope of driving without human error, reducing vehicle emissions and simplifying an everyday task with the future of self-driving cars. Making sure these vehicles are safe is very important to the continuation of this field. In this paper, we formalize the Responsibility-Sensitive Safety model (RSS) for self-driving cars and prove the safety and optimality of this mo… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  7. arXiv:2210.11654  [pdf, other

    eess.AS cs.SD

    Improved Normalizing Flow-Based Speech Enhancement using an All-pole Gammatone Filterbank for Conditional Input Representation

    Authors: Martin Strauss, Matteo Torcoli, Bernd Edler

    Abstract: Deep generative models for Speech Enhancement (SE) received increasing attention in recent years. The most prominent example are Generative Adversarial Networks (GANs), while normalizing flows (NF) received less attention despite their potential. Building on previous work, architectural modifications are proposed, along with an investigation of different conditional input representations. Despite… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: Accepted for Presentation at IEEE SLT 2022

  8. Automated Learning of Interpretable Models with Quantified Uncertainty

    Authors: G. F. Bomarito, P. E. Leser, N. C. M Strauss, K. M. Garbrecht, J. D. Hochhalter

    Abstract: Interpretability and uncertainty quantification in machine learning can provide justification for decisions, promote scientific discovery and lead to a better understanding of model behavior. Symbolic regression provides inherently interpretable machine learning, but relatively little work has focused on the use of symbolic regression on noisy data and the accompanying necessity to quantify uncert… ▽ More

    Submitted 12 April, 2022; originally announced May 2022.

  9. arXiv:2106.09093  [pdf, other

    eess.AS cs.SD

    A Hands-on Comparison of DNNs for Dialog Separation Using Transfer Learning from Music Source Separation

    Authors: Martin Strauss, Jouni Paulus, Matteo Torcoli, Bernd Edler

    Abstract: This paper describes a hands-on comparison on using state-of-the-art music source separation deep neural networks (DNNs) before and after task-specific fine-tuning for separating speech content from non-speech content in broadcast audio (i.e., dialog separation). The music separation models are selected as they share the number of channels (2) and sampling rate (44.1 kHz or higher) with the consid… ▽ More

    Submitted 22 June, 2021; v1 submitted 16 June, 2021; originally announced June 2021.

    Comments: accepted in INTERSPEECH 2021

  10. A Flow-Based Neural Network for Time Domain Speech Enhancement

    Authors: Martin Strauss, Bernd Edler

    Abstract: Speech enhancement involves the distinction of a target speech signal from an intrusive background. Although generative approaches using Variational Autoencoders or Generative Adversarial Networks (GANs) have increasingly been used in recent years, normalizing flow (NF) based systems are still scarse, despite their success in related fields. Thus, in this paper we propose a NF framework to directl… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Comments: Accepted to ICASSP 2021

  11. arXiv:1907.04655  [pdf, other

    eess.SP cs.SD eess.AS

    Audio-Based Search and Rescue with a Drone: Highlights from the IEEE Signal Processing Cup 2019 Student Competition

    Authors: Antoine Deleforge, Diego Di Carlo, Martin Strauss, Romain Serizel, Lucio Marcenaro

    Abstract: Unmanned aerial vehicles (UAV), commonly referred to as drones, have raised increasing interest in recent years. Search and rescue scenarios where humans in emergency situations need to be quickly found in areas difficult to access constitute an important field of application for this technology. While research efforts have mostly focused on developing video-based solutions for this task \cite{lop… ▽ More

    Submitted 3 July, 2019; originally announced July 2019.

    Journal ref: IEEE Signal Processing Magazine, Institute of Electrical and Electronics Engineers, In press

  12. arXiv:1707.00391  [pdf, other

    cs.CY cs.LG stat.ML

    Fair Pipelines

    Authors: Amanda Bower, Sarah N. Kitchen, Laura Niss, Martin J. Strauss, Alexander Vargas, Suresh Venkatasubramanian

    Abstract: This work facilitates ensuring fairness of machine learning in the real world by decoupling fairness considerations in compound decisions. In particular, this work studies how fairness propagates through a compound decision-making processes, which we call a pipeline. Prior work in algorithmic fairness only focuses on fairness with respect to one decision. However, many decision-making processes re… ▽ More

    Submitted 2 July, 2017; originally announced July 2017.

    Comments: Presented as a poster at the 2017 Workshop on Fairness, Accountability, and Transparency in Machine Learning (FAT/ML 2017)

  13. arXiv:1402.1726  [pdf, ps, other

    cs.DS cs.IT

    For-all Sparse Recovery in Near-Optimal Time

    Authors: Anna C. Gilbert, Yi Li, Ely Porat, Martin J. Strauss

    Abstract: An approximate sparse recovery system in $\ell_1$ norm consists of parameters $k$, $ε$, $N$, an $m$-by-$N$ measurement $Φ$, and a recovery algorithm, $\mathcal{R}$. Given a vector, $\mathbf{x}$, the system approximates $x$ by $\widehat{\mathbf{x}} = \mathcal{R}(Φ\mathbf{x})$, which must satisfy $\|\widehat{\mathbf{x}}-\mathbf{x}\|_1 \leq (1+ε)\|\mathbf{x}-\mathbf{x}_k\|_1$. We consider the 'for al… ▽ More

    Submitted 7 March, 2017; v1 submitted 7 February, 2014; originally announced February 2014.

    ACM Class: F.2.2; E.4

    Journal ref: ACM Transactions on Algorithms, Vol. 13, No. 3, pp 32:1--32:26, 2017

  14. arXiv:1304.6232  [pdf, other

    cs.DS

    L2/L2-foreach sparse recovery with low risk

    Authors: Anna C. Gilbert, Hung Q. Ngo, Ely Porat, Atri Rudra, Martin J. Strauss

    Abstract: In this paper, we consider the "foreach" sparse recovery problem with failure probability $p$. The goal of which is to design a distribution over $m \times N$ matrices $Φ$ and a decoding algorithm $\algo$ such that for every $\vx\in\R^N$, we have the following error guarantee with probability at least $1-p$ \[\|\vx-\algo(Φ\vx)\|_2\le C\|\vx-\vx_k\|_2,\] where $C$ is a constant (ideally arbitrarily… ▽ More

    Submitted 23 April, 2013; originally announced April 2013.

    Comments: 1 figure, extended abstract to appear in ICALP 2013

  15. arXiv:1012.1886  [pdf, ps, other

    cs.DS

    Sublinear Time, Measurement-Optimal, Sparse Recovery For All

    Authors: Ely Porat, Martin J. Strauss

    Abstract: An approximate sparse recovery system in ell_1 norm formally consists of parameters N, k, epsilon an m-by-N measurement matrix, Phi, and a decoding algorithm, D. Given a vector, x, where x_k denotes the optimal k-term approximation to x, the system approximates x by hat_x = D(Phi.x), which must satisfy ||hat_x - x||_1 <= (1+epsilon)||x - x_k||_1. Among the goals in designing such systems are m… ▽ More

    Submitted 14 July, 2011; v1 submitted 8 December, 2010; originally announced December 2010.

    Comments: Corrected argument with minor change to results

    ACM Class: F.2.2; E.4

  16. arXiv:0912.0229  [pdf, other

    cs.DS cs.IT

    Approximate Sparse Recovery: Optimizing Time and Measurements

    Authors: Anna C. Gilbert, Yi Li, Ely Porat, Martin J. Strauss

    Abstract: An approximate sparse recovery system consists of parameters $k,N$, an $m$-by-$N$ measurement matrix, $Φ$, and a decoding algorithm, $\mathcal{D}$. Given a vector, $x$, the system approximates $x$ by $\widehat x =\mathcal{D}(Φx)$, which must satisfy $\| \widehat x - x\|_2\le C \|x - x_k\|_2$, where $x_k$ denotes the optimal $k$-term approximation to $x$. For each vector $x$, the system must succ… ▽ More

    Submitted 1 December, 2009; originally announced December 2009.

    Journal ref: SIAM J. Comput. 41(2), pp. 436-453, 2012

  17. arXiv:0804.4666  [pdf, ps, other

    cs.DM cs.DS math.NA

    Combining geometry and combinatorics: A unified approach to sparse signal recovery

    Authors: R. Berinde, A. C. Gilbert, P. Indyk, H. Karloff, M. J. Strauss

    Abstract: There are two main algorithmic approaches to sparse signal recovery: geometric and combinatorial. The geometric approach starts with a geometric constraint on the measurement matrix and then uses linear programming to decode information about the signal from its measurements. The combinatorial approach constructs the measurement matrix and a combinatorial decoding algorithm to match. We present… ▽ More

    Submitted 29 April, 2008; originally announced April 2008.

    ACM Class: F.2; G.1; G.2

  18. arXiv:cs/0609166  [pdf, ps, other

    cs.CR

    Private Approximate Heavy Hitters

    Authors: Martin J. Strauss, Xuan Zheng

    Abstract: We consider the problem of private computation of approximate Heavy Hitters. Alice and Bob each hold a vector and, in the vector sum, they want to find the B largest values along with their indices. While the exact problem requires linear communication, protocols in the literature solve this problem approximately using polynomial computation time, polylogarithmic communication, and constantly ma… ▽ More

    Submitted 29 September, 2006; originally announced September 2006.

    Comments: 17 pages, submitted

  19. arXiv:cs/0608079  [pdf, ps, other

    cs.DS

    Algorithmic linear dimension reduction in the l_1 norm for sparse vectors

    Authors: A. C. Gilbert, M. J. Strauss, J. A. Tropp, R. Vershynin

    Abstract: This paper develops a new method for recovering m-sparse signals that is simultaneously uniform and quick. We present a reconstruction algorithm whose run time, O(m log^2(m) log^2(d)), is sublinear in the length d of the signal. The reconstruction error is within a logarithmic factor (in m) of the optimal m-term approximation error in l_1. In particular, the algorithm recovers m-sparse signals p… ▽ More

    Submitted 18 August, 2006; originally announced August 2006.

  20. arXiv:cs/0607098  [pdf, ps, other

    cs.DS cs.IT

    List decoding of noisy Reed-Muller-like codes

    Authors: A. R. Calderbank, Anna C. Gilbert, Martin J. Strauss

    Abstract: First- and second-order Reed-Muller (RM(1) and RM(2), respectively) codes are two fundamental error-correcting codes which arise in communication as well as in probabilistically-checkable proofs and learning. In this paper, we take the first steps toward extending the quick randomized decoding tools of RM(1) into the realm of quadratic binary and, equivalently, Z_4 codes. Our main algorithmic re… ▽ More

    Submitted 2 August, 2006; v1 submitted 20 July, 2006; originally announced July 2006.

    ACM Class: E.4; F.2.1