Skip to main content

Showing 1–7 of 7 results for author: Harar, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2304.02011  [pdf, other

    cs.LG eess.IV q-bio.QM

    FakET: Simulating Cryo-Electron Tomograms with Neural Style Transfer

    Authors: Pavol Harar, Lukas Herrmann, Philipp Grohs, David Haselbach

    Abstract: In cryo-electron microscopy, accurate particle localization and classification are imperative. Recent deep learning solutions, though successful, require extensive training data sets. The protracted generation time of physics-based models, often employed to produce these data sets, limits their broad applicability. We introduce FakET, a method based on Neural Style Transfer, capable of simulating… ▽ More

    Submitted 19 February, 2025; v1 submitted 4 April, 2023; originally announced April 2023.

    Comments: 25 pages, 3 tables, 19 figures including supplement. Updated LaTeX project structure, updated figure captions, added in-text references to figures, fixed page numbering, fixed typos and typesetting

    Journal ref: Structure, 2025

  2. arXiv:2210.14219  [pdf, other

    cs.CV cs.MS

    Redistributor: Transforming Empirical Data Distributions

    Authors: Pavol Harar, Dennis Elbrächter, Monika Dörfler, Kory D. Johnson

    Abstract: We present an algorithm and package, Redistributor, which forces a collection of scalar samples to follow a desired distribution. When given independent and identically distributed samples of some random variable $S$ and the continuous cumulative distribution function of some desired target $T$, it provably produces a consistent estimator of the transformation $R$ which satisfies $R(S)=T$ in distr… ▽ More

    Submitted 5 July, 2024; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: 16 pages, 13 figures - Added more use cases and comparisons with other methods

  3. arXiv:1907.06129  [pdf, other

    cs.SD cs.LG eess.AS

    Towards Robust Voice Pathology Detection

    Authors: Pavol Harar, Zoltan Galaz, Jesus B. Alonso-Hernandez, Jiri Mekyska, Radim Burget, Zdenek Smekal

    Abstract: Automatic objective non-invasive detection of pathological voice based on computerized analysis of acoustic signals can play an important role in early diagnosis, progression tracking and even effective treatment of pathological voices. In search towards such a robust voice pathology detection system we investigated 3 distinct classifiers within supervised learning and anomaly detection paradigms.… ▽ More

    Submitted 13 July, 2019; originally announced July 2019.

    Comments: 11 pages, 1 figure, 10 tables. Keywords: Voice pathology detection, deep learning, gradient boosting, anomaly detection

    Journal ref: Neural Computing and Applications (2018): 1-11

  4. arXiv:1907.05905  [pdf, other

    eess.AS cs.LG cs.SD

    Voice Pathology Detection Using Deep Learning: a Preliminary Study

    Authors: Pavol Harar, Jesus B. Alonso-Hernandez, Jiri Mekyska, Zoltan Galaz, Radim Burget, Zdenek Smekal

    Abstract: This paper describes a preliminary investigation of Voice Pathology Detection using Deep Neural Networks (DNN). We used voice recordings of sustained vowel /a/ produced at normal pitch from German corpus Saarbruecken Voice Database (SVD). This corpus contains voice recordings and electroglottograph signals of more than 2 000 speakers. The idea behind this experiment is the use of convolutional lay… ▽ More

    Submitted 12 July, 2019; originally announced July 2019.

    Comments: 4 pages, 1 figure, 5 tables

    Journal ref: In 2017 international conference and workshop on bioinspired intelligence (IWOBI), pp. 1-4. IEEE, 2017

  5. arXiv:1903.08950  [pdf, other

    cs.SD cs.LG eess.AS eess.SP stat.ML

    Improving Machine Hearing on Limited Data Sets

    Authors: Pavol Harar, Roswitha Bammer, Anna Breger, Monika Dörfler, Zdenek Smekal

    Abstract: Convolutional neural network (CNN) architectures have originated and revolutionized machine learning for images. In order to take advantage of CNNs in predictive modeling with audio data, standard FFT-based signal processing methods are often applied to convert the raw audio waveforms into an image-like representations (e.g. spectrograms). Even though conventional images and spectrograms differ in… ▽ More

    Submitted 12 July, 2019; v1 submitted 21 March, 2019; originally announced March 2019.

    Comments: 13 pages, 3 figures, 2 tables. Repository for reproducibility: https://gitlab.com/hararticles/gs-ms-mt/. Keywords: audio, CNN, limited data, Mel scattering, mel-spectrogram, augmented target loss function. Rewritten and restructured after peer revision. Recomputed and added new experiments and visualizations. Changed the presentation of the results

    Journal ref: 2019 11th International Congress on Ultra Modern Telecommunications and Control Systems and Workshops (ICUMT), pp. 1-6. IEEE, 2019

  6. arXiv:1901.07598  [pdf, other

    math.NA cs.LG stat.ML

    On orthogonal projections for dimension reduction and applications in augmented target loss functions for learning problems

    Authors: Anna Breger, Jose Ignacio Orlando, Pavol Harar, Monika Dörfler, Sophie Klimscha, Christoph Grechenig, Bianca S. Gerendas, Ursula Schmidt-Erfurth, Martin Ehler

    Abstract: The use of orthogonal projections on high-dimensional input and target data in learning frameworks is studied. First, we investigate the relations between two standard objectives in dimension reduction, preservation of variance and of pairwise relative distances. Investigations of their asymptotic correlation as well as numerical experiments show that a projection does usually not satisfy both obj… ▽ More

    Submitted 9 September, 2019; v1 submitted 22 January, 2019; originally announced January 2019.

    Journal ref: Journal of Mathematical Imaging and Vision, 2019

  7. Gabor frames and deep scattering networks in audio processing

    Authors: Roswitha Bammer, Monika Dörfler, Pavol Harar

    Abstract: This paper introduces Gabor scattering, a feature extractor based on Gabor frames and Mallat's scattering transform. By using a simple signal model for audio signals specific properties of Gabor scattering are studied. It is shown that for each layer, specific invariances to certain signal characteristics occur. Furthermore, deformation stability of the coefficient vector generated by the feature… ▽ More

    Submitted 1 October, 2019; v1 submitted 27 June, 2017; originally announced June 2017.

    Comments: 26 pages, 8 figures, 4 tables. Repository for reproducibility: https://gitlab.com/hararticles/gs-gt . Keywords: machine learning; scattering transform; Gabor transform; deep learning; time-frequency analysis; CNN. Accepted and published after peer revision

    Journal ref: Axioms 2019, 8(4), 106