Skip to main content

Showing 1–12 of 12 results for author: Gul, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2310.17142  [pdf

    eess.AS cs.SD

    Single channel speech enhancement by colored spectrograms

    Authors: Sania Gul, Muhammad Salman Khan, Muhammad Fazeel

    Abstract: Speech enhancement concerns the processes required to remove unwanted background sounds from the target speech to improve its quality and intelligibility. In this paper, a novel approach for single-channel speech enhancement is presented, using colored spectrograms. We propose the use of a deep neural network (DNN) architecture adapted from the pix2pix generative adversarial network (GAN) and trai… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: 18 pages, 6 figures, 5 tables

  2. arXiv:2208.05184  [pdf

    eess.AS cs.SD

    Preserving the beamforming effect for spatial cue-based pseudo-binaural dereverberation of a single source

    Authors: Sania Gul, Muhammad Salman Khan, Syed Waqar Shah

    Abstract: Reverberations are unavoidable in enclosures, resulting in reduced intelligibility for hearing impaired and non native listeners and even for the normal hearing listeners in noisy circumstances. It also degrades the performance of machine listening applications. In this paper, we propose a novel approach of binaural dereverberation of a single speech source, using the differences in the interaural… ▽ More

    Submitted 10 August, 2022; originally announced August 2022.

    Comments: 25 pages, 7 figures

  3. arXiv:2208.04626  [pdf

    eess.AS cs.SD

    Recycling an anechoic pre-trained speech separation deep neural network for binaural dereverberation of a single source

    Authors: Sania Gul, Muhammad Salman Khan, Syed Waqar Shah, Ata Ur-Rehman

    Abstract: Reverberation results in reduced intelligibility for both normal and hearing-impaired listeners. This paper presents a novel psychoacoustic approach of dereverberation of a single speech source by recycling a pre-trained binaural anechoic speech separation neural network. As training the deep neural network (DNN) is a lengthy and computationally expensive process, the advantage of using a pre-trai… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

    Comments: 15 pages, 4 figures

  4. arXiv:2107.03056  [pdf, ps, other

    eess.SY

    Position Constrained, Adaptive Control of Robotic Manipulators without Velocity Measurements

    Authors: Samet Gul, Erkan Zergeroglu, Enver Tatlicioglu

    Abstract: This work presents the design and the corresponding stability analysis of a model based, joint position tracking error constrained, adaptive output feedback controller for robot manipulators. Specifically, provided that the initial joint position tracking error starts within a predefined region, the proposed controller algorithm ensures that the joint tracking error remains inside this region and… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

    Comments: 10 pages, 3 figures

  5. arXiv:2102.13334  [pdf

    eess.AS

    Integration of deep learning with expectation maximization for spatial cue based speech separation in reverberant conditions

    Authors: Sania Gul, Muhammad Salman Khan, Syed Waqar Shah

    Abstract: In this paper, we formulate a blind source separation (BSS) framework, which allows integrating U-Net based deep learning source separation network with probabilistic spatial machine learning expectation maximization (EM) algorithm for separating speech in reverberant conditions. Our proposed model uses a pre-trained deep learning convolutional neural network, U-Net, for clustering the interaural… ▽ More

    Submitted 26 February, 2021; originally announced February 2021.

  6. arXiv:2012.01900  [pdf, other

    eess.IV

    Light-field view synthesis using convolutional block attention module

    Authors: M. Shahzeb Khan Gul, Umair Mukati, Michel Bätz, Søren Forchhammer, Joachim Keinert

    Abstract: Consumer light-field (LF) cameras suffer from a low or limited resolution because of the angular-spatial trade-off. To alleviate this drawback, we propose a novel learning-based approach utilizing attention mechanism to synthesize novel views of a light-field image using a sparse set of input views (i.e., 4 corner views) from a camera array. In the proposed method, we divide the process into three… ▽ More

    Submitted 31 May, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

  7. arXiv:2007.14084  [pdf, other

    cs.MM eess.IV eess.SP

    Kalman Filter-based Head Motion Prediction for Cloud-based Mixed Reality

    Authors: Serhan Gül, Sebastian Bosse, Dimitri Podborski, Thomas Schierl, Cornelius Hellge

    Abstract: Volumetric video allows viewers to experience highly-realistic 3D content with six degrees of freedom in mixed reality (MR) environments. Rendering complex volumetric videos can require a prohibitively high amount of computational power for mobile devices. A promising technique to reduce the computational burden on mobile devices is to perform the rendering at a cloud server. However, cloud-based… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

    Comments: Accepted at the ACM Multimedia Conference (ACMMM) 2020. 9 pages, 9 figures

    Journal ref: Proceedings of the 28th ACM International Conference on Multimedia (2020) 3632-3641

  8. arXiv:2005.11413  [pdf, ps, other

    eess.SP

    FPGA based design for online computation of Multivariate EMD (MEMD)

    Authors: Sikender Gul, Muhammad Faisal Siddiqui, Naveed Ur Rehman

    Abstract: Multivariate or multichannel data have become ubiquitous in many modern scientific and engineering applications, e.g., biomedical engineering, owing to recent advances in sensor and computing technology. Processing these data sets is challenging owing to: i) their large size and multidimensional nature, thus requiring specialized algorithms and efficient hardware designs for on-line and real-time… ▽ More

    Submitted 22 May, 2020; originally announced May 2020.

    ACM Class: B.7.0; B.6.1

  9. arXiv:2004.11277  [pdf, ps, other

    eess.SY

    Desired Model Compensation based Position Constrained Control of Robotic Manipulators

    Authors: Samet Gul, Erkan Zergeroglu, Enver Tatlicioglu, Mesih Veysi Kilinc

    Abstract: This work presents the design and the corresponding stability analysis of desired model based, joint position constrained, robot controller. Specifically, provided that the initial joint position tracking error signal starts below some predefined value, the proposed controller ensures that the joint tracking error signal remains inside the region (defined by predefined upper--bound) and approaches… ▽ More

    Submitted 24 April, 2020; v1 submitted 23 April, 2020; originally announced April 2020.

    Comments: 3 figures 2 tables and total 13 pages

  10. Cloud Rendering-based Volumetric Video Streaming System for Mixed Reality Services

    Authors: Serhan Gül, Dimitri Podborski, Jangwoo Son, Gurdeep Singh Bhullar, Thomas Buchholz, Thomas Schierl, Cornelius Hellge

    Abstract: Volumetric video is an emerging technology for immersive representation of 3D spaces that captures objects from all directions using multiple cameras and creates a dynamic 3D model of the scene. However, processing volumetric content requires high amounts of processing power and is still a very demanding task for today's mobile devices. To mitigate this, we propose a volumetric video streaming sys… ▽ More

    Submitted 16 July, 2020; v1 submitted 5 March, 2020; originally announced March 2020.

    Comments: 4 pages, 2 figures

    Journal ref: 11th ACM Multimedia Systems Conference (MMSys) 2020

  11. Low-latency Cloud-based Volumetric Video Streaming Using Head Motion Prediction

    Authors: Serhan Gül, Dimitri Podborski, Thomas Buchholz, Thomas Schierl, Cornelius Hellge

    Abstract: Volumetric video is an emerging key technology for immersive representation of 3D spaces and objects. Rendering volumetric video requires lots of computational power which is challenging especially for mobile devices. To mitigate this, we developed a streaming system that renders a 2D view from the volumetric video at a cloud server and streams a 2D video stream to the client. However, such networ… ▽ More

    Submitted 16 July, 2020; v1 submitted 17 January, 2020; originally announced January 2020.

    Comments: 7 pages, 4 figures

    Journal ref: 30th ACM Workshop on Network and Operating Systems Support for Digital Audio and Video (NOSSDAV) 2020

  12. High Impedance Fault Detection and Isolation in Power Distribution Networks using Support Vector Machines

    Authors: Muhammad Sarwar, Faisal Mehmood, Muhammad Abid, Abdul Qayyum Khan, Sufi Tabassum Gul, Adil Sarwar Khan

    Abstract: This paper proposes an accurate High Impedance Fault (HIF) detection and isolation scheme in a power distribution network. The proposed schemes utilize the data available from voltage and current sensors. The technique employs multiple algorithms consisting of Principal Component Analysis, Fisher Discriminant Analysis, Binary and Multiclass Support Vector Machine for detection and identification o… ▽ More

    Submitted 9 August, 2019; originally announced September 2019.

    Comments: 16 pages, 19 figures, published in a journal

    Journal ref: Journal of King Saud University - Engineering Sciences, July 2019