Skip to main content

Showing 1–13 of 13 results for author: Deng, G

Searching in archive eess. Search in all archives.
.
  1. arXiv:2506.18094  [pdf

    eess.SY

    G-SEED: A Spatio-temporal Encoding Framework for Forest and Grassland Data Based on GeoSOT

    Authors: Xuan Ouyang, Xinwen Yu, Yan Chen, Guang Deng, Xuanxin Liu

    Abstract: In recent years, the rapid development of remote sensing, Unmanned Aerial Vehicles, and IoT technologies has led to an explosive growth in spatio-temporal forest and grassland data, which are increasingly multimodal, heterogeneous, and subject to continuous updates. However, existing Geographic Information Systems (GIS)-based systems struggle to integrate and manage of such large-scale and diverse… ▽ More

    Submitted 22 June, 2025; originally announced June 2025.

    Comments: 11 pages, 2 figures. Previously submitted to a non-academic conference (ICGARSA 2025) and formally withdrawn

  2. arXiv:2505.16211  [pdf, ps, other

    cs.SD cs.AI cs.CL eess.AS

    AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models

    Authors: Kai Li, Can Shen, Yile Liu, Jirui Han, Kelong Zheng, Xuechao Zou, Zhe Wang, Shun Zhang, Xingjian Du, Hanjun Luo, Yingbin Jin, Xinxin Xing, Ziyang Ma, Yue Liu, Yifan Zhang, Junfeng Fang, Kun Wang, Yibo Yan, Gelei Deng, Haoyang Li, Yiming Li, Xiaobin Zhuang, Tianlong Chen, Qingsong Wen, Tianwei Zhang , et al. (9 additional authors not shown)

    Abstract: Audio Large Language Models (ALLMs) have gained widespread adoption, yet their trustworthiness remains underexplored. Existing evaluation frameworks, designed primarily for text, fail to address unique vulnerabilities introduced by audio's acoustic properties. We identify significant trustworthiness risks in ALLMs arising from non-semantic acoustic cues, including timbre, accent, and background no… ▽ More

    Submitted 30 September, 2025; v1 submitted 22 May, 2025; originally announced May 2025.

    Comments: Technical Report

  3. arXiv:2402.01933  [pdf, other

    eess.AS cs.SD

    ToMoBrush: Exploring Dental Health Sensing using a Sonic Toothbrush

    Authors: Kuang Yuan, Mohamed Ibrahim, Yiwen Song, Guoxiang Deng, Suvendra Vijayan, Robert Nerone, Akshay Gadre, Swarun Kumar

    Abstract: Early detection of dental disease is crucial to prevent adverse outcomes. Today, dental X-rays are currently the most accurate gold standard for dental disease detection. Unfortunately, regular X-ray exam is still a privilege for billions of people around the world. In this paper, we ask: "Can we develop a low-cost sensing system that enables dental self-examination in the comfort of one's home?"… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    ACM Class: J.3; C.3; H.5.2

  4. arXiv:2311.15584  [pdf, other

    eess.IV cs.CV cs.LG

    A deep learning approach for marine snow synthesis and removal

    Authors: Fernando Galetto, Guang Deng

    Abstract: Marine snow, the floating particles in underwater images, severely degrades the visibility and performance of human and machine vision systems. This paper proposes a novel method to reduce the marine snow interference using deep learning techniques. We first synthesize realistic marine snow samples by training a Generative Adversarial Network (GAN) model and combine them with natural underwater im… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  5. arXiv:2308.15742  [pdf, other

    cs.SD cs.AI cs.SE eess.AS

    ASTER: Automatic Speech Recognition System Accessibility Testing for Stutterers

    Authors: Yi Liu, Yuekang Li, Gelei Deng, Felix Juefei-Xu, Yao Du, Cen Zhang, Chengwei Liu, Yeting Li, Lei Ma, Yang Liu

    Abstract: The popularity of automatic speech recognition (ASR) systems nowadays leads to an increasing need for improving their accessibility. Handling stuttering speech is an important feature for accessible ASR systems. To improve the accessibility of ASR systems for stutterers, we need to expose and analyze the failures of ASR systems on stuttering speech. The speech datasets recorded from stutterers are… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  6. arXiv:2306.01219  [pdf, other

    eess.SP

    Brezinski Inverse and Geometric Product-Based Steffensen's Methods for Image Reverse Filtering

    Authors: Guang Deng

    Abstract: This work develops extensions of Steffensen's method to provide new tools for solving the semi-blind image reverse filtering problem. Two extensions are presented: a parametric Steffensen's method for accelerating the Mann iteration, and a family of 12 Steffensen's methods for vector variables. The development is based on Brezinski inverse and geometric product vector inverse. Variants of these me… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  7. arXiv:2206.10124  [pdf, other

    eess.IV

    Fast image reverse filters through fixed point and gradient descent acceleration

    Authors: Fernando Galetto, Guang Deng

    Abstract: In this paper, we study the problem of reverse image filtering. An image filter denoted g(.), which is available as a black box, produces an observation b = g(x) when provided with an input x. The problem is to estimate the original input signal x from the black box filter g(.) and the observation b. We study and re-develop state-of-the-art methods from two points of view, fixed point iteration an… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

  8. arXiv:2112.04121  [pdf, other

    eess.IV cs.CV

    Reverse image filtering using total derivative approximation and accelerated gradient descent

    Authors: Fernando J. Galetto, Guang Deng

    Abstract: In this paper, we address a new problem of reversing the effect of an image filter, which can be linear or nonlinear. The assumption is that the algorithm of the filter is unknown and the filter is available as a black box. We formulate this inverse problem as minimizing a local patch-based cost function and use total derivative to approximate the gradient which is used in gradient descent to solv… ▽ More

    Submitted 13 December, 2021; v1 submitted 8 December, 2021; originally announced December 2021.

  9. arXiv:2108.03799  [pdf, other

    eess.IV cs.CV

    COVID-view: Diagnosis of COVID-19 using Chest CT

    Authors: Shreeraj Jadhav, Gaofeng Deng, Marlene Zawin, Arie E. Kaufman

    Abstract: Significant work has been done towards deep learning (DL) models for automatic lung and lesion segmentation and classification of COVID-19 on chest CT data. However, comprehensive visualization systems focused on supporting the dual visual+DL diagnosis of COVID-19 are non-existent. We present COVID-view, a visualization application specially tailored for radiologists to diagnose COVID-19 from ches… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

    Comments: 11 pages, 10 figures, accepted to IEEE VIS 2021 conference and IEEE Transactions on Visualization and Computer Graphics

  10. A guided edge-aware smoothing-sharpening filter based on patch interpolation model and generalized Gamma distribution

    Authors: Guang Deng, Fernando J. Galetto, Mukhalad Al-nasrawi, Waseem Waheed

    Abstract: Smoothing and sharpening are two fundamental image processing operations. The latter is usually related to the former through the unsharp masking algorithm. In this paper, we develop a new type of filter which performs smoothing or sharpening via a tuning parameter. The development of the new filter is based on (1) a new Laplacian-based filter formulation which unifies the smoothing and sharpening… ▽ More

    Submitted 30 July, 2021; originally announced July 2021.

    Comments: 23 pages, 16 figures

    Journal ref: IEEE Open Journal of Signal Processing, vol. 2, pp. 119-135, 2021

  11. arXiv:2107.14443  [pdf, other

    eess.IV cs.CV

    Single image deep defocus estimation and its applications

    Authors: Fernando J. Galetto, Guang Deng

    Abstract: Depth information is useful in many image processing applications. However, since taking a picture is a process of projection of a 3D scene onto a 2D imaging sensor, the depth information is embedded in the image. Extracting the depth information from the image is a challenging task. A guiding principle is that the level of blurriness due to defocus is related to the distance between the object an… ▽ More

    Submitted 13 December, 2021; v1 submitted 30 July, 2021; originally announced July 2021.

  12. arXiv:2101.00137  [pdf, other

    eess.SP physics.optics

    Coherent optical communications using coherence-cloned Kerr soliton microcombs

    Authors: Yong Geng, Heng Zhou, Wenwen Cui, Xinjie Han, Qiang Zhang, Boyuan Liu, Guangwei Deng, Qiang Zhou, Kun Qiu

    Abstract: Dissipative Kerr soliton microcomb has been recognized as a promising on-chip multi-wavelength laser source for fiber optical communications, as its comb lines possess frequency and phase stability far beyond independent lasers. In the scenarios of coherent optical transmission and interconnect, a highly beneficial but rarely explored target is to re-generate a Kerr soliton microcomb at the receiv… ▽ More

    Submitted 31 December, 2020; originally announced January 2021.

  13. arXiv:2006.10216  [pdf, other

    eess.IV cs.CV

    Generating Fundus Fluorescence Angiography Images from Structure Fundus Images Using Generative Adversarial Networks

    Authors: Wanyue Li, Wen Kong, Yiwei Chen, Jing Wang, Yi He, Guohua Shi, Guohua Deng

    Abstract: Fluorescein angiography can provide a map of retinal vascular structure and function, which is commonly used in ophthalmology diagnosis, however, this imaging modality may pose risks of harm to the patients. To help physicians reduce the potential risks of diagnosis, an image translation method is adopted. In this work, we proposed a conditional generative adversarial network(GAN) - based method t… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

    Comments: 16 pages, 6 figures, accepted by Medical Imaging on Deep Learning