Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > eess

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Electrical Engineering and Systems Science

Authors and titles for June 2023

Total of 1472 entries : 1-100 101-200 201-300 301-400 ... 1401-1472
Showing up to 100 entries per page: fewer | more | all
[1] arXiv:2306.00003 [pdf, html, other]
Title: Detecting Heart Disease from Multi-View Ultrasound Images via Supervised Attention Multiple Instance Learning
Zhe Huang, Benjamin S. Wessler, Michael C.Hughes
Comments: Echocardiogram; multiple-instance learning; self-supervised learning; semi-supervised learning; medical imaging
Journal-ref: MLHC 2023
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[2] arXiv:2306.00034 [pdf, other]
Title: Diagnosis and Prognosis of Head and Neck Cancer Patients using Artificial Intelligence
Ikboljon Sobirov
Comments: This is Masters thesis work submitted to MBZUAI
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[3] arXiv:2306.00047 [pdf, other]
Title: Democratizing Pathological Image Segmentation with Lay Annotators via Molecular-empowered Learning
Ruining Deng, Yanwei Li, Peize Li, Jiacheng Wang, Lucas W. Remedios, Saydolimkhon Agzamkhodjaev, Zuhayr Asad, Quan Liu, Can Cui, Yaohong Wang, Yihan Wang, Yucheng Tang, Haichun Yang, Yuankai Huo
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[4] arXiv:2306.00147 [pdf, other]
Title: Stochastic Analysis of LMS Algorithm with Delayed Block Coefficient Adaptation
Mohd. Tasleem Khan (1), Oscar Gustafsson (2) ((1), (2) Division of Computer Engineering, Department of Electrical Engineering, LinkÖping University, Sweden)
Comments: 13 pages, 8 figures
Subjects: Signal Processing (eess.SP)
[5] arXiv:2306.00157 [pdf, other]
Title: Virtual and Real Data Populated Intersection Visualization and Testing Tool for V2X Application Development
Sukru Yaren Gelbal, Mustafa Ridvan Cantas, Bilin Aksun Guvenc, Levent Guvenc
Subjects: Systems and Control (eess.SY)
[6] arXiv:2306.00160 [pdf, other]
Title: Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Model
Héctor Martel, Julius Richter, Kai Li, Xiaolin Hu, Timo Gerkmann
Comments: Accepted by Interspeech 2023
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[7] arXiv:2306.00203 [pdf, other]
Title: Speaker-independent Speech Inversion for Estimation of Nasalance
Yashish M. Siriwardena, Carol Espy-Wilson, Suzanne Boyce, Mark K.Tiede, Liran Oren
Comments: Interspeech 2023
Subjects: Audio and Speech Processing (eess.AS)
[8] arXiv:2306.00279 [pdf, other]
Title: Dynamic quantized consensus under DoS attacks: Towards a tight zooming-out factor
Shuai Feng, Maopeng Ran, Hideaki Ishii, Shengyuan Xu
Subjects: Systems and Control (eess.SY); Multiagent Systems (cs.MA)
[9] arXiv:2306.00331 [pdf, other]
Title: A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint Models
Pin-Jui Ku, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee
Comments: Accepted to Interspeech 2023. Code will be released at this https URL
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Sound (cs.SD); Signal Processing (eess.SP); Systems and Control (eess.SY)
[10] arXiv:2306.00372 [pdf, other]
Title: A Bi-level Decision Framework for Incentive-Based Demand Response in Distribution Systems
Vipin Chandra Pandey, Nikhil Gupta, Khaleequr Rehman Niazi, Anil Swarnkar, Tanuj Rawat, Charalambos Konstantinou
Comments: IEEE Transactions on Energy Markets, Policy and Regulation
Subjects: Systems and Control (eess.SY)
[11] arXiv:2306.00415 [pdf, other]
Title: Mixed-Integer MPC Strategies for Fueling and Density Control in Fusion Tokamaks
Christopher A. Orrico, Matthijs van Berkel, Thomas O. S. J. Bosman, W. P. M. H. Heemels, Dinesh Krishnamoorthy
Subjects: Systems and Control (eess.SY)
[12] arXiv:2306.00421 [pdf, other]
Title: Introduction to Medical Imaging Informatics
Md. Zihad Bin Jahangir, Ruksat Hossain, Riadul Islam, MD Abdullah Al Nasim, Md. Mahim Anjum Haque, Md Jahangir Alam, Sajedul Talukder
Comments: 18 pages, 11 figures, 2 tables; Acceptance of the chapter for the Springer book "Data-driven approaches to medical imaging"
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[13] arXiv:2306.00426 [pdf, other]
Title: Speaker verification using attentive multi-scale convolutional recurrent network
Yanxiong Li, Zhongjie Jiang, Wenchang Cao, Qisheng Huang
Comments: 21 pages, 6 figures, 8 tables. Accepted for publication in Applied Soft Computing
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[14] arXiv:2306.00433 [pdf, other]
Title: A 3-step Low-latency Low-Power Multichannel Time-to-Digital Converter based on Time Residual Amplifier
Florent Bouyjou
Subjects: Signal Processing (eess.SP)
[15] arXiv:2306.00442 [pdf, html, other]
Title: Fast Variational Block-Sparse Bayesian Learning
Jakob Möderl, Erik Leitinger, Bernard H. Fleury, Franz Pernkopf, Klaus Witrisal
Comments: 16 pages, 4 figures, submitted to IEEE Transactions on Signal Processing on 1st of June, 2023, Major Revision on Dec. 2023, Resubmission Feb 2024, Major Revision Oct. 2024
Subjects: Signal Processing (eess.SP)
[16] arXiv:2306.00446 [pdf, other]
Title: Evaluation of Multi-indicator And Multi-organ Medical Image Segmentation Models
Qi Ye, Lihua Guo
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[17] arXiv:2306.00451 [pdf, other]
Title: S$^2$ME: Spatial-Spectral Mutual Teaching and Ensemble Learning for Scribble-supervised Polyp Segmentation
An Wang, Mengya Xu, Yang Zhang, Mobarakol Islam, Hongliang Ren
Comments: MICCAI 2023 Early Acceptance
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[18] arXiv:2306.00452 [pdf, other]
Title: Speech Self-Supervised Representation Benchmarking: Are We Doing it Right?
Salah Zaiem, Youcef Kemiche, Titouan Parcollet, Slim Essid, Mirco Ravanelli
Comments: 6 pages
Journal-ref: INTERSPEECH 2023
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG)
[19] arXiv:2306.00466 [pdf, other]
Title: Space-Time Phase Coupling in STMM-based Wireless Communications
Marouan Mizmizi, Dario Tagliaferri, Marco Di Renzo, Umberto Spagnolini
Comments: 6 pages
Subjects: Signal Processing (eess.SP)
[20] arXiv:2306.00473 [pdf, other]
Title: Interpretable simultaneous localization of MRI corpus callosum and classification of atypical Parkinsonian disorders using YOLOv5
Vamshi Krishna Kancharla, Debanjali Bhattacharya, Neelam Sinha, Jitender Saini, Pramod Kumar Pal, Sandhya M
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[21] arXiv:2306.00481 [pdf, other]
Title: Automatic Data Augmentation for Domain Adapted Fine-Tuning of Self-Supervised Speech Representations
Salah Zaiem, Titouan Parcollet, Slim Essid
Comments: 6 pages,INTERSPEECH 2023
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG)
[22] arXiv:2306.00499 [pdf, html, other]
Title: DeSAM: Decoupled Segment Anything Model for Generalizable Medical Image Segmentation
Yifan Gao, Wei Xia, Dingdu Hu, Wenkui Wang, Xin Gao
Comments: MICCAI 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[23] arXiv:2306.00530 [pdf, other]
Title: CL-MRI: Self-Supervised Contrastive Learning to Improve the Accuracy of Undersampled MRI Reconstruction
Mevan Ekanayake, Zhifeng Chen, Mehrtash Harandi, Gary Egan, Zhaolin Chen
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[24] arXiv:2306.00548 [pdf, other]
Title: Label- and slide-free tissue histology using 3D epi-mode quantitative phase imaging and virtual H&E staining
Tanishq Mathew Abraham, Paloma Casteleiro Costa, Caroline Filan, Zhe Guang, Zhaobin Zhang, Stewart Neill, Jeffrey J. Olson, Richard Levenson, Francisco E. Robles
Comments: 30 pages, 9 main figures, 1 table, 5 supplementary figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph); Quantitative Methods (q-bio.QM)
[25] arXiv:2306.00598 [pdf, other]
Title: CRAP: Clutter Removal with Acquisitions Under Phase Noise
Marcus Henninger, Silvio Mandelli, Artjom Grudnitsky, Thorsten Wild, Stephan ten Brink
Comments: 8 pages, 6 figures. This work has been submitted to the IEEE for possible publication
Subjects: Signal Processing (eess.SP)
[26] arXiv:2306.00619 [pdf, other]
Title: General SIS diffusion process with indirect spreading pathways on a hypergraph
Shaoxuan Cui, Fangzhou Liu, Hildeberto Jardón-Kojakhmetov, Ming Cao
Subjects: Systems and Control (eess.SY)
[27] arXiv:2306.00625 [pdf, other]
Title: Frame-wise and overlap-robust speaker embeddings for meeting diarization
Tobias Cord-Landwehr, Christoph Boeddeker, Cătălin Zorilă, Rama Doddipatla, Reinhold Haeb-Umbach
Comments: ICASSP 2023
Subjects: Audio and Speech Processing (eess.AS)
[28] arXiv:2306.00633 [pdf, other]
Title: Low-Cost GNSS Simulators with Wireless Clock Synchronization for Indoor Positioning
Woohyun Kim, Jiwon Seo
Comments: Submitted to IEEE Access
Subjects: Signal Processing (eess.SP)
[29] arXiv:2306.00634 [pdf, other]
Title: A Teacher-Student approach for extracting informative speaker embeddings from speech mixtures
Tobias Cord-Landwehr, Christoph Boeddeker, Cătălin Zorilă, Rama Doddipatla, Reinhold Haeb-Umbach
Comments: Proceedings of INTERSPEECH
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[30] arXiv:2306.00688 [pdf, other]
Title: Coherent FDA Receiver and Joint Range-Space-Time Processing
Wenkai Jia, Andreas Jakobsson, Wen-Qin Wang
Comments: 11 pages, 9 figures
Subjects: Signal Processing (eess.SP)
[31] arXiv:2306.00716 [pdf, other]
Title: Revisiting the RBLE design based on Matlab simulation
Bohan Lou
Comments: 6 pages, 14 figures
Subjects: Signal Processing (eess.SP)
[32] arXiv:2306.00736 [pdf, other]
Title: Spoken Language Identification System for English-Mandarin Code-Switching Child-Directed Speech
Shashi Kant Gupta, Sushant Hiray, Prashant Kukde
Comments: Accepted by Interspeech 2023, 5 pages, 1 figure, 4 tables
Journal-ref: Proc. INTERSPEECH 2023, 4114--4118
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[33] arXiv:2306.00812 [pdf, other]
Title: Harmonic enhancement using learnable comb filter for light-weight full-band speech enhancement model
Xiaohuai Le, Tong Lei, Li Chen, Yiqing Guo, Chao He, Cheng Chen, Xianjun Xia, Hua Gao, Yijian Xiao, Piao Ding, Shenyi Song, Jing Lu
Comments: accepted by Interspeech 2023
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[34] arXiv:2306.00854 [pdf, other]
Title: Spatio-Angular Convolutions for Super-resolution in Diffusion MRI
Matthew Lyon, Paul Armitage, Mauricio A Álvarez
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[35] arXiv:2306.00952 [pdf, other]
Title: Meta-Learning Framework for End-to-End Imposter Identification in Unseen Speaker Recognition
Ashutosh Chaubey, Sparsh Sinha, Susmita Ghose
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[36] arXiv:2306.00985 [pdf, other]
Title: Using generative AI to investigate medical imagery models and datasets
Oran Lang, Doron Yaya-Stupp, Ilana Traynis, Heather Cole-Lewis, Chloe R. Bennett, Courtney Lyles, Charles Lau, Michal Irani, Christopher Semturs, Dale R. Webster, Greg S. Corrado, Avinatan Hassidim, Yossi Matias, Yun Liu, Naama Hammel, Boris Babenko
Comments: 43 pages, 1 figure
Journal-ref: EBioMedicine 102 (2024)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[37] arXiv:2306.00988 [pdf, other]
Title: Continual Learning for Abdominal Multi-Organ and Tumor Segmentation
Yixiao Zhang, Xinyi Li, Huimiao Chen, Alan Yuille, Yaoyao Liu, Zongwei Zhou
Comments: MICCAI-2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[38] arXiv:2306.00996 [pdf, other]
Title: Weakly-supervised forced alignment of disfluent speech using phoneme-level modeling
Theodoros Kouzelis, Georgios Paraskevopoulos, Athanasios Katsamanis, Vassilis Katsouros
Comments: Interspeech 2023
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[39] arXiv:2306.00998 [pdf, other]
Title: Towards Selection of Text-to-speech Data to Augment ASR Training
Shuo Liu, Leda Sarı, Chunyang Wu, Gil Keren, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[40] arXiv:2306.01002 [pdf, other]
Title: Adaptive ship-radiated noise recognition with learnable fine-grained wavelet transform
Yuan Xie, Jiawei Ren, Ji Xu
Journal-ref: Ocean Engineering 265 (2022): 112626
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[41] arXiv:2306.01011 [pdf, other]
Title: Data-driven modeling and parameter estimation of Nonlinear systems
Kaushal Kumar
Comments: 20 pages, 6 figures
Journal-ref: Eur. Phys. J. B 96, 107 (2023)
Subjects: Systems and Control (eess.SY); Optimization and Control (math.OC)
[42] arXiv:2306.01022 [pdf, other]
Title: Introduction of Medical Imaging Modalities
S. K. M Shadekul Islam, MD Abdullah Al Nasim, Ismail Hossain, Md Azim Ullah, Kishor Datta Gupta, Md Monjur Hossain Bhuiyan
Comments: 19 pages, 7 figures, 1 table; Acceptance of the chapter for the Springer book "Data-driven approaches to medical imaging"
Subjects: Image and Video Processing (eess.IV); Medical Physics (physics.med-ph)
[43] arXiv:2306.01025 [pdf, other]
Title: Safe Environmental Envelopes of Discrete Systems
Rômulo Meira-Góes, Ian Dardik, Eunsuk Kang, Stéphane Lafortune, Stavros Tripakis
Comments: Full version of CAV23 paper
Subjects: Systems and Control (eess.SY); Formal Languages and Automata Theory (cs.FL)
[44] arXiv:2306.01100 [pdf, other]
Title: ALO-VC: Any-to-any Low-latency One-shot Voice Conversion
Bohan Wang, Damien Ronssin, Milos Cernak
Comments: Accepted to Interspeech 2023. Some audio samples are available at this https URL
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[45] arXiv:2306.01120 [pdf, other]
Title: Frequency-dependent Switching Control for Disturbance Attenuation of Linear Systems
Jingjing Zhang, Jan Heiland, Peter Benner, Xin Du
Subjects: Systems and Control (eess.SY)
[46] arXiv:2306.01177 [pdf, other]
Title: The Effects of Varying Penetration Rates of L4-L5 Autonomous Vehicles on Fuel Efficiency and Mobility of Traffic Networks
Ozgenur Kavas-Torris, M. Ridvan Cantas, Karina Meneses Cime, Bilin Aksun-Guvenc, Levent Guvenc
Subjects: Systems and Control (eess.SY)
[47] arXiv:2306.01190 [pdf, html, other]
Title: Identifying visible tissue in intraoperative ultrasound: a method and application
Alistair Weld, Luke Dixon, Giulio Anichini, Michael Dyck, Alex Ranne, Sophie Camp, Stamatia Giannarou
Journal-ref: Int J CARS (2025)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[48] arXiv:2306.01208 [pdf, other]
Title: Adapting an Unadaptable ASR System
Rao Ma, Mengjie Qian, Mark J. F. Gales, Kate M. Knill
Comments: Proceedings of INTERSPEECH
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[49] arXiv:2306.01210 [pdf, other]
Title: A new method using deep transfer learning on ECG to predict the response to cardiac resynchronization therapy
Zhuo He, Hongjin Si, Xinwei Zhang, Qing-Hui Chen, Jiangang Zou, Weihua Zhou
Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV)
[50] arXiv:2306.01219 [pdf, other]
Title: Brezinski Inverse and Geometric Product-Based Steffensen's Methods for Image Reverse Filtering
Guang Deng
Subjects: Signal Processing (eess.SP)
[51] arXiv:2306.01232 [pdf, other]
Title: Deep Reinforcement Learning Framework for Thoracic Diseases Classification via Prior Knowledge Guidance
Weizhi Nie, Chen Zhang, Dan Song, Lina Zhao, Yunpeng Bai, Keliang Xie, Anan Liu
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[52] arXiv:2306.01247 [pdf, other]
Title: Tensor decomposition for minimization of E2E SLU model toward on-device processing
Yosuke Kashiwagi, Siddhant Arora, Hayato Futami, Jessica Huynh, Shih-Lun Wu, Yifan Peng, Brian Yan, Emiru Tsunoo, Shinji Watanabe
Comments: Accepted by INTERSPEECH 2023
Subjects: Audio and Speech Processing (eess.AS)
[53] arXiv:2306.01251 [pdf, other]
Title: Average AoI Minimization for Energy Harvesting Relay-aided Status Update Network Using Deep Reinforcement Learning
Sin-Yu Huang, Kuang-Hao (Stanley)Liu
Comments: This article has been accepted for publication in IEEE Wireless Communications Letters. Citation information: DOI https://doi.org/10.1109/LWC.2023.3278864
Subjects: Systems and Control (eess.SY)
[54] arXiv:2306.01252 [pdf, other]
Title: Deep Learning based Skin-layer Segmentation for Characterizing Cutaneous Wounds from Optical Coherence Tomography Images
Prashant Kumar, Swatantra Dhara, Ayan Gope, Jyotirmoy Chatterjee, Subhamoy Mandal
Comments: Accepted
Journal-ref: 45th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2023
Subjects: Image and Video Processing (eess.IV); Medical Physics (physics.med-ph)
[55] arXiv:2306.01289 [pdf, html, other]
Title: nnMobileNet: Rethinking CNN for Retinopathy Research
Wenhui Zhu, Peijie Qiu, Xiwen Chen, Xin Li, Natasha Lepore, Oana M. Dumitrascu, Yalin Wang
Comments: Accepted as a conference paper to 2024 CVPRW
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[56] arXiv:2306.01296 [pdf, other]
Title: Improved Training for End-to-End Streaming Automatic Speech Recognition Model with Punctuation
Hanbyul Kim, Seunghyun Seo, Lukas Lee, Seolki Baek
Comments: Accepted at INTERSPEECH 2023
Journal-ref: Proc. INTERSPEECH 2023, 1653-1657
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL)
[57] arXiv:2306.01320 [pdf, other]
Title: Synchro-Transient-Extracting Transform for the Analysis of Signals with Both Harmonic and Impulsive Components
Yunlong Ma, Gang Yu, Tianran Lin, Qingtang Jiang
Subjects: Signal Processing (eess.SP)
[58] arXiv:2306.01332 [pdf, other]
Title: Differentiable Grey-box Modelling of Phaser Effects using Frame-based Spectral Processing
Alistair Carson, Cassia Valentini-Botinhao, Simon King, Stefan Bilbao
Comments: Accepted for publication in Proc. DAFx23, Copenhagen, Denmark, September 2023
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[59] arXiv:2306.01357 [pdf, other]
Title: Model-based demosaicking for acquisitions by a RGBW color filter array
Matthieu Muller (GIPSA-SIGMAPHY), Daniele Picone (GIPSA-SIGMAPHY), Mauro Dalla Mura (GIPSA-SIGMAPHY, IUF), Magnus O Ulfarsson
Subjects: Image and Video Processing (eess.IV)
[60] arXiv:2306.01375 [pdf, other]
Title: Robust and Generalisable Segmentation of Subtle Epilepsy-causing Lesions: a Graph Convolutional Approach
Hannah Spitzer, Mathilde Ripart, Abdulah Fawaz, Logan Z. J. Williams, MELD project, Emma Robinson, Juan Eugenio Iglesias, Sophie Adler, Konrad Wagstyl
Comments: accepted at MICCAI 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[61] arXiv:2306.01385 [pdf, other]
Title: Task-Agnostic Structured Pruning of Speech Representation Models
Haoyu Wang, Siyuan Wang, Wei-Qiang Zhang, Hongbin Suo, Yulong Wan
Comments: Accepted by INTERSPEECH 2023
Journal-ref: INTERSPEECH (2023) 231-235
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[62] arXiv:2306.01387 [pdf, html, other]
Title: Physics-Augmented Data-EnablEd Predictive Control for Eco-driving of Mixed Traffic Considering Diverse Human Behaviors
Dongjun Li, Kaixiang Zhang, Haoxuan Dong, Qun Wang, Zhaojian Li, Ziyou Song
Subjects: Systems and Control (eess.SY)
[63] arXiv:2306.01408 [pdf, other]
Title: Efficient Ray-Tracing Channel Emulation in Industrial Environments: An Analysis of Propagation Model Impact
Gurjot Singh Bhatia, Yoann Corre, M. Di Renzo
Comments: copyright 2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works
Subjects: Signal Processing (eess.SP); Networking and Internet Architecture (cs.NI)
[64] arXiv:2306.01411 [pdf, other]
Title: HD-DEMUCS: General Speech Restoration with Heterogeneous Decoders
Doyeon Kim, Soo-Whan Chung, Hyewon Han, Youna Ji, Hong-Goo Kang
Comments: Accepted by INTERSPEECH 2023
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[65] arXiv:2306.01425 [pdf, other]
Title: Active Noise Control in The New Century: The Role and Prospect of Signal Processing
Dongyuan Shi, Bhan Lam, Woon-Seng Gan, Jordan Cheer, Stephen J. Elliott
Comments: Submitted to this http URL 2023, Chiba, Japan
Subjects: Audio and Speech Processing (eess.AS); Signal Processing (eess.SP); Systems and Control (eess.SY)
[66] arXiv:2306.01432 [pdf, other]
Title: Audio-Visual Speech Enhancement with Score-Based Generative Models
Julius Richter, Simone Frintrop, Timo Gerkmann
Comments: Submitted to ITG Conference on Speech Communication
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG)
[67] arXiv:2306.01433 [pdf, html, other]
Title: Blind Audio Bandwidth Extension: A Diffusion-Based Zero-Shot Approach
Eloi Moliner, Filip Elvander, Vesa Välimäki
Comments: Submitted to IEEE/ACM Transactions on Audio, Speech and Language Processing
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[68] arXiv:2306.01469 [pdf, other]
Title: GANs and alternative methods of synthetic noise generation for domain adaption of defect classification of Non-destructive ultrasonic testing
Shaun McKnight, S. Gareth Pierce, Ehsan Mohseni, Christopher MacKinnon, Charles MacLeod, Tom OHare, Charalampos Loukas
Comments: 16 Pages
Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG)
[69] arXiv:2306.01482 [pdf, other]
Title: Joint User Association and UAV Location Optimization for Two-Tired Visible Light Communication Networks
Alireza Qazavi, Foroogh S Tabataba, Mehdi Naderi Soorki
Comments: 7 pages, 5 figures, conference
Journal-ref: 2022 30th International Conference on Electrical Engineering (ICEE)
Subjects: Signal Processing (eess.SP)
[70] arXiv:2306.01522 [pdf, other]
Title: Auditory Representation Effective for Estimating Vocal Tract Information
Toshio Irino, Shintaro Doan
Comments: This manuscript is a revised version after acceptance for publication in Proc. APSIPA ASC 2023 on August 25, 2023
Journal-ref: Proc. APSIPA ASC 2023
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[71] arXiv:2306.01538 [pdf, other]
Title: On Crowdsourcing-design with Comparison Category Rating for Evaluating Speech Enhancement Algorithms
Angélica S. Z. Suárez, Clément Laroche, Line H. Clemmensen, Sneha Das
Comments: Published at ICASSP 2023
Subjects: Audio and Speech Processing (eess.AS)
[72] arXiv:2306.01546 [pdf, html, other]
Title: Publicly available datasets of breast histopathology H&E whole-slide images: A scoping review
Masoud Tafavvoghi (1), Lars Ailo Bongo (2), Nikita Shvetsov (2), Lill-Tove Rasmussen Busund (3), Kajsa Møllersen (1) ((1) Department of Community Medicine, UiT The Arctic University of Norway, Tromsø, Norway, (2) Department of Computer Science, UiT The Arctic University of Norway, Tromsø, Norway, (3) Department of Medical Biology, UiT The Arctic University of Norway, Tromsø, Norway)
Comments: 27 pages (including references), 8 figures, 3 tables, 5 supporting information materials
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[73] arXiv:2306.01553 [pdf, other]
Title: Detecting Low Pass Graph Signals via Spectral Pattern: Sampling Complexity and Applications
Chenyue Zhang, Yiran He, Hoi-To Wai
Comments: 15 pages, 11 figures, accepted by IEEE Transactions on Signal Processing
Subjects: Signal Processing (eess.SP)
[74] arXiv:2306.01562 [pdf, other]
Title: An Attentive-based Generative Model for Medical Image Synthesis
Jiayuan Wang, Q. M. Jonathan Wu, Farhad Pourpanah
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[75] arXiv:2306.01582 [pdf, html, other]
Title: Scale-free Non-collaborative Linear Protocol Design for A Class of Homogeneous Multi-agent Systems
Zhenwei Liu, Ali Saberi, Anton A. Stoorvogel
Comments: This paper was accepted for publish to IEEE Transactions on Automatic Control at Vol. 69, Issue 5, 2024
Subjects: Systems and Control (eess.SY)
[76] arXiv:2306.01611 [pdf, other]
Title: Training Terahertz Wireless Systems to Battle I/Q Imbalance
Alexandros-Apostolos A. Boulogeorgos, Angeliki Alexiou
Comments: 6 pages, 4 figures
Journal-ref: IEEE Balkan 2023
Subjects: Signal Processing (eess.SP)
[77] arXiv:2306.01630 [pdf, other]
Title: A Conditional Normalizing Flow for Accelerated Multi-Coil MR Imaging
Jeffrey Wen, Rizwan Ahmad, Philip Schniter
Comments: Accepted to ICML 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[78] arXiv:2306.01689 [pdf, other]
Title: Unique Brain Network Identification Number for Parkinson's Individuals Using Structural MRI
Tanmayee Samantaray, Utsav Gupta, Jitender Saini, Cota Navin Gupta
Comments: 15 pages, 5 figures,1 algorithm, 1 main table, 1 appendix table
Journal-ref: Brain Sciences, vol. 13, no. 9, 08 Sep. 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Neurons and Cognition (q-bio.NC)
[79] arXiv:2306.01752 [pdf, other]
Title: Handling Label Uncertainty on the Example of Automatic Detection of Shepherd's Crook RCA in Coronary CT Angiography
Felix Denzinger, Michael Wels, Oliver Taubmann, Florian Kordon, Fabian Wagner, Stephanie Mehltretter, Mehmet A. Gülsün, Max Schöbinger, Florian André, Sebastian Buss, Johannes Görich, Michael Sühling, Andreas Maier
Comments: Accepted at ISBI 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[80] arXiv:2306.01808 [pdf, html, other]
Title: Morphology Edge Attention Network and Optimal Geometric Matching Connection model for vascular segmentation
Yuntao Zhu, Yuxuan Qiao, Xiaoping Yang
Comments: 6 pages
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[81] arXiv:2306.01827 [pdf, other]
Title: Active Learning on Medical Image
Angona Biswas, MD Abdullah Al Nasim, Md Shahin Ali, Ismail Hossain, Md Azim Ullah, Sajedul Talukder
Comments: 12 pages, 8 figures; Acceptance of the chapter for the Springer book "Data-driven approaches to medical imaging"
Subjects: Image and Video Processing (eess.IV)
[82] arXiv:2306.01853 [pdf, other]
Title: Multi-Contrast Computed Tomography Atlas of Healthy Pancreas
Yinchi Zhou, Ho Hin Lee, Yucheng Tang, Xin Yu, Qi Yang, Shunxing Bao, Jeffrey M. Spraggins, Yuankai Huo, Bennett A. Landman
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[83] arXiv:2306.01861 [pdf, other]
Title: Non-uniform Speaker Disentanglement For Depression Detection From Raw Speech Signals
Jinhan Wang, Vijay Ravi, Abeer Alwan
Comments: Accepted to Interspeech 2023
Subjects: Audio and Speech Processing (eess.AS)
[84] arXiv:2306.01894 [pdf, other]
Title: Atmospheric Influence on the Path Loss at High Frequencies for Deployment of 5G Cellular Communication Networks
Rashed Hasan Ratul, S M Mehedi Zaman, Hasib Arman Chowdhury, Md. Zayed Hassan Sagor, Mohammad Tawhid Kawser, Mirza Muntasir Nishat
Comments: Accepted for presentation at THE 14th INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT)
Subjects: Signal Processing (eess.SP)
[85] arXiv:2306.01914 [pdf, html, other]
Title: On the Sample Complexity of Imitation Learning for Smoothed Model Predictive Control
Daniel Pfrommer, Swati Padmanabhan, Kwangjun Ahn, Jack Umenberger, Tobia Marcucci, Zakaria Mhammedi, Ali Jadbabaie
Comments: 15 pages, 2 figures. Preliminary version accepted to CDC 2024
Subjects: Systems and Control (eess.SY); Machine Learning (cs.LG)
[86] arXiv:2306.01916 [pdf, other]
Title: In-the-wild Speech Emotion Conversion Using Disentangled Self-Supervised Representations and Neural Vocoder-based Resynthesis
Navin Raj Prabhu, Nale Lehmann-Willenbrock, Timo Gerkmann
Comments: Submitted to 15th ITG Conference on Speech Communication
Subjects: Audio and Speech Processing (eess.AS); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[87] arXiv:2306.01950 [pdf, other]
Title: Fast and Interpretable Nonlocal Neural Networks for Image Denoising via Group-Sparse Convolutional Dictionary Learning
Nikola Janjušević, Amirhossein Khalilian-Gourtani, Adeen Flinker, Yao Wang
Comments: 11 pages, 8 figures, 6 tables
Subjects: Image and Video Processing (eess.IV); Signal Processing (eess.SP)
[88] arXiv:2306.01957 [pdf, other]
Title: Speaker-independent neural formant synthesis
Pablo Pérez Zarazaga, Zofia Malisz, Gustav Eje Henter, Lauri Juvela
Comments: 5 pages, 4 figures. Article accepted at INTERSPEECH 2023
Subjects: Audio and Speech Processing (eess.AS)
[89] arXiv:2306.01971 [pdf, other]
Title: Impact of Different Desired Velocity Profiles and Controller Gains on Convoy Driveability of Cooperative Adaptive Cruise Control Operated Platoons
Santhosh Tamilarasan, Levent Guvenc
Subjects: Systems and Control (eess.SY)
[90] arXiv:2306.01981 [pdf, other]
Title: SGEM: Test-Time Adaptation for Automatic Speech Recognition via Sequential-Level Generalized Entropy Minimization
Changhun Kim, Joonhyung Park, Hajin Shim, Eunho Yang
Comments: INTERSPEECH 2023 Oral Presentation; Code is available at this https URL
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[91] arXiv:2306.02016 [pdf, other]
Title: Converse negative imaginary theorems
Sei Zhen Khong, Di Zhao, Alexander Lanzon
Comments: This paper has been submitted for possible publication at Automatica
Subjects: Systems and Control (eess.SY); Optimization and Control (math.OC)
[92] arXiv:2306.02017 [pdf, other]
Title: Resilient Distributed Parameter Estimation in Sensor Networks
Jiaqi Yan, Kuo Li, Hideaki Ishii
Subjects: Systems and Control (eess.SY)
[93] arXiv:2306.02020 [pdf, other]
Title: Replay Attack Detection Based on Parity Space Method for Cyber-Physical Systems
Dong Zhao, Yang Shi, Steven X. Ding, Yueyang Li, Fangzhou Fu
Subjects: Systems and Control (eess.SY)
[94] arXiv:2306.02037 [pdf, other]
Title: A Peer-to-peer Federated Continual Learning Network for Improving CT Imaging from Multiple Institutions
Hao Wang, Ruihong He, Xiaoyu Zhang, Zhaoying Bian, Dong Zeng, Jianhua Ma
Subjects: Image and Video Processing (eess.IV)
[95] arXiv:2306.02044 [pdf, other]
Title: Why We Should Report the Details in Subjective Evaluation of TTS More Rigorously
Cheng-Han Chiang, Wei-Ping Huang, Hung-yi Lee
Comments: Interspeech 2023 camera-ready version
Subjects: Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
[96] arXiv:2306.02053 [pdf, other]
Title: Few-shot Class-incremental Audio Classification Using Stochastic Classifier
Yanxiong Li, Wenchang Cao, Jialong Li, Wei Xie, Qianhua He
Comments: 5 pages, 3 figures, 4 tables. Accepted for publication in INTERSPEECH 2023
Subjects: Audio and Speech Processing (eess.AS)
[97] arXiv:2306.02054 [pdf, other]
Title: Low-Complexity Acoustic Scene Classification Using Data Augmentation and Lightweight ResNet
Yanxiong Li, Wenchang Cao, Wei Xie, Qisheng Huang, Wenfeng Pang, Qianhua He
Comments: 5 pages, 5 figures, 4 tables. Accepted for publication in the 16th IEEE International Conference on Signal Processing (IEEE ICSP)
Subjects: Audio and Speech Processing (eess.AS)
[98] arXiv:2306.02057 [pdf, other]
Title: DataAI-6G: A System Parameters Configurable Channel Dataset for AI-6G Research
Zibing Shen, Jianhua Zhang, Li Yu, Yuxiang Zhang, Zhen Zhang, Xidong Hu
Subjects: Signal Processing (eess.SP)
[99] arXiv:2306.02070 [pdf, other]
Title: Adaptive Approximation-Based Control for Nonlinear Systems: A Unified Solution with Accurate and Inaccurate Measurements
Dong Zhao
Subjects: Systems and Control (eess.SY)
[100] arXiv:2306.02176 [pdf, other]
Title: TransRUPNet for Improved Polyp Segmentation
Debesh Jha, Nikhil Kumar Tomar, Debayan Bhattacharya, Ulas Bagci
Comments: Accepted at EMBC 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
Total of 1472 entries : 1-100 101-200 201-300 301-400 ... 1401-1472
Showing up to 100 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack