Skip to main content

Showing 1–50 of 67 results for author: Smith, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2505.04082  [pdf, ps, other

    eess.AS cs.SD eess.SP

    Aliasing Reduction in Neural Amp Modeling by Smoothing Activations

    Authors: Ryota Sato, Julius O. Smith III

    Abstract: The increasing demand for high-quality digital emulations of analog audio hardware such as vintage guitar amplifiers has led to numerous works in neural-network-based black-box modeling, with deep learning architectures like WaveNet showing promising results. However, a key limitation in all of these models is the aliasing artifacts that arise from the use of nonlinear activation functions in neur… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: Accepted to DAFx 2025

  2. arXiv:2503.08978  [pdf, other

    cs.RO cs.LG eess.SY

    TetraGrip: Sensor-Driven Multi-Suction Reactive Object Manipulation in Cluttered Scenes

    Authors: Paolo Torrado, Joshua Levin, Markus Grotz, Joshua Smith

    Abstract: Warehouse robotic systems equipped with vacuum grippers must reliably grasp a diverse range of objects from densely packed shelves. However, these environments present significant challenges, including occlusions, diverse object orientations, stacked and obstructed items, and surfaces that are difficult to suction. We introduce \tetra, a novel vacuum-based grasping strategy featuring four suction… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

  3. arXiv:2502.19315  [pdf, ps, other

    physics.app-ph cond-mat.mes-hall cond-mat.mtrl-sci eess.SY physics.chem-ph

    Epitaxial high-K AlBN barrier GaN HEMTs

    Authors: Chandrashekhar Savant, Thai-Son Nguyen, Kazuki Nomoto, Saurabh Vishwakarma, Siyuan Ma, Akshey Dhar, Yu-Hsin Chen, Joseph Casamento, David J. Smith, Huili Grace Xing, Debdeep Jena

    Abstract: We report a polarization-induced 2D electron gas (2DEG) at an epitaxial AlBN/GaN heterojunction grown on a SiC substrate. Using this 2DEG in a long conducting channel, we realize ultra-thin barrier AlBN/GaN high electron mobility transistors that exhibit current densities of more than 0.25 A/mm, clean current saturation, a low pinch-off voltage of -0.43 V, and a peak transconductance of 0.14 S/mm.… ▽ More

    Submitted 26 February, 2025; originally announced February 2025.

    Comments: Manuscript: 7 pages, 5 figures and Supplementary data: 2 pages, 4 figures

  4. arXiv:2502.00249  [pdf, other

    eess.SP

    A Hodge-FAST Framework for High-Resolution Dynamic Functional Connectivity Analysis of Higher Order Interactions in EEG Signals

    Authors: Om Roy, Yashar Moshfeghi, Jason Smith, Agustin Ibanez, Mario A. Parra, Keith M. Smith

    Abstract: We introduce a novel framework that integrates Hodge decomposition with Filtered Average Short-Term (FAST) functional connectivity to analyze dynamic functional connectivity (DFC) in EEG signals. This method leverages graph-based topology and simplicial analysis to explore transient connectivity patterns at multiple scales, addressing noise, sparsity, and computational efficiency. The temporal EEG… ▽ More

    Submitted 7 February, 2025; v1 submitted 31 January, 2025; originally announced February 2025.

  5. arXiv:2501.08469  [pdf, other

    cs.RO eess.SY

    Electrostatic Clutches Enable Simultaneous Mechanical Multiplexing

    Authors: Timothy E. Amish, Jeffrey T. Auletta, Chad C. Kessens, Joshua R. Smith, Jeffrey I. Lipton

    Abstract: Actuating robotic systems with multiple degrees of freedom (DoF) traditionally requires numerous motors, leading to increased size, weight, cost, and power consumption. Mechanical multiplexing offers a solution by enabling a single actuator to control multiple DoF. However, existing multiplexers have either been limited to electrically controlled time-based multiplexing that control one DoF at a t… ▽ More

    Submitted 21 March, 2025; v1 submitted 14 January, 2025; originally announced January 2025.

  6. arXiv:2412.11967  [pdf, other

    cs.LG eess.SY

    A Digital twin for Diesel Engines: Operator-infused PINNs with Transfer Learning for Engine Health Monitoring

    Authors: Kamaljyoti Nath, Varun Kumar, Daniel J. Smith, George Em Karniadakis

    Abstract: Improving diesel engine efficiency and emission reduction have been critical research topics. Recent government regulations have shifted this focus to another important area related to engine health and performance monitoring. Although the advancements in the use of deep learning methods for system monitoring have shown promising results in this direction, designing efficient methods suitable for… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

  7. arXiv:2411.15965  [pdf, other

    eess.SP

    Phase Selection and Analysis for Multi-frequency Multi-user RIS Systems Employing Subsurfaces in Correlated Ricean and Rayleigh Environments

    Authors: Amy S. Inwood, Peter J. Smith, Philippa A. Martin, Graeme K. Woodward

    Abstract: We analyse the performance of a reconfigurable intelligent surface (RIS) aided system where the RIS is divided into subsurfaces. Each subsurface is designed specifically for one user, who is served on their own frequency band. The other subsurfaces (those not designed for this user) provide additional uncontrolled scattering. We derive the exact closed-form expression for the mean signal-to-noise… ▽ More

    Submitted 24 November, 2024; originally announced November 2024.

  8. arXiv:2411.01878  [pdf, ps, other

    cs.IT eess.SP

    Rician Channel Modelling for Super Wideband MIMO Communications

    Authors: Sachitha C. Bandara, Peter J. Smith, Erfan Khordad, Robin Evans, Rajitha Senanayake

    Abstract: Recent developments in Multiple-Input-Multiple-Output (MIMO) technology include packing a large number of antenna elements in a compact array to access the bandwidth benefits provided by higher mutual coupling (MC). The resulting super-wideband (SW) systems require a circuit-theoretic framework to handle the MC and channel models which span extremely large bands. Hence, in this paper, we make two… ▽ More

    Submitted 26 May, 2025; v1 submitted 4 November, 2024; originally announced November 2024.

    Comments: This paper has been accepted and presented at the IEEE WCNC 2025

  9. arXiv:2409.03055  [pdf, other

    cs.SD eess.AS

    SymPAC: Scalable Symbolic Music Generation With Prompts And Constraints

    Authors: Haonan Chen, Jordan B. L. Smith, Janne Spijkervet, Ju-Chiang Wang, Pei Zou, Bochen Li, Qiuqiang Kong, Xingjian Du

    Abstract: Progress in the task of symbolic music generation may be lagging behind other tasks like audio and text generation, in part because of the scarcity of symbolic training data. In this paper, we leverage the greater scale of audio music data by applying pre-trained MIR models (for transcription, beat tracking, structure analysis, etc.) to extract symbolic events and encode them into token sequences.… ▽ More

    Submitted 9 September, 2024; v1 submitted 4 September, 2024; originally announced September 2024.

    Comments: ISMIR 2024

  10. arXiv:2409.00078  [pdf, other

    eess.SP cs.LG cs.NI

    SGP-RI: A Real-Time-Trainable and Decentralized IoT Indoor Localization Model Based on Sparse Gaussian Process with Reduced-Dimensional Inputs

    Authors: Zhe Tang, Sihao Li, Zichen Huang, Guandong Yang, Kyeong Soo Kim, Jeremy S. Smith

    Abstract: Internet of Things (IoT) devices are deployed in the filed, there is an enormous amount of untapped potential in local computing on those IoT devices. Harnessing this potential for indoor localization, therefore, becomes an exciting research area. Conventionally, the training and deployment of indoor localization models are based on centralized servers with substantial computational resources. Thi… ▽ More

    Submitted 24 August, 2024; originally announced September 2024.

    Comments: 10 pages, 4 figures, under review for journal publication

  11. arXiv:2408.16623  [pdf, other

    cs.CV cs.LG eess.IV

    Turbulence Strength $C_n^2$ Estimation from Video using Physics-based Deep Learning

    Authors: Ripon Kumar Saha, Esen Salcin, Jihoo Kim, Joseph Smith, Suren Jayasuriya

    Abstract: Images captured from a long distance suffer from dynamic image distortion due to turbulent flow of air cells with random temperatures, and thus refractive indices. This phenomenon, known as image dancing, is commonly characterized by its refractive-index structure constant $C_n^2$ as a measure of the turbulence strength. For many applications such as atmospheric forecast model, long-range/astronom… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

    Comments: Code Available: https://github.com/Riponcs/Cn2Estimation

    Journal ref: Optics Express 30, 40854-40870 (2022)

  12. arXiv:2405.06147  [pdf, other

    cs.LG eess.SY

    State-Free Inference of State-Space Models: The Transfer Function Approach

    Authors: Rom N. Parnichkun, Stefano Massaroli, Alessandro Moro, Jimmy T. H. Smith, Ramin Hasani, Mathias Lechner, Qi An, Christopher Ré, Hajime Asama, Stefano Ermon, Taiji Suzuki, Atsushi Yamashita, Michael Poli

    Abstract: We approach designing a state-space model for deep learning applications through its dual representation, the transfer function, and uncover a highly efficient sequence parallel inference algorithm that is state-free: unlike other proposed algorithms, state-free inference does not incur any significant memory or computational cost with an increase in state size. We achieve this using properties of… ▽ More

    Submitted 1 June, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: Resubmission 02/06/2024: Fixed minor typo of recurrent form RTF

  13. arXiv:2405.03762  [pdf, other

    eess.IV cs.CV

    Swin transformers are robust to distribution and concept drift in endoscopy-based longitudinal rectal cancer assessment

    Authors: Jorge Tapias Gomez, Aneesh Rangnekar, Hannah Williams, Hannah Thompson, Julio Garcia-Aguilar, Joshua Jesse Smith, Harini Veeraraghavan

    Abstract: Endoscopic images are used at various stages of rectal cancer treatment starting from cancer screening, diagnosis, during treatment to assess response and toxicity from treatments such as colitis, and at follow up to detect new tumor or local regrowth (LR). However, subjective assessment is highly variable and can underestimate the degree of response in some patients, subjecting them to unnecessar… ▽ More

    Submitted 30 January, 2025; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: Accepted at SPIE Medical Imaging 2025

  14. arXiv:2405.00945  [pdf, other

    cs.IT eess.SP

    Can FSK Be Optimised for Integrated Sensing and Communications?

    Authors: Tian Han, Peter J Smith, Urbashi Mitra, Jamie S Evans, Rajitha Senanayake

    Abstract: Motivated by the ideal peak-to-average-power ratio and radar sensing capability of traditional frequency-coded radar waveforms, this paper considers the frequency shift keying (FSK) based waveform for joint communications and radar (JCR). An analysis of the probability distributions of its ambiguity function (AF) sidelobe levels (SLs) and peak sidelobe level (PSL) is conducted to study the radar s… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: Submitted to IEEE Transactions on Wireless Communications, 13 pages, 6 figures

  15. arXiv:2312.13976  [pdf

    physics.med-ph cs.AI cs.CG eess.IV q-bio.QM

    Anatomical basis of human sex differences in ECG identified by automated torso-cardiac three-dimensional reconstruction

    Authors: Hannah J. Smith, Blanca Rodriguez, Yuling Sang, Marcel Beetz, Robin Choudhury, Vicente Grau, Abhirup Banerjee

    Abstract: Background and Aims: The electrocardiogram (ECG) is routinely used for diagnosis and risk stratification following myocardial infarction (MI), though its interpretation is confounded by anatomical variability and sex differences. Women have a higher incidence of missed MI diagnosis and poorer outcomes following infarction. Sex differences in ECG biomarkers and torso-ventricular anatomy have not be… ▽ More

    Submitted 17 July, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: Paper under revision

  16. Johnsen-Rahbek Capstan Clutch: A High Torque Electrostatic Clutch

    Authors: Timothy E. Amish, Jeffrey T. Auletta, Chad C. Kessens, Joshua R. Smith, Jeffrey I. Lipton

    Abstract: In many robotic systems, the holding state consumes power, limits operating time, and increases operating costs. Electrostatic clutches have the potential to improve robotic performance by generating holding torques with low power consumption. A key limitation of electrostatic clutches has been their low specific shear stresses which restrict generated holding torque, limiting many applications. H… ▽ More

    Submitted 27 March, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Journal ref: 2024 IEEE International Conference on Robotics and Automation (ICRA)

  17. arXiv:2312.12244  [pdf, other

    cs.IT eess.SP

    Protecting Massive MIMO-Radar Coexistence: Precoding Design and Power Control

    Authors: Mohamed Elfiatoure, Mohammadali Mohammadi, Hien Quoc Ngo, Peter J. Smith, Michail Matthaiou

    Abstract: This paper studies the coexistence between a downlink multiuser massive multi-input-multi-output (MIMO) communication system and MIMO radar. The performance of the massive MIMO system with maximum ratio ($\MR$), zero-forcing ($\ZF$), and protective $\ZF$ ($\PZF$) precoding designs is characterized in terms of spectral efficiency (SE) and by taking the channel estimation errors and power control in… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 10 Figures, IEEE Open Journal of the Communication society

  18. arXiv:2311.01058  [pdf, ps, other

    eess.SP

    Continuous Fluid Antenna Systems: Modeling and Analysis

    Authors: Constantinos Psomas, Peter J. Smith, Himal A. Suraweera, Ioannis Krikidis

    Abstract: Fluid antennas (FAs) is a promising technology for introducing flexibility and reconfigurability in wireless networks. Recent research efforts have highlighted the potential gains that can be achieved in comparison to conventional antennas. These works assume that the FA has a discrete number of positions that the liquid can take. However, from a practical standpoint, the liquid moves in a continu… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: IEEE Communications Letters

  19. arXiv:2309.09352  [pdf, other

    eess.SP

    Frequency Estimation Using Complex-Valued Shifted Window Transformer

    Authors: Josiah W. Smith, Murat Torlak

    Abstract: Estimating closely spaced frequency components of a signal is a fundamental problem in statistical signal processing. In this letter, we introduce 1-D real-valued and complex-valued shifted window (Swin) transformers, referred to as SwinFreq and CVSwinFreq, respectively, for line-spectra frequency estimation on 1-D complex-valued signals. Whereas 2-D Swin transformer-based models have gained tract… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

    Comments: Submitted to IEEE Geoscience and Remote Sensing Letters

  20. arXiv:2309.08844  [pdf, other

    eess.SP cs.AI

    Emerging Approaches for THz Array Imaging: A Tutorial Review and Software Tool

    Authors: Josiah W. Smith, Murat Torlak

    Abstract: Accelerated by the increasing attention drawn by 5G, 6G, and Internet of Things applications, communication and sensing technologies have rapidly evolved from millimeter-wave (mmWave) to terahertz (THz) in recent years. Enabled by significant advancements in electromagnetic (EM) hardware, mmWave and THz frequency regimes spanning 30 GHz to 300 GHz and 300 GHz to 3000 GHz, respectively, can be empl… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: Submitted to Proceedings of IEEE

  21. arXiv:2309.07948  [pdf, other

    eess.SP cs.LG

    Complex-Valued Neural Networks for Data-Driven Signal Processing and Signal Understanding

    Authors: Josiah W. Smith

    Abstract: Complex-valued neural networks have emerged boasting superior modeling performance for many tasks across the signal processing, sensing, and communications arenas. However, developing complex-valued models currently demands development of basic deep learning operations, such as linear or convolution layers, as modern deep learning frameworks like PyTorch and Tensor flow do not adequately support c… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: Preprint

  22. arXiv:2309.00006  [pdf, other

    eess.SP cs.CV eess.IV

    Dual Radar SAR Controller

    Authors: Josiah Smith

    Abstract: The following is a user guide for the Dual Radar SAR Controller graphical user interface (GUI) to operate the dual radar synthetic aperture radar (SAR) scanner. The scanner was designed in the Spring semester of 2022 by Josiah Smith (RA), Yusef Alimam (UG), and Geetika Vedula (UG) with multiple axes of motion for the radar and target under test. The system is operated by a personal computer (PC) r… ▽ More

    Submitted 27 June, 2023; originally announced September 2023.

  23. arXiv:2307.09063  [pdf, other

    eess.SP

    Radar-STDA: A High-Performance Spatial-Temporal Denoising Autoencoder for Interference Mitigation of FMCW Radars

    Authors: Lulu Liu, Runwei Guan, Fei Ma, Jeremy Smith, Yutao Yue

    Abstract: With its small size, low cost and all-weather operation, millimeter-wave radar can accurately measure the distance, azimuth and radial velocity of a target compared to other traffic sensors. However, in practice, millimeter-wave radars are plagued by various interferences, leading to a drop in target detection accuracy or even failure to detect targets. This is undesirable in autonomous vehicles a… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

  24. arXiv:2306.15341  [pdf, other

    eess.SP cs.AI cs.CV eess.IV

    Novel Hybrid-Learning Algorithms for Improved Millimeter-Wave Imaging Systems

    Authors: Josiah Smith

    Abstract: Increasing attention is being paid to millimeter-wave (mmWave), 30 GHz to 300 GHz, and terahertz (THz), 300 GHz to 10 THz, sensing applications including security sensing, industrial packaging, medical imaging, and non-destructive testing. Traditional methods for perception and imaging are challenged by novel data-driven algorithms that offer improved resolution, localization, and detection rates.… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: PhD Dissertation Submitted to UTD ECE Department

  25. arXiv:2306.15244  [pdf, other

    cs.CV eess.IV

    Cutting-Edge Techniques for Depth Map Super-Resolution

    Authors: Ryan Peterson, Josiah Smith

    Abstract: To overcome hardware limitations in commercially available depth sensors which result in low-resolution depth maps, depth map super-resolution (DMSR) is a practical and valuable computer vision task. DMSR requires upscaling a low-resolution (LR) depth map into a high-resolution (HR) space. Joint image filtering for DMSR has been applied using spatially-invariant and spatially-variant convolutional… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

  26. Efficient CNN-based Super Resolution Algorithms for mmWave Mobile Radar Imaging

    Authors: Christos Vasileiou, Josiah W. Smith, Shiva Thiagarajan, Matthew Nigh, Yiorgos Makris, Murat Torlak

    Abstract: In this paper, we introduce an innovative super resolution approach to emerging modes of near-field synthetic aperture radar (SAR) imaging. Recent research extends convolutional neural network (CNN) architectures from the optical to the electromagnetic domain to achieve super resolution on images generated from radar signaling. Specifically, near-field synthetic aperture radar (SAR) imaging, a met… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: Accepted to IEEE ICIP

  27. A Vision Transformer Approach for Efficient Near-Field Irregular SAR Super-Resolution

    Authors: Josiah Smith, Yusef Alimam, Geetika Vedula, Murat Torlak

    Abstract: In this paper, we develop a novel super-resolution algorithm for near-field synthetic-aperture radar (SAR) under irregular scanning geometries. As fifth-generation (5G) millimeter-wave (mmWave) devices are becoming increasingly affordable and available, high-resolution SAR imaging is feasible for end-user applications and non-laboratory environments. Emerging applications such freehand imaging, wh… ▽ More

    Submitted 27 June, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

    Comments: Accepted to Proc. IEEE WMCS

  28. Efficient 3-D Near-Field MIMO-SAR Imaging for Irregular Scanning Geometries

    Authors: Josiah Smith, Murat Torlak

    Abstract: In this article, we introduce a novel algorithm for efficient near-field synthetic aperture radar (SAR) imaging for irregular scanning geometries. With the emergence of fifth-generation (5G) millimeter-wave (mmWave) devices, near-field SAR imaging is no longer confined to laboratory environments. Recent advances in positioning technology have attracted significant interest for a diverse set of new… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: Accepted to IEEE Access

    Journal ref: IEEE Access, vol. 10, pp. 10283-10294, 2022

  29. arXiv:2305.02039  [pdf, other

    cs.CV cs.AI eess.SP

    Improved Static Hand Gesture Classification on Deep Convolutional Neural Networks using Novel Sterile Training Technique

    Authors: Josiah Smith, Shiva Thiagarajan, Richard Willis, Yiorgos Makris, Murat Torlak

    Abstract: In this paper, we investigate novel data collection and training techniques towards improving classification accuracy of non-moving (static) hand gestures using a convolutional neural network (CNN) and frequency-modulated-continuous-wave (FMCW) millimeter-wave (mmWave) radars. Recently, non-contact hand pose and static gesture recognition have received considerable attention in many applications r… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: Accepted to IEEE Access

    Journal ref: IEEE Access, vol. 9, pp. 10893-10902, 2021

  30. Near-Field MIMO-ISAR Millimeter-Wave Imaging

    Authors: Josiah W. Smith, Muhammet Emin Yanik, Murat Torlak

    Abstract: Multiple-input-multiple-output (MIMO) millimeter-wave (mmWave) sensors for synthetic aperture radar (SAR) and inverse SAR (ISAR) address the fundamental challenges of cost-effectiveness and scalability inherent to near-field imaging. In this paper, near-field MIMO-ISAR mmWave imaging systems are discussed and developed. The rotational ISAR (R-ISAR) regime investigated in this paper requires rotati… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: Accepted to IEEE Radar Conference 2020

  31. arXiv:2305.02017  [pdf, other

    cs.CV cs.AI eess.SP

    Deep Learning-Based Multiband Signal Fusion for 3-D SAR Super-Resolution

    Authors: Josiah Smith, Murat Torlak

    Abstract: Three-dimensional (3-D) synthetic aperture radar (SAR) is widely used in many security and industrial applications requiring high-resolution imaging of concealed or occluded objects. The ability to resolve intricate 3-D targets is essential to the performance of such applications and depends directly on system bandwidth. However, because high-bandwidth systems face several prohibitive hurdles, an… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: Accepted to IEEE Transactions on Aerospace and Electronic Systems

  32. An FCNN-Based Super-Resolution mmWave Radar Framework for Contactless Musical Instrument Interface

    Authors: Josiah W. Smith, Orges Furxhi, Murat Torlak

    Abstract: In this article, we propose a framework for contactless human-computer interaction (HCI) using novel tracking techniques based on deep learning-based super-resolution and tracking algorithms. Our system offers unprecedented high-resolution tracking of hand position and motion characteristics by leveraging spatial and temporal features embedded in the reflected radar waveform. Rather than classifyi… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: Accepted to IEEE Transactions on Multimedia

    Journal ref: IEEE Transactions on Multimedia, vol. 24, pp. 2315-2328, 2022

  33. Real-Time Prediction of Gas Flow Dynamics in Diesel Engines using a Deep Neural Operator Framework

    Authors: Varun Kumar, Somdatta Goswami, Daniel J. Smith, George Em Karniadakis

    Abstract: We develop a data-driven deep neural operator framework to approximate multiple output states for a diesel engine and generate real-time predictions with reasonable accuracy. As emission norms become more stringent, the need for fast and accurate models that enable analysis of system behavior have become an essential requirement for system development. The fast transient processes involved in the… ▽ More

    Submitted 6 July, 2023; v1 submitted 2 April, 2023; originally announced April 2023.

    Comments: Updated manuscript title to better reflect this work and field of study

    Journal ref: Applied Intelligence, 2023

  34. arXiv:2303.03793  [pdf

    physics.optics eess.IV physics.app-ph physics.bio-ph

    Roadmap on Deep Learning for Microscopy

    Authors: Giovanni Volpe, Carolina Wählby, Lei Tian, Michael Hecht, Artur Yakimovich, Kristina Monakhova, Laura Waller, Ivo F. Sbalzarini, Christopher A. Metzler, Mingyang Xie, Kevin Zhang, Isaac C. D. Lenton, Halina Rubinsztein-Dunlop, Daniel Brunner, Bijie Bai, Aydogan Ozcan, Daniel Midtvedt, Hao Wang, Nataša Sladoje, Joakim Lindblad, Jason T. Smith, Marien Ochoa, Margarida Barroso, Xavier Intes, Tong Qiu , et al. (50 additional authors not shown)

    Abstract: Through digital imaging, microscopy has evolved from primarily being a means for visual observation of life at the micro- and nano-scale, to a quantitative tool with ever-increasing resolution and throughput. Artificial intelligence, deep neural networks, and machine learning are all niche terms describing computational methods that have gained a pivotal role in microscopy-based research over the… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

  35. arXiv:2301.01361  [pdf, other

    eess.AS cs.SD

    Modeling the Rhythm from Lyrics for Melody Generation of Pop Song

    Authors: Daiyu Zhang, Ju-Chiang Wang, Katerina Kosta, Jordan B. L. Smith, Shicen Zhou

    Abstract: Creating a pop song melody according to pre-written lyrics is a typical practice for composers. A computational model of how lyrics are set as melodies is important for automatic composition systems, but an end-to-end lyric-to-melody model would require enormous amounts of paired training data. To mitigate the data constraints, we adopt a two-stage approach, dividing the task into lyric-to-rhythm… ▽ More

    Submitted 3 January, 2023; originally announced January 2023.

    Comments: Published in ISMIR 2022

  36. arXiv:2211.15787  [pdf, other

    cs.SD eess.AS

    MuSFA: Improving Music Structural Function Analysis with Partially Labeled Data

    Authors: Ju-Chiang Wang, Jordan B. L. Smith, Yun-Ning Hung

    Abstract: Music structure analysis (MSA) systems aim to segment a song recording into non-overlapping sections with useful labels. Previous MSA systems typically predict abstract labels in a post-processing step and require the full context of the song. By contrast, we recently proposed a supervised framework, called "Music Structural Function Analysis" (MuSFA), that models and predicts meaningful labels li… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

    Comments: ISMIR2022, LBD paper

  37. arXiv:2211.15752  [pdf, other

    cs.RO eess.SY

    Hierarchical Control Strategy for Moving A Robot Manipulator Between Small Containers

    Authors: Paolo Torrado, Boling Yang, Joshua Smith

    Abstract: In this paper, we study the implementation of a model predictive controller (MPC) for the task of object manipulation in a highly uncertain environment (e.g., picking objects from a semi-flexible array of densely packed bins). As a real-time perception-driven feedback controller, MPC is robust to the uncertainties in this environment. However, our experiment shows MPC cannot control a robot to com… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  38. arXiv:2210.16680  [pdf

    eess.SY

    An Analytical Model for Stepwise Adiabatic Driver Energy Consumption

    Authors: Eric J. Carlson, Joshua R. Smith

    Abstract: This paper presents a complete closed-form analytical model for determining the per-cycle energy consumption of stepwise adiabatic drivers used for driving a capacitive load such as a power FET gate. The model takes into account the number of steps used, the stepwise driver tank capacitance, the load capacitance, and the stepwise driver switch resistance and on-time. Model accuracy is compared to… ▽ More

    Submitted 29 October, 2022; originally announced October 2022.

    Comments: 17 pages, 7 Figures

  39. arXiv:2210.09605  [pdf, other

    cs.IT eess.SP

    Optimal Phase Design for RIS Channel Estimation

    Authors: Chelsea L. Miller, Peter J. Smith, Pawel A. Dmochowski

    Abstract: We develop an optimal version of a prior two-stage channel estimation protocol for RIS-assisted channels. The new design uses a modified DFT matrix (MDFT) for the training phases at the RIS and is shown to minimize the total channel estimation error variance. In conjunction with interpolation (estimating fewer RIS channels), the MDFT approach accelerates channel estimation even when the channel fr… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

  40. arXiv:2209.09204  [pdf

    eess.IV cs.AI

    Robustness of an Artificial Intelligence Solution for Diagnosis of Normal Chest X-Rays

    Authors: Tom Dyer, Jordan Smith, Gaetan Dissez, Nicole Tay, Qaiser Malik, Tom Naunton Morgan, Paul Williams, Liliana Garcia-Mondragon, George Pearse, Simon Rasalingham

    Abstract: Purpose: Artificial intelligence (AI) solutions for medical diagnosis require thorough evaluation to demonstrate that performance is maintained for all patient sub-groups and to ensure that proposed improvements in care will be delivered equitably. This study evaluates the robustness of an AI solution for the diagnosis of normal chest X-rays (CXRs) by comparing performance across multiple patient… ▽ More

    Submitted 31 August, 2022; originally announced September 2022.

  41. arXiv:2208.14742  [pdf

    eess.IV cs.AI

    Enhancing Early Lung Cancer Detection on Chest Radiographs with AI-assistance: A Multi-Reader Study

    Authors: Gaetan Dissez, Nicole Tay, Tom Dyer, Matthew Tam, Richard Dittrich, David Doyne, James Hoare, Jackson J. Pat, Stephanie Patterson, Amanda Stockham, Qaiser Malik, Tom Naunton Morgan, Paul Williams, Liliana Garcia-Mondragon, Jordan Smith, George Pearse, Simon Rasalingham

    Abstract: Objectives: The present study evaluated the impact of a commercially available explainable AI algorithm in augmenting the ability of clinicians to identify lung cancer on chest X-rays (CXR). Design: This retrospective study evaluated the performance of 11 clinicians for detecting lung cancer from chest radiographs, with and without assistance from a commercially available AI algorithm (red dot,… ▽ More

    Submitted 31 August, 2022; originally announced August 2022.

  42. arXiv:2205.14700  [pdf, other

    eess.AS cs.SD

    To catch a chorus, verse, intro, or anything else: Analyzing a song with structural functions

    Authors: Ju-Chiang Wang, Yun-Ning Hung, Jordan B. L. Smith

    Abstract: Conventional music structure analysis algorithms aim to divide a song into segments and to group them with abstract labels (e.g., 'A', 'B', and 'C'). However, explicitly identifying the function of each segment (e.g., 'verse' or 'chorus') is rarely attempted, but has many applications. We introduce a multi-task deep learning framework to model these structural semantic labels directly from audio b… ▽ More

    Submitted 29 May, 2022; originally announced May 2022.

    Comments: This manuscript is accepted by ICASSP 2022

  43. arXiv:2202.01980  [pdf, other

    cs.NI cs.LG eess.SP

    Multi-Output Gaussian Process-Based Data Augmentation for Multi-Building and Multi-Floor Indoor Localization

    Authors: Zhe Tang, Sihao Li, Kyeong Soo Kim, Jeremy Smith

    Abstract: Location fingerprinting based on RSSI becomes a mainstream indoor localization technique due to its advantage of not requiring the installation of new infrastructure and the modification of existing devices, especially given the prevalence of Wi-Fi-enabled devices and the ubiquitous Wi-Fi access in modern buildings. The use of AI/ML technologies like DNNs makes location fingerprinting more accurat… ▽ More

    Submitted 31 July, 2023; v1 submitted 4 February, 2022; originally announced February 2022.

    Comments: 6 pages, 5 figures

  44. arXiv:2201.00820  [pdf, other

    eess.IV cs.CV cs.LG physics.data-an physics.ins-det physics.optics

    Low dosage 3D volume fluorescence microscopy imaging using compressive sensing

    Authors: Varun Mannam, Jacob Brandt, Cody J. Smith, Scott Howard

    Abstract: Fluorescence microscopy has been a significant tool to observe long-term imaging of embryos (in vivo) growth over time. However, cumulative exposure is phototoxic to such sensitive live samples. While techniques like light-sheet fluorescence microscopy (LSFM) allow for reduced exposure, it is not well suited for deep imaging models. Other computational techniques are computationally expensive and… ▽ More

    Submitted 3 January, 2022; originally announced January 2022.

  45. arXiv:2111.08629  [pdf, other

    cs.NI cs.ET eess.SP

    Communication by means of Modulated Johnson Noise

    Authors: Zerina Kapetanovic, Miguel Morales, Joshua R. Smith

    Abstract: We present the design of a new passive wireless communication system that does not rely on ambient or generated RF sources. Instead, we exploit the Johnson (thermal) noise generated by a resistor to transmit information bits wirelessly. By switching the load connected to an antenna between a resistor and open circuit, we can achieve data rates of up to 26bps and distances of up to 7.3 meters. This… ▽ More

    Submitted 6 August, 2022; v1 submitted 16 November, 2021; originally announced November 2021.

  46. arXiv:2110.09000  [pdf, other

    eess.AS cs.SD

    Supervised Metric Learning for Music Structure Features

    Authors: Ju-Chiang Wang, Jordan B. L. Smith, Wei-Tsung Lu, Xuchen Song

    Abstract: Music structure analysis (MSA) methods traditionally search for musically meaningful patterns in audio: homogeneity, repetition, novelty, and segment-length regularity. Hand-crafted audio features such as MFCCs or chromagrams are often used to elicit these patterns. However, with more annotations of section labels (e.g., verse, chorus, and bridge) becoming available, one can use supervised feature… ▽ More

    Submitted 29 April, 2022; v1 submitted 17 October, 2021; originally announced October 2021.

    Comments: This paper was accepted and presented at ISMIR 2021

  47. arXiv:2103.14253  [pdf, other

    eess.AS cs.AI cs.SD

    Supervised Chorus Detection for Popular Music Using Convolutional Neural Network and Multi-task Learning

    Authors: Ju-Chiang Wang, Jordan B. L. Smith, Jitong Chen, Xuchen Song, Yuxuan Wang

    Abstract: This paper presents a novel supervised approach to detecting the chorus segments in popular music. Traditional approaches to this task are mostly unsupervised, with pipelines designed to target some quality that is assumed to define "chorusness," which usually means seeking the loudest or most frequently repeated sections. We propose to use a convolutional neural network with a multi-task learning… ▽ More

    Submitted 21 April, 2021; v1 submitted 26 March, 2021; originally announced March 2021.

    Comments: This version is a preprint of an accepted paper by ICASSP2021. Please cite the publication in the Proceedings of IEEE International Conference on Acoustics, Speech, & Signal Processing

  48. arXiv:2103.14208  [pdf, other

    cs.SD cs.AI eess.AS

    Modeling the Compatibility of Stem Tracks to Generate Music Mashups

    Authors: Jiawen Huang, Ju-Chiang Wang, Jordan B. L. Smith, Xuchen Song, Yuxuan Wang

    Abstract: A music mashup combines audio elements from two or more songs to create a new work. To reduce the time and effort required to make them, researchers have developed algorithms that predict the compatibility of audio elements. Prior work has focused on mixing unaltered excerpts, but advances in source separation enable the creation of mashups from isolated stems (e.g., vocals, drums, bass, etc.). In… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

    Comments: This is a preprint of the paper accepted by AAAI-21. Please cite the version included in the Proceedings of the 35th AAAI Conference on Artificial Intelligence

  49. arXiv:2103.05448  [pdf, other

    eess.IV cs.CV eess.SY

    Convolutional Neural Network Denoising in Fluorescence Lifetime Imaging Microscopy (FLIM)

    Authors: Varun Mannam, Yide Zhang, Xiaotong Yuan, Takashi Hato, Pierre C. Dagher, Evan L. Nichols, Cody J. Smith, Kenneth W. Dunn, Scott Howard

    Abstract: Fluorescence lifetime imaging microscopy (FLIM) systems are limited by their slow processing speed, low signal-to-noise ratio (SNR), and expensive and challenging hardware setups. In this work, we demonstrate applying a denoising convolutional network to improve FLIM SNR. The network will be integrated with an instant FLIM system with fast data acquisition based on analog signal processing, high S… ▽ More

    Submitted 6 March, 2021; originally announced March 2021.

    Comments: SPIE Proceedings Volume 11648, Multiphoton Microscopy in the Biomedical Sciences XXI; 116481C (2021)

    Report number: 116481C

  50. arXiv:2012.00244  [pdf, ps, other

    eess.SP

    The Optimal Location and Size of an Intermediate Coil in a Magnetic Resonant Coupling Wireless Power Transfer System

    Authors: Kedi Yan, Gregory E. Moore, Joshua R. Smith

    Abstract: To increase the transmission distance of Wireless Power Transfer (WPT) systems, we provide guidelines on choosing the optimal location of an Intermediate Coil with respect to size within a standard five-coil axially aligned experimental setup. From our results, for maximum magnitude of S21 at the resonant frequency we found the optimal location to exist where the coupling coefficient between the T… ▽ More

    Submitted 30 November, 2020; originally announced December 2020.