Skip to main content

Showing 1–50 of 412 results for author: Smith, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.05370  [pdf, ps, other

    cs.DC cs.CR

    Walrus: An Efficient Decentralized Storage Network

    Authors: George Danezis, Giacomo Giuliari, Eleftherios Kokoris Kogias, Markus Legner, Jean-Pierre Smith, Alberto Sonnino, Karl Wüst

    Abstract: Decentralized storage systems face a fundamental trade-off between replication overhead, recovery efficiency, and security guarantees. Current approaches either rely on full replication, incurring substantial storage costs, or employ trivial erasure coding schemes that struggle with efficient recovery especially under high storage-node churn. We present Walrus, a novel decentralized blob storage s… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

  2. arXiv:2505.04082  [pdf, ps, other

    eess.AS cs.SD eess.SP

    Aliasing Reduction in Neural Amp Modeling by Smoothing Activations

    Authors: Ryota Sato, Julius O. Smith III

    Abstract: The increasing demand for high-quality digital emulations of analog audio hardware such as vintage guitar amplifiers has led to numerous works in neural-network-based black-box modeling, with deep learning architectures like WaveNet showing promising results. However, a key limitation in all of these models is the aliasing artifacts that arise from the use of nonlinear activation functions in neur… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: Accepted to DAFx 2025

  3. arXiv:2504.17321  [pdf, other

    physics.geo-ph cs.LG

    Dargana: fine-tuning EarthPT for dynamic tree canopy mapping from space

    Authors: Michael J. Smith, Luke Fleming, James E. Geach, Ryan J. Roberts, Freddie Kalaitzis, James Banister

    Abstract: We present Dargana, a fine-tuned variant of the EarthPT time-series foundation model that achieves specialisation using <3% of its pre-training data volume and 5% of its pre-training compute. Dargana is fine-tuned to generate regularly updated classification of tree canopy cover at 10m resolution, distinguishing conifer and broadleaved tree types. Using Cornwall, UK, as a test case, the model achi… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

    Comments: 9 pages, 6 figures, spotlight at `Tackling Climate Change with Machine Learning', ICLR 2025

  4. arXiv:2504.17029  [pdf, other

    astro-ph.IM cs.AI

    Fried Parameter Estimation from Single Wavefront Sensor Image with Artificial Neural Networks

    Authors: Jeffrey Smith, Taisei Fujii, Jesse Cranney, Charles Gretton

    Abstract: Atmospheric turbulence degrades the quality of astronomical observations in ground-based telescopes, leading to distorted and blurry images. Adaptive Optics (AO) systems are designed to counteract these effects, using atmospheric measurements captured by a wavefront sensor to make real-time corrections to the incoming wavefront. The Fried parameter, r0, characterises the strength of atmospheric tu… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

  5. arXiv:2504.10700  [pdf, other

    cs.DC cs.AI

    Optimizing Data Distribution and Kernel Performance for Efficient Training of Chemistry Foundation Models: A Case Study with MACE

    Authors: Jesun Firoz, Franco Pellegrini, Mario Geiger, Darren Hsu, Jenna A. Bilbrey, Han-Yi Chou, Maximilian Stadler, Markus Hoehnerbach, Tingyu Wang, Dejun Lin, Emine Kucukbenli, Henry W. Sprueill, Ilyes Batatia, Sotiris S. Xantheas, MalSoon Lee, Chris Mundy, Gabor Csanyi, Justin S. Smith, Ponnuswamy Sadayappan, Sutanay Choudhury

    Abstract: Chemistry Foundation Models (CFMs) that leverage Graph Neural Networks (GNNs) operating on 3D molecular graph structures are becoming indispensable tools for computational chemists and materials scientists. These models facilitate the understanding of matter and the discovery of new molecules and materials. In contrast to GNNs operating on a large homogeneous graphs, GNNs used by CFMs process a la… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

    Comments: Accepted at The 34th ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC 2025)

  6. arXiv:2504.08583  [pdf, other

    astro-ph.IM cs.LG

    AstroLLaVA: towards the unification of astronomical data and natural language

    Authors: Sharaf Zaman, Michael J. Smith, Pranav Khetarpal, Rishabh Chakrabarty, Michele Ginolfi, Marc Huertas-Company, Maja Jabłońska, Sandor Kruk, Matthieu Le Lain, Sergio José Rodríguez Méndez, Dimitrios Tanoglidis

    Abstract: We present AstroLLaVA, a vision language model for astronomy that enables interaction with astronomical imagery through natural dialogue. By fine-tuning the LLaVA model on a diverse dataset of $\sim$30k images with captions and question-answer pairs sourced from NASA's `Astronomy Picture of the Day', the European Southern Observatory, and the NASA/ESA Hubble Space Telescope, we create a model capa… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

    Comments: 8 pages, 3 figures, accepted to SCI-FM@ICLR 2025. Code at https://w3id.org/UniverseTBD/AstroLLaVA

  7. arXiv:2504.08022  [pdf, other

    cs.GR

    ChildlikeSHAPES: Semantic Hierarchical Region Parsing for Animating Figure Drawings

    Authors: Astitva Srivastava, Harrison Jesse Smith, Thu Nguyen-Phuoc, Yuting Ye

    Abstract: Childlike human figure drawings represent one of humanity's most accessible forms of character expression, yet automatically analyzing their contents remains a significant challenge. While semantic segmentation of realistic humans has recently advanced considerably, existing models often fail when confronted with the abstract, representational nature of childlike drawings. This semantic understand… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

  8. arXiv:2504.05670   

    cs.LG

    Dual Boost-Driven Graph-Level Clustering Network

    Authors: John Smith, Wenxuan Tu, Junlong Wu, Wenxin Zhang, Jingxin Liu, Haotian Wang, Jieren Cheng, Huajie Lei, Guangzhen Yao, Lingren Wang, Mengfei Li, Renda Han, Yu Li

    Abstract: Graph-level clustering remains a pivotal yet formidable challenge in graph learning. Recently, the integration of deep learning with representation learning has demonstrated notable advancements, yielding performance enhancements to a certain degree. However, existing methods suffer from at least one of the following issues: 1. the original graph structure has noise, and 2. during feature propagat… ▽ More

    Submitted 13 April, 2025; v1 submitted 8 April, 2025; originally announced April 2025.

    Comments: Since I did not obtain the consent of all authors and provided this version to the arxiv community without authorization, I request to withdraw the manuscript

  9. arXiv:2504.05496  [pdf, ps, other

    cs.CL

    A Survey on Hypothesis Generation for Scientific Discovery in the Era of Large Language Models

    Authors: Atilla Kaan Alkan, Shashwat Sourav, Maja Jablonska, Simone Astarita, Rishabh Chakrabarty, Nikhil Garuda, Pranav Khetarpal, Maciej Pióro, Dimitrios Tanoglidis, Kartheik G. Iyer, Mugdha S. Polimera, Michael J. Smith, Tirthankar Ghosal, Marc Huertas-Company, Sandor Kruk, Kevin Schawinski, Ioana Ciucă

    Abstract: Hypothesis generation is a fundamental step in scientific discovery, yet it is increasingly challenged by information overload and disciplinary fragmentation. Recent advances in Large Language Models (LLMs) have sparked growing interest in their potential to enhance and automate this process. This paper presents a comprehensive survey of hypothesis generation with LLMs by (i) reviewing existing me… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

    Comments: 9 pages (+2 pages of references), 2 figures

    MSC Class: 68T50

  10. arXiv:2503.15321  [pdf, other

    astro-ph.GA cs.CV

    Euclid Quick Data Release (Q1). Active galactic nuclei identification using diffusion-based inpainting of Euclid VIS images

    Authors: Euclid Collaboration, G. Stevens, S. Fotopoulou, M. N. Bremer, T. Matamoro Zatarain, K. Jahnke, B. Margalef-Bentabol, M. Huertas-Company, M. J. Smith, M. Walmsley, M. Salvato, M. Mezcua, A. Paulino-Afonso, M. Siudek, M. Talia, F. Ricci, W. Roster, N. Aghanim, B. Altieri, S. Andreon, H. Aussel, C. Baccigalupi, M. Baldi, S. Bardelli, P. Battaglia , et al. (249 additional authors not shown)

    Abstract: Light emission from galaxies exhibit diverse brightness profiles, influenced by factors such as galaxy type, structural features and interactions with other galaxies. Elliptical galaxies feature more uniform light distributions, while spiral and irregular galaxies have complex, varied light profiles due to their structural heterogeneity and star-forming activity. In addition, galaxies with an acti… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

    Comments: Paper submitted as part of the A&A Special Issue `Euclid Quick Data Release (Q1)', 32 pages, 26 figures

  11. arXiv:2503.13220   

    quant-ph cs.NI

    Simulating Raman Scattering Impairments with Depolarization Noise in Quantum-Classical Links

    Authors: Jake Smith, Roberto Proietti

    Abstract: We model spontaneous Raman scattering noise in polarization-encoded quantum communication channels co-propagating with classical signals using the depolarization channel. Utilizing NetSquid simulations, we validate the model against demonstrations of qubit transmission, entanglement distribution, and teleportation.

    Submitted 18 March, 2025; v1 submitted 17 March, 2025; originally announced March 2025.

    Comments: This version has been removed by arXiv administrators as the submitter did not have the right to agree to the license at the time of submission

  12. arXiv:2503.08978  [pdf, other

    cs.RO cs.LG eess.SY

    TetraGrip: Sensor-Driven Multi-Suction Reactive Object Manipulation in Cluttered Scenes

    Authors: Paolo Torrado, Joshua Levin, Markus Grotz, Joshua Smith

    Abstract: Warehouse robotic systems equipped with vacuum grippers must reliably grasp a diverse range of objects from densely packed shelves. However, these environments present significant challenges, including occlusions, diverse object orientations, stacked and obstructed items, and surfaces that are difficult to suction. We introduce \tetra, a novel vacuum-based grasping strategy featuring four suction… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

  13. arXiv:2502.17866  [pdf, other

    cs.GR

    Animating Childlike Drawings with 2.5D Character Rigs

    Authors: Harrison Jesse Smith, Nicky He, Yuting Ye

    Abstract: Drawing is a fun and intuitive way to create a character, accessible even to small children. However, animating 2D figure drawings is a much more challenging task, requiring specialized tools and skills. Bringing 2D figures to 3D so they can be animated and consumed in immersive media poses an even greater challenge. Moreover, it is desirable to preserve the unique style and identity of the figure… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.

  14. arXiv:2502.09396  [pdf, other

    cs.LG cs.CR

    A hierarchical approach for assessing the vulnerability of tree-based classification models to membership inference attack

    Authors: Richard J. Preen, Jim Smith

    Abstract: Machine learning models can inadvertently expose confidential properties of their training data, making them vulnerable to membership inference attacks (MIA). While numerous evaluation methods exist, many require computationally expensive processes, such as training multiple shadow models. This article presents two new complementary approaches for efficiently identifying vulnerable tree-based mode… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

  15. arXiv:2501.14713  [pdf, other

    cs.CL cs.LG

    FlexiGPT: Pruning and Extending Large Language Models with Low-Rank Weight Sharing

    Authors: James Seale Smith, Chi-Heng Lin, Shikhar Tuli, Haris Jeelani, Shangqian Gao, Yilin Shen, Hongxia Jin, Yen-Chang Hsu

    Abstract: The rapid proliferation of large language models (LLMs) in natural language processing (NLP) has created a critical need for techniques that enable efficient deployment on memory-constrained devices without compromising performance. We present a method to prune LLMs that selectively prunes model blocks based on an importance score and replaces them with a low-parameter replacement strategy. Specif… ▽ More

    Submitted 31 January, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

    Comments: Accepted to NAACL 2025 - Main Conference

  16. arXiv:2501.14084  [pdf, other

    cs.SE cs.AI cs.CY cs.HC

    The Role of Generative AI in Software Student CollaborAItion

    Authors: Natalie Kiesler, Jacqueline Smith, Juho Leinonen, Armando Fox, Stephen MacNeil, Petri Ihantola

    Abstract: Collaboration is a crucial part of computing education. The increase in AI capabilities over the last couple of years is bound to profoundly affect all aspects of systems and software engineering, including collaboration. In this position paper, we consider a scenario where AI agents would be able to take on any role in collaborative processes in computing education. We outline these roles, the ac… ▽ More

    Submitted 23 January, 2025; originally announced January 2025.

    Comments: 7 pages, 1 figure

  17. arXiv:2501.10383  [pdf, other

    cs.CY cs.HC

    The Generative AI Ethics Playbook

    Authors: Jessie J. Smith, Wesley Hanwen Deng, William H. Smith, Maarten Sap, Nicole DeCario, Jesse Dodge

    Abstract: The Generative AI Ethics Playbook provides guidance for identifying and mitigating risks of machine learning systems across various domains, including natural language processing, computer vision, and generative AI. This playbook aims to assist practitioners in diagnosing potential harms that may arise during the design, development, and deployment of datasets and models. It offers concrete strate… ▽ More

    Submitted 17 December, 2024; originally announced January 2025.

  18. arXiv:2501.08469  [pdf, other

    cs.RO eess.SY

    Electrostatic Clutches Enable Simultaneous Mechanical Multiplexing

    Authors: Timothy E. Amish, Jeffrey T. Auletta, Chad C. Kessens, Joshua R. Smith, Jeffrey I. Lipton

    Abstract: Actuating robotic systems with multiple degrees of freedom (DoF) traditionally requires numerous motors, leading to increased size, weight, cost, and power consumption. Mechanical multiplexing offers a solution by enabling a single actuator to control multiple DoF. However, existing multiplexers have either been limited to electrically controlled time-based multiplexing that control one DoF at a t… ▽ More

    Submitted 21 March, 2025; v1 submitted 14 January, 2025; originally announced January 2025.

  19. arXiv:2501.02314  [pdf, ps, other

    cs.CV

    RadarNeXt: Real-Time and Reliable 3D Object Detector Based On 4D mmWave Imaging Radar

    Authors: Liye Jia, Runwei Guan, Haocheng Zhao, Qiuchi Zhao, Ka Lok Man, Jeremy Smith, Limin Yu, Yutao Yue

    Abstract: 3D object detection is crucial for Autonomous Driving (AD) and Advanced Driver Assistance Systems (ADAS). However, most 3D detectors prioritize detection accuracy, often overlooking network inference speed in practical applications. In this paper, we propose RadarNeXt, a real-time and reliable 3D object detector based on the 4D mmWave radar point clouds. It leverages the re-parameterizable neural… ▽ More

    Submitted 4 January, 2025; originally announced January 2025.

    Comments: 8 pages, 5 figures, 3 tables. Code: https://github.com/Pay246-git468/RadarNeXt

  20. arXiv:2412.11967  [pdf, other

    cs.LG eess.SY

    A Digital twin for Diesel Engines: Operator-infused PINNs with Transfer Learning for Engine Health Monitoring

    Authors: Kamaljyoti Nath, Varun Kumar, Daniel J. Smith, George Em Karniadakis

    Abstract: Improving diesel engine efficiency and emission reduction have been critical research topics. Recent government regulations have shifted this focus to another important area related to engine health and performance monitoring. Although the advancements in the use of deep learning methods for system monitoring have shown promising results in this direction, designing efficient methods suitable for… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

  21. arXiv:2412.04429  [pdf, other

    cs.CV cs.LG

    Grounding Descriptions in Images informs Zero-Shot Visual Recognition

    Authors: Shaunak Halbe, Junjiao Tian, K J Joseph, James Seale Smith, Katherine Stevo, Vineeth N Balasubramanian, Zsolt Kira

    Abstract: Vision-language models (VLMs) like CLIP have been cherished for their ability to perform zero-shot visual recognition on open-vocabulary concepts. This is achieved by selecting the object category whose textual representation bears the highest similarity with the query image. While successful in some domains, this method struggles with identifying fine-grained entities as well as generalizing to u… ▽ More

    Submitted 5 December, 2024; originally announced December 2024.

  22. arXiv:2411.18750  [pdf, other

    cs.HC cs.RO

    OSU-Wing PIC Phase I Evaluation: Baseline Workload and Situation Awareness Results

    Authors: Julie A. Adams, Christopher A. Sanchez, Vivek Mallampati, Joshua Bhagat Smith, Emily Burgess, Andrew Dassonville

    Abstract: The common theory is that human pilot's performance degrades when responsible for an increased number of uncrewed aircraft systems (UAS). This theory was developed in the early 2010's for ground robots and not highly autonomous UAS. It has been shown that increasing autonomy can mitigate some performance impacts associated with increasing the number of UAS. Overall, the Oregon State University-Win… ▽ More

    Submitted 27 November, 2024; originally announced November 2024.

    Comments: 45 pages, 10 figures, 21 tables

  23. arXiv:2411.02627  [pdf, other

    physics.geo-ph cs.CV

    Towards more efficient agricultural practices via transformer-based crop type classification

    Authors: E. Ulises Moya-Sánchez, Yazid S. Mikail, Daisy Nyang'anyi, Michael J. Smith, Isabella Smythe

    Abstract: Machine learning has great potential to increase crop production and resilience to climate change. Accurate maps of where crops are grown are a key input to a number of downstream policy and research applications. In this proposal, we present preliminary work showing that it is possible to accurately classify crops from time series derived from Sentinel 1 and 2 satellite imagery in Mexico using a… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

  24. arXiv:2411.01878  [pdf, ps, other

    cs.IT eess.SP

    Rician Channel Modelling for Super Wideband MIMO Communications

    Authors: Sachitha C. Bandara, Peter J. Smith, Erfan Khordad, Robin Evans, Rajitha Senanayake

    Abstract: Recent developments in Multiple-Input-Multiple-Output (MIMO) technology include packing a large number of antenna elements in a compact array to access the bandwidth benefits provided by higher mutual coupling (MC). The resulting super-wideband (SW) systems require a circuit-theoretic framework to handle the MC and channel models which span extremely large bands. Hence, in this paper, we make two… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

    Comments: This paper has been submitted to the IEEE for possible publication

  25. arXiv:2411.01030  [pdf, other

    cs.CL cs.AI cs.LG

    Birdie: Advancing State Space Models with Reward-Driven Objectives and Curricula

    Authors: Sam Blouir, Jimmy T. H. Smith, Antonios Anastasopoulos, Amarda Shehu

    Abstract: Efficient state space models (SSMs), such as linear recurrent neural networks and linear attention variants, offer computational advantages over Transformers but struggle with tasks requiring long-range in-context retrieval-like text copying, associative recall, and question answering over long contexts. Previous efforts to address these challenges have focused on architectural modifications, ofte… ▽ More

    Submitted 21 February, 2025; v1 submitted 1 November, 2024; originally announced November 2024.

    Comments: Accepted to EMNLP 2024 (Main Conference)

  26. arXiv:2411.01008  [pdf, other

    cs.ET cs.LG cs.NE

    AI-Guided Codesign Framework for Novel Material and Device Design applied to MTJ-based True Random Number Generators

    Authors: Karan P. Patel, Andrew Maicke, Jared Arzate, Jaesuk Kwon, J. Darby Smith, James B. Aimone, Jean Anne C. Incorvia, Suma G. Cardwell, Catherine D. Schuman

    Abstract: Novel devices and novel computing paradigms are key for energy efficient, performant future computing systems. However, designing devices for new applications is often time consuming and tedious. Here, we investigate the design and optimization of spin orbit torque and spin transfer torque magnetic tunnel junction models as the probabilistic devices for true random number generation. We leverage r… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

  27. arXiv:2410.17937  [pdf

    cs.CE

    Toward path-invariant embeddings for local distance source characterization

    Authors: Lisa Linville, Chengping Chai, Nathan Marthindale, Jacob Smith, Scott Stewart, Asmeret Naugle

    Abstract: This work builds on recent advances in foundation models in the language and image domains to explore similar approaches for seismic source characterization. We rely on an architecture called Barlow Twins, borrowed from an understanding of the human visual cortical system and originally envisioned for the image domain and adapt it for learning path invariance in seismic event time series. Our mode… ▽ More

    Submitted 23 October, 2024; originally announced October 2024.

  28. arXiv:2410.02462  [pdf, other

    cs.CR

    Scalable Differential Privacy Mechanisms for Real-Time Machine Learning Applications

    Authors: Jessica Smith, David Williams, Emily Brown

    Abstract: Large language models (LLMs) are increasingly integrated into real-time machine learning applications, where safeguarding user privacy is paramount. Traditional differential privacy mechanisms often struggle to balance privacy and accuracy, particularly in fast-changing environments with continuously flowing data. To address these issues, we introduce Scalable Differential Privacy (SDP), a framewo… ▽ More

    Submitted 16 September, 2024; originally announced October 2024.

    Comments: First v of SDP

  29. arXiv:2409.19494  [pdf, other

    cs.RO cs.CV

    OptiGrasp: Optimized Grasp Pose Detection Using RGB Images for Warehouse Picking Robots

    Authors: Soofiyan Atar, Yi Li, Markus Grotz, Michael Wolf, Dieter Fox, Joshua Smith

    Abstract: In warehouse environments, robots require robust picking capabilities to manage a wide variety of objects. Effective deployment demands minimal hardware, strong generalization to new products, and resilience in diverse settings. Current methods often rely on depth sensors for structural information, which suffer from high costs, complex setups, and technical limitations. Inspired by recent advance… ▽ More

    Submitted 28 September, 2024; originally announced September 2024.

    Comments: 8 pages, 6 figures

  30. arXiv:2409.12805  [pdf, other

    stat.ML cs.LG quant-ph

    Robust estimation of the intrinsic dimension of data sets with quantum cognition machine learning

    Authors: Luca Candelori, Alexander G. Abanov, Jeffrey Berger, Cameron J. Hogan, Vahagn Kirakosyan, Kharen Musaelian, Ryan Samson, James E. T. Smith, Dario Villani, Martin T. Wells, Mengjia Xu

    Abstract: We propose a new data representation method based on Quantum Cognition Machine Learning and apply it to manifold learning, specifically to the estimation of intrinsic dimension of data sets. The idea is to learn a representation of each data point as a quantum state, encoding both local properties of the point as well as its relation with the entire data. Inspired by ideas from quantum geometry, w… ▽ More

    Submitted 19 September, 2024; originally announced September 2024.

  31. arXiv:2409.09563  [pdf, other

    astro-ph.IM cs.LG physics.data-an

    Astrometric Binary Classification Via Artificial Neural Networks

    Authors: Joe Smith

    Abstract: With nearly two billion stars observed and their corresponding astrometric parameters evaluated in the recent Gaia mission, the number of astrometric binary candidates have risen significantly. Due to the surplus of astrometric data, the current computational methods employed to inspect these astrometric binary candidates are both computationally expensive and cannot be executed in a reasonable ti… ▽ More

    Submitted 14 September, 2024; originally announced September 2024.

    Comments: Accepted for publication in Astrophysical Journal (ApJ)

  32. arXiv:2409.09530  [pdf, other

    cs.CV

    An Augmentation-based Model Re-adaptation Framework for Robust Image Segmentation

    Authors: Zheming Zuo, Joseph Smith, Jonathan Stonehouse, Boguslaw Obara

    Abstract: Image segmentation is a crucial task in computer vision, with wide-ranging applications in industry. The Segment Anything Model (SAM) has recently attracted intensive attention; however, its application in industrial inspection, particularly for segmenting commercial anti-counterfeit codes, remains challenging. Unlike open-source datasets, industrial settings often face issues such as small sample… ▽ More

    Submitted 14 September, 2024; originally announced September 2024.

    Comments: Accepted in the European Conference on Computer Vision (ECCV) 2024 workshop

  33. arXiv:2409.07389  [pdf, other

    stat.AP cs.CR stat.ME

    Dynamic Bayesian Networks, Elicitation and Data Embedding for Secure Environments

    Authors: Kieran Drury, Jim Q. Smith

    Abstract: Serious crime modelling typically needs to be undertaken securely behind a firewall where police knowledge and capabilities can remain undisclosed. Data informing an ongoing incident is often sparse, with a large proportion of relevant data only coming to light after the incident culminates or after police intervene - by which point it is too late to make use of the data to aid real-time decision… ▽ More

    Submitted 11 September, 2024; originally announced September 2024.

    Comments: 34 pages, 1 figure. Submitted to Entropy journal

  34. arXiv:2409.03055  [pdf, other

    cs.SD eess.AS

    SymPAC: Scalable Symbolic Music Generation With Prompts And Constraints

    Authors: Haonan Chen, Jordan B. L. Smith, Janne Spijkervet, Ju-Chiang Wang, Pei Zou, Bochen Li, Qiuqiang Kong, Xingjian Du

    Abstract: Progress in the task of symbolic music generation may be lagging behind other tasks like audio and text generation, in part because of the scarcity of symbolic training data. In this paper, we leverage the greater scale of audio music data by applying pre-trained MIR models (for transcription, beat tracking, structure analysis, etc.) to extract symbolic events and encode them into token sequences.… ▽ More

    Submitted 9 September, 2024; v1 submitted 4 September, 2024; originally announced September 2024.

    Comments: ISMIR 2024

  35. arXiv:2409.00078  [pdf, other

    eess.SP cs.LG cs.NI

    SGP-RI: A Real-Time-Trainable and Decentralized IoT Indoor Localization Model Based on Sparse Gaussian Process with Reduced-Dimensional Inputs

    Authors: Zhe Tang, Sihao Li, Zichen Huang, Guandong Yang, Kyeong Soo Kim, Jeremy S. Smith

    Abstract: Internet of Things (IoT) devices are deployed in the filed, there is an enormous amount of untapped potential in local computing on those IoT devices. Harnessing this potential for indoor localization, therefore, becomes an exciting research area. Conventionally, the training and deployment of indoor localization models are based on centralized servers with substantial computational resources. Thi… ▽ More

    Submitted 24 August, 2024; originally announced September 2024.

    Comments: 10 pages, 4 figures, under review for journal publication

  36. arXiv:2408.17207  [pdf, other

    cs.CV cs.RO

    NanoMVG: USV-Centric Low-Power Multi-Task Visual Grounding based on Prompt-Guided Camera and 4D mmWave Radar

    Authors: Runwei Guan, Jianan Liu, Liye Jia, Haocheng Zhao, Shanliang Yao, Xiaohui Zhu, Ka Lok Man, Eng Gee Lim, Jeremy Smith, Yutao Yue

    Abstract: Recently, visual grounding and multi-sensors setting have been incorporated into perception system for terrestrial autonomous driving systems and Unmanned Surface Vehicles (USVs), yet the high complexity of modern learning-based visual grounding model using multi-sensors prevents such model to be deployed on USVs in the real-life. To this end, we design a low-power multi-task model named NanoMVG f… ▽ More

    Submitted 11 February, 2025; v1 submitted 30 August, 2024; originally announced August 2024.

    Comments: 8 pages, 6 figures

  37. arXiv:2408.16623  [pdf, other

    cs.CV cs.LG eess.IV

    Turbulence Strength $C_n^2$ Estimation from Video using Physics-based Deep Learning

    Authors: Ripon Kumar Saha, Esen Salcin, Jihoo Kim, Joseph Smith, Suren Jayasuriya

    Abstract: Images captured from a long distance suffer from dynamic image distortion due to turbulent flow of air cells with random temperatures, and thus refractive indices. This phenomenon, known as image dancing, is commonly characterized by its refractive-index structure constant $C_n^2$ as a measure of the turbulence strength. For many applications such as atmospheric forecast model, long-range/astronom… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

    Comments: Code Available: https://github.com/Riponcs/Cn2Estimation

    Journal ref: Optics Express 30, 40854-40870 (2022)

  38. arXiv:2408.11140  [pdf, other

    cs.HC

    Predictive Anchoring: A Novel Interaction to Support Contextualized Suggestions for Grid Displays

    Authors: Cynthia Zastudil, Christine Holyfield, June A. Smith, Hannah Vy Nguyen, Stephen MacNeil

    Abstract: Grid displays are the most common form of augmentative and alternative communication device recommended by speech-language pathologists for children. Grid displays present a large variety of vocabulary which can be beneficial for a users' language development. However, the extensive navigation and cognitive overhead required of users of grid displays can negatively impact users' ability to activel… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  39. arXiv:2408.09632  [pdf, other

    cs.LG cs.CL stat.ML

    MoDeGPT: Modular Decomposition for Large Language Model Compression

    Authors: Chi-Heng Lin, Shangqian Gao, James Seale Smith, Abhishek Patel, Shikhar Tuli, Yilin Shen, Hongxia Jin, Yen-Chang Hsu

    Abstract: Large Language Models (LLMs) have reshaped the landscape of artificial intelligence by demonstrating exceptional performance across various tasks. However, substantial computational requirements make their deployment challenging on devices with limited resources. Recently, compression methods using low-rank matrix techniques have shown promise, yet these often lead to degraded accuracy or introduc… ▽ More

    Submitted 2 May, 2025; v1 submitted 18 August, 2024; originally announced August 2024.

    Comments: ICLR 2025 Oral

    MSC Class: 15A23 (Primary) ACM Class: I.2.7

  40. arXiv:2408.05604  [pdf, other

    cs.RO

    Cellular Plasticity Model for Bottom-Up Robotic Design

    Authors: Trevor R. Smith, Thomas J. Smith, Nicholas S. Szczecinski, Sergiy Yakovenko, Yu Gu

    Abstract: Traditional top-down robotic design often lacks the adaptability needed to handle real-world complexities, prompting the need for more flexible approaches. Therefore, this study introduces a novel cellular plasticity model tailored for bottom-up robotic design. The proposed model utilizes an activator-inhibitor reaction, a common foundation of Turing patterns, which are fundamental in morphogenesi… ▽ More

    Submitted 10 August, 2024; originally announced August 2024.

    Comments: 15 pages, 7 figures, Living Machines 2024

  41. arXiv:2408.01556  [pdf, other

    astro-ph.IM cs.DL cs.IR

    pathfinder: A Semantic Framework for Literature Review and Knowledge Discovery in Astronomy

    Authors: Kartheik G. Iyer, Mikaeel Yunus, Charles O'Neill, Christine Ye, Alina Hyk, Kiera McCormick, Ioana Ciuca, John F. Wu, Alberto Accomazzi, Simone Astarita, Rishabh Chakrabarty, Jesse Cranney, Anjalie Field, Tirthankar Ghosal, Michele Ginolfi, Marc Huertas-Company, Maja Jablonska, Sandor Kruk, Huiling Liu, Gabriel Marchidan, Rohit Mistry, J. P. Naiman, J. E. G. Peek, Mugdha Polimera, Sergio J. Rodriguez , et al. (5 additional authors not shown)

    Abstract: The exponential growth of astronomical literature poses significant challenges for researchers navigating and synthesizing general insights or even domain-specific knowledge. We present Pathfinder, a machine learning framework designed to enable literature review and knowledge discovery in astronomy, focusing on semantic searching with natural language instead of syntactic searches with keywords.… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

    Comments: 25 pages, 9 figures, submitted to AAS jorunals. Comments are welcome, and the tools mentioned are available online at https://pfdr.app

  42. arXiv:2407.19115  [pdf, other

    cs.LG

    Towards Scalable and Stable Parallelization of Nonlinear RNNs

    Authors: Xavier Gonzalez, Andrew Warrington, Jimmy T. H. Smith, Scott W. Linderman

    Abstract: Transformers and linear state space models can be evaluated in parallel on modern hardware, but evaluating nonlinear RNNs appears to be an inherently sequential problem. Recently, however, Lim et al. '24 developed an approach called DEER, which evaluates nonlinear RNNs in parallel by posing the states as the solution to a fixed-point problem. They derived a parallel form of Newton's method to solv… ▽ More

    Submitted 15 January, 2025; v1 submitted 26 July, 2024; originally announced July 2024.

    Comments: 33 pages, 9 figures, NeurIPS 2024

    ACM Class: I.2.6

  43. arXiv:2407.13303  [pdf, other

    cs.LG

    Mean Teacher based SSL Framework for Indoor Localization Using Wi-Fi RSSI Fingerprinting

    Authors: Sihao Li, Zhe Tang, Kyeong Soo Kim, Jeremy S. Smith

    Abstract: Wi-Fi fingerprinting is widely applied for indoor localization due to the widespread availability of Wi-Fi devices. However, traditional methods are not ideal for multi-building and multi-floor environments due to the scalability issues. Therefore, more and more researchers have employed deep learning techniques to enable scalable indoor localization. This paper introduces a novel semi-supervised… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: 12 pages, 10 figures, under preparation for a journal publication

  44. Hierarchical Stage-Wise Training of Linked Deep Neural Networks for Multi-Building and Multi-Floor Indoor Localization Based on Wi-Fi RSSI Fingerprinting

    Authors: Sihao Li, Kyeong Soo Kim, Zhe Tang, Graduate, Jeremy S. Smith

    Abstract: In this paper, we present a new solution to the problem of large-scale multi-building and multi-floor indoor localization based on linked neural networks, where each neural network is dedicated to a sub-problem and trained under a hierarchical stage-wise training framework. When the measured data from sensors have a hierarchical representation as in multi-building and multi-floor indoor localizati… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: 9 pages, 5 figures, under review for journal publication

    Journal ref: IEEE Sensors Journal, Early Access, 12 September 2024

  45. arXiv:2407.07279  [pdf, other

    cs.LG stat.ML

    Towards a theory of learning dynamics in deep state space models

    Authors: Jakub Smékal, Jimmy T. H. Smith, Michael Kleinman, Dan Biderman, Scott W. Linderman

    Abstract: State space models (SSMs) have shown remarkable empirical performance on many long sequence modeling tasks, but a theoretical understanding of these models is still lacking. In this work, we study the learning dynamics of linear SSMs to understand how covariance structure in data, latent state size, and initialization affect the evolution of parameters throughout learning with gradient descent. We… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  46. arXiv:2407.04667  [pdf, other

    stat.ME cs.LG

    The diameter of a stochastic matrix: A new measure for sensitivity analysis in Bayesian networks

    Authors: Manuele Leonelli, Jim Q. Smith, Sophia K. Wright

    Abstract: Bayesian networks are one of the most widely used classes of probabilistic models for risk management and decision support because of their interpretability and flexibility in including heterogeneous pieces of information. In any applied modelling, it is critical to assess how robust the inferences on certain target variables are to changes in the model. In Bayesian networks, these analyses fall u… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  47. arXiv:2406.04364  [pdf

    cs.CV cs.HC cs.LG

    Use of a Multiscale Vision Transformer to predict Nursing Activities Score from Low Resolution Thermal Videos in an Intensive Care Unit

    Authors: Isaac YL Lee, Thanh Nguyen-Duc, Ryo Ueno, Jesse Smith, Peter Y Chan

    Abstract: Excessive caregiver workload in hospital nurses has been implicated in poorer patient care and increased worker burnout. Measurement of this workload in the Intensive Care Unit (ICU) is often done using the Nursing Activities Score (NAS), but this is usually recorded manually and sporadically. Previous work has made use of Ambient Intelligence (AmI) by using computer vision to passively derive car… ▽ More

    Submitted 30 May, 2024; originally announced June 2024.

    Comments: 4 pages, 1 figure

  48. arXiv:2405.14930  [pdf, other

    astro-ph.IM astro-ph.GA cs.LG

    AstroPT: Scaling Large Observation Models for Astronomy

    Authors: Michael J. Smith, Ryan J. Roberts, Eirini Angeloudi, Marc Huertas-Company

    Abstract: This work presents AstroPT, an autoregressive pretrained transformer developed with astronomical use-cases in mind. The AstroPT models presented here have been pretrained on 8.6 million $512 \times 512$ pixel $grz$-band galaxy postage stamp observations from the DESI Legacy Survey DR8. We train a selection of foundation models of increasing size from 1 million to 2.1 billion parameters, and find t… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 12 pages, 4 figures, 1 table. Code available at https://github.com/Smith42/astroPT

  49. arXiv:2405.12821  [pdf, other

    cs.RO cs.CV

    Talk2Radar: Bridging Natural Language with 4D mmWave Radar for 3D Referring Expression Comprehension

    Authors: Runwei Guan, Ruixiao Zhang, Ningwei Ouyang, Jianan Liu, Ka Lok Man, Xiaohao Cai, Ming Xu, Jeremy Smith, Eng Gee Lim, Yutao Yue, Hui Xiong

    Abstract: Embodied perception is essential for intelligent vehicles and robots in interactive environmental understanding. However, these advancements primarily focus on vision, with limited attention given to using 3D modeling sensors, restricting a comprehensive understanding of objects in response to prompts containing qualitative and quantitative queries. Recently, as a promising automotive sensor with… ▽ More

    Submitted 9 February, 2025; v1 submitted 21 May, 2024; originally announced May 2024.

    Comments: Accepted by ICRA 2025

  50. arXiv:2405.06147  [pdf, other

    cs.LG eess.SY

    State-Free Inference of State-Space Models: The Transfer Function Approach

    Authors: Rom N. Parnichkun, Stefano Massaroli, Alessandro Moro, Jimmy T. H. Smith, Ramin Hasani, Mathias Lechner, Qi An, Christopher Ré, Hajime Asama, Stefano Ermon, Taiji Suzuki, Atsushi Yamashita, Michael Poli

    Abstract: We approach designing a state-space model for deep learning applications through its dual representation, the transfer function, and uncover a highly efficient sequence parallel inference algorithm that is state-free: unlike other proposed algorithms, state-free inference does not incur any significant memory or computational cost with an increase in state size. We achieve this using properties of… ▽ More

    Submitted 1 June, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: Resubmission 02/06/2024: Fixed minor typo of recurrent form RTF