Skip to main content

Showing 1–50 of 57 results for author: Kelly, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.17048  [pdf, other

    cs.CL cs.AI cs.CY q-fin.CP q-fin.GN

    Words That Unite The World: A Unified Framework for Deciphering Central Bank Communications Globally

    Authors: Agam Shah, Siddhant Sukhani, Huzaifa Pardawala, Saketh Budideti, Riya Bhadani, Rudra Gopal, Siddhartha Somani, Michael Galarnyk, Soungmin Lee, Arnav Hiray, Akshar Ravichandran, Eric Kim, Pranav Aluru, Joshua Zhang, Sebastian Jaskowski, Veer Guda, Meghaj Tarte, Liqin Ye, Spencer Gosden, Rutwik Routu, Rachel Yuh, Sloka Chava, Sahasra Chava, Dylan Patrick Kelly, Aiden Chiang , et al. (2 additional authors not shown)

    Abstract: Central banks around the world play a crucial role in maintaining economic stability. Deciphering policy implications in their communications is essential, especially as misinterpretations can disproportionately impact vulnerable populations. To address this, we introduce the World Central Banks (WCB) dataset, the most comprehensive monetary policy corpus to date, comprising over 380k sentences fr… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

  2. arXiv:2503.19894  [pdf, other

    quant-ph cs.ET cs.PF

    Versatile Cross-platform Compilation Toolchain for Schrödinger-style Quantum Circuit Simulation

    Authors: Yuncheng Lu, Shuang Liang, Hongxiang Fan, Ce Guo, Wayne Luk, Paul H. J. Kelly

    Abstract: While existing quantum hardware resources have limited availability and reliability, there is a growing demand for exploring and verifying quantum algorithms. Efficient classical simulators for high-performance quantum simulation are critical to meeting this demand. However, due to the vastly varied characteristics of classical hardware, implementing hardware-specific optimizations for different h… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

    Comments: To appear in DAC 25

  3. arXiv:2503.12315  [pdf, other

    stat.ME cs.LG stat.CO stat.ML

    Simulation-based Bayesian inference under model misspecification

    Authors: Ryan P. Kelly, David J. Warne, David T. Frazier, David J. Nott, Michael U. Gutmann, Christopher Drovandi

    Abstract: Simulation-based Bayesian inference (SBI) methods are widely used for parameter estimation in complex models where evaluating the likelihood is challenging but generating simulations is relatively straightforward. However, these methods commonly assume that the simulation model accurately reflects the true data-generating process, an assumption that is frequently violated in realistic scenarios. I… ▽ More

    Submitted 15 March, 2025; originally announced March 2025.

    Comments: 46 pages, 8 figures

  4. arXiv:2404.13557  [pdf, other

    stat.ML cs.LG

    Preconditioned Neural Posterior Estimation for Likelihood-free Inference

    Authors: Xiaoyu Wang, Ryan P. Kelly, David J. Warne, Christopher Drovandi

    Abstract: Simulation based inference (SBI) methods enable the estimation of posterior distributions when the likelihood function is intractable, but where model simulation is feasible. Popular neural approaches to SBI are the neural posterior estimator (NPE) and its sequential version (SNPE). These methods can outperform statistical SBI approaches such as approximate Bayesian computation (ABC), particularly… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 31 pages, 11 figures

  5. A shared compilation stack for distributed-memory parallelism in stencil DSLs

    Authors: George Bisbas, Anton Lydike, Emilien Bauer, Nick Brown, Mathieu Fehr, Lawrence Mitchell, Gabriel Rodriguez-Canal, Maurice Jamieson, Paul H. J. Kelly, Michel Steuwer, Tobias Grosser

    Abstract: Domain Specific Languages (DSLs) increase programmer productivity and provide high performance. Their targeted abstractions allow scientists to express problems at a high level, providing rich details that optimizing compilers can exploit to target current- and next-generation supercomputers. The convenience and performance of DSLs come with significant development and maintenance costs. The siloe… ▽ More

    Submitted 7 March, 2025; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: Fix some bibtex links, journal ref

    Journal ref: In ASPLOS '24: Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3, 38-56 (2024)

  6. arXiv:2403.08056  [pdf, other

    cs.PL cs.AR

    Improving Memory Dependence Prediction with Static Analysis

    Authors: Luke Panayi, Rohan Gandhi, Jim Whittaker, Vassilios Chouliaras, Martin Berger, Paul Kelly

    Abstract: This paper explores the potential of communicating information gained by static analysis from compilers to Out-of-Order (OoO) machines, focusing on the memory dependence predictor (MDP). The MDP enables loads to issue without all in-flight store addresses being known, with minimal memory order violations. We use LLVM to find loads with no dependencies and label them via their opcode. These labelle… ▽ More

    Submitted 5 June, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: 15 pages

    ACM Class: B.0; B.8; C.1

  7. Distributed Simultaneous Localisation and Auto-Calibration using Gaussian Belief Propagation

    Authors: Riku Murai, Ignacio Alzugaray, Paul H. J. Kelly, Andrew J. Davison

    Abstract: We present a novel scalable, fully distributed, and online method for simultaneous localisation and extrinsic calibration for multi-robot setups. Individual a priori unknown robot poses are probabilistically inferred as robots sense each other while simultaneously calibrating their sensors and markers extrinsic using Gaussian Belief Propagation. In the presented experiments, we show how our method… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: Published in IEEE Robotics and Automation Letters (RA-L) 2024

    Journal ref: IEEE Robotics and Automation Letters, vol. 9, no. 3, pp. 2136-2143, March 2024

  8. arXiv:2312.13094  [pdf, other

    cs.DC cs.MS cs.PF

    Automated MPI-X code generation for scalable finite-difference solvers

    Authors: George Bisbas, Rhodri Nelson, Mathias Louboutin, Fabio Luporini, Paul H. J. Kelly, Gerard Gorman

    Abstract: Partial differential equations (PDEs) are crucial in modeling diverse phenomena across scientific disciplines, including seismic and medical imaging, computational fluid dynamics, image processing, and neural networks. Solving these PDEs at scale is an intricate and time-intensive process that demands careful tuning. This paper introduces automated code-generation techniques specifically tailored… ▽ More

    Submitted 6 January, 2025; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: 11 pages, 12 figures (23 pages with References and Appendix)

  9. arXiv:2312.06741  [pdf, other

    cs.CV cs.RO

    Gaussian Splatting SLAM

    Authors: Hidenobu Matsuki, Riku Murai, Paul H. J. Kelly, Andrew J. Davison

    Abstract: We present the first application of 3D Gaussian Splatting in monocular SLAM, the most fundamental but the hardest setup for Visual SLAM. Our method, which runs live at 3fps, utilises Gaussians as the only 3D representation, unifying the required representation for accurate, efficient tracking, mapping, and high-quality rendering. Designed for challenging monocular settings, our approach is seamles… ▽ More

    Submitted 14 April, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: CVPR2024 Highlight. First two authors contributed equally to this work. Project Page: https://rmurai.co.uk/projects/GaussianSplattingSLAM/

  10. Fashion CUT: Unsupervised domain adaptation for visual pattern classification in clothes using synthetic data and pseudo-labels

    Authors: Enric Moreu, Alex Martinelli, Martina Naughton, Philip Kelly, Noel E. O'Connor

    Abstract: Accurate product information is critical for e-commerce stores to allow customers to browse, filter, and search for products. Product data quality is affected by missing or incorrect information resulting in poor customer experience. While machine learning can be used to correct inaccurate or missing information, achieving high performance on fashion image classification tasks requires large amoun… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

  11. arXiv:2305.02491  [pdf, other

    eess.IV cs.CV

    Self-Supervised Learning for Organs At Risk and Tumor Segmentation with Uncertainty Quantification

    Authors: Ilkin Isler, Debesh Jha, Curtis Lisle, Justin Rineer, Patrick Kelly, Bulent Aydogan, Mohamed Abazeed, Damla Turgut, Ulas Bagci

    Abstract: In this study, our goal is to show the impact of self-supervised pre-training of transformers for organ at risk (OAR) and tumor segmentation as compared to costly fully-supervised learning. The proposed algorithm is called Monte Carlo Transformer based U-Net (MC-Swin-U). Unlike many other available models, our approach presents uncertainty quantification with Monte Carlo dropout strategy while gen… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

  12. arXiv:2301.13368  [pdf, other

    stat.ME cs.LG stat.CO stat.ML

    Misspecification-robust Sequential Neural Likelihood for Simulation-based Inference

    Authors: Ryan P. Kelly, David J. Nott, David T. Frazier, David J. Warne, Chris Drovandi

    Abstract: Simulation-based inference techniques are indispensable for parameter estimation of mechanistic and simulable models with intractable likelihoods. While traditional statistical approaches like approximate Bayesian computation and Bayesian synthetic likelihood have been studied under well-specified and misspecified settings, they often suffer from inefficiencies due to wasted model simulations. Neu… ▽ More

    Submitted 7 March, 2024; v1 submitted 30 January, 2023; originally announced January 2023.

  13. arXiv:2203.03092  [pdf, other

    cs.RO cs.AI

    Systematic Comparison of Path Planning Algorithms using PathBench

    Authors: Hao-Ya Hsueh, Alexandru-Iosif Toma, Hussein Ali Jaafar, Edward Stow, Riku Murai, Paul H. J. Kelly, Sajad Saeedi

    Abstract: Path planning is an essential component of mobile robotics. Classical path planning algorithms, such as wavefront and rapidly-exploring random tree (RRT) are used heavily in autonomous robots. With the recent advances in machine learning, development of learning-based path planning algorithms has been experiencing rapid growth. An unified path planning interface that facilitates the development an… ▽ More

    Submitted 6 March, 2022; originally announced March 2022.

    Comments: Accepted to Advanced Robotics Journal; 23 pages, 9 figures, 4 tables. arXiv admin note: substantial text overlap with arXiv:2105.01777

  14. arXiv:2202.03314  [pdf, other

    cs.RO cs.AI cs.MA

    A Robot Web for Distributed Many-Device Localisation

    Authors: Riku Murai, Joseph Ortiz, Sajad Saeedi, Paul H. J. Kelly, Andrew J. Davison

    Abstract: We show that a distributed network of robots or other devices which make measurements of each other can collaborate to globally localise via efficient ad-hoc peer to peer communication. Our Robot Web solution is based on Gaussian Belief Propagation on the fundamental non-linear factor graph describing the probabilistic structure of all of the observations robots make internally or of each other, a… ▽ More

    Submitted 26 January, 2024; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: Published in IEEE Transactions on Robotics (TRO) 2023

    Journal ref: IEEE Transactions on Robotics, vol. 40, pp. 121-138, 2024

  15. arXiv:2202.01866  [pdf, other

    eess.IV cs.CV

    Enhancing Organ at Risk Segmentation with Improved Deep Neural Networks

    Authors: Ilkin Isler, Curtis Lisle, Justin Rineer, Patrick Kelly, Damla Turgut, Jacob Ricci, Ulas Bagci

    Abstract: Organ at risk (OAR) segmentation is a crucial step for treatment planning and outcome determination in radiotherapy treatments of cancer patients. Several deep learning based segmentation algorithms have been developed in recent years, however, U-Net remains the de facto algorithm designed specifically for biomedical image segmentation and has spawned many variants with known weaknesses. In this s… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

    Comments: 7 pages, 3 figures, 6 tables, The paper is published in SPIE Medical Imaging 2022

  16. arXiv:2106.07456  [pdf, other

    cs.AR

    Extending the RISC-V ISA for exploring advanced reconfigurable SIMD instructions

    Authors: Philippos Papaphilippou, Paul H. J. Kelly, Wayne Luk

    Abstract: This paper presents a novel, non-standard set of vector instruction types for exploring custom SIMD instructions in a softcore. The new types allow simultaneous access to a relatively high number of operands, reducing the instruction count where applicable. Additionally, a high-performance open-source RISC-V (RV32 IM) softcore is introduced, optimised for exploring custom SIMD instructions and str… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

    Comments: Accepted at the Fifth Workshop on Computer Architecture Research with RISC-V (CARRV 2021), co-located with ISCA 2021

  17. arXiv:2106.06086  [pdf, ps, other

    cs.SE cs.NE

    PSB2: The Second Program Synthesis Benchmark Suite

    Authors: Thomas Helmuth, Peter Kelly

    Abstract: For the past six years, researchers in genetic programming and other program synthesis disciplines have used the General Program Synthesis Benchmark Suite to benchmark many aspects of automatic program synthesis systems. These problems have been used to make notable progress toward the goal of general program synthesis: automatically creating the types of software that human programmers code. Many… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

    Comments: To be published in GECCO 2021

  18. arXiv:2105.01777  [pdf, other

    cs.RO cs.LG

    PathBench: A Benchmarking Platform for Classical and Learned Path Planning Algorithms

    Authors: Alexandru-Iosif Toma, Hao-Ya Hsueh, Hussein Ali Jaafar, Riku Murai, Paul H. J. Kelly, Sajad Saeedi

    Abstract: Path planning is a key component in mobile robotics. A wide range of path planning algorithms exist, but few attempts have been made to benchmark the algorithms holistically or unify their interface. Moreover, with the recent advances in deep neural networks, there is an urgent need to facilitate the development and benchmarking of such learning-based planning algorithms. This paper presents PathB… ▽ More

    Submitted 4 May, 2021; originally announced May 2021.

    Comments: The Conference on Robots and Vision (CRV2021), Supplementary Website: https://sites.google.com/view/pathbench/

  19. arXiv:2104.05834  [pdf, other

    cs.RO

    Generative Design of NU's Husky Carbon, A Morpho-Functional, Legged Robot

    Authors: Alireza Ramezani, Pravin Dangol, Eric Sihite, Andrew Lessieur, Peter Kelly

    Abstract: We report the design of a morpho-functional robot called Husky Carbon. Our goal is to integrate two forms of mobility, aerial and quadrupedal-legged locomotion, within a single platform. There are prohibitive design restrictions such as tight power budget and payload, which can particularly become important in aerial flights. To address these challenges, we pose a problem called the Mobility Value… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: 7 Pages, 7 figures, submitted to ICRA 2021

  20. arXiv:2101.08886  [pdf

    cs.HC eess.SY

    A co-Design approach to develop a smart cooking appliance. Applying a Domain Specific Language for a community supported appliance

    Authors: Matteo Zallio, Paula Kelly, Barry Cryan, Damon Berry

    Abstract: Our environment, whether at work, in public spaces, or at home, is becoming more connected, and increasingly responsive. Meal preparation even when it involves simply heating ready-made food can be perceived as a complex process for people with disabilities. This research aimed to prototype, using a co-Design approach a Community Supported Appliance (CSA) by developing a Domain Specific Language (… ▽ More

    Submitted 21 June, 2021; v1 submitted 21 January, 2021; originally announced January 2021.

    Comments: 9 pages, 7 figures

  21. arXiv:2101.08715  [pdf, other

    cs.AR cs.CV cs.DC

    Cain: Automatic Code Generation for Simultaneous Convolutional Kernels on Focal-plane Sensor-processors

    Authors: Edward Stow, Riku Murai, Sajad Saeedi, Paul H. J. Kelly

    Abstract: Focal-plane Sensor-processors (FPSPs) are a camera technology that enable low power, high frame rate computation, making them suitable for edge computation. Unfortunately, these devices' limited instruction sets and registers make developing complex algorithms difficult. In this work, we present Cain - a compiler that targets SCAMP-5, a general-purpose FPSP - which generates code from multiple con… ▽ More

    Submitted 21 January, 2021; originally announced January 2021.

    Comments: 17 pages, 4 figures, Accepted at LCPC 2020 to be published by Springer

    ACM Class: D.3.4; I.4.m

  22. arXiv:2010.10248  [pdf, other

    cs.DC cs.MS cs.PF

    Temporal blocking of finite-difference stencil operators with sparse "off-the-grid" sources

    Authors: George Bisbas, Fabio Luporini, Mathias Louboutin, Rhodri Nelson, Gerard Gorman, Paul H. J. Kelly

    Abstract: Stencil kernels dominate a range of scientific applications, including seismic and medical imaging, image processing, and neural networks. Temporal blocking is a performance optimization that aims to reduce the required memory bandwidth of stencil computations by re-using data from the cache for multiple time steps. It has already been shown to be beneficial for this class of algorithms. However,… ▽ More

    Submitted 25 February, 2021; v1 submitted 20 October, 2020; originally announced October 2020.

    Comments: Accepted for publication at 35th IEEE International Parallel & Distributed Processing Symposium

  23. arXiv:2010.04702  [pdf, other

    cs.RO

    Mechanism Design of a Bio-inspired Armwing Mechanism for Mimicking Bat Flapping Gait

    Authors: E. Sihite, P. Kelly, A. Ramezani

    Abstract: The objective of this work is to design and develop a bio-inspired soft and articulated armwing structure which will be an integral component of a morphing aerial co-bot, Aerobat. In our design, we draw inspiration from bats. Bat membranous wings possess unique functions that make them a good example to take inspiration from and transform current aerial drones. In contrast with other flying verteb… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

    Comments: 2 pages, abstract, 2 figures, accepted at International Conference on Intelligent Robots and Systems (IROS),2020

  24. arXiv:2006.07187  [pdf, other

    eess.IV cs.AI cs.CV cs.LG stat.ML

    HMIC: Hierarchical Medical Image Classification, A Deep Learning Approach

    Authors: Kamran Kowsari, Rasoul Sali, Lubaina Ehsan, William Adorno, Asad Ali, Sean Moore, Beatrice Amadi, Paul Kelly, Sana Syed, Donald Brown

    Abstract: Image classification is central to the big data revolution in medicine. Improved information processing methods for diagnosis and classification of digital medical images have shown to be successful via deep learning approaches. As this field is explored, there are limitations to the performance of traditional supervised classifiers. This paper outlines an approach that is different from the curre… ▽ More

    Submitted 23 June, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

    Journal ref: Information 11, no. 6 (2020): 318

  25. arXiv:2006.04794  [pdf

    cs.SE

    Abstracting spreadsheet data flow through hypergraph redrawing

    Authors: David Birch, Nicolai Stawinoga, Jack Binks, Bruno Nicoletti, Paul Kelly

    Abstract: We believe the error prone nature of traditional spreadsheets is due to their low level of abstraction. End user programmers are forced to construct their data models from low level cells which we define as "a data container or manipulator linked by user-intent to model their world and positioned to reflect its structure". Spreadsheet cells are limited in what they may contain (scalar values) and… ▽ More

    Submitted 4 June, 2020; originally announced June 2020.

    Comments: 23 Pages, 12 Colour Figures

    Journal ref: Proceedings of the EuSpRIG 2019 Conference "Spreadsheet Risk Management", Browns, Covent Garden, London, pp79-102, ISBN: 978-1-905404-56-8

  26. arXiv:2006.01765  [pdf, other

    eess.SP cs.CV cs.LG

    AnalogNet: Convolutional Neural Network Inference on Analog Focal Plane Sensor Processors

    Authors: Matthew Z. Wong, Benoit Guillard, Riku Murai, Sajad Saeedi, Paul H. J. Kelly

    Abstract: We present a high-speed, energy-efficient Convolutional Neural Network (CNN) architecture utilising the capabilities of a unique class of devices known as analog Focal Plane Sensor Processors (FPSP), in which the sensor and the processor are embedded together on the same silicon chip. Unlike traditional vision systems, where the sensor array sends collected data to a separate processor for process… ▽ More

    Submitted 21 June, 2020; v1 submitted 2 June, 2020; originally announced June 2020.

    Comments: 8 pages, 7 figures

  27. arXiv:2005.03868  [pdf, other

    eess.IV cs.LG stat.ML

    Hierarchical Deep Convolutional Neural Networks for Multi-category Diagnosis of Gastrointestinal Disorders on Histopathological Images

    Authors: Rasoul Sali, Sodiq Adewole, Lubaina Ehsan, Lee A. Denson, Paul Kelly, Beatrice C. Amadi, Lori Holtz, Syed Asad Ali, Sean R. Moore, Sana Syed, Donald E. Brown

    Abstract: Deep convolutional neural networks(CNNs) have been successful for a wide range of computer vision tasks, including image classification. A specific area of the application lies in digital pathology for pattern recognition in the tissue-based diagnosis of gastrointestinal(GI) diseases. This domain can utilize CNNs to translate histopathological images into precise diagnostics. This is challenging s… ▽ More

    Submitted 6 August, 2020; v1 submitted 8 May, 2020; originally announced May 2020.

    Comments: accepted at IEEE International Conference on Healthcare Informatics (ICHI 2020)

  28. BIT-VO: Visual Odometry at 300 FPS using Binary Features from the Focal Plane

    Authors: Riku Murai, Sajad Saeedi, Paul H. J. Kelly

    Abstract: Focal-plane Sensor-processor (FPSP) is a next-generation camera technology which enables every pixel on the sensor chip to perform computation in parallel, on the focal plane where the light intensity is captured. SCAMP-5 is a general-purpose FPSP used in this work and it carries out computations in the analog domain before analog to digital conversion. By extracting features from the image on the… ▽ More

    Submitted 23 April, 2020; originally announced April 2020.

    Comments: 8 pages, 16 figures

    Journal ref: 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Las Vegas, NV, USA, 2020, pp. 8579-8586

  29. arXiv:2003.03396  [pdf, other

    cs.CV cs.LG

    Scalable Uncertainty for Computer Vision with Functional Variational Inference

    Authors: Eduardo D C Carvalho, Ronald Clark, Andrea Nicastro, Paul H J Kelly

    Abstract: As Deep Learning continues to yield successful applications in Computer Vision, the ability to quantify all forms of uncertainty is a paramount requirement for its safe and reliable deployment in the real-world. In this work, we leverage the formulation of variational inference in function space, where we associate Gaussian Processes (GPs) to both Bayesian CNN priors and variational family. Since… ▽ More

    Submitted 6 March, 2020; originally announced March 2020.

    Comments: CVPR 2020

  30. arXiv:1909.01963  [pdf, other

    eess.IV cs.CV q-bio.QM

    Self-Attentive Adversarial Stain Normalization

    Authors: Aman Shrivastava, Will Adorno, Yash Sharma, Lubaina Ehsan, S. Asad Ali, Sean R. Moore, Beatrice C. Amadi, Paul Kelly, Sana Syed, Donald E. Brown

    Abstract: Hematoxylin and Eosin (H&E) stained Whole Slide Images (WSIs) are utilized for biopsy visualization-based diagnostic and prognostic assessment of diseases. Variation in the H&E staining process across different lab sites can lead to significant variations in biopsy image appearance. These variations introduce an undesirable bias when the slides are examined by pathologists or used for training dee… ▽ More

    Submitted 22 November, 2020; v1 submitted 4 September, 2019; originally announced September 2019.

    Comments: Accepted at AIDP (ICPR 2021)

  31. arXiv:1908.03272  [pdf, other

    q-bio.QM cs.CV eess.IV

    Deep Learning for Visual Recognition of Environmental Enteropathy and Celiac Disease

    Authors: Aman Shrivastava, Karan Kant, Saurav Sengupta, Sung-Jun Kang, Marium Khan, Asad Ali, Sean R. Moore, Beatrice C. Amadi, Paul Kelly, Donald E. Brown, Sana Syed

    Abstract: Physicians use biopsies to distinguish between different but histologically similar enteropathies. The range of syndromes and pathologies that could cause different gastrointestinal conditions makes this a difficult problem. Recently, deep learning has been used successfully in helping diagnose cancerous tissues in histopathological images. These successes motivated the research presented in this… ▽ More

    Submitted 8 August, 2019; originally announced August 2019.

  32. arXiv:1906.00877  [pdf, other

    cs.AR

    Pangloss: a novel Markov chain prefetcher

    Authors: Philippos Papaphilippou, Paul H. J. Kelly, Wayne Luk

    Abstract: We present Pangloss, an efficient high-performance data prefetcher that approximates a Markov chain on delta transitions. With a limited information scope and space/logic complexity, it is able to reconstruct a variety of both simple and complex access patterns. This is achieved by a highly-efficient representation of the Markov chain to provide accurate values for transition probabilities. In add… ▽ More

    Submitted 3 June, 2019; originally announced June 2019.

    Comments: Accepted in The Third Data Prefetching Championship (DPC3), held in conjunction with ISCA 2019

  33. Model Based System Assurance Using the Structured Assurance Case Metamodel

    Authors: Ran Wei, Tim P. Kelly, Xiaotian Dai, Shuai Zhao, Richard Hawkins

    Abstract: Assurance cases are used to demonstrate confidence in system properties of interest (e.g. safety and/or security). A number of system assurance approaches are adopted by industries in the safety-critical domain. However, the task of constructing assurance cases remains a manual, trivial and informal process. The Structured Assurance Case Metamodel (SACM) is a standard specified by the Object Manag… ▽ More

    Submitted 7 May, 2019; originally announced May 2019.

    Comments: 45 pages, 41 figures, Accepted by Journal of Systems and Software

  34. arXiv:1904.05773  [pdf, other

    eess.IV cs.CV cs.LG q-bio.QM stat.ML

    Diagnosis of Celiac Disease and Environmental Enteropathy on Biopsy Images Using Color Balancing on Convolutional Neural Networks

    Authors: Kamran Kowsari, Rasoul Sali, Marium N. Khan, William Adorno, S. Asad Ali, Sean R. Moore, Beatrice C. Amadi, Paul Kelly, Sana Syed, Donald E. Brown

    Abstract: Celiac Disease (CD) and Environmental Enteropathy (EE) are common causes of malnutrition and adversely impact normal childhood development. CD is an autoimmune disorder that is prevalent worldwide and is caused by an increased sensitivity to gluten. Gluten exposure destructs the small intestinal epithelial barrier, resulting in nutrient mal-absorption and childhood under-nutrition. EE also results… ▽ More

    Submitted 9 October, 2019; v1 submitted 10 April, 2019; originally announced April 2019.

  35. A study of vectorization for matrix-free finite element methods

    Authors: Tianjiao Sun, Lawrence Mitchell, Kaushik Kulkarni, Andreas Klöckner, David A. Ham, Paul H. J. Kelly

    Abstract: Vectorization is increasingly important to achieve high performance on modern hardware with SIMD instructions. Assembly of matrices and vectors in the finite element method, which is characterized by iterating a local assembly kernel over unstructured meshes, poses difficulties to effective vectorization. Maintaining a user-friendly high-level interface with a suitable degree of abstraction while… ▽ More

    Submitted 19 May, 2020; v1 submitted 19 March, 2019; originally announced March 2019.

    Journal ref: International Journal of High Performance Computing Applications (2020)

  36. arXiv:1811.11874  [pdf, other

    eess.IV cs.CV

    RetinaMatch: Efficient Template Matching of Retina Images for Teleophthalmology

    Authors: Chen Gong, N. Benjamin Erichson, John P. Kelly, Laura Trutoiu, Brian T. Schowengerdt, Steven L. Brunton, Eric J. Seibel

    Abstract: Retinal template matching and registration is an important challenge in teleophthalmology with low-cost imaging devices. However, the images from such devices generally have a small field of view (FOV) and image quality degradations, making matching difficult. In this work, we develop an efficient and accurate retinal matching technique that combines dimension reduction and mutual information (MI)… ▽ More

    Submitted 28 November, 2018; originally announced November 2018.

  37. arXiv:1808.06820  [pdf, other

    cs.RO

    SLAMBench2: Multi-Objective Head-to-Head Benchmarking for Visual SLAM

    Authors: Bruno Bodin, Harry Wagstaff, Sajad Saeedi, Luigi Nardi, Emanuele Vespa, John H Mayer, Andy Nisbet, Mikel Luján, Steve Furber, Andrew J Davison, Paul H. J. Kelly, Michael O'Boyle

    Abstract: SLAM is becoming a key component of robotics and augmented reality (AR) systems. While a large number of SLAM algorithms have been presented, there has been little effort to unify the interface of such algorithms, or to perform a holistic comparison of their capabilities. This is a problem since different SLAM applications can have different functional and non-functional requirements. For example,… ▽ More

    Submitted 21 August, 2018; originally announced August 2018.

    Journal ref: 2018 IEEE International Conference on Robotics and Automation (ICRA'18)

  38. Navigating the Landscape for Real-time Localisation and Mapping for Robotics and Virtual and Augmented Reality

    Authors: Sajad Saeedi, Bruno Bodin, Harry Wagstaff, Andy Nisbet, Luigi Nardi, John Mawer, Nicolas Melot, Oscar Palomar, Emanuele Vespa, Tom Spink, Cosmin Gorgovan, Andrew Webb, James Clarkson, Erik Tomusk, Thomas Debrunner, Kuba Kaszyk, Pablo Gonzalez-de-Aledo, Andrey Rodchenko, Graham Riley, Christos Kotselidis, Björn Franke, Michael F. P. O'Boyle, Andrew J. Davison, Paul H. J. Kelly, Mikel Luján , et al. (1 additional authors not shown)

    Abstract: Visual understanding of 3D environments in real-time, at low power, is a huge computational challenge. Often referred to as SLAM (Simultaneous Localisation and Mapping), it is central to applications spanning domestic and industrial robotics, autonomous vehicles, virtual and augmented reality. This paper describes the results of a major research effort to assemble the algorithms, architectures, to… ▽ More

    Submitted 20 August, 2018; originally announced August 2018.

    Comments: Proceedings of the IEEE 2018

  39. arXiv:1807.03032  [pdf, other

    cs.MS

    Architecture and performance of Devito, a system for automated stencil computation

    Authors: Fabio Luporini, Michael Lange, Mathias Louboutin, Navjot Kukreja, Jan Hückelheim, Charles Yount, Philipp Witte, Paul H. J. Kelly, Felix J. Herrmann, Gerard J. Gorman

    Abstract: Stencil computations are a key part of many high-performance computing applications, such as image processing, convolutional neural networks, and finite-difference solvers for partial differential equations. Devito is a framework capable of generating highly-optimized code given symbolic equations expressed in Python, specialized in, but not limited to, affine (stencil) codes. The lowering process… ▽ More

    Submitted 7 February, 2020; v1 submitted 9 July, 2018; originally announced July 2018.

    Comments: Submitted to ACM Transactions on Mathematical Software

    MSC Class: 65N06; 68N20

  40. arXiv:1708.03183  [pdf, other

    cs.CE physics.geo-ph

    Automated Tiling of Unstructured Mesh Computations with Application to Seismological Modelling

    Authors: Fabio Luporini, Michael Lange, Christian T. Jacobs, Gerard J. Gorman, J. Ramanujam, Paul H. J. Kelly

    Abstract: Sparse tiling is a technique to fuse loops that access common data, thus increasing data locality. Unlike traditional loop fusion or blocking, the loops may have different iteration spaces and access shared datasets through indirect memory accesses, such as A[map[i]] -- hence the name "sparse". One notable example of such loops arises in discontinuous-Galerkin finite element methods, because of th… ▽ More

    Submitted 19 June, 2019; v1 submitted 10 August, 2017; originally announced August 2017.

    Comments: 29 pages including supplementary materials and references

    ACM Class: D.1.2; G.4

  41. arXiv:1705.09866  [pdf, other

    physics.geo-ph cs.SI physics.data-an stat.ML

    Machine learning for graph-based representations of three-dimensional discrete fracture networks

    Authors: Manuel Valera, Zhengyang Guo, Priscilla Kelly, Sean Matz, Vito Adrian Cantu, Allon G. Percus, Jeffrey D. Hyman, Gowri Srinivasan, Hari S. Viswanathan

    Abstract: Structural and topological information play a key role in modeling flow and transport through fractured rock in the subsurface. Discrete fracture network (DFN) computational suites such as dfnWorks are designed to simulate flow and transport in such porous media. Flow and transport calculations reveal that a small backbone of fractures exists, where most flow and transport occurs. Restricting the… ▽ More

    Submitted 29 January, 2018; v1 submitted 27 May, 2017; originally announced May 2017.

    Comments: Computational Geosciences (2018)

    Report number: LA-UR-17-24300

    Journal ref: Computational Geosciences 22, 695-710 (2018)

  42. arXiv:1702.00505  [pdf, other

    cs.CV cs.DC cs.LG cs.PF

    Algorithmic Performance-Accuracy Trade-off in 3D Vision Applications Using HyperMapper

    Authors: Luigi Nardi, Bruno Bodin, Sajad Saeedi, Emanuele Vespa, Andrew J. Davison, Paul H. J. Kelly

    Abstract: In this paper we investigate an emerging application, 3D scene understanding, likely to be significant in the mobile space in the near future. The goal of this exploration is to reduce execution time while meeting our quality of result objectives. In previous work we showed for the first time that it is possible to map this application to power constrained embedded systems, highlighting that decis… ▽ More

    Submitted 21 March, 2017; v1 submitted 1 February, 2017; originally announced February 2017.

    Comments: 10 pages, Keywords: design space exploration, machine learning, computer vision, SLAM, embedded systems, GPU, crowd-sourcing

    Journal ref: 31st IEEE International Parallel and Distributed Processing Symposium May 29 - June 2, 2017 Orlando, Florida USA

  43. A structure-exploiting numbering algorithm for finite elements on extruded meshes, and its performance evaluation in Firedrake

    Authors: Gheorghe-Teodor Bercea, Andrew T. T. McRae, David A. Ham, Lawrence Mitchell, Florian Rathgeber, Luigi Nardi, Fabio Luporini, Paul H. J. Kelly

    Abstract: We present a generic algorithm for numbering and then efficiently iterating over the data values attached to an extruded mesh. An extruded mesh is formed by replicating an existing mesh, assumed to be unstructured, to form layers of prismatic cells. Applications of extruded meshes include, but are not limited to, the representation of 3D high aspect ratio domains employed by geophysical finite ele… ▽ More

    Submitted 28 October, 2016; v1 submitted 20 April, 2016; originally announced April 2016.

    Comments: Bibliography fixes, 23 pages

    Journal ref: Geoscientific Model Development 9:3803-3815 (2016)

  44. An algorithm for the optimization of finite element integration loops

    Authors: Fabio Luporini, David A. Ham, Paul H. J. Kelly

    Abstract: We present an algorithm for the optimization of a class of finite element integration loop nests. This algorithm, which exploits fundamental mathematical properties of finite element operators, is proven to achieve a locally optimal operation count. In specified circumstances the optimum achieved is global. Extensive numerical experiments demonstrate significant performance improvements over the s… ▽ More

    Submitted 20 April, 2016; originally announced April 2016.

    ACM Class: G.1.8; G.4

  45. arXiv:1512.06282  [pdf, ps, other

    cs.LO

    Contributions to the compositional semantics of first-order predicate logic

    Authors: Philip Kelly, M. H. van Emden

    Abstract: Henkin, Monk and Tarski gave a compositional semantics for first-order predicate logic. We extend this work by including function symbols in the language and by giving the denotation of the atomic formula as a composition of the denotations of its predicate symbol and of its tuple of arguments. In addition we give the denotation of a term as a composition of the denotations of its function symbol… ▽ More

    Submitted 19 December, 2015; originally announced December 2015.

    Comments: 14 pages, 1 figure

    Report number: DCS-356-IR

  46. Comparative Design Space Exploration of Dense and Semi-Dense SLAM

    Authors: M. Zeeshan Zia, Luigi Nardi, Andrew Jack, Emanuele Vespa, Bruno Bodin, Paul H. J. Kelly, Andrew J. Davison

    Abstract: SLAM has matured significantly over the past few years, and is beginning to appear in serious commercial products. While new SLAM systems are being proposed at every conference, evaluation is often restricted to qualitative visualizations or accuracy estimation against a ground truth. This is due to the lack of benchmarking methodologies which can holistically and quantitatively evaluate these sys… ▽ More

    Submitted 3 March, 2016; v1 submitted 15 September, 2015; originally announced September 2015.

    Comments: IEEE International Conference on Robotics and Automation 2016

  47. arXiv:1505.04694  [pdf, other

    cs.DC

    Thread Parallelism for Highly Irregular Computation in Anisotropic Mesh Adaptation

    Authors: Georgios Rokos, Gerard J. Gorman, Kristian Ejlebjerg Jensen, Paul H. J. Kelly

    Abstract: Thread-level parallelism in irregular applications with mutable data dependencies presents challenges because the underlying data is extensively modified during execution of the algorithm and a high degree of parallelism must be realized while keeping the code race-free. In this article we describe a methodology for exploiting thread parallelism for a class of graph-mutating worklist algorithms, w… ▽ More

    Submitted 18 May, 2015; originally announced May 2015.

    Comments: To appear in the proceedings of EASC 2015

  48. arXiv:1505.04134  [pdf, ps, other

    cs.DC

    An Interrupt-Driven Work-Sharing For-Loop Scheduler

    Authors: Georgios Rokos, Gerard J. Gorman, Paul H. J. Kelly

    Abstract: In this paper we present a parallel for-loop scheduler which is based on work-stealing principles but runs under a completely cooperative scheme. POSIX signals are used by idle threads to interrupt left-behind workers, which in turn decide what portion of their workload can be given to the requester. We call this scheme Interrupt-Driven Work-Sharing (IDWS). This article describes how IDWS works, h… ▽ More

    Submitted 18 May, 2015; v1 submitted 15 May, 2015; originally announced May 2015.

  49. arXiv:1505.04086  [pdf, ps, other

    cs.DC

    A Fast and Scalable Graph Coloring Algorithm for Multi-core and Many-core Architectures

    Authors: Georgios Rokos, Gerard Gorman, Paul H J Kelly

    Abstract: Irregular computations on unstructured data are an important class of problems for parallel programming. Graph coloring is often an important preprocessing step, e.g. as a way to perform dependency analysis for safe parallel execution. The total run time of a coloring algorithm adds to the overall parallel overhead of the application whereas the number of colors used determines the amount of expos… ▽ More

    Submitted 18 May, 2015; v1 submitted 15 May, 2015; originally announced May 2015.

    Comments: To appear in the proceedings of Euro Par 2015

  50. arXiv:1501.01809  [pdf, other

    cs.MS math.NA

    Firedrake: automating the finite element method by composing abstractions

    Authors: Florian Rathgeber, David A. Ham, Lawrence Mitchell, Michael Lange, Fabio Luporini, Andrew T. T. McRae, Gheorghe-Teodor Bercea, Graham R. Markall, Paul H. J. Kelly

    Abstract: Firedrake is a new tool for automating the numerical solution of partial differential equations. Firedrake adopts the domain-specific language for the finite element method of the FEniCS project, but with a pure Python runtime-only implementation centred on the composition of several existing and new abstractions for particular aspects of scientific computing. The result is a more complete separat… ▽ More

    Submitted 1 July, 2016; v1 submitted 8 January, 2015; originally announced January 2015.

    Comments: Minor revisions to v2

    ACM Class: G.1.8; G.4

    Journal ref: ACM Transactions on Mathematical Software 43(3):24:1--24:27 (2016)