Skip to main content

Showing 1–43 of 43 results for author: Kelly, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2508.04995  [pdf

    cs.HC cs.AI cs.DL

    Situated Epistemic Infrastructures: A Diagnostic Framework for Post-Coherence Knowledge

    Authors: Matthew Kelly

    Abstract: Large Language Models (LLMs) such as ChatGPT have rendered visible the fragility of contemporary knowledge infrastructures by simulating coherence while bypassing traditional modes of citation, authority, and validation. This paper introduces the Situated Epistemic Infrastructures (SEI) framework as a diagnostic tool for analyzing how knowledge becomes authoritative across hybrid human-machine sys… ▽ More

    Submitted 12 August, 2025; v1 submitted 6 August, 2025; originally announced August 2025.

    Comments: 22 pages including references. Draft prepared for submission to Science, Technology & Human Values

    ACM Class: K.4.1; K.3; K.2

  2. arXiv:2506.05636  [pdf, other

    cs.LG cs.AI

    Bayesian Inference for Correlated Human Experts and Classifiers

    Authors: Markelle Kelly, Alex Boyd, Sam Showalter, Mark Steyvers, Padhraic Smyth

    Abstract: Applications of machine learning often involve making predictions based on both model outputs and the opinions of human experts. In this context, we investigate the problem of querying experts for class label predictions, using as few human queries as possible, and leveraging the class probability estimates of pre-trained classifiers. We develop a general Bayesian framework for this problem, model… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: accepted to ICML 2025

  3. Understanding Gender Bias in AI-Generated Product Descriptions

    Authors: Markelle Kelly, Mohammad Tahaei, Padhraic Smyth, Lauren Wilcox

    Abstract: While gender bias in large language models (LLMs) has been extensively studied in many domains, uses of LLMs in e-commerce remain largely unexamined and may reveal novel forms of algorithmic bias and harm. Our work investigates this space, developing data-driven taxonomic categories of gender bias in the context of product description generation, which we situate with respect to existing general p… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

    Comments: Accepted to FAccT 2025

  4. arXiv:2502.01525  [pdf, ps, other

    cs.DL

    Archiving and Replaying Current Web Advertisements: Challenges and Opportunities

    Authors: Travis Reid, Alex H. Poole, Hyung Wook Choi, Christopher Rauch, Mat Kelly, Michael L. Nelson, Michele C. Weigle

    Abstract: Although web advertisements represent an inimitable part of digital cultural heritage, serious archiving and replay challenges persist. To explore these challenges, we created a dataset of 279 archived ads. We encountered five problems in archiving and replaying them. For one, prior to August 2023, Internet Archive's Save Page Now service excluded not only well-known ad services' ads, but also URL… ▽ More

    Submitted 22 September, 2025; v1 submitted 3 February, 2025; originally announced February 2025.

  5. arXiv:2501.09951  [pdf, other

    cs.HC

    Discord's Design Encourages "Third Place" Social Media Experiences

    Authors: JaeWon Kim, Thea Klein-Balajee, Ryan M. Kelly, Alexis Hiniker

    Abstract: In light of the diminishing presence of physical third places -- informal gathering spaces essential for social connection -- this study explores how the social media platform Discord fosters third-place experiences. Drawing on Oldenburg's conceptual framework, we analyze how Discord's design elements support the creation of virtual third places that foster both dyadic and community-based relation… ▽ More

    Submitted 16 January, 2025; originally announced January 2025.

  6. arXiv:2410.24100  [pdf, other

    cs.LG cs.DL

    Benchmark Data Repositories for Better Benchmarking

    Authors: Rachel Longjohn, Markelle Kelly, Sameer Singh, Padhraic Smyth

    Abstract: In machine learning research, it is common to evaluate algorithms via their performance on standard benchmark datasets. While a growing body of work establishes guidelines for -- and levies criticisms at -- data and benchmarking practices in machine learning, comparatively less attention has been paid to the data repositories where these datasets are stored, documented, and shared. In this paper,… ▽ More

    Submitted 31 October, 2024; originally announced October 2024.

    Comments: Accepted to NeurIPS Datasets and Benchmarks 2024

  7. Envisioning New Futures of Positive Social Technology: Beyond Paradigms of Fixing, Protecting, and Preventing

    Authors: JaeWon Kim, Lindsay Popowski, Anna Fang, Cassidy Pyle, Guo Freeman, Ryan M. Kelly, Angela Y. Lee, Fannie Liu, Angela D. R. Smith, Alexandra To, Amy X. Zhang

    Abstract: Social technology research today largely focuses on mitigating the negative impacts of technology and, therefore, often misses the potential of technology to enhance human connections and well-being. However, we see a potential to shift towards a holistic view of social technology's impact on human flourishing. We introduce Positive Social Technology (Positech), a framework that shifts emphasis to… ▽ More

    Submitted 14 October, 2024; v1 submitted 24 July, 2024; originally announced July 2024.

  8. arXiv:2407.15814  [pdf, other

    cs.CL cs.AI cs.LG

    Perceptions of Linguistic Uncertainty by Language Models and Humans

    Authors: Catarina G Belem, Markelle Kelly, Mark Steyvers, Sameer Singh, Padhraic Smyth

    Abstract: _Uncertainty expressions_ such as "probably" or "highly unlikely" are pervasive in human language. While prior work has established that there is population-level agreement in terms of how humans quantitatively interpret these expressions, there has been little inquiry into the abilities of language models in the same context. In this paper, we investigate how language models map linguistic expres… ▽ More

    Submitted 7 November, 2024; v1 submitted 22 July, 2024; originally announced July 2024.

    Comments: Accepted at EMNLP 2024 (Main)

  9. arXiv:2406.01076  [pdf, other

    cs.CV cs.AI cs.LG

    Estimating Canopy Height at Scale

    Authors: Jan Pauls, Max Zimmer, Una M. Kelly, Martin Schwartz, Sassan Saatchi, Philippe Ciais, Sebastian Pokutta, Martin Brandt, Fabian Gieseke

    Abstract: We propose a framework for global-scale canopy height estimation based on satellite data. Our model leverages advanced data preprocessing techniques, resorts to a novel loss function designed to counter geolocation inaccuracies inherent in the ground-truth height measurements, and employs data from the Shuttle Radar Topography Mission to effectively filter out erroneous labels in mountainous regio… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: ICML Camera-Ready, 17 pages, 14 figures, 7 tables

  10. arXiv:2405.16426  [pdf, other

    cs.CV

    Segmentation of Maya hieroglyphs through fine-tuned foundation models

    Authors: FNU Shivam, Megan Leight, Mary Kate Kelly, Claire Davis, Kelsey Clodfelter, Jacob Thrasher, Yenumula Reddy, Prashnna Gyawali

    Abstract: The study of Maya hieroglyphic writing unlocks the rich history of cultural and societal knowledge embedded within this ancient civilization's visual narrative. Artificial Intelligence (AI) offers a novel lens through which we can translate these inscriptions, with the potential to allow non-specialists access to reading these texts and to aid in the decipherment of those hieroglyphs which continu… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  11. arXiv:2404.08611  [pdf, other

    cs.CV cs.AI physics.med-ph

    Automatic Quantification of Serial PET/CT Images for Pediatric Hodgkin Lymphoma Patients Using a Longitudinally-Aware Segmentation Network

    Authors: Xin Tie, Muheon Shin, Changhee Lee, Scott B. Perlman, Zachary Huemann, Amy J. Weisman, Sharon M. Castellino, Kara M. Kelly, Kathleen M. McCarten, Adina L. Alazraki, Junjie Hu, Steve Y. Cho, Tyler J. Bradshaw

    Abstract: $\textbf{Purpose}$: Automatic quantification of longitudinal changes in PET scans for lymphoma patients has proven challenging, as residual disease in interim-therapy scans is often subtle and difficult to detect. Our goal was to develop a longitudinally-aware segmentation network (LAS-Net) that can quantify serial PET/CT images for pediatric Hodgkin lymphoma patients. $\textbf{Materials and Metho… ▽ More

    Submitted 30 September, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

    Comments: There are 6 figures and 4 tables in the main text. The supplementary material is appended to the main text

  12. arXiv:2404.06784  [pdf

    quant-ph cond-mat.mes-hall cs.AR eess.SY

    Statistical evaluation of 571 GaAs quantum point contact transistors showing the 0.7 anomaly in quantized conductance using millikelvin cryogenic on-chip multiplexing

    Authors: Pengcheng Ma, Kaveh Delfanazari, Reuben K. Puddy, Jiahui Li, Moda Cao, Teng Yi, Jonathan P. Griffiths, Harvey E. Beere, David A. Ritchie, Michael J. Kelly, Charles G. Smith

    Abstract: The mass production and the practical number of cryogenic quantum devices producible in a single chip are limited to the number of electrical contact pads and wiring of the cryostat or dilution refrigerator. It is, therefore, beneficial to contrast the measurements of hundreds of devices fabricated in a single chip in one cooldown process to promote the scalability, integrability, reliability, and… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  13. arXiv:2403.18827  [pdf, other

    cs.AI cs.LG cs.NE q-bio.NC

    Bridging Generative Networks with the Common Model of Cognition

    Authors: Robert L. West, Spencer Eckler, Brendan Conway-Smith, Nico Turcas, Eilene Tomkins-Flanagan, Mary Alexandria Kelly

    Abstract: This article presents a theoretical framework for adapting the Common Model of Cognition to large generative network models within the field of artificial intelligence. This can be accomplished by restructuring modules within the Common Model into shadow production systems that are peripheral to a central production system, which handles higher-level reasoning based on the shadow productions' outp… ▽ More

    Submitted 25 January, 2024; originally announced March 2024.

  14. The Effects of Generative AI on Design Fixation and Divergent Thinking

    Authors: Samangi Wadinambiarachchi, Ryan M. Kelly, Saumya Pareek, Qiushi Zhou, Eduardo Velloso

    Abstract: Generative AI systems have been heralded as tools for augmenting human creativity and inspiring divergent thinking, though with little empirical evidence for these claims. This paper explores the effects of exposure to AI-generated images on measures of design fixation and divergent thinking in a visual ideation task. Through a between-participants experiment (N=60), we found that support from an… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: Accepted at the CHI Conference on Human Factors in Computing Systems (CHI 24),18 pages, 15 figures,

  15. arXiv:2402.01040  [pdf

    cs.HC

    Everyday Uses of Music Listening and Music Technologies by Caregivers and People with Dementia: Survey and Focus Group Study

    Authors: Dianna Vidas, Romina Carrasco, Ryan M. Kelly, Jenny Waycott, Jeanette Tamplin, Kate McMahon, Libby M. Flynn, Phoebe A. Stretton-Smith, Tanara Vieira Sousa, Felicity A. Baker

    Abstract: Music is a valuable non-pharmacological tool that provides benefits for people with dementia, and there is interest in designing technologies to support music use in dementia care. To ensure music technologies are appropriately designed for supporting caregivers and people living with dementia, there remains a need to better understand how music is currently used in everyday care at home. We aimed… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  16. arXiv:2401.11628  [pdf

    cs.HC

    Older Adults Imagining Future Technologies in Participatory Design Workshops: Supporting Continuity in the Pursuit of Meaningful Activities

    Authors: Wei Zhao, Ryan M. Kelly, Melissa J. Rogerson, Jenny Waycott

    Abstract: Recent innovations in digital technology offer significant opportunities for older adults to engage in meaningful activities. To investigate older adults' perceptions of using existing and emerging technologies for meaningful activities, we conducted three participatory design workshops and follow-up interviews with adults aged over 65. The workshops encompassed discussions on existing technologie… ▽ More

    Submitted 23 May, 2024; v1 submitted 21 January, 2024; originally announced January 2024.

  17. arXiv:2310.15177  [pdf, other

    q-bio.NC cs.AI

    A Neuro-mimetic Realization of the Common Model of Cognition via Hebbian Learning and Free Energy Minimization

    Authors: Alexander Ororbia, Mary Alexandria Kelly

    Abstract: Over the last few years, large neural generative models, capable of synthesizing semantically rich passages of text or producing complex images, have recently emerged as a popular representation of what has come to be known as ``generative artificial intelligence'' (generative AI). Beyond opening the door to new opportunities as well as challenges for the domain of statistical machine learning, th… ▽ More

    Submitted 3 November, 2023; v1 submitted 14 October, 2023; originally announced October 2023.

    Comments: Additional section on hopfield functionals and CogNGen's full free energy, basal ganglia sub-circuit diagram integrated

  18. arXiv:2310.12369  [pdf, other

    cs.IR

    On Identifying Points of Semantic Shift Across Domains

    Authors: Hyung Wook Choi, Mat Kelly

    Abstract: The semantics used for particular terms in an academic field organically evolve over time. Tracking this evolution through inspection of published literature has either been from the perspective of Linguistic scholars or has concentrated the focus of term evolution within a single domain of study. In this paper, we performed a case study to identify semantic evolution across different domains and… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: In 17th International Conference on Metadata and Semantics Research, October 2023

  19. Worst-Case Morphs using Wasserstein ALI and Improved MIPGAN

    Authors: Una M. Kelly, Meike Nauta, Lu Liu, Luuk J. Spreeuwers, Raymond N. J. Veldhuis

    Abstract: A morph is a combination of two separate facial images and contains identity information of two different people. When used in an identity document, both people can be authenticated by a biometric Face Recognition (FR) system. Morphs can be generated using either a landmark-based approach or approaches based on deep learning such as Generative Adversarial Networks (GAN). In a recent paper, we intr… ▽ More

    Submitted 13 October, 2023; v1 submitted 12 October, 2023; originally announced October 2023.

  20. arXiv:2309.10066  [pdf, other

    cs.AI cs.CL physics.med-ph

    Automatic Personalized Impression Generation for PET Reports Using Large Language Models

    Authors: Xin Tie, Muheon Shin, Ali Pirasteh, Nevein Ibrahim, Zachary Huemann, Sharon M. Castellino, Kara M. Kelly, John Garrett, Junjie Hu, Steve Y. Cho, Tyler J. Bradshaw

    Abstract: In this study, we aimed to determine if fine-tuned large language models (LLMs) can generate accurate, personalized impressions for whole-body PET reports. Twelve language models were trained on a corpus of PET reports using the teacher-forcing algorithm, with the report findings as input and the clinical impressions as reference. An extra input token encodes the reading physician's identity, allo… ▽ More

    Submitted 17 October, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: 25 pages in total. 6 figures and 3 tables in the main body. The manuscript has been submitted to a journal for potential publication

    Journal ref: J Digit Imaging. Inform. Med. (2024)

  21. arXiv:2305.09064  [pdf, other

    cs.LG cs.AI cs.HC

    Capturing Humans' Mental Models of AI: An Item Response Theory Approach

    Authors: Markelle Kelly, Aakriti Kumar, Padhraic Smyth, Mark Steyvers

    Abstract: Improving our understanding of how humans perceive AI teammates is an important foundation for our general understanding of human-AI teams. Extending relevant work from cognitive science, we propose a framework based on item response theory for modeling these perceptions. We apply this framework to real-world experiments, in which each participant works alongside another person or an AI agent in a… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: FAccT 2023

  22. Aggregator Reuse and Extension for Richer Web Archive Interaction

    Authors: Mat Kelly

    Abstract: Memento aggregators enable users to query multiple web archives for captures of a URI in time through a single HTTP endpoint. While this one-to-many access point is useful for researchers and end-users, aggregators are in a position to provide additional functionality to end-users beyond black box style aggregation. This paper identifies the state-of-the-art of Memento aggregation, abstracts its p… ▽ More

    Submitted 3 October, 2022; originally announced October 2022.

    Comments: 16 pages, preprint accepted to be In Proceedings of the 24th International Conference on Asia-Pacific Digital Libraries (ICADL 2022)

  23. arXiv:2209.15154  [pdf, other

    cs.LG

    Variable-Based Calibration for Machine Learning Classifiers

    Authors: Markelle Kelly, Padhraic Smyth

    Abstract: The deployment of machine learning classifiers in high-stakes domains requires well-calibrated confidence scores for model predictions. In this paper we introduce the notion of variable-based calibration to characterize calibration properties of a model with respect to a variable of interest, generalizing traditional score-based metrics such as expected calibration error (ECE). In particular, we f… ▽ More

    Submitted 5 April, 2023; v1 submitted 29 September, 2022; originally announced September 2022.

  24. VisQuiz: Exploring Feedback Mechanisms to Improve Graphical Perception

    Authors: Ryan Birchfield, Maddison Caten, Errica Cheng, Madyson Kelly, Truman Larson, Hoan Phan Pham, Yiren Ding, Noëlle Rakotondravony, Lane Harrison

    Abstract: Graphical perception studies are a key element of visualization research, forming the basis of design recommendations and contributing to our understanding of how people make sense of visualizations. However, graphical perception studies typically include only brief training sessions, and the impact of longer and more in-depth feedback remains unclear. In this paper, we explore the design and eval… ▽ More

    Submitted 2 October, 2023; v1 submitted 19 July, 2022; originally announced July 2022.

    Comments: 5 pages, 5 figures, short paper

    Journal ref: Proceedings of IEEE Visualization conference 2023

  25. arXiv:2204.00619  [pdf, other

    cs.AI cs.LG cs.NE q-bio.NC

    Maze Learning using a Hyperdimensional Predictive Processing Cognitive Architecture

    Authors: Alexander Ororbia, M. Alex Kelly

    Abstract: We present the COGnitive Neural GENerative system (CogNGen), a cognitive architecture that combines two neurobiologically-plausible, computational models: predictive processing and hyperdimensional/vector-symbolic models. We draw inspiration from architectures such as ACT-R and Spaun/Nengo. CogNGen is in broad agreement with these, providing a level of detail between ACT-R's high-level symbolic de… ▽ More

    Submitted 8 August, 2022; v1 submitted 31 March, 2022; originally announced April 2022.

    Comments: Revisions applied to reflect the version accepted to AGI 2022. Note that this includes the appendix mentioned in the AGI 2022 proceedings publication

  26. arXiv:2202.10194  [pdf

    physics.chem-ph cs.DC cs.LG physics.comp-ph stat.ML

    Low-Dimensional High-Fidelity Kinetic Models for NOX Formation by a Compute Intensification Method

    Authors: Mark Kelly, Harry Dunne, Gilles Bourque, Stephen Dooley

    Abstract: A novel compute intensification methodology to the construction of low-dimensional, high-fidelity "compact" kinetic models for NOX formation is designed and demonstrated. The method adapts the data intensive Machine Learned Optimization of Chemical Kinetics (MLOCK) algorithm for compact model generation by the use of a Latin Square method for virtual reaction network generation. A set of logical r… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: text overlap with arXiv:2202.08021

  27. arXiv:2202.08021  [pdf

    physics.chem-ph cs.DC cs.LG physics.comp-ph stat.ML

    Toward Development of Machine Learned Techniques for Production of Compact Kinetic Models

    Authors: Mark Kelly, Mark Fortune, Gilles Bourque, Stephen Dooley

    Abstract: Chemical kinetic models are an essential component in the development and optimisation of combustion devices through their coupling to multi-dimensional simulations such as computational fluid dynamics (CFD). Low-dimensional kinetic models which retain good fidelity to the reality are needed, the production of which requires considerable human-time cost and expert knowledge. Here, we present a nov… ▽ More

    Submitted 16 February, 2022; originally announced February 2022.

  28. arXiv:2111.15416  [pdf, other

    cs.CV

    Worst-Case Morphs: a Theoretical and a Practical Approach

    Authors: Una M. Kelly, Raymond Veldhuis, Luuk Spreeuwers

    Abstract: Face Recognition (FR) systems have been shown to be vulnerable to morphing attacks. We examine exactly how challenging morphs can become. By showing a worst-case construction in the embedding space of an FR system and using a mapping from embedding space back to image space we generate images that show that this theoretical upper bound can be approximated if the FR system is known. The resulting m… ▽ More

    Submitted 19 September, 2022; v1 submitted 30 November, 2021; originally announced November 2021.

  29. arXiv:2111.03910  [pdf

    cs.DL cs.IR

    FAIR Metadata: A Community-driven Vocabulary Application

    Authors: Christopher B. Rauch, Mat Kelly, John A. Kunze, Jane Greenberg

    Abstract: FAIR metadata is critical to supporting FAIR data overall. Transparency, community engagement, and flexibility are key aspects of FAIR that apply to metadata. This paper presents YAMZ (Yet Another Metadata Zoo), a community-driven vocabulary application that supports FAIR. The history ofYAMZ and its original features are reviewed, followed by a presentation of recent innovations and a discussion o… ▽ More

    Submitted 6 November, 2021; originally announced November 2021.

    ACM Class: H.3.7

  30. arXiv:2109.13915  [pdf

    cs.DL cs.IT

    Modeling Ephraim Chambers' Knowledge Structure from a Naive Standpoint

    Authors: Scott McClellan, Mat Kelly, Jane Greenberg

    Abstract: In the preface to his Cyclopaedia published in 1728 Ephraim Chambers offers readers a systematized structure of his attempt to produce a universal repository of human knowledge. Divided into an interconnected taxonomic tree and domain vocabulary, this structure forms the basis of one effort from the Metadata Research Center to study historical ontologies. The knowledge structure is being encoded i… ▽ More

    Submitted 28 September, 2021; originally announced September 2021.

    Comments: NASKO 2021 Conference. 9 pages, 3 figures

    ACM Class: I.7

  31. arXiv:2109.06317  [pdf

    cs.DL

    Project Pipeline: Preservation, Persistence, and Performance

    Authors: Jane Greenberg, Christopher B. Rauch, Mat Kelly

    Abstract: Preservation pipelines demonstrate extended value when digitized content is also computation ready. Expanding this to historical controlled vocabularies published in analog format requires additional steps if they are to be fully leveraged for research. This paper reports on work addressing this challenge. We report on a pipeline and project progress addressing three key goals: 1) transforming the… ▽ More

    Submitted 18 September, 2021; v1 submitted 13 September, 2021; originally announced September 2021.

    Comments: 5 pages, 2 figures. 17th International Conference on Digital Preservation (iPRES) 2021, Beijing, China

  32. arXiv:2105.07308  [pdf, other

    cs.AI cs.LG q-bio.NC

    Towards a Predictive Processing Implementation of the Common Model of Cognition

    Authors: Alexander Ororbia, M. A. Kelly

    Abstract: In this article, we present a cognitive architecture that is built from powerful yet simple neural models. Specifically, we describe an implementation of the common model of cognition grounded in neural generative coding and holographic associative memory. The proposed system creates the groundwork for developing agents that learn continually from diverse tasks as well as model human performance a… ▽ More

    Submitted 18 May, 2021; v1 submitted 15 May, 2021; originally announced May 2021.

    Comments: 6 pages, 2 figures

  33. arXiv:2102.12899  [pdf, other

    cs.NI

    Mobility for Cellular-Connected UAVs: challenges for the network provider

    Authors: Erika Fonseca, Boris Galkin, Marvin Kelly, Luiz A. DaSilva, Ivana Dusparic

    Abstract: Unmanned Aerial Vehicle (UAV) technology is becoming more prevalent and more diverse in its application. 5G and beyond networks must enable UAV connectivity. This will require the network operator to consider this new type of user in the planning and operation of the network. This work presents the challenges an operator will encounter and should consider in the future as UAVs become users of the… ▽ More

    Submitted 25 February, 2021; originally announced February 2021.

    Comments: 6 pages, 4 figures

  34. A Computational Approach to Historical Ontologies

    Authors: Mat Kelly, Jane Greenberg, Christopher B. Rauch, Sam Grabus, Joan P. Boone, John A. Kunze, Peter Melville Logan

    Abstract: This paper presents a use case exploring the application of the Archival Resource Key (ARK) persistent identifier for promoting and maintaining ontologies. In particular, we look at improving computation with an in-house ontology server in the context of temporally aligned vocabularies. This effort demonstrates the utility of ARKs in preparing historical ontologies for computational archival scien… ▽ More

    Submitted 25 November, 2020; originally announced November 2020.

    Comments: 6 pages, 5 figures. To be published in Proceedings of the 2020 IEEE International Conference on Big Data (IEEE Big Data 2020)

    ACM Class: H.3.7

    Journal ref: 2020 IEEE International Conference on Big Data (Big Data), Atlanta, GA, USA, 2020, pp. 1878-1883

  35. arXiv:2011.03236  [pdf, other

    cs.NI

    Experimental Evaluation of a UAV User QoS from a Two-Tier 3.6GHz Spectrum Network

    Authors: Boris Galkin, Erika Fonseca, Gavin Lee, Conor Duff, Marvin Kelly, Edward Emmanuel, Ivana Dusparic

    Abstract: Unmanned Aerial Vehicle (UAV) technology is becoming increasingly used in a variety of applications such as video surveillance and deliveries. To enable safe and efficient use of UAVs, the devices will need to be connected into cellular networks. Existing research on UAV cellular connectivity shows that UAVs encounter significant issues with existing networks, such as strong interference and anten… ▽ More

    Submitted 9 April, 2021; v1 submitted 6 November, 2020; originally announced November 2020.

  36. arXiv:2006.02487  [pdf, other

    cs.DL

    Visualizing Webpage Changes Over Time

    Authors: Abigail Mabe, Dhruv Patel, Maheedhar Gunnam, Surbhi Shankar, Mat Kelly, Sawood Alam, Michael L. Nelson, Michele C. Weigle

    Abstract: We report on the development of TMVis, a web service to provide visualizations of how individual webpages have changed over time. We leverage past research on summarizing collections of webpages with thumbnail-sized screenshots and on choosing a small number of representative past archived webpages from a large collection. We offer four visualizations: image grid, image slider, timeline, and anima… ▽ More

    Submitted 3 June, 2020; originally announced June 2020.

    Comments: 13 pages

  37. arXiv:1909.08663  [pdf, other

    cs.CL cs.AI cs.LG

    Do We Need Neural Models to Explain Human Judgments of Acceptability?

    Authors: Wang Jing, M. A. Kelly, David Reitter

    Abstract: Native speakers can judge whether a sentence is an acceptable instance of their language. Acceptability provides a means of evaluating whether computational language models are processing language in a human-like manner. We test the ability of computational language models, simple language features, and word embeddings to predict native English speakers judgments of acceptability on English-langua… ▽ More

    Submitted 9 October, 2019; v1 submitted 18 September, 2019; originally announced September 2019.

    Comments: 10 pages (8 pages + 2 pages of references), 1 figure, 7 tables

  38. arXiv:1907.12214  [pdf, other

    cs.SE

    A Case Study on Automated Fuzz Target Generation for Large Codebases

    Authors: Matthew Kelly, Christoph Treude, Alex Murray

    Abstract: Fuzz Testing is a largely automated testing technique that provides random and unexpected input to a program in attempt to trigger failure conditions. Much of the research conducted thus far into Fuzz Testing has focused on developing improvements to available Fuzz Testing tools and frameworks in order to improve efficiency. In this paper however, we instead look at a way in which we can reduce th… ▽ More

    Submitted 29 July, 2019; originally announced July 2019.

    Comments: to appear as industry track paper at ESEM 2019, the 13th International Symposium on Empirical Software Engineering and Measurement

  39. arXiv:1810.02890  [pdf, other

    cs.RO

    HG-DAgger: Interactive Imitation Learning with Human Experts

    Authors: Michael Kelly, Chelsea Sidrane, Katherine Driggs-Campbell, Mykel J. Kochenderfer

    Abstract: Imitation learning has proven to be useful for many real-world problems, but approaches such as behavioral cloning suffer from data mismatch and compounding error issues. One attempt to address these limitations is the DAgger algorithm, which uses the state distribution induced by the novice to sample corrective actions from the expert. Such sampling schemes, however, require the expert to provide… ▽ More

    Submitted 11 March, 2019; v1 submitted 5 October, 2018; originally announced October 2018.

  40. A Framework for Aggregating Private and Public Web Archives

    Authors: Mat Kelly, Michael L. Nelson, Michele C. Weigle

    Abstract: Personal and private Web archives are proliferating due to the increase in the tools to create them and the realization that Internet Archive and other public Web archives are unable to capture personalized (e.g., Facebook) and private (e.g., banking) Web pages. We introduce a framework to mitigate issues of aggregation in private, personal, and public Web archives without compromising potential s… ▽ More

    Submitted 3 June, 2018; originally announced June 2018.

    Comments: Preprint version of the ACM/IEEE Joint Conference on Digital Libraries (JCDL 2018) full paper, accessible at the DOI

  41. arXiv:1805.11546  [pdf, other

    cs.CL cs.AI

    Like a Baby: Visually Situated Neural Language Acquisition

    Authors: Alexander G. Ororbia, Ankur Mali, Matthew A. Kelly, David Reitter

    Abstract: We examine the benefits of visual context in training neural language models to perform next-word prediction. A multi-modal neural architecture is introduced that outperform its equivalent trained on language alone with a 2\% decrease in perplexity, even when no visual context is available at test. Fine-tuning the embeddings of a pre-trained state-of-the-art bidirectional language model (BERT) in… ▽ More

    Submitted 4 June, 2019; v1 submitted 29 May, 2018; originally announced May 2018.

    Comments: Final submission (camera-ready), accepted to ACL 2019

  42. Impact of URI Canonicalization on Memento Count

    Authors: Mat Kelly, Lulwah M. Alkwai, Michael L. Nelson, Michele C. Weigle, Herbert Van de Sompel

    Abstract: Quantifying the captures of a URI over time is useful for researchers to identify the extent to which a Web page has been archived. Memento TimeMaps provide a format to list mementos (URI-Ms) for captures along with brief metadata, like Memento-Datetime, for each URI-M. However, when some URI-Ms are dereferenced, they simply provide a redirect to a different URI-M (instead of a unique representati… ▽ More

    Submitted 9 March, 2017; originally announced March 2017.

    Comments: 43 pages, 8 figures

  43. On the Change in Archivability of Websites Over Time

    Authors: Mat Kelly, Justin F. Brunelle, Michele C. Weigle, Michael L. Nelson

    Abstract: As web technologies evolve, web archivists work to keep up so that our digital history is preserved. Recent advances in web technologies have introduced client-side executed scripts that load data without a referential identifier or that require user interaction (e.g., content loading when the page has scrolled). These advances have made automating methods for capturing web pages more difficult. B… ▽ More

    Submitted 30 July, 2013; originally announced July 2013.

    Comments: 12 pages, 8 figures, Theory and Practice of Digital Libraries (TPDL) 2013, Valletta, Malta