Skip to main content

Showing 1–41 of 41 results for author: Ross, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.18845  [pdf, ps, other

    cs.SI

    SocioXplorer: An Interactive Tool for Topic and Network Analysis in Social Data

    Authors: Sandrine Chausson, Youssef Al Hariri, Walid Magdy, Björn Ross

    Abstract: SocioXplorer is a powerful interactive tool that computational social science researchers can use to understand topics and networks in social data from Twitter (X) and YouTube. It integrates, among other things, artificial intelligence, natural language processing and social network analysis. It can be used with ``live" datasets that receive regular updates. SocioXplorer is an extension of a previ… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

  2. arXiv:2506.10060  [pdf, other

    cs.LG cs.AI stat.ML

    Textual Bayes: Quantifying Uncertainty in LLM-Based Systems

    Authors: Brendan Leigh Ross, Noël Vouitsis, Atiyeh Ashari Ghomi, Rasa Hosseinzadeh, Ji Xin, Zhaoyan Liu, Yi Sui, Shiyi Hou, Kin Kwan Leung, Gabriel Loaiza-Ganem, Jesse C. Cresswell

    Abstract: Although large language models (LLMs) are becoming increasingly capable of solving challenging real-world tasks, accurately quantifying their uncertainty remains a critical open problem, which limits their applicability in high-stakes domains. This challenge is further compounded by the closed-source, black-box nature of many state-of-the-art LLMs. Moreover, LLM-based systems can be highly sensiti… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

  3. arXiv:2506.03916  [pdf, ps, other

    cs.CL

    Compositional Generalisation for Explainable Hate Speech Detection

    Authors: Agostina Calabrese, Tom Sherborne, Björn Ross, Mirella Lapata

    Abstract: Hate speech detection is key to online content moderation, but current models struggle to generalise beyond their training data. This has been linked to dataset biases and the use of sentence-level labels, which fail to teach models the underlying structure of hate speech. In this work, we show that even when models are trained with more fine-grained, span-level annotations (e.g., "artists" is lab… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  4. arXiv:2504.17200  [pdf, other

    cs.CL

    A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and Adaptation

    Authors: Yangxinyu Xie, Bowen Jiang, Tanwi Mallick, Joshua David Bergerson, John K. Hutchison, Duane R. Verner, Jordan Branham, M. Ross Alexander, Robert B. Ross, Yan Feng, Leslie-Anne Levy, Weijie Su, Camillo J. Taylor

    Abstract: Large language models (LLMs) are a transformational capability at the frontier of artificial intelligence and machine learning that can support decision-makers in addressing pressing societal challenges such as extreme natural hazard events. As generalized models, LLMs often struggle to provide context-specific information, particularly in areas requiring specialized knowledge. In this work we pro… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

  5. "Till I can get my satisfaction": Open Questions in the Public Desire to Punish AI

    Authors: Eddie L. Ungless, Zachary Horne, Björn Ross

    Abstract: There are countless examples of how AI can cause harm, and increasing evidence that the public are willing to ascribe blame to the AI itself, regardless of how "illogical" this might seem. This raises the question of whether and how the public might expect AI to be punished for this harm. However, public expectations of the punishment of AI have been vastly underexplored. Understanding these expec… ▽ More

    Submitted 5 March, 2025; originally announced March 2025.

    Comments: Accepted as an Extended Abstract to CHI 2025 'Late Breaking Work'

  6. arXiv:2412.20581  [pdf, other

    cs.SI

    "The Prophet said so!": On Exploring Hadith Presence on Arabic Social Media

    Authors: Mahmoud Fawzi, Björn Ross, Walid Magdy

    Abstract: Hadith, the recorded words and actions of the prophet Muhammad, is a key source of the instructions and foundations of Islam, alongside the Quran. Interpreting individual hadiths and verifying their authenticity can be difficult, even controversial, and the subject has attracted the attention of many scholars who have established an entire science of Hadith criticism. Recent quantitative studies o… ▽ More

    Submitted 29 December, 2024; originally announced December 2024.

    Comments: accepted at CSCW 2025 (to appear in ACM Library)

  7. arXiv:2412.16022  [pdf, other

    cs.CL cs.AI

    The Only Way is Ethics: A Guide to Ethical Research with Large Language Models

    Authors: Eddie L. Ungless, Nikolas Vitsakis, Zeerak Talat, James Garforth, Björn Ross, Arno Onken, Atoosa Kasirzadeh, Alexandra Birch

    Abstract: There is a significant body of work looking at the ethical considerations of large language models (LLMs): critiquing tools to measure performance and harms; proposing toolkits to aid in ideation; discussing the risks to workers; considering legislation around privacy and security etc. As yet there is no work that integrates these resources into a single practical guide that focuses on LLMs; we at… ▽ More

    Submitted 20 December, 2024; originally announced December 2024.

    Comments: Accepted to COLING '25. This paper is the condensed pocket guide to accompany our full LLM Ethics Whitepaper, available at arXiv:2410.19812, and at https://github.com/MxEddie/Ethics-Whitepaper for suggested revisions

  8. arXiv:2411.08954  [pdf, other

    cs.LG cs.AI

    Inconsistencies In Consistency Models: Better ODE Solving Does Not Imply Better Samples

    Authors: Noël Vouitsis, Rasa Hosseinzadeh, Brendan Leigh Ross, Valentin Villecroze, Satya Krishna Gorti, Jesse C. Cresswell, Gabriel Loaiza-Ganem

    Abstract: Although diffusion models can generate remarkably high-quality samples, they are intrinsically bottlenecked by their expensive iterative sampling procedure. Consistency models (CMs) have recently emerged as a promising diffusion model distillation method, reducing the cost of sampling by generating high-fidelity samples in just a few iterations. Consistency model distillation aims to solve the pro… ▽ More

    Submitted 15 November, 2024; v1 submitted 13 November, 2024; originally announced November 2024.

    Comments: NeurIPS 2024 ATTRIB Workshop

  9. arXiv:2411.00113  [pdf, other

    stat.ML cs.LG

    A Geometric Framework for Understanding Memorization in Generative Models

    Authors: Brendan Leigh Ross, Hamidreza Kamkari, Tongzi Wu, Rasa Hosseinzadeh, Zhaoyan Liu, George Stein, Jesse C. Cresswell, Gabriel Loaiza-Ganem

    Abstract: As deep generative models have progressed, recent work has shown them to be capable of memorizing and reproducing training datapoints when deployed. These findings call into question the usability of generative models, especially in light of the legal and privacy risks brought about by memorization. To better understand this phenomenon, we propose the manifold memorization hypothesis (MMH), a geom… ▽ More

    Submitted 12 March, 2025; v1 submitted 31 October, 2024; originally announced November 2024.

    Comments: Accepted to ICLR 2025 (Spotlight)

  10. arXiv:2410.21611  [pdf, other

    physics.ins-det cs.LG hep-ex hep-ph

    CaloChallenge 2022: A Community Challenge for Fast Calorimeter Simulation

    Authors: Claudius Krause, Michele Faucci Giannelli, Gregor Kasieczka, Benjamin Nachman, Dalila Salamani, David Shih, Anna Zaborowska, Oz Amram, Kerstin Borras, Matthew R. Buckley, Erik Buhmann, Thorsten Buss, Renato Paulo Da Costa Cardoso, Anthony L. Caterini, Nadezda Chernyavskaya, Federico A. G. Corchia, Jesse C. Cresswell, Sascha Diefenbacher, Etienne Dreyer, Vijay Ekambaram, Engin Eren, Florian Ernst, Luigi Favaro, Matteo Franchini, Frank Gaede , et al. (44 additional authors not shown)

    Abstract: We present the results of the "Fast Calorimeter Simulation Challenge 2022" - the CaloChallenge. We study state-of-the-art generative models on four calorimeter shower datasets of increasing dimensionality, ranging from a few hundred voxels to a few tens of thousand voxels. The 31 individual submissions span a wide range of current popular generative architectures, including Variational AutoEncoder… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.

    Comments: 204 pages, 100+ figures, 30+ tables

    Report number: HEPHY-ML-24-05, FERMILAB-PUB-24-0728-CMS, TTK-24-43

  11. arXiv:2410.19812  [pdf

    cs.CY cs.CL

    Ethics Whitepaper: Whitepaper on Ethical Research into Large Language Models

    Authors: Eddie L. Ungless, Nikolas Vitsakis, Zeerak Talat, James Garforth, Björn Ross, Arno Onken, Atoosa Kasirzadeh, Alexandra Birch

    Abstract: This whitepaper offers an overview of the ethical considerations surrounding research into or with large language models (LLMs). As LLMs become more integrated into widely used applications, their societal impact increases, bringing important ethical questions to the forefront. With a growing body of work examining the ethical development, deployment, and use of LLMs, this whitepaper provides a co… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 47 pages

    ACM Class: I.2

  12. arXiv:2407.14164  [pdf, other

    cs.HC

    Experiences of Censorship on TikTok Across Marginalised Identities

    Authors: Eddie L. Ungless, Nina Markl, Björn Ross

    Abstract: TikTok has seen exponential growth as a platform, fuelled by the success of its proprietary recommender algorithm which serves tailored content to every user - though not without controversy. Users complain of their content being unfairly suppressed by ''the algorithm'', particularly users with marginalised identities such as LGBTQ+ users. Together with content removal, this suppression acts to ce… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

    Comments: To appear at ICWSM '25

  13. arXiv:2406.04106  [pdf, other

    cs.CL

    Explainability and Hate Speech: Structured Explanations Make Social Media Moderators Faster

    Authors: Agostina Calabrese, Leonardo Neves, Neil Shah, Maarten W. Bos, Björn Ross, Mirella Lapata, Francesco Barbieri

    Abstract: Content moderators play a key role in keeping the conversation on social media healthy. While the high volume of content they need to judge represents a bottleneck to the moderation pipeline, no studies have explored how models could support them to make faster decisions. There is, by now, a vast body of research into detecting hate speech, sometimes explicitly motivated by a desire to help improv… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 11 pages, 14 figures, to be published at ACL 2024

  14. arXiv:2406.03537  [pdf, other

    cs.LG cs.AI stat.ML

    A Geometric View of Data Complexity: Efficient Local Intrinsic Dimension Estimation with Diffusion Models

    Authors: Hamidreza Kamkari, Brendan Leigh Ross, Rasa Hosseinzadeh, Jesse C. Cresswell, Gabriel Loaiza-Ganem

    Abstract: High-dimensional data commonly lies on low-dimensional submanifolds, and estimating the local intrinsic dimension (LID) of a datum -- i.e. the dimension of the submanifold it belongs to -- is a longstanding problem. LID can be understood as the number of local factors of variation: the more factors of variation a datum has, the more complex it tends to be. Estimating this quantity has proven usefu… ▽ More

    Submitted 24 October, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: NeurIPS 2024 (spotlight)

  15. arXiv:2405.05705  [pdf, other

    cs.CL

    Detecting Statements in Text: A Domain-Agnostic Few-Shot Solution

    Authors: Sandrine Chausson, Björn Ross

    Abstract: Many tasks related to Computational Social Science and Web Content Analysis involve classifying pieces of text based on the claims they contain. State-of-the-art approaches usually involve fine-tuning models on large annotated datasets, which are costly to produce. In light of this, we propose and release a qualitative and versatile few-shot learning methodology as a common paradigm for any claim-… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: Paper accepted for publication at NOCAPS workshop at ICWSM 2024 conference

  16. arXiv:2404.02954  [pdf, other

    cs.LG cs.AI stat.ML

    Deep Generative Models through the Lens of the Manifold Hypothesis: A Survey and New Connections

    Authors: Gabriel Loaiza-Ganem, Brendan Leigh Ross, Rasa Hosseinzadeh, Anthony L. Caterini, Jesse C. Cresswell

    Abstract: In recent years there has been increased interest in understanding the interplay between deep generative models (DGMs) and the manifold hypothesis. Research in this area focuses on understanding the reasons why commonly-used DGMs succeed or fail at learning distributions supported on unknown low-dimensional manifolds, as well as developing new models explicitly designed to account for manifold-sup… ▽ More

    Submitted 25 September, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: TMLR 2024 (survey certification, expert certification)

  17. arXiv:2403.18910  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    A Geometric Explanation of the Likelihood OOD Detection Paradox

    Authors: Hamidreza Kamkari, Brendan Leigh Ross, Jesse C. Cresswell, Anthony L. Caterini, Rahul G. Krishnan, Gabriel Loaiza-Ganem

    Abstract: Likelihood-based deep generative models (DGMs) commonly exhibit a puzzling behaviour: when trained on a relatively complex dataset, they assign higher likelihood values to out-of-distribution (OOD) data from simpler sources. Adding to the mystery, OOD samples are never generated by these DGMs despite having higher likelihoods. This two-pronged paradox has yet to be conclusively explained, making l… ▽ More

    Submitted 11 June, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: ICML 2024

  18. Union: An Automatic Workload Manager for Accelerating Network Simulation

    Authors: Xin Wang, Misbah Mubarak, Yao Kang, Robert B. Ross, Zhiling Lan

    Abstract: With the rapid growth of the machine learning applications, the workloads of future HPC systems are anticipated to be a mix of scientific simulation, big data analytics, and machine learning applications. Simulation is a great research vehicle to understand the performance implications of co-running scientific applications with big data and machine learning workloads on large-scale systems. In thi… ▽ More

    Submitted 3 April, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

  19. arXiv:2402.07877  [pdf, other

    cs.AI

    WildfireGPT: Tailored Large Language Model for Wildfire Analysis

    Authors: Yangxinyu Xie, Bowen Jiang, Tanwi Mallick, Joshua David Bergerson, John K. Hutchison, Duane R. Verner, Jordan Branham, M. Ross Alexander, Robert B. Ross, Yan Feng, Leslie-Anne Levy, Weijie Su, Camillo J. Taylor

    Abstract: Recent advancement of large language models (LLMs) represents a transformational capability at the frontier of artificial intelligence. However, LLMs are generalized models, trained on extensive text corpus, and often struggle to provide context-specific information, particularly in areas requiring specialized knowledge, such as wildfire details within the broader context of climate change. For de… ▽ More

    Submitted 22 April, 2025; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: restoring content for arXiv:2402.07877v2 which was replaced in error

  20. arXiv:2310.16996  [pdf, other

    cs.LG cs.DC

    Towards Continually Learning Application Performance Models

    Authors: Ray A. O. Sinurat, Anurag Daram, Haryadi S. Gunawi, Robert B. Ross, Sandeep Madireddy

    Abstract: Machine learning-based performance models are increasingly being used to build critical job scheduling and application optimization decisions. Traditionally, these models assume that data distribution does not change as more samples are collected over time. However, owing to the complexity and heterogeneity of production HPC systems, they are susceptible to hardware degradation, replacement, and/o… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: Presented at Workshop on Machine Learning for Systems at 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  21. arXiv:2306.04675  [pdf, other

    cs.LG cs.CV stat.ML

    Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models

    Authors: George Stein, Jesse C. Cresswell, Rasa Hosseinzadeh, Yi Sui, Brendan Leigh Ross, Valentin Villecroze, Zhaoyan Liu, Anthony L. Caterini, J. Eric T. Taylor, Gabriel Loaiza-Ganem

    Abstract: We systematically study a wide variety of generative models spanning semantically-diverse image datasets to understand and improve the feature extractors and metrics used to evaluate them. Using best practices in psychophysics, we measure human perception of image realism for generated samples by conducting the largest experiment evaluating generative models to date, and find that no existing metr… ▽ More

    Submitted 30 October, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023. 53 pages, 29 figures, 12 tables. Code at https://github.com/layer6ai-labs/dgm-eval, reviews at https://openreview.net/forum?id=08zf7kTOoh

    Journal ref: Thirty-seventh Conference on Neural Information Processing Systems (2023)

  22. arXiv:2305.17072  [pdf, other

    cs.CL cs.CY

    Stereotypes and Smut: The (Mis)representation of Non-cisgender Identities by Text-to-Image Models

    Authors: Eddie L. Ungless, Björn Ross, Anne Lauscher

    Abstract: Cutting-edge image generation has been praised for producing high-quality images, suggesting a ubiquitous future in a variety of applications. However, initial studies have pointed to the potential for harm due to predictive bias, reflecting and potentially reinforcing cultural stereotypes. In this work, we are the first to investigate how multimodal models handle diverse gender identities. Concre… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL Findings 2023

  23. arXiv:2305.12709  [pdf, other

    cs.CL

    Cross-lingual Transfer Can Worsen Bias in Sentiment Analysis

    Authors: Seraphina Goldfarb-Tarrant, Björn Ross, Adam Lopez

    Abstract: Sentiment analysis (SA) systems are widely deployed in many of the world's languages, and there is well-documented evidence of demographic bias in these systems. In languages beyond English, scarcer training data is often supplemented with transfer learning using pre-trained models, including multilingual models trained on other languages. In some cases, even supervision data comes from other lang… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: 8 pages, preprint

  24. arXiv:2303.18149  [pdf

    cs.CL

    Can AI Chatbots Pass the Fundamentals of Engineering (FE) and Principles and Practice of Engineering (PE) Structural Exams?

    Authors: M. Z. Naser, Brandon Ross, Jennier Ogle, Venkatesh Kodur, Rami Hawileh, Jamal Abdalla, Huu-Tai Thai

    Abstract: The engineering community has recently witnessed the emergence of chatbot technology with the release of OpenAI ChatGPT-4 and Google Bard. While these chatbots have been reported to perform well and even pass various standardized tests, including medical and law exams, this forum paper explores whether these chatbots can also pass the Fundamentals of Engineering (FE) and Principles and Practice of… ▽ More

    Submitted 2 April, 2023; v1 submitted 31 March, 2023; originally announced March 2023.

  25. arXiv:2212.01265  [pdf, other

    cs.LG cs.AI

    Denoising Deep Generative Models

    Authors: Gabriel Loaiza-Ganem, Brendan Leigh Ross, Luhuan Wu, John P. Cunningham, Jesse C. Cresswell, Anthony L. Caterini

    Abstract: Likelihood-based deep generative models have recently been shown to exhibit pathological behaviour under the manifold hypothesis as a consequence of using high-dimensional densities to model data with low-dimensional structure. In this paper we propose two methodologies aimed at addressing this problem. Both are based on adding Gaussian noise to the data to remove the dimensionality mismatch durin… ▽ More

    Submitted 4 January, 2023; v1 submitted 30 November, 2022; originally announced December 2022.

    Comments: NeurIPS 2022 ICBINB workshop (spotlight)

  26. arXiv:2211.15380  [pdf, other

    hep-ph cs.LG hep-ex physics.data-an physics.ins-det

    CaloMan: Fast generation of calorimeter showers with density estimation on learned manifolds

    Authors: Jesse C. Cresswell, Brendan Leigh Ross, Gabriel Loaiza-Ganem, Humberto Reyes-Gonzalez, Marco Letizia, Anthony L. Caterini

    Abstract: Precision measurements and new physics searches at the Large Hadron Collider require efficient simulations of particle propagation and interactions within the detectors. The most computationally expensive simulations involve calorimeter showers. Advances in deep generative modelling - particularly in the realm of high-dimensional data - have opened the possibility of generating realistic calorimet… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: Accepted to the Machine Learning and the Physical Sciences Workshop at NeurIPS 2022

  27. arXiv:2210.14552  [pdf, other

    cs.CL

    A Robust Bias Mitigation Procedure Based on the Stereotype Content Model

    Authors: Eddie L. Ungless, Amy Rafferty, Hrichika Nag, Björn Ross

    Abstract: The Stereotype Content model (SCM) states that we tend to perceive minority groups as cold, incompetent or both. In this paper we adapt existing work to demonstrate that the Stereotype Content model holds for contextualised word embeddings, then use these results to evaluate a fine-tuning process designed to drive a language model away from stereotyped portrayals of minority groups. We find the SC… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

  28. arXiv:2210.06597  [pdf, other

    cs.LG

    Find Your Friends: Personalized Federated Learning with the Right Collaborators

    Authors: Yi Sui, Junfeng Wen, Yenson Lau, Brendan Leigh Ross, Jesse C. Cresswell

    Abstract: In the traditional federated learning setting, a central server coordinates a network of clients to train one global model. However, the global model may serve many clients poorly due to data heterogeneity. Moreover, there may not exist a trusted central party that can coordinate the clients to ensure that each of them can benefit from others. To address these concerns, we present a novel decentra… ▽ More

    Submitted 14 October, 2022; v1 submitted 12 October, 2022; originally announced October 2022.

  29. arXiv:2210.02659  [pdf, other

    cs.CL

    Explainable Abuse Detection as Intent Classification and Slot Filling

    Authors: Agostina Calabrese, Björn Ross, Mirella Lapata

    Abstract: To proactively offer social media users a safe online experience, there is a need for systems that can detect harmful posts and promptly alert platform moderators. In order to guarantee the enforcement of a consistent policy, moderators are provided with detailed guidelines. In contrast, most state-of-the-art models learn what abuse is from labelled examples and as a result base their predictions… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Comments: 14 pages, 2 figures, to be published in TACL (pre-MIT Press publication version)

    ACM Class: I.2.7

  30. arXiv:2207.02862  [pdf, other

    stat.ML cs.AI cs.LG

    Verifying the Union of Manifolds Hypothesis for Image Data

    Authors: Bradley C. A. Brown, Anthony L. Caterini, Brendan Leigh Ross, Jesse C. Cresswell, Gabriel Loaiza-Ganem

    Abstract: Deep learning has had tremendous success at learning low-dimensional representations of high-dimensional data. This success would be impossible if there was no hidden low-dimensional structure in data of interest; this existence is posited by the manifold hypothesis, which states that the data lies on an unknown manifold of low intrinsic dimension. In this paper, we argue that this hypothesis does… ▽ More

    Submitted 2 March, 2023; v1 submitted 6 July, 2022; originally announced July 2022.

    Comments: ICLR 2023

  31. arXiv:2206.11267  [pdf, other

    stat.ML cs.LG

    Neural Implicit Manifold Learning for Topology-Aware Density Estimation

    Authors: Brendan Leigh Ross, Gabriel Loaiza-Ganem, Anthony L. Caterini, Jesse C. Cresswell

    Abstract: Natural data observed in $\mathbb{R}^n$ is often constrained to an $m$-dimensional manifold $\mathcal{M}$, where $m < n$. This work focuses on the task of building theoretically principled generative models for such data. Current generative models learn $\mathcal{M}$ by mapping an $m$-dimensional latent variable through a neural network $f_θ: \mathbb{R}^m \to \mathbb{R}^n$. These procedures, which… ▽ More

    Submitted 21 December, 2023; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: Accepted to TMLR in 2023. Code: https://github.com/layer6ai-labs/implicit-manifolds

  32. arXiv:2204.08180  [pdf, other

    cs.DC cs.PF

    A Taxonomy of Error Sources in HPC I/O Machine Learning Models

    Authors: Mihailo Isakov, Mikaela Currier, Eliakin del Rosario, Sandeep Madireddy, Prasanna Balaprakash, Philip Carns, Robert B. Ross, Glenn K. Lockwood, Michel A. Kinsy

    Abstract: I/O efficiency is crucial to productivity in scientific computing, but the increasing complexity of the system and the applications makes it difficult for practitioners to understand and optimize I/O behavior at scale. Data-driven machine learning-based I/O throughput models offer a solution: they can be used to identify bottlenecks, automate I/O tuning, or optimize job scheduling with minimal hum… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

    Report number: STAM01

  33. arXiv:2204.07172  [pdf, other

    stat.ML cs.AI cs.LG stat.ME

    Diagnosing and Fixing Manifold Overfitting in Deep Generative Models

    Authors: Gabriel Loaiza-Ganem, Brendan Leigh Ross, Jesse C. Cresswell, Anthony L. Caterini

    Abstract: Likelihood-based, or explicit, deep generative models use neural networks to construct flexible high-dimensional densities. This formulation directly contradicts the manifold hypothesis, which states that observed data lies on a low-dimensional manifold embedded in high-dimensional ambient space. In this paper we investigate the pathologies of maximum-likelihood training in the presence of this di… ▽ More

    Submitted 28 November, 2022; v1 submitted 14 April, 2022; originally announced April 2022.

    Comments: Accepted for publication in TMLR

  34. arXiv:2202.07986  [pdf

    cs.SI

    Towards a Better Understanding of Online Influence: Differences in Twitter CommunicationBetween Companies and Influencers

    Authors: Diana C. Hernandez-Bocanegra, Angela Borchert, Felix Brünker, Gautam Kishore Shahi, Björn Ross

    Abstract: In the last decade, Social Media platforms such as Twitter have gained importance in the various marketing strategies of companies. This work aims to examine the presence of influential content on a textual level, by investigating characteristics of tweets in the context of social impact theory, and its dimension immediacy. To this end, we analysed influential Twitter communication data during Bla… ▽ More

    Submitted 16 February, 2022; originally announced February 2022.

    Comments: Australasian Conference on Information Systems, 2020, Wellington

    MSC Class: 91D30 ACM Class: H.0

  35. arXiv:2106.05275  [pdf, other

    stat.ML cs.LG

    Tractable Density Estimation on Learned Manifolds with Conformal Embedding Flows

    Authors: Brendan Leigh Ross, Jesse C. Cresswell

    Abstract: Normalizing flows are generative models that provide tractable density estimation via an invertible transformation from a simple base distribution to a complex target distribution. However, this technique cannot directly model data supported on an unknown low-dimensional manifold, a common occurrence in real-world domains such as image data. Recent attempts to remedy this limitation have introduce… ▽ More

    Submitted 11 November, 2021; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021 Camera-Ready. Code: https://github.com/layer6ai-labs/CEF

  36. arXiv:2001.09399  [pdf, other

    cs.DC cs.HC cs.LG cs.PF

    A Visual Analytics Framework for Reviewing Streaming Performance Data

    Authors: Suraj P. Kesavan, Takanori Fujiwara, Jianping Kelvin Li, Caitlin Ross, Misbah Mubarak, Christopher D. Carothers, Robert B. Ross, Kwan-Liu Ma

    Abstract: Understanding and tuning the performance of extreme-scale parallel computing systems demands a streaming approach due to the computational cost of applying offline algorithms to vast amounts of performance log data. Analyzing large streaming data is challenging because the rate of receiving data and limited time to comprehend data make it difficult for the analysts to sufficiently examine the data… ▽ More

    Submitted 25 January, 2020; originally announced January 2020.

    Comments: This is the author's preprint version that will be published in Proceedings of IEEE Pacific Visualization Symposium, 2020

  37. arXiv:1912.01077  [pdf

    cs.AI cs.CY eess.SY

    Towards Successful Collaboration: Design Guidelines for AI-based Services enriching Information Systems in Organisations

    Authors: Nicholas R. J. Frick, Felix Brünker, Björn Ross, Stefan Stieglitz

    Abstract: Information systems (IS) are widely used in organisations to improve business performance. The steady progression in improving technologies like artificial intelligence (AI) and the need of securing future success of organisations lead to new requirements for IS. This research in progress firstly introduces the term AI-based services (AIBS) describing AI as a component enriching IS aiming at colla… ▽ More

    Submitted 2 December, 2019; originally announced December 2019.

    Comments: Proceedings of the 30th Australasian Conference on Information Systems (ACIS), Fremantle, Australia

  38. arXiv:1710.04044  [pdf

    cs.HC cs.CY cs.SI

    Do Social Bots Dream of Electric Sheep? A Categorisation of Social Media Bot Accounts

    Authors: Stefan Stieglitz, Florian Brachten, Björn Ross, Anna-Katharina Jung

    Abstract: So-called 'social bots' have garnered a lot of attention lately. Previous research showed that they attempted to influence political events such as the Brexit referendum and the US presidential elections. It remains, however, somewhat unclear what exactly can be understood by the term 'social bot'. This paper addresses the need to better understand the intentions of bots on social media and to dev… ▽ More

    Submitted 11 October, 2017; originally announced October 2017.

    Comments: Accepted for publication in the Proceedings of the Australasian Conference on Information Systems, 2017

  39. Measuring the Reliability of Hate Speech Annotations: The Case of the European Refugee Crisis

    Authors: Björn Ross, Michael Rist, Guillermo Carbonell, Benjamin Cabrera, Nils Kurowsky, Michael Wojatzki

    Abstract: Some users of social media are spreading racist, sexist, and otherwise hateful content. For the purpose of training a hate speech detection system, the reliability of the annotations is crucial, but there is no universally agreed-upon definition. We collected potentially hateful messages and asked two groups of internet users to determine whether they were hate speech or not, whether they should b… ▽ More

    Submitted 27 January, 2017; originally announced January 2017.

    Journal ref: Proceedings of NLP4CMC III: 3rd Workshop on Natural Language Processing for Computer-Mediated Communication (Bochum), Bochumer Linguistische Arbeitsberichte, vol. 17, sep 2016, pp. 6-9

  40. arXiv:1510.02135  [pdf, other

    cs.DC

    A Remote Procedure Call Approach for Extreme-scale Services

    Authors: Jerome Soumagne, Philip H. Carns, Dries Kimpe, Quincey Koziol, Robert B. Ross

    Abstract: When working at exascale, the various constraints imposed by the extreme scale of the system bring new challenges for application users and software/middleware developers. In that context, and to provide best performance, resiliency and energy efficiency, software may be provided as a service oriented approach, adjusting resource utilization to best meet facility and user requirements. Remote proc… ▽ More

    Submitted 5 October, 2015; originally announced October 2015.

    Comments: CSESSP 2015

  41. arXiv:1509.05492  [pdf, other

    cs.DC

    Challenges and Considerations for Utilizing Burst Buffers in High-Performance Computing

    Authors: Melissa Romanus, Robert B. Ross, Manish Parashar

    Abstract: As high-performance computing (HPC) moves into the exascale era, computer scientists and engineers must find innovative ways of transferring and processing unprecedented amounts of data. As the scale and complexity of the applications running on these machines increases, the cost of their interactions and data exchanges (in terms of latency, energy, runtime, etc.) can increase exponentially. In or… ▽ More

    Submitted 29 September, 2015; v1 submitted 17 September, 2015; originally announced September 2015.

    Comments: 18 pages, 2 figures

    ACM Class: B.4.3; D.4.2; C.1.4