Skip to main content

Showing 1–41 of 41 results for author: Keller, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.03307  [pdf, ps, other

    cs.LG

    Budgeted Online Active Learning with Expert Advice and Episodic Priors

    Authors: Kristen Goebel, William Solow, Paola Pesantez-Cabrera, Markus Keller, Alan Fern

    Abstract: This paper introduces a novel approach to budgeted online active learning from finite-horizon data streams with extremely limited labeling budgets. In agricultural applications, such streams might include daily weather data over a growing season, and labels require costly measurements of weather-dependent plant characteristics. Our method integrates two key sources of prior information: a collecti… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  2. arXiv:2504.13142  [pdf, ps, other

    cs.LG

    Transfer Learning via Auxiliary Labels with Application to Cold-Hardiness Prediction

    Authors: Kristen Goebel, Paola Pesantez-Cabrera, Markus Keller, Alan Fern

    Abstract: Cold temperatures can cause significant frost damage to fruit crops depending on their resilience, or cold hardiness, which changes throughout the dormancy season. This has led to the development of predictive cold-hardiness models, which help farmers decide when to deploy expensive frost-mitigation measures. Unfortunately, cold-hardiness data for model training is only available for some fruit cu… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

  3. arXiv:2503.08836  [pdf, other

    cs.HC

    A Critical Analysis of the Usage of Dimensionality Reduction in Four Domains

    Authors: Dylan Cashman, Mark Keller, Hyeon Jeon, Bum Chul Kwon, Qianwen Wang

    Abstract: Dimensionality reduction is used as an important tool for unraveling the complexities of high-dimensional datasets in many fields of science, such as cell biology, chemical informatics, and physics. Visualizations of the dimensionally reduced data enable scientists to delve into the intrinsic structures of their datasets and align them with established hypotheses. Visualization researchers have th… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

    Comments: In submission to TVCG. Currently under minor revision

  4. arXiv:2502.00265  [pdf

    cs.DB

    RADx Data Hub: A Cloud Platform for FAIR, Harmonized COVID-19 Data

    Authors: Marcos Martinez-Romero, Matthew Horridge, Nilesh Mistry, Aubrie Weyhmiller, Jimmy K. Yu, Alissa Fujimoto, Aria Henry, Martin J. O'Connor, Ashley Sier, Stephanie Suber, Mete U. Akdogan, Yan Cao, Somu Valliappan, Joanna O. Mieczkowska, the RADx Data Hub team, Ashok Krishnamurthy, Michael A. Keller, Mark A. Musen

    Abstract: The COVID-19 pandemic highlighted the urgent need for robust systems to enable rapid data collection, integration, and analysis for public health responses. Existing approaches often relied on disparate, non-interoperable systems, creating bottlenecks in comprehensive analyses and timely decision-making. To address these challenges, the U.S. National Institutes of Health (NIH) launched the Rapid A… ▽ More

    Submitted 15 February, 2025; v1 submitted 31 January, 2025; originally announced February 2025.

  5. arXiv:2501.10600  [pdf, other

    cs.CV

    High Resolution Tree Height Mapping of the Amazon Forest using Planet NICFI Images and LiDAR-Informed U-Net Model

    Authors: Fabien H Wagner, Ricardo Dalagnol, Griffin Carter, Mayumi CM Hirye, Shivraj Gill, Le Bienfaiteur Sagang Takougoum, Samuel Favrichon, Michael Keller, Jean PHB Ometto, Lorena Alves, Cynthia Creze, Stephanie P George-Chacon, Shuang Li, Zhihua Liu, Adugna Mullissa, Yan Yang, Erone G Santos, Sarah R Worden, Martin Brandt, Philippe Ciais, Stephen C Hagen, Sassan Saatchi

    Abstract: Tree canopy height is one of the most important indicators of forest biomass, productivity, and ecosystem structure, but it is challenging to measure accurately from the ground and from space. Here, we used a U-Net model adapted for regression to map the mean tree canopy height in the Amazon forest from Planet NICFI images at ~4.78 m spatial resolution for the period 2020-2024. The U-Net model was… ▽ More

    Submitted 17 January, 2025; originally announced January 2025.

    Comments: will be submitted to the journal Remote Sensing of Environment in February 2025

    MSC Class: 92-08 ACM Class: I.4.8

  6. arXiv:2501.04630  [pdf, other

    cs.IR cs.SD eess.AS

    Evaluating Interval-based Tokenization for Pitch Representation in Symbolic Music Analysis

    Authors: Dinh-Viet-Toan Le, Louis Bigo, Mikaela Keller

    Abstract: Symbolic music analysis tasks are often performed by models originally developed for Natural Language Processing, such as Transformers. Such models require the input data to be represented as sequences, which is achieved through a process of tokenization. Tokenization strategies for symbolic music often rely on absolute MIDI values to represent pitch information. However, music research largely pr… ▽ More

    Submitted 8 January, 2025; originally announced January 2025.

    Comments: Accepted at Artificial Intelligence for Music Workshop at AAAI 2025 (https://ai4musicians.org/2025aaai.html)

  7. arXiv:2501.01256  [pdf, other

    cs.CL cs.LG

    Digital Guardians: Can GPT-4, Perspective API, and Moderation API reliably detect hate speech in reader comments of German online newspapers?

    Authors: Manuel Weber, Moritz Huber, Maximilian Auch, Alexander Döschl, Max-Emanuel Keller, Peter Mandl

    Abstract: In recent years, toxic content and hate speech have become widespread phenomena on the internet. Moderators of online newspapers and forums are now required, partly due to legal regulations, to carefully review and, if necessary, delete reader comments. This is a labor-intensive process. Some providers of large language models already offer solutions for automated hate speech detection or the iden… ▽ More

    Submitted 2 January, 2025; originally announced January 2025.

    ACM Class: I.2.7

  8. arXiv:2412.12954  [pdf, other

    cs.CL

    Recipient Profiling: Predicting Characteristics from Messages

    Authors: Martin Borquez, Mikaela Keller, Michael Perrot, Damien Sileo

    Abstract: It has been shown in the field of Author Profiling that texts may inadvertently reveal sensitive information about their authors, such as gender or age. This raises important privacy concerns that have been extensively addressed in the literature, in particular with the development of methods to hide such information. We argue that, when these texts are in fact messages exchanged between individua… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

    MSC Class: 68T50; 68P20; 94A60 ACM Class: I.2.7; K.4.1; H.3.3

  9. arXiv:2410.14625  [pdf

    cs.IR cs.LG

    Enhancing AI Accessibility in Veterinary Medicine: Linking Classifiers and Electronic Health Records

    Authors: Chun Yin Kong, Picasso Vasquez, Makan Farhoodimoghadam, Chris Brandt, Titus C. Brown, Krystle L. Reagan, Allison Zwingenberger, Stefan M. Keller

    Abstract: In the rapidly evolving landscape of veterinary healthcare, integrating machine learning (ML) clinical decision-making tools with electronic health records (EHRs) promises to improve diagnostic accuracy and patient care. However, the seamless integration of ML classifiers into existing EHRs in veterinary medicine is frequently hindered by the rigidity of EHR systems or the limited availability of… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

  10. arXiv:2410.01448  [pdf, other

    cs.IR cs.CL cs.SD eess.AS

    Analyzing Byte-Pair Encoding on Monophonic and Polyphonic Symbolic Music: A Focus on Musical Phrase Segmentation

    Authors: Dinh-Viet-Toan Le, Louis Bigo, Mikaela Keller

    Abstract: Byte-Pair Encoding (BPE) is an algorithm commonly used in Natural Language Processing to build a vocabulary of subwords, which has been recently applied to symbolic music. Given that symbolic music can differ significantly from text, particularly with polyphony, we investigate how BPE behaves with different types of musical content. This study provides a qualitative analysis of BPE's behavior acro… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

    Comments: Accepted to 3rd Workshop on NLP for Music and Audio (NLP4MusA, co-located with ISMIR 2024)

  11. Classification performance and reproducibility of GPT-4 omni for information extraction from veterinary electronic health records

    Authors: Judit M Wulcan, Kevin L Jacques, Mary Ann Lee, Samantha L Kovacs, Nicole Dausend, Lauren E Prince, Jonatan Wulcan, Sina Marsilio, Stefan M Keller

    Abstract: Large language models (LLMs) can extract information from veterinary electronic health records (EHRs), but performance differences between models, the effect of temperature settings, and the influence of text ambiguity have not been previously evaluated. This study addresses these gaps by comparing the performance of GPT-4 omni (GPT-4o) and GPT-3.5 Turbo under different conditions and investigatin… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

    Comments: 24 pages, 3 figures, 8 supplementary figures

    Journal ref: Frontiers in Veterinary Science, Vol. 11, 2025

  12. To Word Senses and Beyond: Inducing Concepts with Contextualized Language Models

    Authors: Bastien Liétard, Pascal Denis, Mikaella Keller

    Abstract: Polysemy and synonymy are two crucial interrelated facets of lexical ambiguity. While both phenomena are widely documented in lexical resources and have been studied extensively in NLP, leading to dedicated systems, they are often being considered independently in practical problems. While many tasks dealing with polysemy (e.g. Word Sense Disambiguiation or Induction) highlight the role of word's… ▽ More

    Submitted 19 December, 2024; v1 submitted 28 June, 2024; originally announced June 2024.

    Comments: Published in EMNLP 2024 main conference proceedings

    Journal ref: In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pages 2684-2696 (2024)

  13. arXiv:2405.14521  [pdf, other

    cs.LG cs.CL

    Synthetic Data Generation for Intersectional Fairness by Leveraging Hierarchical Group Structure

    Authors: Gaurav Maheshwari, Aurélien Bellet, Pascal Denis, Mikaela Keller

    Abstract: In this paper, we introduce a data augmentation approach specifically tailored to enhance intersectional fairness in classification tasks. Our method capitalizes on the hierarchical structure inherent to intersectionality, by viewing groups as intersections of their parent categories. This perspective allows us to augment data for smaller groups by learning a transformation function that combines… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  14. arXiv:2405.07814  [pdf, other

    cs.CV

    NutritionVerse-Direct: Exploring Deep Neural Networks for Multitask Nutrition Prediction from Food Images

    Authors: Matthew Keller, Chi-en Amy Tai, Yuhao Chen, Pengcheng Xi, Alexander Wong

    Abstract: Many aging individuals encounter challenges in effectively tracking their dietary intake, exacerbating their susceptibility to nutrition-related health complications. Self-reporting methods are often inaccurate and suffer from substantial bias; however, leveraging intelligent prediction methods can automate and enhance precision in this process. Recent work has explored using computer vision predi… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  15. arXiv:2402.17467  [pdf, other

    cs.IR cs.AI cs.SD eess.AS

    Natural Language Processing Methods for Symbolic Music Generation and Information Retrieval: a Survey

    Authors: Dinh-Viet-Toan Le, Louis Bigo, Mikaela Keller, Dorien Herremans

    Abstract: Several adaptations of Transformers models have been developed in various domains since its breakthrough in Natural Language Processing (NLP). This trend has spread into the field of Music Information Retrieval (MIR), including studies processing music data. However, the practice of leveraging NLP tools for symbolic music data is not novel in MIR. Music has been frequently compared to language, as… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 36 pages, 5 figures, 4 tables

    Journal ref: ACM Computing Surveys 2025, Volume 57, Issue 7

  16. arXiv:2401.08598  [pdf, other

    cs.CV

    NutritionVerse-Real: An Open Access Manually Collected 2D Food Scene Dataset for Dietary Intake Estimation

    Authors: Chi-en Amy Tai, Saeejith Nair, Olivia Markham, Matthew Keller, Yifan Wu, Yuhao Chen, Alexander Wong

    Abstract: Dietary intake estimation plays a crucial role in understanding the nutritional habits of individuals and populations, aiding in the prevention and management of diet-related health issues. Accurate estimation requires comprehensive datasets of food scenes, including images, segmentation masks, and accompanying dietary intake metadata. In this paper, we introduce NutritionVerse-Real, an open acces… ▽ More

    Submitted 20 November, 2023; originally announced January 2024.

  17. Hierarchical Multigrid Ansatz for Variational Quantum Algorithms

    Authors: Christo Meriwether Keller, Stephan Eidenbenz, Andreas Bärtschi, Daniel O'Malley, John Golden, Satyajayant Misra

    Abstract: Quantum computing is an emerging topic in engineering that promises to enhance supercomputing using fundamental physics. In the near term, the best candidate algorithms for achieving this advantage are variational quantum algorithms (VQAs). We design and numerically evaluate a novel ansatz for VQAs, focusing in particular on the variational quantum eigensolver (VQE). As our ansatz is inspired by c… ▽ More

    Submitted 16 July, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: 11 pages, 9 figures

    Report number: LA-UR-23-33674

    Journal ref: ISC High Performance 2024 Research Paper Proceedings (39th International Conference), 2024

  18. arXiv:2309.07704  [pdf, other

    cs.CV cs.AI

    NutritionVerse: Empirical Study of Various Dietary Intake Estimation Approaches

    Authors: Chi-en Amy Tai, Matthew Keller, Saeejith Nair, Yuhao Chen, Yifan Wu, Olivia Markham, Krish Parmar, Pengcheng Xi, Heather Keller, Sharon Kirkpatrick, Alexander Wong

    Abstract: Accurate dietary intake estimation is critical for informing policies and programs to support healthy eating, as malnutrition has been directly linked to decreased quality of life. However self-reporting methods such as food diaries suffer from substantial bias. Other conventional dietary assessment techniques and emerging alternative approaches such as mobile applications incur high time costs an… ▽ More

    Submitted 1 September, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: Corrections made to Tables 6, 7, and 8, and corrections made to Experiments Part C. Additional clarification made in Section 4

  19. arXiv:2306.03367  [pdf, other

    cs.RO cs.AI

    Bridging the Gap Between Multi-Step and One-Shot Trajectory Prediction via Self-Supervision

    Authors: Faris Janjoš, Max Keller, Maxim Dolgov, J. Marius Zöllner

    Abstract: Accurate vehicle trajectory prediction is an unsolved problem in autonomous driving with various open research questions. State-of-the-art approaches regress trajectories either in a one-shot or step-wise manner. Although one-shot approaches are usually preferred for their simplicity, they relinquish powerful self-supervision schemes that can be constructed by chaining multiple time-steps. We addr… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: 8 pages, 6 figures, to be published in 34th IEEE Intelligent Vehicles Symposium (IV)

    ACM Class: I.1.2

  20. arXiv:2305.19143  [pdf, other

    cs.CL

    A Tale of Two Laws of Semantic Change: Predicting Synonym Changes with Distributional Semantic Models

    Authors: Bastien Liétard, Mikaela Keller, Pascal Denis

    Abstract: Lexical Semantic Change is the study of how the meaning of words evolves through time. Another related question is whether and how lexical relations over pairs of words, such as synonymy, change over time. There are currently two competing, apparently opposite hypotheses in the historical linguistic literature regarding how synonymous words evolve: the Law of Differentiation (LD) argues that synon… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: Accepted at The 12th Joint Conference on Lexical and Computational Semantics (*SEM 2023)

  21. arXiv:2305.12495  [pdf, other

    cs.LG cs.CL cs.CY

    Fair Without Leveling Down: A New Intersectional Fairness Definition

    Authors: Gaurav Maheshwari, Aurélien Bellet, Pascal Denis, Mikaela Keller

    Abstract: In this work, we consider the problem of intersectional group fairness in the classification setting, where the objective is to learn discrimination-free models in the presence of several intersecting sensitive groups. First, we illustrate various shortcomings of existing fairness measures commonly used to capture intersectional fairness. Then, we propose a new definition called the $α$-Intersecti… ▽ More

    Submitted 7 November, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: The paper has been accepted at: The 2023 Conference on Empirical Methods in Natural Language Processing

  22. arXiv:2304.05619  [pdf, other

    cs.CV

    NutritionVerse-3D: A 3D Food Model Dataset for Nutritional Intake Estimation

    Authors: Chi-en Amy Tai, Matthew Keller, Mattie Kerrigan, Yuhao Chen, Saeejith Nair, Pengcheng Xi, Alexander Wong

    Abstract: 77% of adults over 50 want to age in place today, presenting a major challenge to ensuring adequate nutritional intake. It has been reported that one in four older adults that are 65 years or older are malnourished and given the direct link between malnutrition and decreased quality of life, there have been numerous studies conducted on how to efficiently track nutritional intake of food. Recent a… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

  23. arXiv:2301.01815  [pdf, other

    cs.LG

    Multi-Task Learning for Budbreak Prediction

    Authors: Aseem Saxena, Paola Pesantez-Cabrera, Rohan Ballapragada, Markus Keller, Alan Fern

    Abstract: Grapevine budbreak is a key phenological stage of seasonal development, which serves as a signal for the onset of active growth. This is also when grape plants are most vulnerable to damage from freezing temperatures. Hence, it is important for winegrowers to anticipate the day of budbreak occurrence to protect their vineyards from late spring frost events. This work investigates deep learning for… ▽ More

    Submitted 4 January, 2023; originally announced January 2023.

    Comments: Accepted at AIFS Workshop AAAI 2023. arXiv admin note: text overlap with arXiv:2209.10585

  24. arXiv:2209.10585  [pdf, other

    cs.LG

    Grape Cold Hardiness Prediction via Multi-Task Learning

    Authors: Aseem Saxena, Paola Pesantez-Cabrera, Rohan Ballapragada, Kin-Ho Lam, Markus Keller, Alan Fern

    Abstract: Cold temperatures during fall and spring have the potential to cause frost damage to grapevines and other fruit plants, which can significantly decrease harvest yields. To help prevent these losses, farmers deploy expensive frost mitigation measures such as sprinklers, heaters, and wind machines when they judge that damage may occur. This judgment, however, is challenging because the cold hardines… ▽ More

    Submitted 4 January, 2023; v1 submitted 21 September, 2022; originally announced September 2022.

    Comments: 6 pages, 2 figures, accepted at IAAI-23

  25. arXiv:2205.06135  [pdf, other

    cs.CL cs.LG

    Fair NLP Models with Differentially Private Text Encoders

    Authors: Gaurav Maheshwari, Pascal Denis, Mikaela Keller, Aurélien Bellet

    Abstract: Encoded text representations often capture sensitive attributes about individuals (e.g., race or gender), which raise privacy concerns and can make downstream models unfair to certain groups. In this work, we propose FEDERATE, an approach that combines ideas from differential privacy and adversarial training to learn private text representations which also induces fairer models. We empirically eva… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

    Comments: submitted to: ACL-ARR 2022 (February) - https://openreview.net/forum?id=BVgNSki6q1c

  26. arXiv:2204.10129  [pdf, other

    cs.CV

    OSSO: Obtaining Skeletal Shape from Outside

    Authors: Marilyn Keller, Silvia Zuffi, Michael J. Black, Sergi Pujades

    Abstract: We address the problem of inferring the anatomic skeleton of a person, in an arbitrary pose, from the 3D surface of the body; i.e. we predict the inside (bones) from the outside (skin). This has many applications in medicine and biomechanics. Existing state-of-the-art biomechanical skeletons are detailed but do not easily generalize to new subjects. Additionally, computer vision and graphics metho… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

    Comments: Project page: https://osso.is.tue.mpg.de/. Accepted in CVPR 2022

  27. arXiv:2107.00501  [pdf, other

    cs.LG

    Secure Quantized Training for Deep Learning

    Authors: Marcel Keller, Ke Sun

    Abstract: We implement training of neural networks in secure multi-party computation (MPC) using quantization commonly used in said setting. We are the first to present an MNIST classifier purely trained in MPC that comes within 0.2 percent of the accuracy of the same convolutional neural network trained via plaintext computation. More concretely, we have trained a network with two convolutional and two den… ▽ More

    Submitted 18 July, 2022; v1 submitted 1 July, 2021; originally announced July 2021.

    Comments: 27 pages

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, PMLR 162:10912-10938, 2022

  28. Data Augmentation in Natural Language Processing: A Novel Text Generation Approach for Long and Short Text Classifiers

    Authors: Markus Bayer, Marc-André Kaufhold, Björn Buchhold, Marcel Keller, Jörg Dallmeyer, Christian Reuter

    Abstract: In many cases of machine learning, research suggests that the development of training data might have a higher relevance than the choice and modelling of classifiers themselves. Thus, data augmentation methods have been developed to improve classifiers by artificially created training data. In NLP, there is the challenge of establishing universal rules for text transformations which provide new li… ▽ More

    Submitted 22 July, 2022; v1 submitted 26 March, 2021; originally announced March 2021.

    Comments: 17 pages, 3 figure, 5 tables

    Journal ref: International Journal of Machine Learning and Cybernetics (2022)

  29. arXiv:2011.11202  [pdf, ps, other

    cs.LG cs.CR

    Effectiveness of MPC-friendly Softmax Replacement

    Authors: Marcel Keller, Ke Sun

    Abstract: Softmax is widely used in deep learning to map some representation to a probability distribution. As it is based on exp/log functions that are relatively expensive in multi-party computation, Mohassel and Zhang (2017) proposed a simpler replacement based on ReLU to be used in secure computation. However, we could not reproduce the accuracy they reported for training on MNIST with three fully conne… ▽ More

    Submitted 6 July, 2021; v1 submitted 22 November, 2020; originally announced November 2020.

    Comments: 6 pages, PPML/PriML workshop at NeurIPS 2020; updated accuracy figures after bug fix

  30. arXiv:2010.00635  [pdf

    cs.CV eess.IV

    StreamSoNG: A Soft Streaming Classification Approach

    Authors: Wenlong Wu, James M. Keller, Jeffrey Dale, James C. Bezdek

    Abstract: Examining most streaming clustering algorithms leads to the understanding that they are actually incremental classification models. They model existing and newly discovered structures via summary information that we call footprints. Incoming data is normally assigned a crisp label (into one of the structures) and that structure's footprint is incrementally updated. There is no reason that these as… ▽ More

    Submitted 13 July, 2021; v1 submitted 1 October, 2020; originally announced October 2020.

  31. arXiv:2002.00815  [pdf, other

    cs.CV cs.LG

    Learning Extremal Representations with Deep Archetypal Analysis

    Authors: Sebastian Mathias Keller, Maxim Samarin, Fabricio Arend Torres, Mario Wieser, Volker Roth

    Abstract: Archetypes are typical population representatives in an extremal sense, where typicality is understood as the most extreme manifestation of a trait or feature. In linear feature space, archetypes approximate the data convex hull allowing all data points to be expressed as convex mixtures of archetypes. However, it might not always be possible to identify meaningful archetypes in a given feature sp… ▽ More

    Submitted 3 February, 2020; originally announced February 2020.

    Comments: Under review for publication at the International Journal of Computer Vision (IJCV). Extended version of our GCPR2019 paper "Deep Archetypal Analysis"

  32. Secure Evaluation of Quantized Neural Networks

    Authors: Anders Dalskov, Daniel Escudero, Marcel Keller

    Abstract: We investigate two questions in this paper: First, we ask to what extent "MPC friendly" models are already supported by major Machine Learning frameworks such as TensorFlow or PyTorch. Prior works provide protocols that only work on fixed-point integers and specialized activation functions, two aspects that are not supported by popular Machine Learning frameworks, and the need for these specialize… ▽ More

    Submitted 28 February, 2021; v1 submitted 28 October, 2019; originally announced October 2019.

    Comments: 22 pages

    Journal ref: Proceedings on Privacy Enhancing Technologies 4 (2020): 355-375

  33. arXiv:1910.11680  [pdf, ps, other

    cs.CR cs.LG

    A Note on Our Submission to Track 4 of iDASH 2019

    Authors: Marcel Keller, Ke Sun

    Abstract: iDASH is a competition soliciting implementations of cryptographic schemes of interest in the context of biology. In 2019, one track asked for multi-party computation implementations of training of a machine learning model suitable for two datasets from cancer research. In this note, we describe our solution submitted to the competition. We found that the training can be run on three AWS c5.9xlarg… ▽ More

    Submitted 24 October, 2019; originally announced October 2019.

    Comments: 4 pages

  34. Enabling Explainable Fusion in Deep Learning with Fuzzy Integral Neural Networks

    Authors: Muhammad Aminul Islam, Derek T. Anderson, Anthony J. Pinar, Timothy C. Havens, Grant Scott, James M. Keller

    Abstract: Information fusion is an essential part of numerous engineering systems and biological functions, e.g., human cognition. Fusion occurs at many levels, ranging from the low-level combination of signals to the high-level aggregation of heterogeneous decision-making processes. While the last decade has witnessed an explosion of research in deep learning, fusion in neural networks has not observed the… ▽ More

    Submitted 10 May, 2019; originally announced May 2019.

    Comments: IEEE Transactions on Fuzzy Systems

  35. arXiv:1901.10799  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Deep Archetypal Analysis

    Authors: Sebastian Mathias Keller, Maxim Samarin, Mario Wieser, Volker Roth

    Abstract: "Deep Archetypal Analysis" generates latent representations of high-dimensional datasets in terms of fractions of intuitively understandable basic entities called archetypes. The proposed method is an extension of linear "Archetypal Analysis" (AA), an unsupervised method to represent multivariate data points as sparse convex combinations of extremal elements of the dataset. Unlike the original for… ▽ More

    Submitted 24 January, 2020; v1 submitted 30 January, 2019; originally announced January 2019.

    Comments: Published at the German Conference on Pattern Recognition 2019 (GCPR)

    Journal ref: 41th German Conference on Pattern Recognition, GCPR 2019

  36. arXiv:1812.06594  [pdf, other

    q-bio.NC cs.LG eess.SP q-bio.QM stat.ML

    Computational EEG in Personalized Medicine: A study in Parkinson's Disease

    Authors: Sebastian Mathias Keller, Maxim Samarin, Antonia Meyer, Vitalii Kosak, Ute Gschwandtner, Peter Fuhr, Volker Roth

    Abstract: Recordings of electrical brain activity carry information about a person's cognitive health. For recording EEG signals, a very common setting is for a subject to be at rest with its eyes closed. Analysis of these recordings often involve a dimensionality reduction step in which electrodes are grouped into 10 or more regions (depending on the number of electrodes available). Then an average over ea… ▽ More

    Submitted 2 December, 2018; originally announced December 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:811.07216

  37. arXiv:1601.06262  [pdf, other

    cs.NI

    Response-Time-Optimized Distributed Cloud Resource Allocation

    Authors: Matthias Keller, Holger Karl

    Abstract: A current trend in networking and cloud computing is to provide compute resources over widely dispersed places exemplified by initiatives like Network Function Virtualisation. This paves the way for a widespread service deployment and can improve service quality; a nearby server can reduce the user-perceived response times. But always using the nearest server is a bad decision if that server is al… ▽ More

    Submitted 29 May, 2016; v1 submitted 23 January, 2016; originally announced January 2016.

  38. arXiv:1511.05789  [pdf, ps, other

    cs.LG

    Metric learning approach for graph-based label propagation

    Authors: Pauline Wauquier, Mikaela Keller

    Abstract: The efficiency of graph-based semi-supervised algorithms depends on the graph of instances on which they are applied. The instances are often in a vectorial form before a graph linking them is built. The construction of the graph relies on a metric over the vectorial space that help define the weight of the connection between entities. The classic choice for this metric is usually a distance measu… ▽ More

    Submitted 18 February, 2016; v1 submitted 18 November, 2015; originally announced November 2015.

    Comments: Workshop track submission ICLR 2016

  39. arXiv:1507.08834  [pdf, other

    cs.NI cs.DC

    Response-Time-Optimised Service Deployment: MILP Formulations of Piece-wise Linear Functions Approximating Non-linear Bivariate Mixed-integer Functions

    Authors: Matthias Keller, Holger Karl

    Abstract: A current trend in networking and cloud computing is to provide compute resources at widely dispersed places; this is exemplified by developments such as Network Function Virtualisation. This paves the way for wide-area service deployments with improved service quality: e.g, a nearby server can reduce the user-perceived response times. But always using the nearest server can be a bad decision if t… ▽ More

    Submitted 30 August, 2016; v1 submitted 31 July, 2015; originally announced July 2015.

  40. Specifying and Placing Chains of Virtual Network Functions

    Authors: Sevil Mehraghdam, Matthias Keller, Holger Karl

    Abstract: Network appliances perform different functions on network flows and constitute an important part of an operator's network. Normally, a set of chained network functions process network flows. Following the trend of virtualization of networks, virtualization of the network functions has also become a topic of interest. We define a model for formalizing the chaining of network functions using a conte… ▽ More

    Submitted 4 June, 2014; originally announced June 2014.

  41. arXiv:1210.4860  [pdf

    cs.SI cs.LG physics.soc-ph stat.ML

    Spectral Estimation of Conditional Random Graph Models for Large-Scale Network Data

    Authors: Antonino Freno, Mikaela Keller, Gemma C. Garriga, Marc Tommasi

    Abstract: Generative models for graphs have been typically committed to strong prior assumptions concerning the form of the modeled distributions. Moreover, the vast majority of currently available models are either only suitable for characterizing some particular network properties (such as degree distribution or clustering coefficient), or they are aimed at estimating joint probability distributions, whic… ▽ More

    Submitted 16 October, 2012; originally announced October 2012.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-265-274