Search | arXiv e-print repository

doi 10.1109/ACCESS.2023.3247564

Autoencoder Based Iterative Modeling and Multivariate Time-Series Subsequence Clustering Algorithm

Authors: Jonas Köhne, Lars Henning, Clemens Gühmann

Abstract: This paper introduces an algorithm for the detection of change-points and the identification of the corresponding subsequences in transient multivariate time-series data (MTSD). The analysis of such data has become more and more important due to the increase of availability in many industrial fields. Labeling, sorting or filtering highly transient measurement data for training condition based main… ▽ More This paper introduces an algorithm for the detection of change-points and the identification of the corresponding subsequences in transient multivariate time-series data (MTSD). The analysis of such data has become more and more important due to the increase of availability in many industrial fields. Labeling, sorting or filtering highly transient measurement data for training condition based maintenance (CbM) models is cumbersome and error-prone. For some applications it can be sufficient to filter measurements by simple thresholds or finding change-points based on changes in mean value and variation. But a robust diagnosis of a component within a component group for example, which has a complex non-linear correlation between multiple sensor values, a simple approach would not be feasible. No meaningful and coherent measurement data which could be used for training a CbM model would emerge. Therefore, we introduce an algorithm which uses a recurrent neural network (RNN) based Autoencoder (AE) which is iteratively trained on incoming data. The scoring function uses the reconstruction error and latent space information. A model of the identified subsequence is saved and used for recognition of repeating subsequences as well as fast offline clustering. For evaluation, we propose a new similarity measure based on the curvature for a more intuitive time-series subsequence clustering metric. A comparison with seven other state-of-the-art algorithms and eight datasets shows the capability and the increased performance of our algorithm to cluster MTSD online and offline in conjunction with mechatronic systems. △ Less

Submitted 23 September, 2022; v1 submitted 9 September, 2022; originally announced September 2022.

Comments: 26 pages, 11 figures, for associated python code repositories see https://github.com/Jokonu/mt3scm and https://github.com/Jokonu/abimca; Minor spelling and grammar corrections, fixed wrong bibtex entry for SOStream, some improvements and corrections in formulas of section 4

arXiv:2004.12764 [pdf, other]

"Call me sexist, but...": Revisiting Sexism Detection Using Psychological Scales and Adversarial Samples

Authors: Mattia Samory, Indira Sen, Julian Kohne, Fabian Floeck, Claudia Wagner

Abstract: Research has focused on automated methods to effectively detect sexism online. Although overt sexism seems easy to spot, its subtle forms and manifold expressions are not. In this paper, we outline the different dimensions of sexism by grounding them in their implementation in psychological scales. From the scales, we derive a codebook for sexism in social media, which we use to annotate existing… ▽ More Research has focused on automated methods to effectively detect sexism online. Although overt sexism seems easy to spot, its subtle forms and manifold expressions are not. In this paper, we outline the different dimensions of sexism by grounding them in their implementation in psychological scales. From the scales, we derive a codebook for sexism in social media, which we use to annotate existing and novel datasets, surfacing their limitations in breadth and validity with respect to the construct of sexism. Next, we leverage the annotated datasets to generate adversarial examples, and test the reliability of sexism detection methods. Results indicate that current machine learning models pick up on a very narrow set of linguistic markers of sexism and do not generalize well to out-of-domain examples. Yet, including diverse data and adversarial examples at training time results in models that generalize better and that are more robust to artifacts of data collection. By providing a scale-based codebook and insights regarding the shortcomings of the state-of-the-art, we hope to contribute to the development of better and broader models for sexism detection, including reflections on theory-driven approaches to data collection. △ Less

Submitted 2 June, 2021; v1 submitted 27 April, 2020; originally announced April 2020.

Comments: Indira Sen and Julian Kohne contributed equally to this work

Journal ref: Proceedings of the 15th International AAAI Conference on Web and Social Media (ICWSM), 2021

arXiv:1907.00888 [pdf, ps, other]

The Role of Network Structure and Initial Group Norm Distributions in Norm Conflict

Authors: Julian Kohne, Natalie Gallagher, Zeynep Melis Kirgil, Rocco Paolillo, Lars Padmos, Fariba Karimi

Abstract: Social norms can facilitate societal coexistence in groups by providing an implicitly shared set of expectations and behavioral guidelines. However, different social groups can hold different norms, and lacking an overarching normative consensus can lead to conflict within and between groups. In this paper, we present an agent-based model that simulates the adoption of norms in two interacting gro… ▽ More Social norms can facilitate societal coexistence in groups by providing an implicitly shared set of expectations and behavioral guidelines. However, different social groups can hold different norms, and lacking an overarching normative consensus can lead to conflict within and between groups. In this paper, we present an agent-based model that simulates the adoption of norms in two interacting groups. We explore this phenomenon while varying relative group sizes and homophily/heterophily (two features of network structure), and initial group norm distributions. Agents update their norm according to an adapted version of Granovetter's threshold model, using a uniform distribution of thresholds. We study the impact of network structure and initial norm distributions on the process of achieving normative consensus and the resulting potential for intragroup and intergroup conflict. Our results show that norm change is most likely when norms are strongly tied to group membership. Groups end up with the most similar norm distributions when networks are heterophilic, with small to middling minority groups. High homophilic networks show high potential intergroup conflict and low potential intragroup conflict, while the opposite pattern emerges for high heterophilic networks. △ Less

Submitted 1 July, 2019; originally announced July 2019.

Comments: 29 pages, 5 figures

arXiv:1311.5904 [pdf, ps, other]

doi 10.1016/j.jpdc.2014.08.001

The IceProd Framework: Distributed Data Processing for the IceCube Neutrino Observatory

Authors: M. G. Aartsen, R. Abbasi, M. Ackermann, J. Adams, J. A. Aguilar, M. Ahlers, D. Altmann, C. Arguelles, J. Auffenberg, X. Bai, M. Baker, S. W. Barwick, V. Baum, R. Bay, J. J. Beatty, J. Becker Tjus, K. -H. Becker, S. BenZvi, P. Berghaus, D. Berley, E. Bernardini, A. Bernhard, D. Z. Besson, G. Binder, D. Bindig , et al. (262 additional authors not shown)

Abstract: IceCube is a one-gigaton instrument located at the geographic South Pole, designed to detect cosmic neutrinos, iden- tify the particle nature of dark matter, and study high-energy neutrinos themselves. Simulation of the IceCube detector and processing of data require a significant amount of computational resources. IceProd is a distributed management system based on Python, XML-RPC and GridFTP. It… ▽ More IceCube is a one-gigaton instrument located at the geographic South Pole, designed to detect cosmic neutrinos, iden- tify the particle nature of dark matter, and study high-energy neutrinos themselves. Simulation of the IceCube detector and processing of data require a significant amount of computational resources. IceProd is a distributed management system based on Python, XML-RPC and GridFTP. It is driven by a central database in order to coordinate and admin- ister production of simulations and processing of data produced by the IceCube detector. IceProd runs as a separate layer on top of other middleware and can take advantage of a variety of computing resources, including grids and batch systems such as CREAM, Condor, and PBS. This is accomplished by a set of dedicated daemons that process job submission in a coordinated fashion through the use of middleware plugins that serve to abstract the details of job submission and job management from the framework. △ Less

Submitted 22 August, 2014; v1 submitted 22 November, 2013; originally announced November 2013.

Journal ref: Journal of Parallel & Distributed Computing 75:198,2015

Showing 1–4 of 4 results for author: Kohne, J