-
PyScrew: A Comprehensive Dataset Collection from Industrial Screw Driving Experiments
Authors:
Nikolai West,
Jochen Deuse
Abstract:
This paper presents a comprehensive collection of industrial screw driving datasets designed to advance research in manufacturing process monitoring and quality control. The collection comprises six distinct datasets with over 34,000 individual screw driving operations conducted under controlled experimental conditions, capturing the multifaceted nature of screw driving processes in plastic compon…
▽ More
This paper presents a comprehensive collection of industrial screw driving datasets designed to advance research in manufacturing process monitoring and quality control. The collection comprises six distinct datasets with over 34,000 individual screw driving operations conducted under controlled experimental conditions, capturing the multifaceted nature of screw driving processes in plastic components. Each dataset systematically investigates specific aspects: natural thread degradation patterns through repeated use (s01), variations in surface friction conditions including contamination and surface treatments (s02), diverse assembly faults with up to 27 error types (s03-s04), and fabrication parameter variations in both upper and lower workpieces through modified injection molding settings (s05-s06). We detail the standardized experimental setup used across all datasets, including hardware specifications, process phases, and data acquisition methods. The hierarchical data model preserves the temporal and operational structure of screw driving processes, facilitating both exploratory analysis and the development of machine learning models. To maximize accessibility, we provide dual access pathways: raw data through Zenodo with a persistent DOI, and a purpose-built Python library (PyScrew) that offers consistent interfaces for data loading, preprocessing, and integration with common analysis workflows. These datasets serve diverse research applications including anomaly detection, predictive maintenance, quality control system development, feature extraction methodology evaluation, and classification of specific error conditions. By addressing the scarcity of standardized, comprehensive datasets in industrial manufacturing, this collection enables reproducible research and fair comparison of analytical approaches in an area of growing importance for industrial automation.
△ Less
Submitted 17 May, 2025;
originally announced May 2025.
-
Encoder-Decoder Networks for Self-Supervised Pretraining and Downstream Signal Bandwidth Regression on Digital Antenna Arrays
Authors:
Rajib Bhattacharjea,
Nathan West
Abstract:
This work presents the first applications of self-supervised learning applied to data from digital antenna arrays. Encoder-decoder networks are pretrained on digital array data to perform a self-supervised noisy-reconstruction task called channel in-painting, in which the network infers the contents of array data that has been masked with zeros. The self-supervised step requires no human-labeled d…
▽ More
This work presents the first applications of self-supervised learning applied to data from digital antenna arrays. Encoder-decoder networks are pretrained on digital array data to perform a self-supervised noisy-reconstruction task called channel in-painting, in which the network infers the contents of array data that has been masked with zeros. The self-supervised step requires no human-labeled data. The encoder architecture and weights from pretraining are then transferred to a new network with a task-specific decoder, and the new network is trained on a small volume of labeled data. We show that pretraining on the unlabeled data allows the new network to perform the task of bandwidth regression on the digital array data better than an equivalent network that is trained on the same labeled data from random initialization.
△ Less
Submitted 6 July, 2023;
originally announced July 2023.
-
Data-driven approach for diagnostic analysis of dynamic bottlenecks in serial manufacturing systems
Authors:
Nikolai West,
Joern Schwenken,
Jochen Deuse
Abstract:
A variety of established approaches exist for the detection of dynamic bottlenecks. Furthermore, the prediction of bottlenecks is experiencing a growing scientific interest, quantifiable by the increasing number of publications in recent years. Neglected, on the other hand, is the diagnosis of occurring bottlenecks. Detection methods may determine the current location of a bottleneck, while predic…
▽ More
A variety of established approaches exist for the detection of dynamic bottlenecks. Furthermore, the prediction of bottlenecks is experiencing a growing scientific interest, quantifiable by the increasing number of publications in recent years. Neglected, on the other hand, is the diagnosis of occurring bottlenecks. Detection methods may determine the current location of a bottleneck, while predictive approaches may indicate the location of an upcoming bottleneck. However, mere knowledge of current and future bottlenecks does not enable concrete actions to be taken to avoid the bottlenecks, nor does it open up any immediate advantage for manufacturing companies. Since small and medium-sized companies in particular have limited resources, they cannot implement improvement measures for every bottleneck that occurs. Due to the shifts of dynamic bottlenecks, the selection of the mostsuitable stations in the value stream becomes more difficult. This paper therefore contributes to the neglected field of bottleneck diagnosis. First, we propose two data-driven metrics, relative bottleneck frequency and relative bottleneck severity, which allow a quantitative assessment of the respective bottleneck situations. For validation purposes, we apply these metrics in nine selected scenarios generated using discrete event simulation in a value stream with a serial manufacturing line. Finally, we evaluate and discuss the results.
△ Less
Submitted 28 June, 2023;
originally announced June 2023.
-
DiME and AGVis: A Distributed Messaging Environment and Geographical Visualizer for Large-scale Power System Simulation
Authors:
Nicholas Parsly,
Jinning Wang,
Nick West,
Qiwei Zhang,
Hantao Cui,
Fangxing Li
Abstract:
This paper introduces the messaging environment and the geographical visualization tool of the CURENT Large-scale Testbed (LTB) that can be used for large-scale power system closed-loop simulation. First, Distributed Messaging Environment (DiME) implements an asynchronous shared workspace to enable high-concurrent data exchange. Second, Another Grid Visualizer (AGVis) is presented as a geovisualiz…
▽ More
This paper introduces the messaging environment and the geographical visualization tool of the CURENT Large-scale Testbed (LTB) that can be used for large-scale power system closed-loop simulation. First, Distributed Messaging Environment (DiME) implements an asynchronous shared workspace to enable high-concurrent data exchange. Second, Another Grid Visualizer (AGVis) is presented as a geovisualization tool that facilitates the visualization of real-time power system simulation. Third, case studies show the use of DiME and AGVis. The results demonstrate that, with the modular structure, the LTB is capable of not only federal use for real-time, large-scale power system simulation, but also independent use for customized power system research.
△ Less
Submitted 17 October, 2023; v1 submitted 21 November, 2022;
originally announced November 2022.
-
Wideband Signal Localization with Spectral Segmentation
Authors:
Nathan West,
Tamoghna Roy,
Timothy O'Shea
Abstract:
Signal localization is a spectrum sensing problem that jointly detects the presence of a signal and estimates a center frequency and bandwidth. This is a step beyond most spectrum sensing work which estimates "present" or "not present" detections for either a single channel or fixed sized channels. We define the signal localization task, present the metrics of precision and recall, and establish b…
▽ More
Signal localization is a spectrum sensing problem that jointly detects the presence of a signal and estimates a center frequency and bandwidth. This is a step beyond most spectrum sensing work which estimates "present" or "not present" detections for either a single channel or fixed sized channels. We define the signal localization task, present the metrics of precision and recall, and establish baselines for traditional energy detection on this task. We introduce a new dataset that is useful for training neural networks to perform this task and show a training framework to train signal detectors to achieve the task and present precision and recall curves over SNR. This neural network based approach shows an 8 dB improvement in recall over the traditional energy detection approach with minor improvements in precision.
△ Less
Submitted 1 October, 2021;
originally announced October 2021.
-
A Wideband Signal Recognition Dataset
Authors:
Nathan West,
Timothy O'Shea,
Tamoghna Roy
Abstract:
Signal recognition is a spectrum sensing problem that jointly requires detection, localization in time and frequency, and classification. This is a step beyond most spectrum sensing work which involves signal detection to estimate "present" or "not present" detections for either a single channel or fixed sized channels or classification which assumes a signal is present. We define the signal recog…
▽ More
Signal recognition is a spectrum sensing problem that jointly requires detection, localization in time and frequency, and classification. This is a step beyond most spectrum sensing work which involves signal detection to estimate "present" or "not present" detections for either a single channel or fixed sized channels or classification which assumes a signal is present. We define the signal recognition task, present the metrics of precision and recall to the RF domain, and review recent machine-learning based approaches to this problem. We introduce a new dataset that is useful for training neural networks to perform these tasks and show a training framework to train wideband signal recognizers.
△ Less
Submitted 1 October, 2021;
originally announced October 2021.
-
Approximating the Void: Learning Stochastic Channel Models from Observation with Variational Generative Adversarial Networks
Authors:
Timothy J. O'Shea,
Tamoghna Roy,
Nathan West
Abstract:
Channel modeling is a critical topic when considering designing, learning, or evaluating the performance of any communications system. Most prior work in designing or learning new modulation schemes has focused on using highly simplified analytic channel models such as additive white Gaussian noise (AWGN), Rayleigh fading channels or similar. Recently, we proposed the usage of a generative adversa…
▽ More
Channel modeling is a critical topic when considering designing, learning, or evaluating the performance of any communications system. Most prior work in designing or learning new modulation schemes has focused on using highly simplified analytic channel models such as additive white Gaussian noise (AWGN), Rayleigh fading channels or similar. Recently, we proposed the usage of a generative adversarial networks (GANs) to jointly approximate a wireless channel response model (e.g. from real black box measurements) and optimize for an efficient modulation scheme over it using machine learning. This approach worked to some degree, but was unable to produce accurate probability distribution functions (PDFs) representing the stochastic channel response. In this paper, we focus specifically on the problem of accurately learning a channel PDF using a variational GAN, introducing an architecture and loss function which can accurately capture stochastic behavior. We illustrate where our prior method failed and share results capturing the performance of such as system over a range of realistic channel distributions.
△ Less
Submitted 20 August, 2018; v1 submitted 16 May, 2018;
originally announced May 2018.
-
Physical Layer Communications System Design Over-the-Air Using Adversarial Networks
Authors:
Timothy J. O'Shea,
Tamoghna Roy,
Nathan West,
Benjamin C. Hilburn
Abstract:
This paper presents a novel method for synthesizing new physical layer modulation and coding schemes for communications systems using a learning-based approach which does not require an analytic model of the impairments in the channel. It extends prior work published on the channel autoencoder to consider the case where the channel response is not known or can not be easily modeled in a closed for…
▽ More
This paper presents a novel method for synthesizing new physical layer modulation and coding schemes for communications systems using a learning-based approach which does not require an analytic model of the impairments in the channel. It extends prior work published on the channel autoencoder to consider the case where the channel response is not known or can not be easily modeled in a closed form analytic expression. By adopting an adversarial approach for channel response approximation and information encoding, we can jointly learn a good solution to both tasks over a wide range of channel environments. We describe the operation of the proposed adversarial system, share results for its training and validation over-the-air, and discuss implications and future work in the area.
△ Less
Submitted 8 March, 2018;
originally announced March 2018.