-
Unintended Bias in 2D+ Image Segmentation and Its Effect on Attention Asymmetry
Authors:
Zsófia Molnár,
Gergely Szabó,
András Horváth
Abstract:
Supervised pretrained models have become widely used in deep learning, especially for image segmentation tasks. However, when applied to specialized datasets such as biomedical imaging, pretrained weights often introduce unintended biases. These biases cause models to assign different levels of importance to different slices, leading to inconsistencies in feature utilization, which can be observed…
▽ More
Supervised pretrained models have become widely used in deep learning, especially for image segmentation tasks. However, when applied to specialized datasets such as biomedical imaging, pretrained weights often introduce unintended biases. These biases cause models to assign different levels of importance to different slices, leading to inconsistencies in feature utilization, which can be observed as asymmetries in saliency map distributions. This transfer of color distributions from natural images to non-natural datasets can compromise model performance and reduce the reliability of results. In this study, we investigate the effects of these biases and propose strategies to mitigate them. Through a series of experiments, we test both pretrained and randomly initialized models, comparing their performance and saliency map distributions. Our proposed methods, which aim to neutralize the bias introduced by pretrained color channel weights, demonstrate promising results, offering a practical approach to improving model explainability while maintaining the benefits of pretrained models. This publication presents our findings, providing insights into addressing pretrained weight biases across various deep learning tasks.
△ Less
Submitted 20 May, 2025;
originally announced May 2025.
-
The iToBoS dataset: skin region images extracted from 3D total body photographs for lesion detection
Authors:
Anup Saha,
Joseph Adeola,
Nuria Ferrera,
Adam Mothershaw,
Gisele Rezze,
Séraphin Gaborit,
Brian D'Alessandro,
James Hudson,
Gyula Szabó,
Balazs Pataki,
Hayat Rajani,
Sana Nazari,
Hassan Hayat,
Clare Primiero,
H. Peter Soyer,
Josep Malvehy,
Rafael Garcia
Abstract:
Artificial intelligence has significantly advanced skin cancer diagnosis by enabling rapid and accurate detection of malignant lesions. In this domain, most publicly available image datasets consist of single, isolated skin lesions positioned at the center of the image. While these lesion-centric datasets have been fundamental for developing diagnostic algorithms, they lack the context of the surr…
▽ More
Artificial intelligence has significantly advanced skin cancer diagnosis by enabling rapid and accurate detection of malignant lesions. In this domain, most publicly available image datasets consist of single, isolated skin lesions positioned at the center of the image. While these lesion-centric datasets have been fundamental for developing diagnostic algorithms, they lack the context of the surrounding skin, which is critical for improving lesion detection. The iToBoS dataset was created to address this challenge. It includes 16,954 images of skin regions from 100 participants, captured using 3D total body photography. Each image roughly corresponds to a $7 \times 9$ cm section of skin with all suspicious lesions annotated using bounding boxes. Additionally, the dataset provides metadata such as anatomical location, age group, and sun damage score for each image. This dataset aims to facilitate training and benchmarking of algorithms, with the goal of enabling early detection of skin cancer and deployment of this technology in non-clinical environments.
△ Less
Submitted 30 January, 2025;
originally announced January 2025.
-
Post-Hoc MOTS: Exploring the Capabilities of Time-Symmetric Multi-Object Tracking
Authors:
Gergely Szabó,
Zsófia Molnár,
András Horváth
Abstract:
Temporal forward-tracking has been the dominant approach for multi-object segmentation and tracking (MOTS). However, a novel time-symmetric tracking methodology has recently been introduced for the detection, segmentation, and tracking of budding yeast cells in pre-recorded samples. Although this architecture has demonstrated a unique perspective on stable and consistent tracking, as well as misse…
▽ More
Temporal forward-tracking has been the dominant approach for multi-object segmentation and tracking (MOTS). However, a novel time-symmetric tracking methodology has recently been introduced for the detection, segmentation, and tracking of budding yeast cells in pre-recorded samples. Although this architecture has demonstrated a unique perspective on stable and consistent tracking, as well as missed instance re-interpolation, its evaluation has so far been largely confined to settings related to videomicroscopic environments. In this work, we aim to reveal the broader capabilities, advantages, and potential challenges of this architecture across various specifically designed scenarios, including a pedestrian tracking dataset. We also conduct an ablation study comparing the model against its restricted variants and the widely used Kalman filter. Furthermore, we present an attention analysis of the tracking architecture for both pretrained and non-pretrained models
△ Less
Submitted 11 December, 2024;
originally announced December 2024.
-
Network Resource Management For Cyber-Physical Production Systems Based On Quality of Experience
Authors:
Attila Vidács,
Zalán Trombitás,
Géza Szabó
Abstract:
In today's industrial challenges, it can be observed that the trends point in the direction of agile, wireless connected robots where elements of intelligence and control are implemented in the edge cloud. This paper outlines the roles of three key participants in the value chain of an industrial process: the network provider, the robot operator, and the customer. It proposes a scheme where the Qu…
▽ More
In today's industrial challenges, it can be observed that the trends point in the direction of agile, wireless connected robots where elements of intelligence and control are implemented in the edge cloud. This paper outlines the roles of three key participants in the value chain of an industrial process: the network provider, the robot operator, and the customer. It proposes a scheme where the Quality of Service (QoS) parameters of the robot are fed into the network to inform network resource management. A sanding process use case is simulated to demonstrate the relationship between QoS and Quality of Experience (QoE) for each participant, quantitatively.
△ Less
Submitted 10 November, 2023;
originally announced November 2023.
-
On The Effects of The Variations In Network Characteristics In Cyber Physical Systems
Authors:
Géza Szabó,
Sándor Rácz,
József Pető,
Rafael Roque Aschoff
Abstract:
The popular robotic simulator, Gazebo, lacks the feature of simulating the effects of control latency that would make it a fully-fledged cyber-physical system (CPS) simulator. The CPS that we address to measure is a robotic arm (UR5) controlled remotely with velocity commands. The main goal is to measure Quality of Control (QoC) related KPIs during various network conditions in a simulated environ…
▽ More
The popular robotic simulator, Gazebo, lacks the feature of simulating the effects of control latency that would make it a fully-fledged cyber-physical system (CPS) simulator. The CPS that we address to measure is a robotic arm (UR5) controlled remotely with velocity commands. The main goal is to measure Quality of Control (QoC) related KPIs during various network conditions in a simulated environment. We propose a Gazebo plugin to make the above measurement feasible by making Gazebo capable to delay internal control and status messages and also to interface with external network simulators to derive even more advanced network effects. Our preliminary evaluation shows that there is certainly an effect on the behavior of the robotic arm with the introduced network latency in line with our expectations, but a more detailed further study is needed.
△ Less
Submitted 26 September, 2023;
originally announced September 2023.
-
FATHER: FActory on THE Road
Authors:
Géza Szabó,
Balázs Tárnok,
Levente Vajda,
József Pető,
Attila Vidács
Abstract:
In most factories today the robotic cells are deployed on well enforced bases to avoid any external impact on the accuracy of production. In contrast to that, we evaluate a futuristic concept where the whole robotic cell could work in a moving platform. Imagine a trailer of a truck moving along the motorway while exposed to heavy physical impacts due to maneuvering. The key question here is how th…
▽ More
In most factories today the robotic cells are deployed on well enforced bases to avoid any external impact on the accuracy of production. In contrast to that, we evaluate a futuristic concept where the whole robotic cell could work in a moving platform. Imagine a trailer of a truck moving along the motorway while exposed to heavy physical impacts due to maneuvering. The key question here is how the robotic cell behaves and how the productivity is affected. We propose a system architecture (FATHER) and show some solutions including network related information and artificial intelligence to make the proposed futuristic concept feasible to implement.
△ Less
Submitted 22 September, 2023;
originally announced September 2023.
-
Advancing Hungarian Text Processing with HuSpaCy: Efficient and Accurate NLP Pipelines
Authors:
György Orosz,
Gergő Szabó,
Péter Berkecz,
Zsolt Szántó,
Richárd Farkas
Abstract:
This paper presents a set of industrial-grade text processing models for Hungarian that achieve near state-of-the-art performance while balancing resource efficiency and accuracy. Models have been implemented in the spaCy framework, extending the HuSpaCy toolkit with several improvements to its architecture. Compared to existing NLP tools for Hungarian, all of our pipelines feature all basic text…
▽ More
This paper presents a set of industrial-grade text processing models for Hungarian that achieve near state-of-the-art performance while balancing resource efficiency and accuracy. Models have been implemented in the spaCy framework, extending the HuSpaCy toolkit with several improvements to its architecture. Compared to existing NLP tools for Hungarian, all of our pipelines feature all basic text processing steps including tokenization, sentence-boundary detection, part-of-speech tagging, morphological feature tagging, lemmatization, dependency parsing and named entity recognition with high accuracy and throughput. We thoroughly evaluated the proposed enhancements, compared the pipelines with state-of-the-art tools and demonstrated the competitive performance of the new models in all text preprocessing steps. All experiments are reproducible and the pipelines are freely available under a permissive license.
△ Less
Submitted 24 August, 2023;
originally announced August 2023.
-
Enhancing Cell Tracking with a Time-Symmetric Deep Learning Approach
Authors:
Gergely Szabó,
Paolo Bonaiuti,
Andrea Ciliberto,
András Horváth
Abstract:
The accurate tracking of live cells using video microscopy recordings remains a challenging task for popular state-of-the-art image processing based object tracking methods. In recent years, several existing and new applications have attempted to integrate deep-learning based frameworks for this task, but most of them still heavily rely on consecutive frame based tracking embedded in their archite…
▽ More
The accurate tracking of live cells using video microscopy recordings remains a challenging task for popular state-of-the-art image processing based object tracking methods. In recent years, several existing and new applications have attempted to integrate deep-learning based frameworks for this task, but most of them still heavily rely on consecutive frame based tracking embedded in their architecture or other premises that hinder generalized learning. To address this issue, we aimed to develop a new deep-learning based tracking method that relies solely on the assumption that cells can be tracked based on their spatio-temporal neighborhood, without restricting it to consecutive frames. The proposed method has the additional benefit that the motion patterns of the cells can be learned completely by the predictor without any prior assumptions, and it has the potential to handle a large number of video frames with heavy artifacts. The efficacy of the proposed method is demonstrated through biologically motivated validation strategies and compared against multiple state-of-the-art cell tracking methods.
△ Less
Submitted 31 January, 2025; v1 submitted 4 August, 2023;
originally announced August 2023.
-
Hybrid lemmatization in HuSpaCy
Authors:
Péter Berkecz,
György Orosz,
Zsolt Szántó,
Gergő Szabó,
Richárd Farkas
Abstract:
Lemmatization is still not a trivial task for morphologically rich languages. Previous studies showed that hybrid architectures usually work better for these languages and can yield great results. This paper presents a hybrid lemmatizer utilizing both a neural model, dictionaries and hand-crafted rules. We introduce a hybrid architecture along with empirical results on a widely used Hungarian data…
▽ More
Lemmatization is still not a trivial task for morphologically rich languages. Previous studies showed that hybrid architectures usually work better for these languages and can yield great results. This paper presents a hybrid lemmatizer utilizing both a neural model, dictionaries and hand-crafted rules. We introduce a hybrid architecture along with empirical results on a widely used Hungarian dataset. The presented methods are published as three HuSpaCy models.
△ Less
Submitted 13 June, 2023;
originally announced June 2023.
-
HuSpaCy: an industrial-strength Hungarian natural language processing toolkit
Authors:
György Orosz,
Zsolt Szántó,
Péter Berkecz,
Gergő Szabó,
Richárd Farkas
Abstract:
Although there are a couple of open-source language processing pipelines available for Hungarian, none of them satisfies the requirements of today's NLP applications. A language processing pipeline should consist of close to state-of-the-art lemmatization, morphosyntactic analysis, entity recognition and word embeddings. Industrial text processing applications have to satisfy non-functional softwa…
▽ More
Although there are a couple of open-source language processing pipelines available for Hungarian, none of them satisfies the requirements of today's NLP applications. A language processing pipeline should consist of close to state-of-the-art lemmatization, morphosyntactic analysis, entity recognition and word embeddings. Industrial text processing applications have to satisfy non-functional software quality requirements, what is more, frameworks supporting multiple languages are more and more favored. This paper introduces HuSpaCy, an industry-ready Hungarian language processing toolkit. The presented tool provides components for the most important basic linguistic analysis tasks. It is open-source and is available under a permissive license. Our system is built upon spaCy's NLP components resulting in an easily usable, fast yet accurate application. Experiments confirm that HuSpaCy has high accuracy while maintaining resource-efficient prediction capabilities.
△ Less
Submitted 10 January, 2022; v1 submitted 6 January, 2022;
originally announced January 2022.
-
Mitigating the Bias of Centered Objects in Common Datasets
Authors:
Gergely Szabo,
Andras Horvath
Abstract:
Convolutional networks are considered shift invariant, but it was demonstrated that their response may vary according to the exact location of the objects. In this paper we will demonstrate that most commonly investigated datasets have a bias, where objects are over-represented at the center of the image during training. This bias and the boundary condition of these networks can have a significant…
▽ More
Convolutional networks are considered shift invariant, but it was demonstrated that their response may vary according to the exact location of the objects. In this paper we will demonstrate that most commonly investigated datasets have a bias, where objects are over-represented at the center of the image during training. This bias and the boundary condition of these networks can have a significant effect on the performance of these architectures and their accuracy drops significantly as an object approaches the boundary. We will also demonstrate how this effect can be mitigated with data augmentation techniques.
△ Less
Submitted 4 August, 2023; v1 submitted 16 December, 2021;
originally announced December 2021.
-
System-aware dynamic partitioning for batch and streaming workloads
Authors:
Zoltán Zvara,
Péter G. N. Szabó,
Balázs Barnabás Lóránt,
András A. Benczúr
Abstract:
When processing data streams with highly skewed and nonstationary key distributions, we often observe overloaded partitions when the hash partitioning fails to balance data correctly. To avoid slow tasks that delay the completion of the whole stage of computation, it is necessary to apply adaptive, on-the-fly partitioning that continuously recomputes an optimal partitioner, given the observed key…
▽ More
When processing data streams with highly skewed and nonstationary key distributions, we often observe overloaded partitions when the hash partitioning fails to balance data correctly. To avoid slow tasks that delay the completion of the whole stage of computation, it is necessary to apply adaptive, on-the-fly partitioning that continuously recomputes an optimal partitioner, given the observed key distribution. While such solutions exist for batch processing of static data sets and stateless stream processing, the task is difficult for long-running stateful streaming jobs where key distribution changes over time. Careful checkpointing and operator state migration is necessary to change the partitioning while the operation is running.
Our key result is a lightweight on-the-fly Dynamic Repartitioning (DR) module for distributed data processing systems (DDPS), including Apache Spark and Flink, which improves the performance with negligible overhead. DR can adaptively repartition data during execution using our Key Isolator Partitioner (KIP). In our experiments with real workloads and power-law distributions, we reach a speedup of 1.5-6 for a variety of Spark and Flink jobs.
△ Less
Submitted 31 May, 2021;
originally announced May 2021.
-
Receptive Field Size Optimization with Continuous Time Pooling
Authors:
Dóra Babicz,
Soma Kontár,
Márk Pető,
András Fülöp,
Gergely Szabó,
András Horváth
Abstract:
The pooling operation is a cornerstone element of convolutional neural networks. These elements generate receptive fields for neurons, in which local perturbations should have minimal effect on the output activations, increasing robustness and invariance of the network. In this paper we will present an altered version of the most commonly applied method, maximum pooling, where pooling in theory is…
▽ More
The pooling operation is a cornerstone element of convolutional neural networks. These elements generate receptive fields for neurons, in which local perturbations should have minimal effect on the output activations, increasing robustness and invariance of the network. In this paper we will present an altered version of the most commonly applied method, maximum pooling, where pooling in theory is substituted by a continuous time differential equation, which generates a location sensitive pooling operation, more similar to biological receptive fields. We will present how this continuous method can be approximated numerically using discrete operations which fit ideally on a GPU. In our approach the kernel size is substituted by diffusion strength which is a continuous valued parameter, this way it can be optimized by gradient descent algorithms. We will evaluate the effect of continuous pooling on accuracy and computational need using commonly applied network architectures and datasets.
△ Less
Submitted 6 November, 2020; v1 submitted 2 November, 2020;
originally announced November 2020.
-
On the notion of number in humans and machines
Authors:
Norbert Bátfai,
Dávid Papp,
Gergő Bogacsovics,
Máté Szabó,
Viktor Szilárd Simkó,
Márió Bersenszki,
Gergely Szabó,
Lajos Kovács,
Ferencz Kovács,
Erik Szilveszter Varga
Abstract:
In this paper, we performed two types of software experiments to study the numerosity classification (subitizing) in humans and machines. Experiments focus on a particular kind of task is referred to as Semantic MNIST or simply SMNIST where the numerosity of objects placed in an image must be determined. The experiments called SMNIST for Humans are intended to measure the capacity of the Object Fi…
▽ More
In this paper, we performed two types of software experiments to study the numerosity classification (subitizing) in humans and machines. Experiments focus on a particular kind of task is referred to as Semantic MNIST or simply SMNIST where the numerosity of objects placed in an image must be determined. The experiments called SMNIST for Humans are intended to measure the capacity of the Object File System in humans. In this type of experiment the measurement result is in well agreement with the value known from the cognitive psychology literature. The experiments called SMNIST for Machines serve similar purposes but they investigate existing, well known (but originally developed for other purpose) and under development deep learning computer programs. These measurement results can be interpreted similar to the results from SMNIST for Humans. The main thesis of this paper can be formulated as follows: in machines the image classification artificial neural networks can learn to distinguish numerosities with better accuracy when these numerosities are smaller than the capacity of OFS in humans. Finally, we outline a conceptual framework to investigate the notion of number in humans and machines.
△ Less
Submitted 27 June, 2019;
originally announced June 2019.
-
Congestion phenomena caused by matching pennies in evolutionary games
Authors:
György Szabó,
Attila Szolnoki
Abstract:
Evolutionary social dilemma games are extended by an additional matching-pennies game that modifies the collected payoffs. In a spatial version players are distributed on a square lattice and interact with their neighbors. Firstly, we show that the matching-pennies game can be considered as the microscopic force of the Red Queen effect that breaks the detailed balance and induces eddies in the mic…
▽ More
Evolutionary social dilemma games are extended by an additional matching-pennies game that modifies the collected payoffs. In a spatial version players are distributed on a square lattice and interact with their neighbors. Firstly, we show that the matching-pennies game can be considered as the microscopic force of the Red Queen effect that breaks the detailed balance and induces eddies in the microscopic probability currents if the strategy update is analogous to the Glauber dynamics for the kinetic Ising models. The resulting loops in probability current breaks symmetry between the chessboard-like arrangements of strategies via a bottleneck effect occurring along the four-edge loops in the microscopic states. The impact of this congestion is analogous to the application of a staggered magnetic field in the Ising model, that is, the order-disorder critical transition is wiped out by noise. It is illustrated that the congestion induced symmetry breaking can be beneficial for the whole community within a certain region of parameters.
△ Less
Submitted 6 March, 2015;
originally announced March 2015.
-
Selfishness, fraternity, and other-regarding preference in spatial evolutionary games
Authors:
Gyorgy Szabo,
Attila Szolnoki
Abstract:
Spatial evolutionary games are studied with myopic players whose payoff interest, as a personal character, is tuned from selfishness to other-regarding preference via fraternity. The players are located on a square lattice and collect income from symmetric two-person two-strategy (called cooperation and defection) games with their nearest neighbors. During the elementary steps of evolution a rando…
▽ More
Spatial evolutionary games are studied with myopic players whose payoff interest, as a personal character, is tuned from selfishness to other-regarding preference via fraternity. The players are located on a square lattice and collect income from symmetric two-person two-strategy (called cooperation and defection) games with their nearest neighbors. During the elementary steps of evolution a randomly chosen player modifies her strategy in order to maximize stochastically her utility function composed from her own and the co-players' income with weight factors $1-Q$ and Q. These models are studied within a wide range of payoff parameters using Monte Carlo simulations for noisy strategy updates and by spatial stability analysis in the low noise limit. For fraternal players ($Q=1/2$) the system evolves into ordered arrangements of strategies in the low noise limit in a way providing optimum payoff for the whole society. Dominance of defectors, representing the "tragedy of the commons", is found within the regions of prisoner's dilemma and stag hunt game for selfish players (Q=0). Due to the symmetry in the effective utility function the system exhibits similar behavior even for Q=1 that can be interpreted as the "lovers' dilemma".
△ Less
Submitted 22 March, 2011;
originally announced March 2011.
-
Trends in Social Media : Persistence and Decay
Authors:
Sitaram Asur,
Bernardo A. Huberman,
Gabor Szabo,
Chunyan Wang
Abstract:
Social media generates a prodigious wealth of real-time content at an incessant rate. From all the content that people create and share, only a few topics manage to attract enough attention to rise to the top and become temporal trends which are displayed to users. The question of what factors cause the formation and persistence of trends is an important one that has not been answered yet. In this…
▽ More
Social media generates a prodigious wealth of real-time content at an incessant rate. From all the content that people create and share, only a few topics manage to attract enough attention to rise to the top and become temporal trends which are displayed to users. The question of what factors cause the formation and persistence of trends is an important one that has not been answered yet. In this paper, we conduct an intensive study of trending topics on Twitter and provide a theoretical basis for the formation, persistence and decay of trends. We also demonstrate empirically how factors such as user activity and number of followers do not contribute strongly to trend creation and its propagation. In fact, we find that the resonance of the content with the users of the social network plays a major role in causing trends.
△ Less
Submitted 7 February, 2011;
originally announced February 2011.
-
Predicting the popularity of online content
Authors:
Gabor Szabo,
Bernardo A. Huberman
Abstract:
We present a method for accurately predicting the long time popularity of online content from early measurements of user access. Using two content sharing portals, Youtube and Digg, we show that by modeling the accrual of views and votes on content offered by these services we can predict the long-term dynamics of individual submissions from initial data. In the case of Digg, measuring access to…
▽ More
We present a method for accurately predicting the long time popularity of online content from early measurements of user access. Using two content sharing portals, Youtube and Digg, we show that by modeling the accrual of views and votes on content offered by these services we can predict the long-term dynamics of individual submissions from initial data. In the case of Digg, measuring access to given stories during the first two hours allows us to forecast their popularity 30 days ahead with remarkable accuracy, while downloads of Youtube videos need to be followed for 10 days to attain the same performance. The differing time scales of the predictions are shown to be due to differences in how content is consumed on the two portals: Digg stories quickly become outdated, while Youtube videos are still found long after they are initially submitted to the portal. We show that predictions are more accurate for submissions for which attention decays quickly, whereas predictions for evergreen content will be prone to larger errors.
△ Less
Submitted 4 November, 2008;
originally announced November 2008.
-
Characterization and Modeling of an Electro-thermal MEMS Structure
Authors:
P. G. Szabo,
Vladimir Szekely
Abstract:
Thermal functional circuits are an interesting and perspectivic group of the MEMS elements. A practical realization is called Quadratic Transfer Characteristic (QTC) element which driving principle is the Seebeck-effect. In this paper we present the analyses of a QTC element from different perspectives. To check the real behavior of the device, we measured a few, secondary properties of the stru…
▽ More
Thermal functional circuits are an interesting and perspectivic group of the MEMS elements. A practical realization is called Quadratic Transfer Characteristic (QTC) element which driving principle is the Seebeck-effect. In this paper we present the analyses of a QTC element from different perspectives. To check the real behavior of the device, we measured a few, secondary properties of the structure which correspond to special behavior because these properties can not be easily derived from the main characteristics.
△ Less
Submitted 7 May, 2008;
originally announced May 2008.
-
Diversity of Online Community Activities
Authors:
Tad Hogg,
Gabor Szabo
Abstract:
Web sites where users create and rate content as well as form networks with other users display long-tailed distributions in many aspects of behavior. Using behavior on one such community site, Essembly, we propose and evaluate plausible mechanisms to explain these behaviors. Unlike purely descriptive models, these mechanisms rely on user behaviors based on information available locally to each…
▽ More
Web sites where users create and rate content as well as form networks with other users display long-tailed distributions in many aspects of behavior. Using behavior on one such community site, Essembly, we propose and evaluate plausible mechanisms to explain these behaviors. Unlike purely descriptive models, these mechanisms rely on user behaviors based on information available locally to each user. For Essembly, we find the long-tails arise from large differences among user activity rates and qualities of the rated content, as well as the extensive variability in the time users devote to the site. We show that the models not only explain overall behavior but also allow estimating the quality of content from their early behaviors.
△ Less
Submitted 24 March, 2008;
originally announced March 2008.