-
HiLoTs: High-Low Temporal Sensitive Representation Learning for Semi-Supervised LiDAR Segmentation in Autonomous Driving
Authors:
R. D. Lin,
Pengcheng Weng,
Yinqiao Wang,
Han Ding,
Jinsong Han,
Fei Wang
Abstract:
LiDAR point cloud semantic segmentation plays a crucial role in autonomous driving. In recent years, semi-supervised methods have gained popularity due to their significant reduction in annotation labor and time costs. Current semi-supervised methods typically focus on point cloud spatial distribution or consider short-term temporal representations, e.g., only two adjacent frames, often overlookin…
▽ More
LiDAR point cloud semantic segmentation plays a crucial role in autonomous driving. In recent years, semi-supervised methods have gained popularity due to their significant reduction in annotation labor and time costs. Current semi-supervised methods typically focus on point cloud spatial distribution or consider short-term temporal representations, e.g., only two adjacent frames, often overlooking the rich long-term temporal properties inherent in autonomous driving scenarios. In driving experience, we observe that nearby objects, such as roads and vehicles, remain stable while driving, whereas distant objects exhibit greater variability in category and shape. This natural phenomenon is also captured by LiDAR, which reflects lower temporal sensitivity for nearby objects and higher sensitivity for distant ones. To leverage these characteristics, we propose HiLoTs, which learns high-temporal sensitivity and low-temporal sensitivity representations from continuous LiDAR frames. These representations are further enhanced and fused using a cross-attention mechanism. Additionally, we employ a teacher-student framework to align the representations learned by the labeled and unlabeled branches, effectively utilizing the large amounts of unlabeled data. Experimental results on the SemanticKITTI and nuScenes datasets demonstrate that our proposed HiLoTs outperforms state-of-the-art semi-supervised methods, and achieves performance close to LiDAR+Camera multimodal approaches. Code is available on https://github.com/rdlin118/HiLoTs
△ Less
Submitted 22 March, 2025;
originally announced March 2025.
-
Score-Based Generative Models for PET Image Reconstruction
Authors:
Imraj RD Singh,
Alexander Denker,
Riccardo Barbano,
Željko Kereta,
Bangti Jin,
Kris Thielemans,
Peter Maass,
Simon Arridge
Abstract:
Score-based generative models have demonstrated highly promising results for medical image reconstruction tasks in magnetic resonance imaging or computed tomography. However, their application to Positron Emission Tomography (PET) is still largely unexplored. PET image reconstruction involves a variety of challenges, including Poisson noise with high variance and a wide dynamic range. To address t…
▽ More
Score-based generative models have demonstrated highly promising results for medical image reconstruction tasks in magnetic resonance imaging or computed tomography. However, their application to Positron Emission Tomography (PET) is still largely unexplored. PET image reconstruction involves a variety of challenges, including Poisson noise with high variance and a wide dynamic range. To address these challenges, we propose several PET-specific adaptations of score-based generative models. The proposed framework is developed for both 2D and 3D PET. In addition, we provide an extension to guided reconstruction using magnetic resonance images. We validate the approach through extensive 2D and 3D $\textit{in-silico}$ experiments with a model trained on patient-realistic data without lesions, and evaluate on data without lesions as well as out-of-distribution data with lesions. This demonstrates the proposed method's robustness and significant potential for improved PET reconstruction.
△ Less
Submitted 23 January, 2024; v1 submitted 27 August, 2023;
originally announced August 2023.
-
Development and evaluation of intraoperative ultrasound segmentation with negative image frames and multiple observer labels
Authors:
Liam F Chalcroft,
Jiongqi Qu,
Sophie A Martin,
Iani JMB Gayo,
Giulio V Minore,
Imraj RD Singh,
Shaheer U Saeed,
Qianye Yang,
Zachary MC Baum,
Andre Altmann,
Yipeng Hu
Abstract:
When developing deep neural networks for segmenting intraoperative ultrasound images, several practical issues are encountered frequently, such as the presence of ultrasound frames that do not contain regions of interest and the high variance in ground-truth labels. In this study, we evaluate the utility of a pre-screening classification network prior to the segmentation network. Experimental resu…
▽ More
When developing deep neural networks for segmenting intraoperative ultrasound images, several practical issues are encountered frequently, such as the presence of ultrasound frames that do not contain regions of interest and the high variance in ground-truth labels. In this study, we evaluate the utility of a pre-screening classification network prior to the segmentation network. Experimental results demonstrate that such a classifier, minimising frame classification errors, was able to directly impact the number of false positive and false negative frames. Importantly, the segmentation accuracy on the classifier-selected frames, that would be segmented, remains comparable to or better than those from standalone segmentation networks. Interestingly, the efficacy of the pre-screening classifier was affected by the sampling methods for training labels from multiple observers, a seemingly independent problem. We show experimentally that a previously proposed approach, combining random sampling and consensus labels, may need to be adapted to perform well in our application. Furthermore, this work aims to share practical experience in developing a machine learning application that assists highly variable interventional imaging for prostate cancer patients, to present robust and reproducible open-source implementations, and to report a set of comprehensive results and analysis comparing these practical, yet important, options in a real-world clinical application.
△ Less
Submitted 28 July, 2021;
originally announced August 2021.
-
A Perspective Study on Content Management in E-Learning and M-Learning
Authors:
RD. Balaji,
Fatma Al-Mahri,
R. Malathi
Abstract:
This is the era of Information and Communication Technology (ICT). Nowadays, there is no limit to learn, people can learn anywhere and anytime with the enhancement of technology. Electronic Learning (E-learning) and Mobile Learning (M-learning) are the two vital buzz terms in modern education particularly in Education Enhanced Technology and Technologies Supported Learning. E-learning is defined a…
▽ More
This is the era of Information and Communication Technology (ICT). Nowadays, there is no limit to learn, people can learn anywhere and anytime with the enhancement of technology. Electronic Learning (E-learning) and Mobile Learning (M-learning) are the two vital buzz terms in modern education particularly in Education Enhanced Technology and Technologies Supported Learning. E-learning is defined as the instructional content or learning experience delivered or enabled by electronic technologies whereas, M-learning is defined simply as learning via mobile devices such as cell phones, smart phones, palmtops, and handheld computers. There are many similarities between the two technologies as both are modern learning tools. Moreover, the latter is an extension and a subset of the former. However, there are few limitations or differences still exist in mobile learning tools, especially in the design, development and the technology usability. In this paper we have mainly focused on how the digital content is administrated (Content Management) in these two technologies. Additionally, the content management in E and Mlearning are compared and their similarities and differences are figured out.
△ Less
Submitted 26 April, 2016;
originally announced May 2016.
-
An Appropriate Sensor Distribution Technique in Wireless Sensor Networks
Authors:
R. Malathi,
Alagarsamy,
V. Veeramani,
RD. Balaji
Abstract:
Wireless Sensor Network (WSN) is pertinent to many applications with varied network parameters. Sensor node placement in the application region whether it is indoor or outdoor is a major task as well as plays very remarkable role in the network performance. Node placement is carried out according to the region where it is applied, either deterministic or non deterministic. Because of the need for…
▽ More
Wireless Sensor Network (WSN) is pertinent to many applications with varied network parameters. Sensor node placement in the application region whether it is indoor or outdoor is a major task as well as plays very remarkable role in the network performance. Node placement is carried out according to the region where it is applied, either deterministic or non deterministic. Because of the need for different sensing probability or detection probability, same approach of sensor placement may not be suited for all the applications. Some of the applications are well formed and give better performance with the uniform distribution of sensors but few need intense distribution of nodes in particular sensitive places especially those applications meant for intrusion detection. An application which needs high level intrusion detection and a suitable sensor distribution methodology, known as Half-Normal Distribution (Half-Gaussian) based deployment, have been set forth in this paper. We have also discussed the theoretical comparison for detection probability with the uniform distribution in terms of number of sensor nodes.
△ Less
Submitted 26 April, 2016;
originally announced April 2016.
-
Confidence Decision Trees via Online and Active Learning for Streaming (BIG) Data
Authors:
Rocco De Rosa
Abstract:
Decision tree classifiers are a widely used tool in data stream mining. The use of confidence intervals to estimate the gain associated with each split leads to very effective methods, like the popular Hoeffding tree algorithm. From a statistical viewpoint, the analysis of decision tree classifiers in a streaming setting requires knowing when enough new information has been collected to justify sp…
▽ More
Decision tree classifiers are a widely used tool in data stream mining. The use of confidence intervals to estimate the gain associated with each split leads to very effective methods, like the popular Hoeffding tree algorithm. From a statistical viewpoint, the analysis of decision tree classifiers in a streaming setting requires knowing when enough new information has been collected to justify splitting a leaf. Although some of the issues in the statistical analysis of Hoeffding trees have been already clarified, a general and rigorous study of confidence intervals for splitting criteria is missing. We fill this gap by deriving accurate confidence intervals to estimate the splitting gain in decision tree learning with respect to three criteria: entropy, Gini index, and a third index proposed by Kearns and Mansour. Our confidence intervals depend in a more detailed way on the tree parameters. We also extend our confidence analysis to a selective sampling setting, in which the decision tree learner adaptively decides which labels to query in the stream. We furnish theoretical guarantee bounding the probability that the classification is non-optimal learning the decision tree via our selective sampling strategy. Experiments on real and synthetic data in a streaming setting show that our trees are indeed more accurate than trees with the same number of leaves generated by other techniques and our active learning module permits to save labeling cost. In addition, comparing our labeling strategy with recent methods, we show that our approach is more robust and consistent respect all the other techniques applied to incremental decision trees.
△ Less
Submitted 12 April, 2016;
originally announced April 2016.
-
Active Learning for Online Recognition of Human Activities from Streaming Videos
Authors:
Rocco De Rosa,
Ilaria Gori,
Fabio Cuzzolin,
Barbara Caputo,
Nicolò Cesa-Bianchi
Abstract:
Recognising human activities from streaming videos poses unique challenges to learning algorithms: predictive models need to be scalable, incrementally trainable, and must remain bounded in size even when the data stream is arbitrarily long. Furthermore, as parameter tuning is problematic in a streaming setting, suitable approaches should be parameterless, and make no assumptions on what class lab…
▽ More
Recognising human activities from streaming videos poses unique challenges to learning algorithms: predictive models need to be scalable, incrementally trainable, and must remain bounded in size even when the data stream is arbitrarily long. Furthermore, as parameter tuning is problematic in a streaming setting, suitable approaches should be parameterless, and make no assumptions on what class labels may occur in the stream. We present here an approach to the recognition of human actions from streaming data which meets all these requirements by: (1) incrementally learning a model which adaptively covers the feature space with simple local classifiers; (2) employing an active learning strategy to reduce annotation requests; (3) achieving promising accuracy within a fixed model size. Extensive experiments on standard benchmarks show that our approach is competitive with state-of-the-art non-incremental methods, and outperforms the existing active incremental baselines.
△ Less
Submitted 11 April, 2016;
originally announced April 2016.
-
Online Open World Recognition
Authors:
Rocco De Rosa,
Thomas Mensink,
Barbara Caputo
Abstract:
As we enter into the big data age and an avalanche of images have become readily available, recognition systems face the need to move from close, lab settings where the number of classes and training data are fixed, to dynamic scenarios where the number of categories to be recognized grows continuously over time, as well as new data providing useful information to update the system. Recent attempt…
▽ More
As we enter into the big data age and an avalanche of images have become readily available, recognition systems face the need to move from close, lab settings where the number of classes and training data are fixed, to dynamic scenarios where the number of categories to be recognized grows continuously over time, as well as new data providing useful information to update the system. Recent attempts, like the open world recognition framework, tried to inject dynamics into the system by detecting new unknown classes and adding them incrementally, while at the same time continuously updating the models for the known classes. incrementally adding new classes and detecting instances from unknown classes, while at the same time continuously updating the models for the known classes. In this paper we argue that to properly capture the intrinsic dynamic of open world recognition, it is necessary to add to these aspects (a) the incremental learning of the underlying metric, (b) the incremental estimate of confidence thresholds for the unknown classes, and (c) the use of local learning to precisely describe the space of classes. We extend three existing metric learning algorithms towards these goals by using online metric learning. Experimentally we validate our approach on two large-scale datasets in different learning scenarios. For all these scenarios our proposed methods outperform their non-online counterparts. We conclude that local and online learning is important to capture the full dynamics of open world recognition.
△ Less
Submitted 8 April, 2016;
originally announced April 2016.
-
The ABACOC Algorithm: a Novel Approach for Nonparametric Classification of Data Streams
Authors:
Rocco De Rosa,
Francesco Orabona,
Nicolò Cesa-Bianchi
Abstract:
Stream mining poses unique challenges to machine learning: predictive models are required to be scalable, incrementally trainable, must remain bounded in size (even when the data stream is arbitrarily long), and be nonparametric in order to achieve high accuracy even in complex and dynamic environments. Moreover, the learning system must be parameterless ---traditional tuning methods are problemat…
▽ More
Stream mining poses unique challenges to machine learning: predictive models are required to be scalable, incrementally trainable, must remain bounded in size (even when the data stream is arbitrarily long), and be nonparametric in order to achieve high accuracy even in complex and dynamic environments. Moreover, the learning system must be parameterless ---traditional tuning methods are problematic in streaming settings--- and avoid requiring prior knowledge of the number of distinct class labels occurring in the stream. In this paper, we introduce a new algorithmic approach for nonparametric learning in data streams. Our approach addresses all above mentioned challenges by learning a model that covers the input space using simple local classifiers. The distribution of these classifiers dynamically adapts to the local (unknown) complexity of the classification problem, thus achieving a good balance between model complexity and predictive accuracy. We design four variants of our approach of increasing adaptivity. By means of an extensive empirical evaluation against standard nonparametric baselines, we show state-of-the-art results in terms of accuracy versus model size. For the variant that imposes a strict bound on the model size, we show better performance against all other methods measured at the same model size value. Our empirical analysis is complemented by a theoretical performance guarantee which does not rely on any stochastic assumption on the source generating the stream.
△ Less
Submitted 20 August, 2015;
originally announced August 2015.
-
Augmented Reality in ICT for Minimum Knowledge Loss
Authors:
RamKumar Lakshminarayanan,
RD. Balaji,
Binod kumar,
Malathi Balaji
Abstract:
Informatics world digitizes the human beings, with the contribution made by all the industrial people. In the recent survey it is proved that people are not accustomed or they are not able to access the electronic devices to its extreme usage. Also people are more dependent to the technologies and their day-to-day activities are ruled by the same. In this paper we discuss on one of the advanced te…
▽ More
Informatics world digitizes the human beings, with the contribution made by all the industrial people. In the recent survey it is proved that people are not accustomed or they are not able to access the electronic devices to its extreme usage. Also people are more dependent to the technologies and their day-to-day activities are ruled by the same. In this paper we discuss on one of the advanced technology which will soon rule the world and make the people are more creative and at the same time hassle-free. This concept is introduced as 6th sense technology by an IIT, Mumbai student who is presently Ph.D., scholar in MIT, USA. Similar to this research there is one more research going on under the title Augmented Reality. This research makes a new association with the real world to digital world and allows us to share and manipulate the information directly with our mental thoughts. A college which implements state of the art technology for teaching and learning, Higher College of Technology, Muscat, (HCT) tries to identify the opportunities and limitations of implementing this augmented reality for teaching and learning. The research team of HCT, here, tries to give two scenarios in which augmented reality can fit in. Since this research is in the conceptual level we are trying to illustrate the history of this technology and how it can be adopted in the teaching environment
△ Less
Submitted 11 May, 2013;
originally announced May 2013.
-
A short proof that adding some permutation rules to $β$ preserves $SN$
Authors:
Rene David
Abstract:
I show that, if a term is $SN$ for $β$, it remains $SN$ when some permutation rules are added.
I show that, if a term is $SN$ for $β$, it remains $SN$ when some permutation rules are added.
△ Less
Submitted 5 November, 2010;
originally announced November 2010.