-
A Maximum Modulus Theorem for functions admitting Stokes phenomena, and specific cases of Dulac's Theorem
Authors:
Jesús Palma-Márquez,
Melvin Yeung
Abstract:
We study large classes of real-valued analytic functions that naturally emerge in the understanding of Dulac's problem, which addresses the finiteness of limit cycles in planar differential equations. Building on a Maximum Modulus-type result we got, our main statement essentially follows. Namely, for any function belonging to these classes, the following dichotomy holds: either it has isolated ze…
▽ More
We study large classes of real-valued analytic functions that naturally emerge in the understanding of Dulac's problem, which addresses the finiteness of limit cycles in planar differential equations. Building on a Maximum Modulus-type result we got, our main statement essentially follows. Namely, for any function belonging to these classes, the following dichotomy holds: either it has isolated zeros or it coincides with the identity. As an application, we prove that the non-accumulation of limit cycles holds around a specific class of the so-called superreal polycycles.
△ Less
Submitted 9 October, 2024;
originally announced October 2024.
-
Natural levels in return maps of elementary polycycles
Authors:
Melvin Yeung
Abstract:
We will provide a proof of a known specific case of Dulac's Theorem in the style of Ilyashenko. From this we derive a quasi-analyticity result for some return maps of polycycles and we give a Structural Theorem for the formal asymptotics of such a polycycle.
We will provide a proof of a known specific case of Dulac's Theorem in the style of Ilyashenko. From this we derive a quasi-analyticity result for some return maps of polycycles and we give a Structural Theorem for the formal asymptotics of such a polycycle.
△ Less
Submitted 20 September, 2024;
originally announced September 2024.
-
On the monograph "Finiteness Theorems for limit cycles" and a special case of alternant cycles
Authors:
Melvin Yeung
Abstract:
We provide evidence that the approach of [Ilyashenko 1991] to the proof of Dulac's theorem has a gap. Although the asymptotics of [Ilyashenko 1991] capture far more than the asymptotics of Dulac, we prove that the arguments for why the asymptotics in [Ilyashenko 1991] are not themselves oscillatory is insufficient. We give an explicit counterexample and we draw confines to which Ilyashenko's resul…
▽ More
We provide evidence that the approach of [Ilyashenko 1991] to the proof of Dulac's theorem has a gap. Although the asymptotics of [Ilyashenko 1991] capture far more than the asymptotics of Dulac, we prove that the arguments for why the asymptotics in [Ilyashenko 1991] are not themselves oscillatory is insufficient. We give an explicit counterexample and we draw confines to which Ilyashenko's result may be restricted in order to keep the validity.
△ Less
Submitted 19 February, 2024;
originally announced February 2024.
-
Focal Attention Networks: optimising attention for biomedical image segmentation
Authors:
Michael Yeung,
Leonardo Rundo,
Evis Sala,
Carola-Bibiane Schönlieb,
Guang Yang
Abstract:
In recent years, there has been increasing interest to incorporate attention into deep learning architectures for biomedical image segmentation. The modular design of attention mechanisms enables flexible integration into convolutional neural network architectures, such as the U-Net. Whether attention is appropriate to use, what type of attention to use, and where in the network to incorporate att…
▽ More
In recent years, there has been increasing interest to incorporate attention into deep learning architectures for biomedical image segmentation. The modular design of attention mechanisms enables flexible integration into convolutional neural network architectures, such as the U-Net. Whether attention is appropriate to use, what type of attention to use, and where in the network to incorporate attention modules, are all important considerations that are currently overlooked. In this paper, we investigate the role of the Focal parameter in modulating attention, revealing a link between attention in loss functions and networks. By incorporating a Focal distance penalty term, we extend the Unified Focal loss framework to include boundary-based losses. Furthermore, we develop a simple and interpretable, dataset and model-specific heuristic to integrate the Focal parameter into the Squeeze-and-Excitation block and Attention Gate, achieving optimal performance with fewer number of attention modules on three well-validated biomedical imaging datasets, suggesting judicious use of attention modules results in better performance and efficiency.
△ Less
Submitted 31 October, 2021;
originally announced November 2021.
-
Incorporating Boundary Uncertainty into loss functions for biomedical image segmentation
Authors:
Michael Yeung,
Guang Yang,
Evis Sala,
Carola-Bibiane Schönlieb,
Leonardo Rundo
Abstract:
Manual segmentation is used as the gold-standard for evaluating neural networks on automated image segmentation tasks. Due to considerable heterogeneity in shapes, colours and textures, demarcating object boundaries is particularly difficult in biomedical images, resulting in significant inter and intra-rater variability. Approaches, such as soft labelling and distance penalty term, apply a global…
▽ More
Manual segmentation is used as the gold-standard for evaluating neural networks on automated image segmentation tasks. Due to considerable heterogeneity in shapes, colours and textures, demarcating object boundaries is particularly difficult in biomedical images, resulting in significant inter and intra-rater variability. Approaches, such as soft labelling and distance penalty term, apply a global transformation to the ground truth, redefining the loss function with respect to uncertainty. However, global operations are computationally expensive, and neither approach accurately reflects the uncertainty underlying manual annotation. In this paper, we propose the Boundary Uncertainty, which uses morphological operations to restrict soft labelling to object boundaries, providing an appropriate representation of uncertainty in ground truth labels, and may be adapted to enable robust model training where systematic manual segmentation errors are present. We incorporate Boundary Uncertainty with the Dice loss, achieving consistently improved performance across three well-validated biomedical imaging datasets compared to soft labelling and distance-weighted penalty. Boundary Uncertainty not only more accurately reflects the segmentation process, but it is also efficient, robust to segmentation errors and exhibits better generalisation.
△ Less
Submitted 31 October, 2021;
originally announced November 2021.
-
Calibrating the Dice loss to handle neural network overconfidence for biomedical image segmentation
Authors:
Michael Yeung,
Leonardo Rundo,
Yang Nan,
Evis Sala,
Carola-Bibiane Schönlieb,
Guang Yang
Abstract:
The Dice similarity coefficient (DSC) is both a widely used metric and loss function for biomedical image segmentation due to its robustness to class imbalance. However, it is well known that the DSC loss is poorly calibrated, resulting in overconfident predictions that cannot be usefully interpreted in biomedical and clinical practice. Performance is often the only metric used to evaluate segment…
▽ More
The Dice similarity coefficient (DSC) is both a widely used metric and loss function for biomedical image segmentation due to its robustness to class imbalance. However, it is well known that the DSC loss is poorly calibrated, resulting in overconfident predictions that cannot be usefully interpreted in biomedical and clinical practice. Performance is often the only metric used to evaluate segmentations produced by deep neural networks, and calibration is often neglected. However, calibration is important for translation into biomedical and clinical practice, providing crucial contextual information to model predictions for interpretation by scientists and clinicians. In this study, we provide a simple yet effective extension of the DSC loss, named the DSC++ loss, that selectively modulates the penalty associated with overconfident, incorrect predictions. As a standalone loss function, the DSC++ loss achieves significantly improved calibration over the conventional DSC loss across six well-validated open-source biomedical imaging datasets, including both 2D binary and 3D multi-class segmentation tasks. Similarly, we observe significantly improved calibration when integrating the DSC++ loss into four DSC-based loss functions. Finally, we use softmax thresholding to illustrate that well calibrated outputs enable tailoring of recall-precision bias, which is an important post-processing technique to adapt the model predictions to suit the biomedical or clinical task. The DSC++ loss overcomes the major limitation of the DSC loss, providing a suitable loss function for training deep learning segmentation models for use in biomedical and clinical practice. Source code is available at: https://github.com/mlyg/DicePlusPlus.
△ Less
Submitted 1 November, 2022; v1 submitted 31 October, 2021;
originally announced November 2021.
-
A FEAST variant incorporated with a power iteration
Authors:
Man-Chung Yeung,
Long Lee
Abstract:
We present a variant of the FEAST matrix eigensolver for solving restricted real and symmetric eigenvalue problems. The method is derived from a combination of a variant of the FEAST method, which employs two contour integrals per iteration, and a power subspace iteration process. Compared with the original FEAST method, our new method does not require that the search subspace dimension must be gr…
▽ More
We present a variant of the FEAST matrix eigensolver for solving restricted real and symmetric eigenvalue problems. The method is derived from a combination of a variant of the FEAST method, which employs two contour integrals per iteration, and a power subspace iteration process. Compared with the original FEAST method, our new method does not require that the search subspace dimension must be greater than or equal to the number of eigenvalues inside a search interval, and can deal with narrow search intervals more effectively. Empirically, the FEAST iteration and the power subspace iteration are in a mutually beneficial collaboration to make the new method stable and robust.
△ Less
Submitted 6 March, 2022; v1 submitted 3 December, 2019;
originally announced December 2019.
-
A Spectral Projection Preconditioner for Solving Ill Conditioned Linear Systems
Authors:
Man-Chung Yeung,
Craig C. Douglas,
Long Lee
Abstract:
We present a preconditioner based on spectral projection that is combined with a deflated Krylov subspace method for solving ill conditioned linear systems of equations. Our results show that the proposed algorithm requires many fewer iterations to achieve the convergence criterion for solving an ill conditioned problem than a Krylov subspace solver. In our numerical experiments, the solution obta…
▽ More
We present a preconditioner based on spectral projection that is combined with a deflated Krylov subspace method for solving ill conditioned linear systems of equations. Our results show that the proposed algorithm requires many fewer iterations to achieve the convergence criterion for solving an ill conditioned problem than a Krylov subspace solver. In our numerical experiments, the solution obtained by the proposed algorithm is more accurate in terms of the norm of the distance to the exact solution of the linear system of equations.
△ Less
Submitted 22 September, 2016;
originally announced September 2016.
-
On Solving Ill-Conditioned Linear Systems
Authors:
Craig C. Douglas,
Long Lee,
Man-Chung Yeung
Abstract:
This paper presents the first results to combine two theoretically sound methods (spectral projection and multigrid methods) together to attack ill-conditioned linear systems. Our preliminary results show that the proposed algorithm applied to a Krylov subspace method takes much fewer iterations for solving an ill-conditioned problem downloaded from a popular online sparse matrix collection.
This paper presents the first results to combine two theoretically sound methods (spectral projection and multigrid methods) together to attack ill-conditioned linear systems. Our preliminary results show that the proposed algorithm applied to a Krylov subspace method takes much fewer iterations for solving an ill-conditioned problem downloaded from a popular online sparse matrix collection.
△ Less
Submitted 16 February, 2016;
originally announced February 2016.
-
A FEAST Algorithm with oblique projection for generalized eigenvalue problems
Authors:
Guojian Yin,
Raymond H. Chan,
Man-Chung Yeung
Abstract:
The contour-integral based eigensolvers are the recent efforts for computing the eigenvalues inside a given region in the complex plane. The best-known members are the Sakurai-Sugiura (SS) method, its stable version CIRR, and the FEAST algorithm. An attractive computational advantage of these methods is that they are easily parallelizable. The FEAST algorithm was developed for the generalized Herm…
▽ More
The contour-integral based eigensolvers are the recent efforts for computing the eigenvalues inside a given region in the complex plane. The best-known members are the Sakurai-Sugiura (SS) method, its stable version CIRR, and the FEAST algorithm. An attractive computational advantage of these methods is that they are easily parallelizable. The FEAST algorithm was developed for the generalized Hermitian eigenvalue problems. It is stable and accurate. However, it may fail when applied to non-Hermitian problems. In this paper, we extend the FEAST algorithm to non-Hermitian problems. The approach can be summarized as follows: (i) to construct a particular contour integral to form a subspace containing the desired eigenspace, and (ii) to use the oblique projection technique to extract desired eigenpairs with appropriately chosen test subspace. The related mathematical framework is established. We also address some implementation issues such as how to choose a suitable starting matrix and design good stopping criteria. Numerical experiments are provided to illustrate that our method is stable and efficient.
△ Less
Submitted 9 June, 2015; v1 submitted 7 April, 2014;
originally announced April 2014.
-
ML(n)BiCGStabt: A ML(n)BiCGStab variant with A-transpose
Authors:
Man-Chung Yeung
Abstract:
The 1980 IDR method plays an important role in the history of Krylov subspace methods. It started the research of transpose-free Krylov subspace methods. In this paper, we make a first attempt to bring back A-transpose to the research area by presenting a new ML(n)BiCGStab variant that involves A-transpose in its implementation. Comparisons of this new algorithm with the existing ML(n)BiCGStab alg…
▽ More
The 1980 IDR method plays an important role in the history of Krylov subspace methods. It started the research of transpose-free Krylov subspace methods. In this paper, we make a first attempt to bring back A-transpose to the research area by presenting a new ML(n)BiCGStab variant that involves A-transpose in its implementation. Comparisons of this new algorithm with the existing ML(n)BiCGStab algorithms will be presented.
△ Less
Submitted 3 November, 2013;
originally announced November 2013.
-
An introduction to ML(n)BiCGStab
Authors:
Man-Chung Yeung
Abstract:
ML(n)BiCGStab is a Krylov subspace method for the solution of large, sparse and non-symmetric linear systems. In theory, it is a method that lies between the well-known BiCGStab and GMRES/FOM. In fact, when n = 1, ML(1)BiCGStab is BiCGStab and when n = N, ML(N)BiCGStab is GMRES/FOM where N is the size of the linear system. Therefore, ML(n)BiCGStab is a bridge that connects the Lanczos-based BiCGSt…
▽ More
ML(n)BiCGStab is a Krylov subspace method for the solution of large, sparse and non-symmetric linear systems. In theory, it is a method that lies between the well-known BiCGStab and GMRES/FOM. In fact, when n = 1, ML(1)BiCGStab is BiCGStab and when n = N, ML(N)BiCGStab is GMRES/FOM where N is the size of the linear system. Therefore, ML(n)BiCGStab is a bridge that connects the Lanczos-based BiCGStab and the Arnoldi-based GMRES/FOM. In computation, ML(n)BiCGStab can be much more stable and converge much faster than BiCGStab when a problem with ill-condition is solved. We have tested ML(n)BiCGStab on the standard oil reservoir simulation test data called SPE9 and found that ML(n)BiCGStab reduced the total computational time by more than 60% when compared to BiCGStab. Tests made on the data from Matrix Market also support the superiority of ML(n)BiCGStab over BiCGStab. Because of the O(N^2) storage requirement in the full GMRES, one has to adopt a restart strategy to get the storage under control when GMRES is implemented. In comparison, ML(n)BiCGStab is a method with only O(nN) storage requirement and therefore it does not need a restart strategy. In this paper, we introduce ML(n)BiCGStab (in particular, a new algorithm involving A-transpose), its relations to some existing methods and its implementations.
△ Less
Submitted 18 June, 2011;
originally announced June 2011.
-
ML(n)BiCGStab: Reformulation, Analysis and Implementation
Authors:
Man-Chung Yeung
Abstract:
With the aid of index functions, we re-derive the ML(n)BiCGStab algorithm in a paper by Yeung and Chan in 1999 in a more systematic way. It turns out that there are n ways to define the ML(n)BiCGStab residual vector. Each definition will lead to a different ML(n)BiCGStab algorithm. We demonstrate this by presenting a second algorithm which requires less storage. In theory, this second algorithm se…
▽ More
With the aid of index functions, we re-derive the ML(n)BiCGStab algorithm in a paper by Yeung and Chan in 1999 in a more systematic way. It turns out that there are n ways to define the ML(n)BiCGStab residual vector. Each definition will lead to a different ML(n)BiCGStab algorithm. We demonstrate this by presenting a second algorithm which requires less storage. In theory, this second algorithm serves as a bridge that connects the Lanczos-based BiCGStab and the Arnoldi-based FOM while ML(n)BiCG a bridge connecting BiCG and FOM. We also analyze the breakdown situations from the probabilistic point of view and summarize some useful properties of ML(n)BiCGStab. Implementation issues are also addressed.
△ Less
Submitted 24 November, 2010;
originally announced November 2010.