Search | arXiv e-print repository

A Maximum Modulus Theorem for functions admitting Stokes phenomena, and specific cases of Dulac's Theorem

Authors: Jesús Palma-Márquez, Melvin Yeung

Abstract: We study large classes of real-valued analytic functions that naturally emerge in the understanding of Dulac's problem, which addresses the finiteness of limit cycles in planar differential equations. Building on a Maximum Modulus-type result we got, our main statement essentially follows. Namely, for any function belonging to these classes, the following dichotomy holds: either it has isolated ze… ▽ More We study large classes of real-valued analytic functions that naturally emerge in the understanding of Dulac's problem, which addresses the finiteness of limit cycles in planar differential equations. Building on a Maximum Modulus-type result we got, our main statement essentially follows. Namely, for any function belonging to these classes, the following dichotomy holds: either it has isolated zeros or it coincides with the identity. As an application, we prove that the non-accumulation of limit cycles holds around a specific class of the so-called superreal polycycles. △ Less

Submitted 9 October, 2024; originally announced October 2024.

Comments: 19 pages, 4 figures

MSC Class: 30C80; 34C05; 34C07; 40A30

arXiv:2409.13630 [pdf, ps, other]

Natural levels in return maps of elementary polycycles

Authors: Melvin Yeung

Abstract: We will provide a proof of a known specific case of Dulac's Theorem in the style of Ilyashenko. From this we derive a quasi-analyticity result for some return maps of polycycles and we give a Structural Theorem for the formal asymptotics of such a polycycle. We will provide a proof of a known specific case of Dulac's Theorem in the style of Ilyashenko. From this we derive a quasi-analyticity result for some return maps of polycycles and we give a Structural Theorem for the formal asymptotics of such a polycycle. △ Less

Submitted 20 September, 2024; originally announced September 2024.

MSC Class: 34E99

arXiv:2402.12506 [pdf, other]

On the monograph "Finiteness Theorems for limit cycles" and a special case of alternant cycles

Authors: Melvin Yeung

Abstract: We provide evidence that the approach of [Ilyashenko 1991] to the proof of Dulac's theorem has a gap. Although the asymptotics of [Ilyashenko 1991] capture far more than the asymptotics of Dulac, we prove that the arguments for why the asymptotics in [Ilyashenko 1991] are not themselves oscillatory is insufficient. We give an explicit counterexample and we draw confines to which Ilyashenko's resul… ▽ More We provide evidence that the approach of [Ilyashenko 1991] to the proof of Dulac's theorem has a gap. Although the asymptotics of [Ilyashenko 1991] capture far more than the asymptotics of Dulac, we prove that the arguments for why the asymptotics in [Ilyashenko 1991] are not themselves oscillatory is insufficient. We give an explicit counterexample and we draw confines to which Ilyashenko's result may be restricted in order to keep the validity. △ Less

Submitted 19 February, 2024; originally announced February 2024.

MSC Class: 34C07 (Primary) 34E05 (Secondary)

arXiv:2111.00534 [pdf, other]

Focal Attention Networks: optimising attention for biomedical image segmentation

Authors: Michael Yeung, Leonardo Rundo, Evis Sala, Carola-Bibiane Schönlieb, Guang Yang

Abstract: In recent years, there has been increasing interest to incorporate attention into deep learning architectures for biomedical image segmentation. The modular design of attention mechanisms enables flexible integration into convolutional neural network architectures, such as the U-Net. Whether attention is appropriate to use, what type of attention to use, and where in the network to incorporate att… ▽ More In recent years, there has been increasing interest to incorporate attention into deep learning architectures for biomedical image segmentation. The modular design of attention mechanisms enables flexible integration into convolutional neural network architectures, such as the U-Net. Whether attention is appropriate to use, what type of attention to use, and where in the network to incorporate attention modules, are all important considerations that are currently overlooked. In this paper, we investigate the role of the Focal parameter in modulating attention, revealing a link between attention in loss functions and networks. By incorporating a Focal distance penalty term, we extend the Unified Focal loss framework to include boundary-based losses. Furthermore, we develop a simple and interpretable, dataset and model-specific heuristic to integrate the Focal parameter into the Squeeze-and-Excitation block and Attention Gate, achieving optimal performance with fewer number of attention modules on three well-validated biomedical imaging datasets, suggesting judicious use of attention modules results in better performance and efficiency. △ Less

Submitted 31 October, 2021; originally announced November 2021.

arXiv:2111.00533 [pdf, other]

Incorporating Boundary Uncertainty into loss functions for biomedical image segmentation

Authors: Michael Yeung, Guang Yang, Evis Sala, Carola-Bibiane Schönlieb, Leonardo Rundo

Abstract: Manual segmentation is used as the gold-standard for evaluating neural networks on automated image segmentation tasks. Due to considerable heterogeneity in shapes, colours and textures, demarcating object boundaries is particularly difficult in biomedical images, resulting in significant inter and intra-rater variability. Approaches, such as soft labelling and distance penalty term, apply a global… ▽ More Manual segmentation is used as the gold-standard for evaluating neural networks on automated image segmentation tasks. Due to considerable heterogeneity in shapes, colours and textures, demarcating object boundaries is particularly difficult in biomedical images, resulting in significant inter and intra-rater variability. Approaches, such as soft labelling and distance penalty term, apply a global transformation to the ground truth, redefining the loss function with respect to uncertainty. However, global operations are computationally expensive, and neither approach accurately reflects the uncertainty underlying manual annotation. In this paper, we propose the Boundary Uncertainty, which uses morphological operations to restrict soft labelling to object boundaries, providing an appropriate representation of uncertainty in ground truth labels, and may be adapted to enable robust model training where systematic manual segmentation errors are present. We incorporate Boundary Uncertainty with the Dice loss, achieving consistently improved performance across three well-validated biomedical imaging datasets compared to soft labelling and distance-weighted penalty. Boundary Uncertainty not only more accurately reflects the segmentation process, but it is also efficient, robust to segmentation errors and exhibits better generalisation. △ Less

Submitted 31 October, 2021; originally announced November 2021.

arXiv:2111.00528 [pdf, other]

Calibrating the Dice loss to handle neural network overconfidence for biomedical image segmentation

Authors: Michael Yeung, Leonardo Rundo, Yang Nan, Evis Sala, Carola-Bibiane Schönlieb, Guang Yang

Abstract: The Dice similarity coefficient (DSC) is both a widely used metric and loss function for biomedical image segmentation due to its robustness to class imbalance. However, it is well known that the DSC loss is poorly calibrated, resulting in overconfident predictions that cannot be usefully interpreted in biomedical and clinical practice. Performance is often the only metric used to evaluate segment… ▽ More The Dice similarity coefficient (DSC) is both a widely used metric and loss function for biomedical image segmentation due to its robustness to class imbalance. However, it is well known that the DSC loss is poorly calibrated, resulting in overconfident predictions that cannot be usefully interpreted in biomedical and clinical practice. Performance is often the only metric used to evaluate segmentations produced by deep neural networks, and calibration is often neglected. However, calibration is important for translation into biomedical and clinical practice, providing crucial contextual information to model predictions for interpretation by scientists and clinicians. In this study, we provide a simple yet effective extension of the DSC loss, named the DSC++ loss, that selectively modulates the penalty associated with overconfident, incorrect predictions. As a standalone loss function, the DSC++ loss achieves significantly improved calibration over the conventional DSC loss across six well-validated open-source biomedical imaging datasets, including both 2D binary and 3D multi-class segmentation tasks. Similarly, we observe significantly improved calibration when integrating the DSC++ loss into four DSC-based loss functions. Finally, we use softmax thresholding to illustrate that well calibrated outputs enable tailoring of recall-precision bias, which is an important post-processing technique to adapt the model predictions to suit the biomedical or clinical task. The DSC++ loss overcomes the major limitation of the DSC loss, providing a suitable loss function for training deep learning segmentation models for use in biomedical and clinical practice. Source code is available at: https://github.com/mlyg/DicePlusPlus. △ Less

Submitted 1 November, 2022; v1 submitted 31 October, 2021; originally announced November 2021.

arXiv:1912.01642 [pdf, ps, other]

A FEAST variant incorporated with a power iteration

Authors: Man-Chung Yeung, Long Lee

Abstract: We present a variant of the FEAST matrix eigensolver for solving restricted real and symmetric eigenvalue problems. The method is derived from a combination of a variant of the FEAST method, which employs two contour integrals per iteration, and a power subspace iteration process. Compared with the original FEAST method, our new method does not require that the search subspace dimension must be gr… ▽ More We present a variant of the FEAST matrix eigensolver for solving restricted real and symmetric eigenvalue problems. The method is derived from a combination of a variant of the FEAST method, which employs two contour integrals per iteration, and a power subspace iteration process. Compared with the original FEAST method, our new method does not require that the search subspace dimension must be greater than or equal to the number of eigenvalues inside a search interval, and can deal with narrow search intervals more effectively. Empirically, the FEAST iteration and the power subspace iteration are in a mutually beneficial collaboration to make the new method stable and robust. △ Less

Submitted 6 March, 2022; v1 submitted 3 December, 2019; originally announced December 2019.

Comments: Algorithm 4 in Section 3.3 revised, Sections 1 and 2 rewritten, Table 4.7 added to Experiment 4 in Section 4.1, references changed, typos corrected

MSC Class: 15A18; 58C40; 65F15 ACM Class: G.1

arXiv:1609.06839 [pdf, other]

A Spectral Projection Preconditioner for Solving Ill Conditioned Linear Systems

Authors: Man-Chung Yeung, Craig C. Douglas, Long Lee

Abstract: We present a preconditioner based on spectral projection that is combined with a deflated Krylov subspace method for solving ill conditioned linear systems of equations. Our results show that the proposed algorithm requires many fewer iterations to achieve the convergence criterion for solving an ill conditioned problem than a Krylov subspace solver. In our numerical experiments, the solution obta… ▽ More We present a preconditioner based on spectral projection that is combined with a deflated Krylov subspace method for solving ill conditioned linear systems of equations. Our results show that the proposed algorithm requires many fewer iterations to achieve the convergence criterion for solving an ill conditioned problem than a Krylov subspace solver. In our numerical experiments, the solution obtained by the proposed algorithm is more accurate in terms of the norm of the distance to the exact solution of the linear system of equations. △ Less

Submitted 22 September, 2016; originally announced September 2016.

Comments: ICCS 2016 Workshop. arXiv admin note: text overlap with arXiv:1602.05211

MSC Class: 65F08; 65F10; 65F15; 65F22; 65F50

arXiv:1602.05211 [pdf, other]

On Solving Ill-Conditioned Linear Systems

Authors: Craig C. Douglas, Long Lee, Man-Chung Yeung

Abstract: This paper presents the first results to combine two theoretically sound methods (spectral projection and multigrid methods) together to attack ill-conditioned linear systems. Our preliminary results show that the proposed algorithm applied to a Krylov subspace method takes much fewer iterations for solving an ill-conditioned problem downloaded from a popular online sparse matrix collection. This paper presents the first results to combine two theoretically sound methods (spectral projection and multigrid methods) together to attack ill-conditioned linear systems. Our preliminary results show that the proposed algorithm applied to a Krylov subspace method takes much fewer iterations for solving an ill-conditioned problem downloaded from a popular online sparse matrix collection. △ Less

Submitted 16 February, 2016; originally announced February 2016.

arXiv:1404.1768 [pdf, ps, other]

A FEAST Algorithm with oblique projection for generalized eigenvalue problems

Authors: Guojian Yin, Raymond H. Chan, Man-Chung Yeung

Abstract: The contour-integral based eigensolvers are the recent efforts for computing the eigenvalues inside a given region in the complex plane. The best-known members are the Sakurai-Sugiura (SS) method, its stable version CIRR, and the FEAST algorithm. An attractive computational advantage of these methods is that they are easily parallelizable. The FEAST algorithm was developed for the generalized Herm… ▽ More The contour-integral based eigensolvers are the recent efforts for computing the eigenvalues inside a given region in the complex plane. The best-known members are the Sakurai-Sugiura (SS) method, its stable version CIRR, and the FEAST algorithm. An attractive computational advantage of these methods is that they are easily parallelizable. The FEAST algorithm was developed for the generalized Hermitian eigenvalue problems. It is stable and accurate. However, it may fail when applied to non-Hermitian problems. In this paper, we extend the FEAST algorithm to non-Hermitian problems. The approach can be summarized as follows: (i) to construct a particular contour integral to form a subspace containing the desired eigenspace, and (ii) to use the oblique projection technique to extract desired eigenpairs with appropriately chosen test subspace. The related mathematical framework is established. We also address some implementation issues such as how to choose a suitable starting matrix and design good stopping criteria. Numerical experiments are provided to illustrate that our method is stable and efficient. △ Less

Submitted 9 June, 2015; v1 submitted 7 April, 2014; originally announced April 2014.

arXiv:1311.0583 [pdf, ps, other]

ML(n)BiCGStabt: A ML(n)BiCGStab variant with A-transpose

Authors: Man-Chung Yeung

Abstract: The 1980 IDR method plays an important role in the history of Krylov subspace methods. It started the research of transpose-free Krylov subspace methods. In this paper, we make a first attempt to bring back A-transpose to the research area by presenting a new ML(n)BiCGStab variant that involves A-transpose in its implementation. Comparisons of this new algorithm with the existing ML(n)BiCGStab alg… ▽ More The 1980 IDR method plays an important role in the history of Krylov subspace methods. It started the research of transpose-free Krylov subspace methods. In this paper, we make a first attempt to bring back A-transpose to the research area by presenting a new ML(n)BiCGStab variant that involves A-transpose in its implementation. Comparisons of this new algorithm with the existing ML(n)BiCGStab algorithms will be presented. △ Less

Submitted 3 November, 2013; originally announced November 2013.

MSC Class: Primary; 65F10; 65F15; Secondary; 65F25; 65F30

arXiv:1106.3678 [pdf, ps, other]

An introduction to ML(n)BiCGStab

Authors: Man-Chung Yeung

Abstract: ML(n)BiCGStab is a Krylov subspace method for the solution of large, sparse and non-symmetric linear systems. In theory, it is a method that lies between the well-known BiCGStab and GMRES/FOM. In fact, when n = 1, ML(1)BiCGStab is BiCGStab and when n = N, ML(N)BiCGStab is GMRES/FOM where N is the size of the linear system. Therefore, ML(n)BiCGStab is a bridge that connects the Lanczos-based BiCGSt… ▽ More ML(n)BiCGStab is a Krylov subspace method for the solution of large, sparse and non-symmetric linear systems. In theory, it is a method that lies between the well-known BiCGStab and GMRES/FOM. In fact, when n = 1, ML(1)BiCGStab is BiCGStab and when n = N, ML(N)BiCGStab is GMRES/FOM where N is the size of the linear system. Therefore, ML(n)BiCGStab is a bridge that connects the Lanczos-based BiCGStab and the Arnoldi-based GMRES/FOM. In computation, ML(n)BiCGStab can be much more stable and converge much faster than BiCGStab when a problem with ill-condition is solved. We have tested ML(n)BiCGStab on the standard oil reservoir simulation test data called SPE9 and found that ML(n)BiCGStab reduced the total computational time by more than 60% when compared to BiCGStab. Tests made on the data from Matrix Market also support the superiority of ML(n)BiCGStab over BiCGStab. Because of the O(N^2) storage requirement in the full GMRES, one has to adopt a restart strategy to get the storage under control when GMRES is implemented. In comparison, ML(n)BiCGStab is a method with only O(nN) storage requirement and therefore it does not need a restart strategy. In this paper, we introduce ML(n)BiCGStab (in particular, a new algorithm involving A-transpose), its relations to some existing methods and its implementations. △ Less

Submitted 18 June, 2011; originally announced June 2011.

Comments: This paper introduces the three algorithms of the ML(n)BiCGStab method for the solution of large, nonsymmetric liear systems. One of the algorithms involves A-transpose in its implmentation. This paper will be presented in the 33rd International Conference on Boundary Elements and other Mesh Reduction Methods, 28-30 June 2011, New Forest, UK

Report number: Report 11-06, Math Dept., UW MSC Class: 65D15; 65F30; 65G20; 68W10

arXiv:1011.5314 [pdf, ps, other]

ML(n)BiCGStab: Reformulation, Analysis and Implementation

Authors: Man-Chung Yeung

Abstract: With the aid of index functions, we re-derive the ML(n)BiCGStab algorithm in a paper by Yeung and Chan in 1999 in a more systematic way. It turns out that there are n ways to define the ML(n)BiCGStab residual vector. Each definition will lead to a different ML(n)BiCGStab algorithm. We demonstrate this by presenting a second algorithm which requires less storage. In theory, this second algorithm se… ▽ More With the aid of index functions, we re-derive the ML(n)BiCGStab algorithm in a paper by Yeung and Chan in 1999 in a more systematic way. It turns out that there are n ways to define the ML(n)BiCGStab residual vector. Each definition will lead to a different ML(n)BiCGStab algorithm. We demonstrate this by presenting a second algorithm which requires less storage. In theory, this second algorithm serves as a bridge that connects the Lanczos-based BiCGStab and the Arnoldi-based FOM while ML(n)BiCG a bridge connecting BiCG and FOM. We also analyze the breakdown situations from the probabilistic point of view and summarize some useful properties of ML(n)BiCGStab. Implementation issues are also addressed. △ Less

Submitted 24 November, 2010; originally announced November 2010.

Comments: This paper is dedicated to the memory of Prof. Gene Golub. Most part of the paper was presented in Gene Golub Memorial Conference, Feb. 29-Mar. 1, 2008, University of Massachusetts, Dartmouth, U.S.A

MSC Class: Numerical Analysis

Showing 1–13 of 13 results for author: Yeung, M