-
Tooth morphometry using quasi-conformal theory
Authors:
Gary P. T. Choi,
Hei Long Chan,
Robin Yong,
Sarbin Ranjitkar,
Alan Brook,
Grant Townsend,
Ke Chen,
Lok Ming Lui
Abstract:
Shape analysis is important in anthropology, bioarchaeology and forensic science for interpreting useful information from human remains. In particular, teeth are morphologically stable and hence well-suited for shape analysis. In this work, we propose a framework for tooth morphometry using quasi-conformal theory. Landmark-matching Teichmüller maps are used for establishing a 1-1 correspondence be…
▽ More
Shape analysis is important in anthropology, bioarchaeology and forensic science for interpreting useful information from human remains. In particular, teeth are morphologically stable and hence well-suited for shape analysis. In this work, we propose a framework for tooth morphometry using quasi-conformal theory. Landmark-matching Teichmüller maps are used for establishing a 1-1 correspondence between tooth surfaces with prescribed anatomical landmarks. Then, a quasi-conformal statistical shape analysis model based on the Teichmüller mapping results is proposed for building a tooth classification scheme. We deploy our framework on a dataset of human premolars to analyze the tooth shape variation among genders and ancestries. Experimental results show that our method achieves much higher classification accuracy with respect to both gender and ancestry when compared to the existing methods. Furthermore, our model reveals the underlying tooth shape difference between different genders and ancestries in terms of the local geometric distortion and curvatures.
△ Less
Submitted 6 January, 2019;
originally announced January 2019.
-
Dual Long Short-Term Memory Networks for Sub-Character Representation Learning
Authors:
Han He,
Lei Wu,
Xiaokun Yang,
Hua Yan,
Zhimin Gao,
Yi Feng,
George Townsend
Abstract:
Characters have commonly been regarded as the minimal processing unit in Natural Language Processing (NLP). But many non-latin languages have hieroglyphic writing systems, involving a big alphabet with thousands or millions of characters. Each character is composed of even smaller parts, which are often ignored by the previous work. In this paper, we propose a novel architecture employing two stac…
▽ More
Characters have commonly been regarded as the minimal processing unit in Natural Language Processing (NLP). But many non-latin languages have hieroglyphic writing systems, involving a big alphabet with thousands or millions of characters. Each character is composed of even smaller parts, which are often ignored by the previous work. In this paper, we propose a novel architecture employing two stacked Long Short-Term Memory Networks (LSTMs) to learn sub-character level representation and capture deeper level of semantic meanings. To build a concrete study and substantiate the efficiency of our neural architecture, we take Chinese Word Segmentation as a research case example. Among those languages, Chinese is a typical case, for which every character contains several components called radicals. Our networks employ a shared radical level embedding to solve both Simplified and Traditional Chinese Word Segmentation, without extra Traditional to Simplified Chinese conversion, in such a highly end-to-end way the word segmentation can be significantly simplified compared to the previous work. Radical level embeddings can also capture deeper semantic meaning below character level and improve the system performance of learning. By tying radical and character embeddings together, the parameter count is reduced whereas semantic knowledge is shared and transferred between two levels, boosting the performance largely. On 3 out of 4 Bakeoff 2005 datasets, our method surpassed state-of-the-art results by up to 0.4%. Our results are reproducible, source codes and corpora are available on GitHub.
△ Less
Submitted 4 January, 2018; v1 submitted 23 December, 2017;
originally announced December 2017.
-
Effective Neural Solution for Multi-Criteria Word Segmentation
Authors:
Han He,
Lei Wu,
Hua Yan,
Zhimin Gao,
Yi Feng,
George Townsend
Abstract:
We present a simple yet elegant solution to train a single joint model on multi-criteria corpora for Chinese Word Segmentation (CWS). Our novel design requires no private layers in model architecture, instead, introduces two artificial tokens at the beginning and ending of input sentence to specify the required target criteria. The rest of the model including Long Short-Term Memory (LSTM) layer an…
▽ More
We present a simple yet elegant solution to train a single joint model on multi-criteria corpora for Chinese Word Segmentation (CWS). Our novel design requires no private layers in model architecture, instead, introduces two artificial tokens at the beginning and ending of input sentence to specify the required target criteria. The rest of the model including Long Short-Term Memory (LSTM) layer and Conditional Random Fields (CRFs) layer remains unchanged and is shared across all datasets, keeping the size of parameter collection minimal and constant. On Bakeoff 2005 and Bakeoff 2008 datasets, our innovative design has surpassed both single-criterion and multi-criteria state-of-the-art learning results. To the best knowledge, our design is the first one that has achieved the latest high performance on such large scale datasets. Source codes and corpora of this paper are available on GitHub.
△ Less
Submitted 4 January, 2018; v1 submitted 7 December, 2017;
originally announced December 2017.
-
Complex Trkalian Fields and Solutions to Euler's Equations for the Ideal Fluid
Authors:
P. R. Baldwin,
G. M. Townsend
Abstract:
We consider solutions to the complex Trkalian equation,~$ \vec{\nabla} \times \vc = \vc ,$ where~$\vc$ is a 3 component vector function with each component in the complex field, and may be expressed in the form~$ \vc = e^{ig} \vec{\nabla} F, $ with~$g$ real and~$F$ complex. We find, there are precisely two classes of solutions; one where~$g$ is a Cartesian variable and one where~$g$ is the spheric…
▽ More
We consider solutions to the complex Trkalian equation,~$ \vec{\nabla} \times \vc = \vc ,$ where~$\vc$ is a 3 component vector function with each component in the complex field, and may be expressed in the form~$ \vc = e^{ig} \vec{\nabla} F, $ with~$g$ real and~$F$ complex. We find, there are precisely two classes of solutions; one where~$g$ is a Cartesian variable and one where~$g$ is the spherical radial coordinate. We consider these flows to be the simplest of all exact 3-d solutions to the Euler's equation for the ideal incompressible fluid. The novel approach we use in solving for these classes of solutions to these 3-dimensional vector pdes involves differential geometric techniques: one may employ the method to generate solutions to other classes of vector pdes.
△ Less
Submitted 13 February, 1995;
originally announced February 1995.