Extraction of Templates from Phrases Using Sequence Binary Decision Diagrams
Authors:
Daiki Hirano,
Kumiko Tanaka-Ishii,
Andrew Finch
Abstract:
The extraction of templates such as ``regard X as Y'' from a set of related phrases requires the identification of their internal structures. This paper presents an unsupervised approach for extracting templates on-the-fly from only tagged text by using a novel relaxed variant of the Sequence Binary Decision Diagram (SeqBDD). A SeqBDD can compress a set of sequences into a graphical structure equi…
▽ More
The extraction of templates such as ``regard X as Y'' from a set of related phrases requires the identification of their internal structures. This paper presents an unsupervised approach for extracting templates on-the-fly from only tagged text by using a novel relaxed variant of the Sequence Binary Decision Diagram (SeqBDD). A SeqBDD can compress a set of sequences into a graphical structure equivalent to a minimal DFA, but more compact and better suited to the task of template extraction. The main contribution of this paper is a relaxed form of the SeqBDD construction algorithm that enables it to form general representations from a small amount of data. The process of compression of shared structures in the text during Relaxed SeqBDD construction, naturally induces the templates we wish to extract. Experiments show that the method is capable of high-quality extraction on tasks based on verb+preposition templates from corpora and phrasal templates from short messages from social media.
△ Less
Submitted 28 January, 2020;
originally announced January 2020.
A machine learning-based selective sampling procedure for identifying the low energy region in a potential energy surface: a case study on proton conduction in oxides
Authors:
Kazuaki Toyoura,
Daisuke Hirano,
Atsuto Seko,
Motoki Shiga,
Akihide Kuwabara,
Masayuki Karasuyama,
Kazuki Shitara,
Ichiro Takeuchi
Abstract:
In this paper, we propose a selective sampling procedure to preferentially evaluate a potential energy surface (PES) in a part of the configuration space governing a physical property of interest. The proposed sampling procedure is based on a machine learning method called the Gaussian process (GP), which is used to construct a statistical model of the PES for identifying the region of interest in…
▽ More
In this paper, we propose a selective sampling procedure to preferentially evaluate a potential energy surface (PES) in a part of the configuration space governing a physical property of interest. The proposed sampling procedure is based on a machine learning method called the Gaussian process (GP), which is used to construct a statistical model of the PES for identifying the region of interest in the configuration space. We demonstrate the efficacy of the proposed procedure for atomic diffusion and ionic conduction, specifically the proton conduction in a well-studied proton-conducting oxide, barium zirconate BaZrO3. The results of the demonstration study indicate that our procedure can efficiently identify the low-energy region characterizing the proton conduction in the host crystal lattice, and that the descriptors used for the statistical PES model have a great influence on the performance.
△ Less
Submitted 3 December, 2015; v1 submitted 2 December, 2015;
originally announced December 2015.
Observations of 6.7 GHz Methanol Masers with EAVN I: VLBI Images of the first Epoch of Observations
Authors:
Kenta Fujisawa,
Koichiro Sugiyama,
Kazuhito Motogi,
Kazuya Hachisuka,
Yoshinori Yonekura,
Satoko Sawada-Satoh,
Naoko Matsumoto,
Kazuo Sorai,
Munetake Momose,
Yu Saito,
Hiroshi Takaba,
Hideo Ogawa,
Kimihiro Kimura,
Kotaro Niinuma,
Daiki Hirano,
Toshihiro Omodaka,
Hideyuki Kobayashi,
Noriyuki Kawaguchi,
Katsunori M. Shibata,
Mareki Honma,
Tomoya Hirota,
Yasuhiro Murata,
Akihiro Doi,
Nanako Mochizuki,
Zhiqiang Shen
, et al. (4 additional authors not shown)
Abstract:
Very long baseline interferometry (VLBI) monitoring of the 6.7 GHz methanol maser allows us to measure the internal proper motions of the maser spots and therefore study the gas motion around high-mass young stellar objects. To this end, we have begun monitoring observations with the East-Asian VLBI Network. In this paper we present the results of the first epoch observation for 36 sources, includ…
▽ More
Very long baseline interferometry (VLBI) monitoring of the 6.7 GHz methanol maser allows us to measure the internal proper motions of the maser spots and therefore study the gas motion around high-mass young stellar objects. To this end, we have begun monitoring observations with the East-Asian VLBI Network. In this paper we present the results of the first epoch observation for 36 sources, including 35 VLBI images of the methanol maser. Since two independent sources were found in three images, respectively, images of 38 sources were obtained. In 34 sources, more than or equal to 10 spots were detected. The observed spatial scale of the maser distribution was from 9 to 4900 astronomical units, and the following morphological categories were observed: elliptical, arched, linear, paired, and complex. The position of the maser spot was determined to an accuracy of approximately 0.1 mas, sufficiently high to measure the internal proper motion from two years of monitoring observations. The VLBI observation, however, detected only approximately 20% of all maser emission, suggesting that the remaining 80% of the total flux was spread into an undetectable extended distribution. Therefore, in addition to high-resolution observations, it is important to observe the whole structure of the maser emission including extended low-brightness structures, to reveal the associated site of the maser and gas motion.
△ Less
Submitted 14 November, 2013;
originally announced November 2013.