Showing 1–2 of 2 results for author: Yueh, C
-
Stress Testing BERT Anaphora Resolution Models for Reaction Extraction in Chemical Patents
Authors:
Chieling Yueh,
Evangelos Kanoulas,
Bruno Martins,
Camilo Thorne,
Saber Akhondi
Abstract:
The high volume of published chemical patents and the importance of a timely acquisition of their information gives rise to automating information extraction from chemical patents. Anaphora resolution is an important component of comprehensive information extraction, and is critical for extracting reactions. In chemical patents, there are five anaphoric relations of interest: co-reference, transfo…
▽ More
The high volume of published chemical patents and the importance of a timely acquisition of their information gives rise to automating information extraction from chemical patents. Anaphora resolution is an important component of comprehensive information extraction, and is critical for extracting reactions. In chemical patents, there are five anaphoric relations of interest: co-reference, transformed, reaction associated, work up, and contained. Our goal is to investigate how the performance of anaphora resolution models for reaction texts in chemical patents differs in a noise-free and noisy environment and to what extent we can improve the robustness against noise of the model.
△ Less
Submitted 23 June, 2023;
originally announced June 2023.
-
Sample Complexity of Kernel-Based Q-Learning
Authors:
Sing-Yuan Yeh,
Fu-Chieh Chang,
Chang-Wei Yueh,
Pei-Yuan Wu,
Alberto Bernacchia,
Sattar Vakili
Abstract:
Modern reinforcement learning (RL) often faces an enormous state-action space. Existing analytical results are typically for settings with a small number of state-actions, or simple models such as linearly modeled Q-functions. To derive statistically efficient RL policies handling large state-action spaces, with more general Q-functions, some recent works have considered nonlinear function approxi…
▽ More
Modern reinforcement learning (RL) often faces an enormous state-action space. Existing analytical results are typically for settings with a small number of state-actions, or simple models such as linearly modeled Q-functions. To derive statistically efficient RL policies handling large state-action spaces, with more general Q-functions, some recent works have considered nonlinear function approximation using kernel ridge regression. In this work, we derive sample complexities for kernel based Q-learning when a generative model exists. We propose a nonparametric Q-learning algorithm which finds an $ε$-optimal policy in an arbitrarily large scale discounted MDP. The sample complexity of the proposed algorithm is order optimal with respect to $ε$ and the complexity of the kernel (in terms of its information gain). To the best of our knowledge, this is the first result showing a finite sample complexity under such a general model.
△ Less
Submitted 1 February, 2023;
originally announced February 2023.