We gratefully acknowledge support from
the Simons Foundation and member institutions.

Mohammad Kachuee Mr. is qualified to endorse.

Constrained Policy Optimization for Controlled Self-Learning in Conversational AI Systems

Mohammad Kachuee Mr.: Is registered as an author of this paper.
Can endorse for cs.AI, cs.LG. (why?)

Sungjin Lee is not registered as an owner of this paper. (why?)