Variational reward estimator bottleneck: Towards robust reward estimator for multidomain task-oriented dialogue

Jeiyoon Park, Chanhee Lee, Chanjun Park, Kuekyeng Kim, Heuiseok Lim

Research output: Contribution to journalArticlepeer-review

Fingerprint

Dive into the research topics of 'Variational reward estimator bottleneck: Towards robust reward estimator for multidomain task-oriented dialogue'. Together they form a unique fingerprint.

Physics & Astronomy

Engineering & Materials Science

Chemical Compounds