natural-language-processing-7

Task-Completion Dialogue Policy Learning

3260 papers • 126 benchmarks • 313 datasets

This task has no description! Would you like to contribute one?

(Image credit: Papersgraph)

Benchmarks

These leaderboards are used to track progress in task-completion-dialogue-policy-learning-14

Trend

Dataset

Best Model

Actions

No benchmarks available.

Libraries

i

Use these libraries to find task-completion-dialogue-policy-learning-14 models and implementations

Datasets

No datasets available.

Subtasks

No subtasks available.

Most implemented papers

Switch-based Active Deep Dyna-Q: Efficient Adaptive Planning for Task-Completion Dialogue Policy Learning

Yuexin Wu, Yiming Yang, Jianfeng Gao, Jingjing Liu, Xiujun Li•Sun Nov 18 2018

By combining switcher and active learning, the new framework named as Switch-based Active Deep Dyna-Q (Switch-DDQ), leads to significant improvement over DDQ and Q-learning baselines in both simulation and human evaluations.

48

Content

0

Paper Graph

Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning

Baolin Peng, Xiujun Li, Jianfeng Gao, Jingjing Liu, Kam-Fai Wong•Wed Jan 17 2018

Deep Dyna-Q is presented, which to the authors' knowledge is the first deep RL framework that integrates planning for task-completion dialogue policy learning and incorporates into the dialogue agent a model of the environment, referred to as the world model, to mimic real user response and generate simulated experience.

74 0

Paper Graph

Discriminative Deep Dyna-Q: Robust Planning for Dialogue Policy Learning

Jianfeng Gao, Yun-Nung (Vivian) Chen, Jingjing Liu, Xiujun Li, Shang-Yu Su•Fri Aug 10 2018

Experiments show that D3Q significantly outperforms DDQ by controlling the quality of simulated experience used for planning and is further demonstrated in a domain extension setting, where the agent’s capability of adapting to a changing environment is tested.

67 0

Paper Graph

Adding a benchmark result helps the community track progress.