Skip to Main Content
Keywords: dialogue policy optimization
Close
Follow your search
Access your saved searches in your account

Would you like to receive an alert when new items match your search?
Close Modal
Sort by
Journal Articles
APSIPA Transactions on Signal and Information Processing (2024) 13 (1): 1–26.
Published: 09 September 2024
... system dialogue policy optimization student-teacher learning offline reinforcement learning Designing a faultless dialogue system is challenging, especially in the case of multi-domain multi-turn dialogue tasks where each conversation with multiple turns may comprise multiple domains...
Journal Articles
APSIPA Transactions on Signal and Information Processing (2023) 12 (1): 1–52.
Published: 05 September 2023
... within a short conversation. Furthermore, offering the precise answers to satisfy the user requirements makes the task even more challenging. This paper surveys recent advances in multi-domain task-oriented dialogue policy optimization and summarizes a number of solutions to policy learning...

or Create an Account

Close Modal
Close Modal