Overview of the proposed learning strategy for robust dialogue policy consisting of the stages of data pre-processing, data augmentation and dialogue policy training. Perfect NLU means that the true user dialogue acts are directly passed to the DST component.