Vision and Language Navigation Tasks | State-of-the-Art