Natural Language Visual Grounding Tasks | State-of-the-Art