Learning to Generate Text-Grounded Mask for Open-World Semantic Segmentation from Only Image-Text Pairs (2022-12-01T00:00:00.000000Z)