Vision-Language Transformer and Query Generation for Referring Segmentation - Citation Graph | Papersgraph