Zero-shot audio captioning with audio-language model guidance and audio context keywords - Citation Graph | Papersgraph