Zero-shot audio captioning with audio-language model guidance and audio context keywords (2023-11-14T00:00:00.000000Z)