This study presents a large multi-modal Bangla YouTube clickbait dataset consisting of 253,070 data points collected through an automated process using the YouTube API and Python web automation frameworks, providing significant value for natural language processing and data science researchers seeking to advance modeling of clickbait phenomena in low-resource languages.