Towards Diverse and Efficient Audio Captioning via Diffusion ModelsJan 1, 2024·Manjie Xu,Chenxing Li,Xinyi Tu,Yong Ren,Ruibo Fu,Wei Liang,Dong Yu· 0 min read CiteTypeJournal articlePublicationarXiv preprint arXiv:2409.09401Last updated on Jan 1, 2024 ← Text Prompt is Not Enough: Sound Event Enhanced Prompt Adapter for Target Style Audio Generation Jan 1, 2024Transferring Personality Knowledge to Multimodal Sentiment Analysis Jan 1, 2024 →