VQ-CTAP: Cross-Modal Fine-Grained Sequence Representation Learning for Speech Processing

Jan 1, 2024·
Chunyu Qiang
,
Wang Geng
,
Yi Zhao
,
Ruibo Fu
,
Tao Wang
,
Cheng Gong
,
Tianrui Wang
,
Qiuyu Liu
,
Jiangyan Yi
,
Zhengqi Wen
,
Others
· 0 min read
Type
Publication
arXiv preprint arXiv:2408.05758