CORC

浏览/检索结果: 共285条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Cogeneration of Innovative Audio-visual Content: A New Challenge for Computing Art 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 1, 页码: 4-28
作者:  Mengting Liu
收藏  |  浏览/下载:3/0  |  提交时间:2024/01/25
Visually Guided Sound Source Separation With Audio-Visual Predictive Coding 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 15
作者:  Song, Zengjie;  Zhang, Zhaoxiang
收藏  |  浏览/下载:1/0  |  提交时间:2023/11/17
Adversarial Multi-Task Learning for Mandarin Prosodic Boundary Prediction With Multi-Modal Embeddings 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2963-2973
作者:  Yi, Jiangyan;  Tao, Jianhua;  Fu, Ruibo;  Wang, Tao;  Zhang, Chu Yuan
收藏  |  浏览/下载:1/0  |  提交时间:2023/11/17
Audio-driven Dubbing for User Generated Contents via Style-aware Semi-parametric Synthesis 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2022, 卷号: 33, 期号: 3, 页码: 1247 - 1261
作者:  Song LS(宋林森);  Wu WY(吴文岩);  Fu CY(傅朝友);  Loy, Chen Change;  He R(赫然)
收藏  |  浏览/下载:8/0  |  提交时间:2023/06/29
A retrieval method for encrypted speech based on improved power normalized cepstrum coefficients and perceptual hashing 期刊论文
Multimedia Tools and Applications, 2022, 卷号: 81, 期号: 11, 页码: 15127-15151
作者:  Zhang, Qiu-yu;  Bai, Jian;  Xu, Fu-jiu
收藏  |  浏览/下载:20/0  |  提交时间:2022/06/20
VAG: A Uniform Model for Cross-Modal Visual-Audio Mutual Generation 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 13
作者:  Hao, Wangli;  Guan, He;  Zhang, Zhaoxiang
收藏  |  浏览/下载:18/0  |  提交时间:2022/06/10
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:36/0  |  提交时间:2022/09/19
Cross-model retrieval with deep learning for business application 会议论文
Busan, Korea, Republic of, 2020-11-14
作者:  Wang, Yufei;  Wang, Huanting;  Yang, Jiating;  Chen, Jianbo
收藏  |  浏览/下载:28/0  |  提交时间:2021/04/06
Audio fingerprint retrieval method based on feature dimension reduction and feature combination 期刊论文
KSII Transactions on Internet and Information Systems, 2021, 卷号: 15, 期号: 2, 页码: 522-539
作者:  Zhang, Qiu-Yu;  Xu, Fu-Jiu;  Bai, Jian
收藏  |  浏览/下载:12/0  |  提交时间:2021/06/03
Listen, understand and translate: triple supervision decouples end-to-endspeech-to-text translation 会议论文
Virtual, 2021-2
作者:  Dong QQ(董倩倩)
收藏  |  浏览/下载:33/0  |  提交时间:2021/06/24


©版权所有 ©2017 CSpace - Powered by CSpace