×
验证码:
换一张
忘记密码?
记住我
CORC
首页
科研机构
检索
知识图谱
申请加入
托管服务
登录
注册
在结果中检索
科研机构
清华大学 [48]
自动化研究所 [46]
北京大学 [25]
兰州理工大学 [19]
声学研究所 [15]
上海电子信息职业技... [15]
更多...
内容类型
期刊论文 [178]
会议论文 [57]
学位论文 [34]
其他 [13]
会议 [2]
专利 [1]
更多...
发表日期
2022 [4]
2021 [7]
2020 [3]
2019 [4]
2018 [9]
2017 [11]
更多...
学科主题
Physics, C... [3]
半导体物理 [3]
Physics, M... [2]
Biochemist... [1]
Computer S... [1]
Engineerin... [1]
更多...
×
知识图谱
CORC
开始提交
已提交作品
待认领作品
已认领作品
未提交全文
收藏管理
QQ客服
官方微博
反馈留言
浏览/检索结果:
共285条,第1-10条
帮助
已选(
0
)
清除
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
作者升序
作者降序
题名升序
题名降序
发表日期升序
发表日期降序
提交时间升序
提交时间降序
Cogeneration of Innovative Audio-visual Content: A New Challenge for Computing Art
期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 1, 页码: 4-28
作者:
Mengting Liu
收藏
  |  
浏览/下载:3/0
  |  
提交时间:2024/01/25
Artificial intelligence (AI) art, audio-visual, artificial intelligence generated content (AIGC), multimodal, artistic evaluation
Visually Guided Sound Source Separation With Audio-Visual Predictive Coding
期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 15
作者:
Song, Zengjie
;
Zhang, Zhaoxiang
收藏
  |  
浏览/下载:1/0
  |  
提交时间:2023/11/17
Feature fusion
multimodal learning
predictive coding (PC)
self-supervised learning
sound source separation
Adversarial Multi-Task Learning for Mandarin Prosodic Boundary Prediction With Multi-Modal Embeddings
期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2963-2973
作者:
Yi, Jiangyan
;
Tao, Jianhua
;
Fu, Ruibo
;
Wang, Tao
;
Zhang, Chu Yuan
收藏
  |  
浏览/下载:1/0
  |  
提交时间:2023/11/17
Adversarial training
multi-task learning
prosodic boundaries
speech synthesis
multi-modal embeddings
Audio-driven Dubbing for User Generated Contents via Style-aware Semi-parametric Synthesis
期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2022, 卷号: 33, 期号: 3, 页码: 1247 - 1261
作者:
Song LS(宋林森)
;
Wu WY(吴文岩)
;
Fu CY(傅朝友)
;
Loy, Chen Change
;
He R(赫然)
收藏
  |  
浏览/下载:8/0
  |  
提交时间:2023/06/29
Talking Face Generation
Video Generation
GAN
Thin-plate Spline
A retrieval method for encrypted speech based on improved power normalized cepstrum coefficients and perceptual hashing
期刊论文
Multimedia Tools and Applications, 2022, 卷号: 81, 期号: 11, 页码: 15127-15151
作者:
Zhang, Qiu-yu
;
Bai, Jian
;
Xu, Fu-jiu
收藏
  |  
浏览/下载:20/0
  |  
提交时间:2022/06/20
Authentication
Chaotic systems
Discrete wavelet transforms
Efficiency
Extraction
Hamming distance
Hash functions
Information retrieval
Principal component analysis
Speech
Cepstrum
Chaotic mapping
Encrypted speech
Encrypted speech retrieval
Features extraction
Henon chaotic mapping
Perceptual hashing
Power
Power normalized cepstrum coefficient
Speech feature extraction
Speech features
Speech retrieval
VAG: A Uniform Model for Cross-Modal Visual-Audio Mutual Generation
期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 13
作者:
Hao, Wangli
;
Guan, He
;
Zhang, Zhaoxiang
收藏
  |  
浏览/下载:18/0
  |  
提交时间:2022/06/10
Task analysis
Instruments
Visualization
Image reconstruction
Generators
Decoding
Generative adversarial networks
Cross modality
cross-modal generation
mutual generation
visual and audio
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing
期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254
作者:
Wang, Tao
;
Yi, Jiangyan
;
Fu, Ruibo
;
Tao, Jianhua
;
Wen, Zhengqi
收藏
  |  
浏览/下载:36/0
  |  
提交时间:2022/09/19
Speech processing
Decoding
Predictive models
Acoustics
Transfer learning
Training
Task analysis
Coarse-to-fine decoding
mask prediction
one-shot learning
text-based speech editing
text-to-speech
Cross-model retrieval with deep learning for business application
会议论文
Busan, Korea, Republic of, 2020-11-14
作者:
Wang, Yufei
;
Wang, Huanting
;
Yang, Jiating
;
Chen, Jianbo
收藏
  |  
浏览/下载:28/0
  |  
提交时间:2021/04/06
Cross-modal retrieval
Audio features
Deep hashing
Useful information
Audio fingerprint retrieval method based on feature dimension reduction and feature combination
期刊论文
KSII Transactions on Internet and Information Systems, 2021, 卷号: 15, 期号: 2, 页码: 522-539
作者:
Zhang, Qiu-Yu
;
Xu, Fu-Jiu
;
Bai, Jian
收藏
  |  
浏览/下载:12/0
  |  
提交时间:2021/06/03
Hamming distance
Information retrieval
Speech analysis
Dimension reduction
Distance algorithm
Feature combination
Feature dimensions
Information entropy
Mel-frequency cepstral coefficients
Retrieval accuracy
Retrieval efficiency
Listen, understand and translate: triple supervision decouples end-to-endspeech-to-text translation
会议论文
Virtual, 2021-2
作者:
Dong QQ(董倩倩)
收藏
  |  
浏览/下载:33/0
  |  
提交时间:2021/06/24
©版权所有 ©2017 CSpace - Powered by
CSpace