×
验证码:
换一张
忘记密码?
记住我
CORC
首页
科研机构
检索
知识图谱
申请加入
托管服务
登录
注册
在结果中检索
科研机构
自动化研究所 [15]
计算技术研究所 [4]
兰州理工大学 [2]
清华大学 [1]
北京大学 [1]
内容类型
期刊论文 [22]
会议论文 [1]
发表日期
2024 [6]
2023 [7]
2022 [1]
2020 [1]
2019 [2]
2018 [2]
更多...
×
知识图谱
CORC
开始提交
已提交作品
待认领作品
已认领作品
未提交全文
收藏管理
QQ客服
官方微博
反馈留言
浏览/检索结果:
共23条,第1-10条
帮助
已选(
0
)
清除
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
作者升序
作者降序
题名升序
题名降序
发表日期升序
发表日期降序
提交时间升序
提交时间降序
VLP2MSA: Expanding vision-language pre-training to multimodal sentiment analysis
期刊论文
KNOWLEDGE-BASED SYSTEMS, 2024, 卷号: 283, 页码: 9
作者:
Yi, Guofeng
;
Fan, Cunhang
;
Zhu, Kang
;
Lv, Zhao
;
Liang, Shan
收藏
  |  
浏览/下载:4/0
  |  
提交时间:2024/02/22
Multimodal sentiment analysis
Vision-language
Multimodal fusion
CLIP-VG: Self-Paced Curriculum Adapting of CLIP for Visual Grounding
期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 4334-4347
作者:
Xiao, Linhui
;
Yang, Xiaoshan
;
Peng, Fang
;
Yan, Ming
;
Wang, Yaowei
收藏
  |  
浏览/下载:1/0
  |  
提交时间:2024/05/30
Grounding
Reliability
Adaptation models
Task analysis
Visualization
Data models
Annotations
Visual grounding
curriculum learning
pseudo-language label
and vision-language models
ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments
期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 页码: 1-16
作者:
Dong An
;
Hanqing Wang
;
Wenguan Wang
;
Zun Wang
;
Yan Huang
收藏
  |  
浏览/下载:0/0
  |  
提交时间:2024/05/27
Vision-Language Navigation
Topological Map
Obstacle Avoidance
Memory-Adaptive Vision-and-Language Navigation
期刊论文
Pattern Recognition, 2024, 卷号: 153, 页码: 110511
作者:
Keji He
;
Ya Jing
;
Yan Huang
;
Zhihe Lu
;
Dong An
收藏
  |  
浏览/下载:2/0
  |  
提交时间:2024/06/26
Vision-and-Language Navigation
Memory bank
History noises
Memory-Adaptive Model
SgVA-CLIP: Semantic-Guided Visual Adapting of Vision-Language Models for Few-Shot Image Classification
期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 3469-3480
作者:
Peng, Fang
;
Yang, Xiaoshan
;
Xiao, Linhui
;
Wang, Yaowei
;
Xu, Changsheng
收藏
  |  
浏览/下载:0/0
  |  
提交时间:2024/07/03
Few-shot
image classification
vision-language models
CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image Segmentation
期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 6906-6916
作者:
Wang, Wenxuan
;
He, Xingjian
;
Zhang, Yisi
;
Guo, Longteng
;
Shen, Jiachen
收藏
  |  
浏览/下载:0/0
  |  
提交时间:2024/07/03
Referring image segmentation
cross-modality guidance
masked self-distillation
vision and language
PCEN: Potential Correlation-Enhanced Network for Multimodal Named Entity Recognition
会议论文
Charlotte, NC, USA, 02-03 October 2023
作者:
Jiakai Geng
;
Chenyang Zhang
;
Linjing Li
;
Qing Yang
;
Daniel Zeng
收藏
  |  
浏览/下载:1/0
  |  
提交时间:2024/05/31
named entity recognition
multimodal learning
vision-language pre-trained model
inconsistency loss
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models
期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:
Ma, Chengcheng
;
Liu, Yang
;
Deng, Jiankang
;
Xie, Lingxi
;
Dong, Weiming
收藏
  |  
浏览/下载:5/0
  |  
提交时间:2023/11/16
Vision-language model
prompt tuning
over-fitting
subspace learning
gradient projection
Masked Vision-language Transformer in Fashion
期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 421-434
作者:
Ge-Peng Ji
收藏
  |  
浏览/下载:9/0
  |  
提交时间:2023/05/29
Vision-language, masked image reconstruction, transformer, fashion, e-commercial
VLP: A Survey on Vision-language Pre-training
期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:
Fei-Long Chen
收藏
  |  
浏览/下载:16/0
  |  
提交时间:2023/01/18
Vision and language
pre-training
transformers
multimodal learning
representation learning
©版权所有 ©2017 CSpace - Powered by
CSpace