CORC

浏览/检索结果: 共62条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Investigating Compositional Challenges in Vision-Language Models for Visual Grounding 会议论文
Seattle WA, USA, 17-21 June 2024
作者:  Yunan Zeng;  Yan Huang;  Jinjin Zhang;  Zequn Jie;  Zhenhua Chai
收藏  |  浏览/下载:1/0  |  提交时间:2024/06/05
The organization of the semantic network as reflected by the neural correlates of six semantic dimensions 期刊论文
BRAIN AND LANGUAGE, 2024, 卷号: 250, 页码: 13
作者:  Lin, Nan;  Zhang, Xiaohan;  Wang, Xiuyi;  Wang, Shaonan
收藏  |  浏览/下载:0/0  |  提交时间:2024/07/03
AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models 会议论文
VANCOUVER, CANADA, 2024-2-20至2024-2-27
作者:  Zhaopeng Gu;  Bingke Zhu;  Guibo Zhu;  Yingying Chen;  Ming Tang
收藏  |  浏览/下载:0/0  |  提交时间:2024/06/06
AnyFace++: A Unified Framework for Free-style Text-to-Face Synthesis and Manipulation 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 页码: 1-15
作者:  Sun, Jianxin;  Deng, Qiyao;  Li, Qi;  Sun, Muyi;  Liu, Yunfan
收藏  |  浏览/下载:2/0  |  提交时间:2024/02/23
CLIP-VG: Self-Paced Curriculum Adapting of CLIP for Visual Grounding 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 4334-4347
作者:  Xiao, Linhui;  Yang, Xiaoshan;  Peng, Fang;  Yan, Ming;  Wang, Yaowei
收藏  |  浏览/下载:1/0  |  提交时间:2024/05/30
SgVA-CLIP: Semantic-Guided Visual Adapting of Vision-Language Models for Few-Shot Image Classification 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 3469-3480
作者:  Peng, Fang;  Yang, Xiaoshan;  Xiao, Linhui;  Wang, Yaowei;  Xu, Changsheng
收藏  |  浏览/下载:0/0  |  提交时间:2024/07/03
ESTATE: Expert-Guided State Text Enhancement for Zero-Shot Industrial Anomaly Detection 会议论文
Abu Dhabi, UAE, 2024.10.27-2024.10.30
作者:  Bingke Zhu;  Hao Li;  Changlin Chen;  Liujie Hua;  Jinqiao Wang
收藏  |  浏览/下载:0/0  |  提交时间:2024/06/21
Large sequence models for sequential decision-making: a survey 期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18
作者:  Wen, Muning;  Lin, Runji;  Wang, Hanjing;  Yang, Yaodong;  Wen, Ying
收藏  |  浏览/下载:9/0  |  提交时间:2023/11/17
Frequency-Enhanced Data Augmentation for Vision-and-Language Navigation 会议论文
新奥尔良, 2023-12-9 至 2023-12-15
作者:  Keji He;  Chenyang Si;  Zhihe Lu;  Yan Huang;  Liang Wang
收藏  |  浏览/下载:0/0  |  提交时间:2024/06/26
So Many Heads, So Many Wits: Multimodal Graph Reasoning for Text-Based Visual Question Answering 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 12
作者:  Zheng, Wenbo;  Yan, Lan;  Wang, Fei-Yue
收藏  |  浏览/下载:4/0  |  提交时间:2023/12/21


©版权所有 ©2017 CSpace - Powered by CSpace