Follow
Gengyuan Zhang
Title
Cited by
Cited by
Year
A systematic survey of prompt engineering on vision-language foundation models
J Gu, Z Han, S Chen, A Beirami, B He, G Zhang, R Liao, Y Qin, V Tresp, ...
arXiv preprint arXiv:2307.12980, 2023
412023
Time-dependent entity embedding is not all you need: A re-evaluation of temporal knowledge graph completion models under a unified framework
Z Han, G Zhang, Y Ma, V Tresp
Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021
162021
Cl-crossvqa: A continual learning benchmark for cross-domain visual question answering
Y Zhang, H Chen, A Frikha, Y Yang, D Krompass, G Zhang, J Gu, V Tresp
arXiv preprint arXiv:2211.10567, 2022
62022
Multi-event Video-Text Retrieval
G Zhang, J Ren, J Gu, V Tresp
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
32023
SPOT! Revisiting Video-Language Models for Event Understanding
G Zhang, J Bi, J Gu, V Tresp
arXiv preprint arXiv:2311.12919, 2023
12023
Can Vision-Language Models be a Good Guesser? Exploring VLMs for Times and Location Reasoning
G Zhang, Y Zhang, K Zhang, V Tresp
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024
2024
Can Vision-Language Models be a Good Guesser? Exploring VLMs for Times and Location Reasoning Supplementary Materials
G Zhang, Y Zhang, K Zhang, V Tresp, AD WikiTiLo
Middle East 11, 16, 0
The system can't perform the operation now. Try again later.
Articles 1–7