Follow
Xiangyang Li
Title
Cited by
Cited by
Year
Scene recognition with cnns: objects, scales and dataset bias
L Herranz, S Jiang, X Li
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2016
2182016
Know more say less: Image captioning based on scene graphs
X Li, S Jiang
IEEE Transactions on Multimedia 21 (8), 2117-2130, 2019
1492019
Learning object context for dense captioning
X Li, S Jiang, J Han
Proceedings of the AAAI conference on artificial intelligence 33 (01), 8650-8657, 2019
412019
Visual relationship detection with object spatial distribution
Y Zhu, S Jiang, X Li
2017 IEEE International Conference on Multimedia and Expo (ICME), 379-384, 2017
272017
Image captioning with both object and scene information
X Li, X Song, L Herranz, Y Zhu, S Jiang
Proceedings of the 24th ACM international conference on Multimedia, 1107-1110, 2016
242016
Bundled Object Context for Referring Expressions
X Li, S Jiang
IEEE Transactions on Multimedia, 2018
202018
Class agnostic image common object detection
S Jiang, S Liang, C Chen, Y Zhu, X Li
IEEE Transactions on Image Processing 28 (6), 2836-2846, 2019
192019
Where and what to eat: Simultaneous restaurant and dish recognition from food image
H Wang, W Min, X Li, S Jiang
Advances in Multimedia Information Processing-PCM 2016: 17th Pacific-Rim …, 2016
172016
ISIA at the ImageCLEF 2017 Image Caption Task.
S Liang, X Li, Y Zhu, X Li, S Jiang
CLEF (working notes), 2017
162017
The retrieval of shoeprint images based on the integral histogram of the gabor transform domain
X Li, M Wu, Z Shi
Intelligent Information Processing VII: 8th IFIP TC 12 International …, 2014
142014
Modality-specific and hierarchical feature learning for RGB-D hand-held object recognition
X Lv, X Liu, X Li, X Li, S Jiang, Z He
Multimedia Tools and Applications 76, 4273-4290, 2017
122017
Dataset bias in few-shot image recognition
S Jiang, Y Zhu, C Liu, X Song, X Li, W Min
IEEE transactions on pattern analysis and machine intelligence 45 (1), 229-246, 2022
92022
Multifaceted Analysis of Fine-Tuning in a Deep Model for Visual Recognition
X Li, L Herranz, S Jiang
ACM Transactions on Data Science 1 (1), 1-22, 2020
62020
Joint Learning of CNN and LSTM for Image Captioning.
Y Zhu, X Li, X Li, J Sun, X Song, S Jiang
CLEF (Working Notes), 421-427, 2016
62016
Heterogeneous convolutional neural networks for visual recognition
X Li, L Herranz, S Jiang
Advances in Multimedia Information Processing-PCM 2016: 17th Pacific-Rim …, 2016
32016
Combining heterogeneous features for 3D hand-held object recognition
X Lv, S Wang, X Li, S Jiang
Optoelectronic Imaging and Multimedia Technology III 9273, 472-481, 2014
32014
KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation
X Li, Z Wang, J Yang, Y Wang, S Jiang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
22023
GridMM: Grid Memory Map for Vision-and-Language Navigation
Z Wang, X Li, J Yang, Y Liu, S Jiang
arXiv preprint arXiv:2307.12907, 2023
12023
MemBridge: Video-Language Pre-Training With Memory-Augmented Inter-Modality Bridge
J Yang, X Li, M Zheng, Z Wang, Y Zhu, X Guo, Y Yuan, Z Chai, S Jiang
IEEE Transactions on Image Processing, 2023
12023
Expressional region retrieval
X Guo, X Li, S Jiang
Proceedings of the 28th ACM International Conference on Multimedia, 2581-2589, 2020
12020
The system can't perform the operation now. Try again later.
Articles 1–20