Follow
Bin Zhu
Bin Zhu
Assistant Professor, Singapore Management University
Verified email at smu.edu.sg - Homepage
Title
Cited by
Cited by
Year
R2GAN: Cross-modal recipe retrieval with generative adversarial network
B Zhu, CW Ngo, J Chen, Y Hao
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
1532019
A study of multi-task and region-wise deep learning for food ingredient recognition
J Chen, B Zhu, CW Ngo, TS Chua, YG Jiang
IEEE Transactions on Image Processing 30, 1514-1526, 2020
902020
Epic-kitchens visor benchmark: Video segmentations and object relations
A Darkhalil, D Shan, B Zhu, J Ma, A Kar, R Higgins, S Fidler, D Fouhey, ...
Advances in Neural Information Processing Systems 35, 13745-13758, 2022
882022
CookGAN: Causality based Text-to-Image Synthesis
B Zhu, CW Ngo
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2020
842020
Foodlmm: A versatile food assistant using large multi-modal model
Y Yin, H Qi, B Zhu, J Chen, YG Jiang, CW Ngo
arXiv preprint arXiv:2312.14991, 2023
142023
Cross-domain cross-modal food transfer
B Zhu, CW Ngo, J Chen
Proceedings of the 28th ACM International Conference on Multimedia, 3762-3770, 2020
132020
Learning from web recipe-image pairs for food recognition: Problem, baselines and performance
B Zhu, CW Ngo, WK Chan
IEEE Transactions on Multimedia 24, 1175-1185, 2021
122021
Person-level action recognition in complex events via tsd-tsm networks
Y Hao, ZN Liu, H Zhang, B Zhu, J Chen, YG Jiang, CW Ngo
Proceedings of the 28th ACM International Conference on Multimedia, 4699-4702, 2020
122020
CgT-GAN: CLIP-guided Text GAN for Image Captioning
J Yu, H Li, Y Hao, B Zhu, T Xu, X He
Proceedings of the 31st ACM International Conference on Multimedia, 2252-2263, 2023
92023
Unsupervised video hashing with multi-granularity contextualization and multi-structure preservation
Y Hao, J Duan, H Zhang, B Zhu, P Zhou, X He
Proceedings of the 30th ACM International Conference on Multimedia, 3754-3763, 2022
82022
Mix-dann and dynamic-modal-distillation for video domain adaptation
Y Yin, B Zhu, J Chen, L Cheng, YG Jiang
Proceedings of the 30th ACM International Conference on Multimedia, 3224-3233, 2022
72022
Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective
F Song, B Zhu, Y Hao, S Wang
European Conference on Computer Vision (ECCV), 2024
6*2024
From Canteen Food to Daily Meals: Generalizing Food Recognition to More Practical Scenarios
G Liu, Y Jiao, J Chen, B Zhu, YG Jiang
IEEE Transactions on Multimedia, 2024
62024
Pyramid fusion dark channel prior for single image dehazing
Q Liang, B Zhu, CW Ngo
arXiv preprint arXiv:2105.10192, 2021
62021
Rode: Linear rectified mixture of diverse experts for food large multi-modal models
P Jiao, X Wu, B Zhu, J Chen, CW Ngo, Y Jiang
arXiv preprint arXiv:2407.12730, 2024
52024
Learning to match anchor-target video pairs with dual attentional holographic networks
Y Hao, CW Ngo, B Zhu
IEEE Transactions on Image Processing 30, 8130-8143, 2021
52021
Cross-lingual adaptation for recipe retrieval with mixup
B Zhu, CW Ngo, J Chen, WK Chan
Proceedings of the 2022 International Conference on Multimedia Retrieval …, 2022
42022
Text-driven Video Prediction
X Song, J Chen, B Zhu, Y Jiang
ACM Transactions on Multimedia Computing, Communications, and Applications …, 2024
32024
Hand1000: Generating realistic hands from text with only 1,000 images
H Zhang, B Zhu, Y Cao, Y Hao
Proceedings of the AAAI Conference on Artificial Intelligence 39, 2025
22025
Navigating Weight Prediction with Diet Diary
Y Gui, B Zhu, J Chen, CW Ngo, YG Jiang
Proceedings of the 32nd ACM International Conference on Multimedia, 127-136, 2024
12024
The system can't perform the operation now. Try again later.
Articles 1–20