Rethinking the Evaluation of Video Summaries M Otani, Y Nakashima, E Rahtu, J Heikkilä IEEE Computer Society Conference on Computer Vision and Pattern Recognition …, 2019 | 162 | 2019 |
Video summarization using deep semantic features M Otani, Y Nakashima, E Rahtu, J Heikkilä, N Yokoya Computer Vision–ACCV 2016: 13th Asian Conference on Computer Vision, Taipei …, 2017 | 157 | 2017 |
Bert representations for video question answering Z Yang, N Garcia, C Chu, M Otani, Y Nakashima, H Takemura Proceedings of the IEEE/CVF winter conference on applications of computer …, 2020 | 124 | 2020 |
Learning joint representations of videos and sentences with web image search M Otani, Y Nakashima, E Rahtu, J Heikkilä, N Yokoya European Conference on Computer Vision Workshop, 651-667, 2016 | 101 | 2016 |
KnowIT VQA: Answering knowledge-based questions about videos N Garcia, M Otani, C Chu, Y Nakashima Proceedings of the AAAI conference on artificial intelligence 34 (07), 10826 …, 2020 | 94 | 2020 |
Constrained graphic layout generation via latent optimization K Kikuchi, E Simo-Serra, M Otani, K Yamaguchi Proceedings of the 29th ACM International Conference on Multimedia, 88-96, 2021 | 86 | 2021 |
Layoutdm: Discrete diffusion model for controllable layout generation N Inoue, K Kikuchi, E Simo-Serra, M Otani, K Yamaguchi Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 80 | 2023 |
Uncovering Hidden Challenges in Query-Based Video Moment Retrieval M Otani, Y Nakashima, E Rahtu, J Heikkilä British Machine Vision Conference, 2020 | 73 | 2020 |
Toward verifiable and reproducible human evaluation for text-to-image generation M Otani, R Togashi, Y Sawai, R Ishigami, Y Nakashima, E Rahtu, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 60 | 2023 |
A dataset and baselines for visual question answering on art N Garcia, C Ye, Z Liu, Q Hu, M Otani, C Chu, Y Nakashima, T Mitamura Computer Vision–ECCV 2020 Workshops: Glasgow, UK, August 23–28, 2020 …, 2020 | 59 | 2020 |
Alleviating cold-start problems in recommendation through pseudo-labelling over knowledge graph R Togashi, M Otani, S Satoh Proceedings of the 14th ACM international conference on web search and data …, 2021 | 42 | 2021 |
Does robustness on imagenet transfer to downstream tasks? Y Yamada, M Otani Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 31 | 2022 |
Modeling visual containment for web page layout optimization K Kikuchi, M Otani, K Yamaguchi, E Simo‐Serra Computer Graphics Forum 40 (7), 33-44, 2021 | 19 | 2021 |
A comparative study of language transformers for video question answering Z Yang, N Garcia, C Chu, M Otani, Y Nakashima, H Takemura Neurocomputing 445, 121-133, 2021 | 19 | 2021 |
The laughing machine: Predicting humor in video Y Kayatani, Z Yang, M Otani, N Garcia, C Chu, Y Nakashima, H Takemura Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2021 | 17 | 2021 |
Video summarization using textual descriptions for authoring video blogs M Otani, Y Nakashima, T Sato, N Yokoya Multimedia Tools and Applications 76, 12097-12115, 2017 | 17 | 2017 |
Optimal correction cost for object detection evaluation M Otani, R Togashi, Y Nakashima, E Rahtu, J Heikkilä, S Satoh Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 16 | 2022 |
Textual description-based video summarization for video blogs M Otani, Y Nakashima, T Sato, N Yokoya 2015 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2015 | 12 | 2015 |
iParaphrasing: Extracting Visually Grounded Paraphrases via an Image C Chu, M Otani, Y Nakashima International Conference on Computational Linguistics, 3479–3492, 2018 | 11 | 2018 |
Towards flexible multi-modal document models N Inoue, K Kikuchi, E Simo-Serra, M Otani, K Yamaguchi Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 10 | 2023 |