Follow
Gedas Bertasius
Gedas Bertasius
Verified email at cs.unc.edu - Homepage
Title
Cited by
Cited by
Year
Is space-time attention all you need for video understanding?
G Bertasius, H Wang, L Torresani
ICML 2 (3), 4, 2021
16282021
Deepedge: A multi-scale bifurcated deep network for top-down contour detection
G Bertasius, J Shi, L Torresani
Proceedings of the IEEE conference on computer vision and pattern …, 2015
5672015
Object detection in video with spatiotemporal sampling networks
G Bertasius, L Torresani, J Shi
Proceedings of the European Conference on Computer Vision (ECCV), 331-346, 2018
2682018
Semantic segmentation with boundary neural fields
G Bertasius, J Shi, L Torresani
Proceedings of the IEEE conference on computer vision and pattern …, 2016
2262016
High-for-low and low-for-high: Efficient boundary detection from deep object features and its applications to high-level vision
G Bertasius, J Shi, L Torresani
Proceedings of the IEEE international conference on computer vision, 504-512, 2015
2042015
Classifying, segmenting, and tracking object instances in video with mask propagation
G Bertasius, L Torresani
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
1852020
Convolutional random walk networks for semantic image segmentation
G Bertasius, L Torresani, SX Yu, J Shi
Proceedings of the IEEE conference on computer vision and pattern …, 2017
1602017
Learning temporal pose estimation from sparsely-labeled videos
G Bertasius, C Feichtenhofer, D Tran, J Shi, L Torresani
Advances in neural information processing systems 32, 2019
96*2019
Am I a baller? basketball performance assessment from first-person videos
G Bertasius, H Soo Park, SX Yu, J Shi
Proceedings of the IEEE international conference on computer vision, 2177-2185, 2017
862017
Automatic lymph node cluster segmentation using holistically-nested neural networks and structured optimization in CT images
I Nogues, L Lu, X Wang, H Roth, G Bertasius, N Lay, J Shi, Y Tsehay, ...
International Conference on Medical Image Computing and Computer-Assisted …, 2016
692016
TallFormer: Temporal Action Localization with a Long-Memory Transformer
F Cheng, G Bertasius
European Conference on Computer Vision, 503-521, 2022
622022
Vx2text: End-to-end learning of video-based text generation from multimodal inputs
X Lin, G Bertasius, J Wang, SF Chang, D Parikh, L Torresani
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
602021
First person action-object detection with egonet
G Bertasius, HS Park, SX Yu, J Shi
arXiv preprint arXiv:1603.04908, 2016
592016
Simpleclick: Interactive image segmentation with simple vision transformers
Q Liu, Z Xu, G Bertasius, M Niethammer
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
512023
Learning to recognize procedural activities with distant supervision
X Lin, F Petroni, G Bertasius, M Rohrbach, SF Chang, L Torresani
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
492022
Long movie clip classification with state-space video models
MM Islam, G Bertasius
European Conference on Computer Vision, 87-104, 2022
482022
Vindlu: A recipe for effective video-and-language pretraining
F Cheng, X Wang, J Lei, D Crandall, M Bansal, G Bertasius
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
452023
Long-short temporal contrastive learning of video transformers
J Wang, G Bertasius, D Tran, L Torresani
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
452022
Unsupervised learning of important objects from first-person videos
G Bertasius, H Soo Park, SX Yu, J Shi
Proceedings of the IEEE International Conference on Computer Vision, 1956-1964, 2017
32*2017
Vision transformers are parameter-efficient audio-visual learners
YB Lin, YL Sung, J Lei, M Bansal, G Bertasius
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
312023
The system can't perform the operation now. Try again later.
Articles 1–20