Main » ICMR » 2021 »

MMPT@ICMR2021: Proceedings of the 2021 Workshop on Multi-Modal Pre-Training for Multimedia Understanding, Taipei, Taiwan, August 21, 2021

Bei Liu, Jianlong Fu, Shizhe Chen, Qin Jin, Alexander G. Hauptmann, Yong Rui (Editors)


Anthology ID:
2021.mir_workshop-2021mmpt
Year:
2021
Venue:
mir_workshop
Publisher:
ACM
URL:
https://doi.org/10.1145/3463945
DOI:
10.1145/3463945
DBLP:
conf/mir/2021mmpt
BibTeX:
Download

doi dblp
MMPT@ICMR2021: Proceedings of the 2021 Workshop on Multi-Modal Pre-Training for Multimedia Understanding, Taipei, Taiwan, August 21, 2021

doi dblp
Cross-modal Pretraining and Matching for Video Understanding
Limin Wang

doi dblp
WenLan: Efficient Large-Scale Multi-Modal Pre-Training on Real World Data
Ruihua Song

doi dblp
Be Specific, Be Clear: Bridging Machine and Human Captions by Scene-Guided Transformer
Yupan Huang | Zhaoyang Zeng | Yutong Lu

doi dblp
Language-Conditioned Region Proposal and Retrieval Network for Referring Expression Comprehension
Yanwei Xie | Daqing Liu | Xuejin Chen | Zheng-Jun Zha

doi dblp
Residual Recurrent CRNN for End-to-End Optical Music Recognition on Monophonic Scores
Aozhi Liu | Lipei Zhang | Yaqi Mei | Baoqiang Han | Zifeng Cai | Zhaohua Zhu | Jing Xiao

doi dblp
Style-Guided Image-to-Image Translation for Multiple Domains
Tingting Li | Huan Zhao | Song Wang | Jing Huang

doi dblp
A Fair and Comprehensive Comparison of Multimodal Tweet Sentiment Analysis Methods
Gullal S. Cheema | Sherzod Hakimov | Eric Müller-Budack | Ralph Ewerth

doi dblp
Unsupervised Training Data Generation of Handwritten Formulas using Generative Adversarial Networks with Self-Attention
Matthias Springstein | Eric Müller-Budack | Ralph Ewerth