MMPT@ICMR2021: Proceedings of the 2021 Workshop on Multi-Modal Pre-Training for Multimedia Understanding, Taipei, Taiwan, August 21, 2021
Bei Liu, Jianlong Fu, Shizhe Chen, Qin Jin, Alexander G. Hauptmann, Yong Rui (Editors)
- Anthology ID:
- 2021.mir_workshop-2021mmpt
- Year:
- 2021
- Venue:
- mir_workshop
- Publisher:
- ACM
- URL:
- https://doi.org/10.1145/3463945
- DOI:
- 10.1145/3463945
- DBLP:
- conf/mir/2021mmpt
Cross-modal Pretraining and Matching for Video Understanding
Limin Wang
WenLan: Efficient Large-Scale Multi-Modal Pre-Training on Real World Data
Ruihua Song
Be Specific, Be Clear: Bridging Machine and Human Captions by Scene-Guided Transformer
Yupan Huang
|
Zhaoyang Zeng
|
Yutong Lu
Language-Conditioned Region Proposal and Retrieval Network for Referring Expression Comprehension
Yanwei Xie
|
Daqing Liu
|
Xuejin Chen
|
Zheng-Jun Zha
Residual Recurrent CRNN for End-to-End Optical Music Recognition on Monophonic Scores
Aozhi Liu
|
Lipei Zhang
|
Yaqi Mei
|
Baoqiang Han
|
Zifeng Cai
|
Zhaohua Zhu
|
Jing Xiao
Style-Guided Image-to-Image Translation for Multiple Domains
Tingting Li
|
Huan Zhao
|
Song Wang
|
Jing Huang
A Fair and Comprehensive Comparison of Multimodal Tweet Sentiment Analysis Methods
Gullal S. Cheema
|
Sherzod Hakimov
|
Eric Müller-Budack
|
Ralph Ewerth
Unsupervised Training Data Generation of Handwritten Formulas using Generative Adversarial Networks with Self-Attention
Matthias Springstein
|
Eric Müller-Budack
|
Ralph Ewerth