Multi-level, multi-modal interactions for visual question answering over text in images
Jincai Chen, Sheng Zhang, Jiangfeng Zeng, Fuhao Zou, Yuan-Fang Li, Tao Liu, Ping Lu
- Anthology ID:
- DBLP:journals/www/ChenZZZLLL22
- Volume:
- 2022 Volume 25 Issue 4
- Year:
- 2022
- Venue:
- wwwjournals_journal
- Pages:
- 1607–1623
- URL:
- https://doi.org/10.1007/s11280-021-00976-2
- DOI:
- 10.1007/s11280-021-00976-2
- DBLP:
- journals/www/ChenZZZLLL22