International Conference on Multimedia Retrieval (2021)
Workshops
- MMPT@ICMR2021: Proceedings of the 2021 Workshop on Multi-Modal Pre-Training for Multimedia Understanding, Taipei, Taiwan, August 21, 2021 9 papers
- MMArt-ACM '21: Proceedings of the 2021 International Joint Workshop on Multimedia Artworks Analysis and Attractiveness Computing in Multimedia 2021, Taipei, Taiwan, August 21, 2021 5 papers
- ICDAR@ICMR 2021: Proceedings of the 2021 Workshop on Intelligent Cross-Data Analysis and Retrieval, Taipei, Taiwan, 21 August 2021 15 papers
MMPT@ICMR2021: Proceedings of the 2021 Workshop on Multi-Modal Pre-Training for Multimedia Understanding, Taipei, Taiwan, August 21, 2021
Cross-modal Pretraining and Matching for Video Understanding
Limin Wang
WenLan: Efficient Large-Scale Multi-Modal Pre-Training on Real World Data
Ruihua Song
Be Specific, Be Clear: Bridging Machine and Human Captions by Scene-Guided Transformer
Yupan Huang
|
Zhaoyang Zeng
|
Yutong Lu
Language-Conditioned Region Proposal and Retrieval Network for Referring Expression Comprehension
Yanwei Xie
|
Daqing Liu
|
Xuejin Chen
|
Zheng-Jun Zha
Residual Recurrent CRNN for End-to-End Optical Music Recognition on Monophonic Scores
Aozhi Liu
|
Lipei Zhang
|
Yaqi Mei
|
Baoqiang Han
|
Zifeng Cai
|
Zhaohua Zhu
|
Jing Xiao
Style-Guided Image-to-Image Translation for Multiple Domains
Tingting Li
|
Huan Zhao
|
Song Wang
|
Jing Huang
A Fair and Comprehensive Comparison of Multimodal Tweet Sentiment Analysis Methods
Gullal S. Cheema
|
Sherzod Hakimov
|
Eric Müller-Budack
|
Ralph Ewerth
Unsupervised Training Data Generation of Handwritten Formulas using Generative Adversarial Networks with Self-Attention
Matthias Springstein
|
Eric Müller-Budack
|
Ralph Ewerth
MMArt-ACM '21: Proceedings of the 2021 International Joint Workshop on Multimedia Artworks Analysis and Attractiveness Computing in Multimedia 2021, Taipei, Taiwan, August 21, 2021
Automatic Music Composition with Transformers
Yi-Hsuan Yang
Color-Grayscale-Pair Image Sentiment Dataset and Its Application to Sentiment-Driven Image Color Conversion
Atsushi Takada
|
Xueting Wang
|
Toshihiko Yamasaki
Ketchup GAN: A New Dataset for Realistic Synthesis of Letters on Food
Gibran Benitez-Garcia
|
Keiji Yanai
Estimating Groups of Featured Characters in Comics with Sequence of Characters' Appearance
Kodai Imaizumi
|
Ryosuke Yamanishi
|
Yoko Nishihara
|
Takahiro Ozawa
ICDAR@ICMR 2021: Proceedings of the 2021 Workshop on Intelligent Cross-Data Analysis and Retrieval, Taipei, Taiwan, 21 August 2021
Discovering Knowledge Hidden in Raster Images using RasterMiner
R. Uday Kiran
Multimodal Virtual Avatars for Investigative Interviews with Children
Gunn Astrid Baugerud
|
Miriam S. Johnson
|
Ragnhild Klingenberg Røed
|
Michael E. Lamb
|
Martine B. Powell
|
Vajira Thambawita
|
Steven Alexander Hicks
|
Pegah Salehi
|
Syed Zohaib Hassan
|
Pål Halvorsen
|
Michael A. Riegler
ST-HOI: A Spatial-Temporal Baseline for Human-Object Interaction Detection in Videos
Meng-Jiun Chiou
|
Chun-Yu Liao
|
Li-Wei Wang
|
Roger Zimmermann
|
Jiashi Feng
Temperature Forecasting using Tower Networks
Siri S. Eide
|
Michael A. Riegler
|
Hugo Lewi Hammer
|
John Bjørnar Bremnes
Scattering Transform Based Image Clustering using Projection onto Orthogonal Complement
Angel Villar-Corrales
|
Veniamin I. Morgenshtern
Pyramidal Segmentation of Medical Images using Adversarial Training
Espen Naess
|
Vajira Thambawita
|
Steven Alexander Hicks
|
Michael A. Riegler
|
Pål Halvorsen
Two-Faced Humans on Twitter and Facebook: Harvesting Social Multimedia for Human Personality Profiling
Qi Yang
|
Aleksandr Farseev
|
Andrey Filchenkov
Cross-Modal Deep Neural Networks based Smartphone Authentication for Intelligent Things System
Tran Anh Khoa
|
Dinh Nguyen The Truong
|
Duc Ngoc Minh Dang
Models to Predict Sleeping Quality from Activities and Environment: Current Status, Challenges and Opportunities
Thi Phuoc Van Nguyen
|
Do Van Nguyen
|
Koji Zettsu
Dutkat: A Multimedia System for Catching Illegal Catchers in a Privacy-Preserving Manner
Tor-Arne S. Nordmo
|
Aril B. Ovesen
|
Håvard D. Johansen
|
Michael A. Riegler
|
Pål Halvorsen
|
Dag Johansen
Investigation on Privacy-Preserving Techniques For Personal Data
Rafik Hamza
|
Koji Zettsu
Session details: Keynote & Invited Talks
Minh-Son Dao
Proceedings of the 4th Annual on Lifelog Search Challenge, LSC@ICMR 2021, Taipei, Taiwan, 21 August 2021
Lifelogging as a Memory Prosthetic
Alan F. Smeaton
Exquisitor at the Lifelog Search Challenge 2021: Relationships Between Semantic Classifiers
Omar Shahbaz Khan
|
Aaron Duane
|
Björn Þór Jónsson
|
Jan Zahálka
|
Stevan Rudinac
|
Marcel Worring
Exploring Graph-querying approaches in LifeGraph
Luca Rossetto
|
Matthias Baumgartner
|
Ralph Gasser
|
Lucien Heitz
|
Ruijie Wang
|
Abraham Bernstein
Myscéal 2.0: A Revised Experimental Interactive Lifelog Retrieval System for LSC'21
Ly-Duyen Tran
|
Manh-Duy Nguyen
|
Nguyen Thanh Binh
|
Hyowon Lee
|
Cathal Gurrin
Exploring Intuitive Lifelog Retrieval and Interaction Modes in Virtual Reality with vitrivr-VR
Florian Spiess
|
Ralph Gasser
|
Silvan Heller
|
Luca Rossetto
|
Loris Sauter
|
Milan van Zanten
|
Heiko Schuldt
lifeXplore at the Lifelog Search Challenge 2021
Andreas Leibetseder
|
Klaus Schoeffmann
ViRMA: Virtual Reality Multimedia Analytics at LSC 2021
Aaron Duane
|
Björn Þór Jónsson
Interactive Multimodal Lifelog Retrieval with vitrivr at LSC 2021
Silvan Heller
|
Ralph Gasser
|
Mahnaz Parian-Scherb
|
Sanja Popovic
|
Luca Rossetto
|
Loris Sauter
|
Florian Spiess
|
Heiko Schuldt
LifeSeeker 3.0: An Interactive Lifelog Search Engine for LSC'21
Thao-Nhu Nguyen
|
Tu-Khiem Le
|
Van-Tu Ninh
|
Minh-Triet Tran
|
Nguyen Thanh Binh
|
Graham Healy
|
Annalina Caputo
|
Cathal Gurrin
LifeConcept: An Interactive Approach for Multimodal Lifelog Retrieval through Concept Recommendation
Wei-Hong Ang
|
An-Zi Yen
|
Tai-Te Chu
|
Hen-Hsen Huang
|
Hsin-Hsi Chen
Memento: A Prototype Lifelog Search Engine for LSC'21
Naushad Alam
|
Yvette Graham
|
Cathal Gurrin
PhotoCube at the Lifelog Search Challenge 2021
Jihye Shin
|
Alexandra Waldau
|
Aaron Duane
|
Björn Þór Jónsson
Voxento 2.0: A Prototype Voice-controlled Interactive Search Engine for Lifelogs
Ahmed Alateeq
|
Mark Roantree
|
Cathal Gurrin
Enhanced SOMHunter for Known-item Search in Lifelog Data
Jakub Lokoc
|
Frantisek Mejzlík
|
Patrik Veselý
|
Tomás Soucek
LifeMon: A MongoDB-Based Lifelog Retrieval Prototype
Alexander Christian Faisst
|
Björn Þór Jónsson
Flexible Interactive Retrieval SysTem 2.0 for Visual Lifelog Exploration at LSC 2021
Hoang-Phuc Trang-Trung
|
Thanh-Cong Le
|
Mai-Khiem Tran
|
Van-Tu Ninh
|
Tu-Khiem Le
|
Cathal Gurrin
|
Minh-Triet Tran
XQC at the Lifelog Search Challenge 2021: Interactive Learning on a Mobile Device
Emil Knudsen
|
Thomas Holstein Qvortrup
|
Omar Shahbaz Khan
|
Björn Þór Jónsson
ICMR '21: International Conference on Multimedia Retrieval, Taipei, Taiwan, August 21-24, 2021
ICMR '21: International Conference on Multimedia Retrieval, Taipei, Taiwan, August 21-24, 2021
Combining Adversarial and Reinforcement Learning for Video Thumbnail Selection
Evlampios E. Apostolidis
|
Eleni Adamantidou
|
Vasileios Mezaris
|
Ioannis Patras
Efficient Indexing of 3D Human Motions
Petra Budíková
|
Jan Sedmidubský
|
Pavel Zezula
Global Relation-Aware Attention Network for Image-Text Retrieval
Jie Cao
|
Shengsheng Qian
|
Huaiwen Zhang
|
Quan Fang
|
Changsheng Xu
MS-SincResNet: Joint Learning of 1D and 2D Kernels Using Multi-scale SincNet and ResNet for Music Genre Classification
Pei-Chun Chang
|
Yong-Sheng Chen
|
Chang-Hsing Lee
MLFont: Few-Shot Chinese Font Generation via Deep Meta-Learning
Xu Chen
|
Lei Wu
|
Minggang He
|
Lei Meng
|
Xiangxu Meng
Facial Structure Guided GAN for Identity-preserved Face Image De-occlusion
Yiu-Ming Cheung
|
Mengke Li
|
Rong Zou
Heterogeneous Side Information-based Iterative Guidance Model for Recommendation
Feifei Dai
|
Xiaoyan Gu
|
Zhuo Wang
|
Mingda Qian
|
Bo Li
|
Weiping Wang
Dense Scale Network for Crowd Counting
Feng Dai
|
Hao Liu
|
Yike Ma
|
Xi Zhang
|
Qiang Zhao
Leveraging Two Types of Global Graph for Sequential Fashion Recommendation
Yujuan Ding
|
Yunshan Ma
|
Wai Keung Wong
|
Tat-Seng Chua
HSGMP: Heterogeneous Scene Graph Message Passing for Cross-modal Retrieval
Yu Duan
|
Yun Xiong
|
Yao Zhang
|
Yuwei Fu
|
Yangyong Zhu
GCNBoost: Artwork Classification by Label Propagation through a Knowledge Graph
Cheikh Brahim El Vaigh
|
Noa Garcia
|
Benjamin Renoust
|
Chenhui Chu
|
Yuta Nakashima
|
Hajime Nagahara
Can Action be Imitated? Learn to Reconstruct and Transfer Human Dynamics from Videos
Yuqian Fu
|
Yanwei Fu
|
Yu-Gang Jiang
SAGN: Semantic Adaptive Graph Network for Skeleton-Based Human Action Recognition
Ziwang Fu
|
Feng Liu
|
Jiahao Zhang
|
Hanyang Wang
|
Chengyi Yang
|
Qing Xu
|
Jiayin Qi
|
Xiangling Fu
|
Aimin Zhou
Text-Guided Visual Feature Refinement for Text-Based Person Search
Liying Gao
|
Kai Niu
|
Zehong Ma
|
Bingliang Jiao
|
Tonghao Tan
|
Peng Wang
RGB-D Scene Recognition based on Object-Scene Relation and Semantics-Preserving Attention
Yuhui Guo
|
Xun Liang
Multi-Feature Graph Attention Network for Cross-Modal Video-Text Retrieval
Xiaoshuai Hao
|
Yucan Zhou
|
Dayan Wu
|
Wanqian Zhang
|
Bo Li
|
Weiping Wang
HPOF: 3D Human Pose Recovery from Monocular Video with Optical Flow
Bin Ji
|
Chen Yang
|
Shunyu Yao
|
Ye Pan
Leveraging EfficientNet and Contrastive Learning for Accurate Global-scale Location Estimation
Giorgos Kordopatis-Zilos
|
Panagiotis Galopoulos
|
Symeon Papadopoulos
|
Ioannis Kompatsiaris
Relation-aware Hierarchical Attention Framework for Video Question Answering
Fangtao Li
|
Ting Bai
|
Chenyu Cao
|
Zihe Liu
|
Chenghao Yan
|
Bin Wu
Cross-Modal Image-Recipe Retrieval via Intra- and Inter-Modality Hybrid Fusion
Jiao Li
|
Jialiang Sun
|
Xing Xu
|
Wei Yu
|
Fumin Shen
Unsupervised Deep Cross-Modal Hashing by Knowledge Distillation for Large-scale Cross-modal Retrieval
Mingyong Li
|
Hongya Wang
A Unified-Model via Block Coordinate Descent for Learning the Importance of Filter
Qinghua Li
|
Xue Zhang
|
Cuiping Li
|
Hong Chen
Local-enhanced Interaction for Temporal Moment Localization
Guoqiang Liang
|
Shiyu Ji
|
Yanning Zhang
Reading Scene Text by Fusing Visual Attention with Semantic Representations
Zhiguang Liu
|
Liangwei Wang
|
Jian Qiao
Generative Adversarial Networks with Bi-directional Normalization for Semantic Image Synthesis
Jia Long
|
Hongtao Lu
A Smart Adversarial Attack on Deep Hashing Based Image Retrieval
Junda Lu
|
Mingyang Chen
|
Yifang Sun
|
Wei Wang
|
Yi Wang
|
Xiaochun Yang
Image-to-Image Transfer Makes Chaos to Order
Sanbi Luo
|
Tao Guo
Summary of the 2021 Embedded Deep Learning Object Detection Model Compression Competition for Traffic in Asian Countries
Yu-Shu Ni
|
Chia-Chi Tsai
|
Jiun-In Guo
|
Jenq-Neng Hwang
|
Bo-Xun Wu
|
Po-Chi Hu
|
Ted T. Kuo
|
Po-Yu Chen
|
Hsien-Kai Kuo
Nested Dense Attention Network for Single Image Super-Resolution
Cheng Qiu
|
Yirong Yao
|
Yuntao Du
Multi-scale Dynamic Network for Temporal Action Detection
Yifan Ren
|
Xing Xu
|
Fumin Shen
|
Zheng Wang
|
Yang Yang
|
Heng Tao Shen
Distractor-Aware Tracker with a Domain-Special Optimized Benchmark for Soccer Player Tracking
Zikai Song
|
Zhiwen Wan
|
Wei Yuan
|
Ying Tang
|
Junqing Yu
|
Yi-Ping Phoebe Chen
Efficient Nearest Neighbor Search by Removing Anti-hub
Kimihiro Tanaka
|
Yusuke Matsui
|
Shin'ichi Satoh
A Denoising Convolutional Neural Network for Self-Supervised Rank Effectiveness Estimation on Image Retrieval
Lucas Pascotti Valem
|
Daniel Carlos Guimarães Pedronette
Know Yourself and Know Others: Efficient Common Representation Learning for Few-shot Cross-modal Retrieval
Shaoying Wang
|
Hanjiang Lai
|
Zhenyu Shi
Neural Symbolic Representation Learning for Image Captioning
Xiaomei Wang
|
Lin Ma
|
Yanwei Fu
|
Xiangyang Xue
G-CAM: Graph Convolution Network Based Class Activation Mapping for Multi-label Image Recognition
Yangtao Wang
|
Yanzhao Xie
|
Yu Liu
|
Lisheng Fan
NASTER: Non-local Attentional Scene Text Recognizer
Lei Wu
|
Xueliang Liu
|
Yanbin Hao
|
Yunjie Ma
|
Richang Hong
Few-Shot Action Localization without Knowing Boundaries
Ting-Ting Xie
|
Christos Tzelepis
|
Fan Fu
|
Ioannis Patras
Learning Hierarchical Visual-Semantic Representation with Phrase Alignment
Baoming Yan
|
Qingheng Zhang
|
Liyu Chen
|
Lin Wang
|
Leihao Pei
|
Jiang Yang
|
Enyun Yu
|
Xiaobo Li
|
Binqiang Zhao
Social Relation Analysis from Videos via Multi-entity Reasoning
Chenghao Yan
|
Zihe Liu
|
Fangtao Li
|
Chenyu Cao
|
Zheng Wang
|
Bin Wu
Aligning Visual Prototypes with BERT Embeddings for Few-Shot Learning
Kun Yan
|
Zied Bouraoui
|
Ping Wang
|
Shoaib Jameel
|
Steven Schockaert
TEACH: Attention-Aware Deep Cross-Modal Hashing
Hong-Lei Yao
|
Yu-Wei Zhan
|
Zhen-Duo Chen
|
Xin Luo
|
Xin-Shun Xu
Scene Text Recognition with Cascade Attention Network
Min Zhang
|
Meng Ma
|
Ping Wang
Multi-Attention Audio-Visual Fusion Network for Audio Spatialization
Wen Zhang
|
Jie Shao
Multi-Initialization Graph Meta-Learning for Node Classification
Feng Zhao
|
Donglin Wang
|
Xintao Xiang
Question-Guided Semantic Dual-Graph Visual Reasoning with Novel Answers
Xinzhe Zhou
|
Yadong Mu
Joint Hand-Object Pose Estimation with Differentiably-Learned Physical Contact Point Analysis
Nan Zhuang
|
Yadong Mu
HINFShot: A Challenge Dataset for Few-Shot Node Classification in Heterogeneous Information Network
Zifeng Zhuang
|
Xintao Xiang
|
Siteng Huang
|
Donglin Wang
Learning to Select: A Fully Attentive Approach for Novel Object Captioning
Marco Cagrandi
|
Marcella Cornia
|
Matteo Stefanini
|
Lorenzo Baraldi
|
Rita Cucchiara
Semi-supervised Many-to-many Music Timbre Transfer
Yu-Chen Chang
|
Wen-Cheng Chen
|
Min-Chun Hu
Text-Enhanced Attribute-Based Attention for Generalized Zero-Shot Fine-Grained Image Classification
Yan-He Chen
|
Mei-Chen Yeh
Spatio-Temporal Activity Detection and Recognition in Untrimmed Surveillance Videos
Konstantinos Gkountakos
|
Despoina Touska
|
Konstantinos Ioannidis
|
Theodora Tsikrika
|
Stefanos Vrochidis
|
Ioannis Kompatsiaris
Cross-Modal Self-Attention with Multi-Task Pre-Training for Medical Visual Question Answering
Haifan Gong
|
Guanqi Chen
|
Sishuo Liu
|
Yizhou Yu
|
Guanbin Li
Body Shape Calculator: Understanding the Type of Body Shapes from Anthropometric Measurements
Shintami Chusnul Hidayati
|
Yeni Anistyasari
Unsupervised Video Summarization via Multi-source Features
Hussain Kanafani
|
Junaid Ahmed Ghauri
|
Sherzod Hakimov
|
Ralph Ewerth
Evaluating Contrastive Models for Instance-based Image Retrieval
Tarun Krishna
|
Kevin McGuinness
|
Noel E. O'Connor
AWFA-LPD: Adaptive Weight Feature Aggregation for Multi-frame License Plate Detection
Xiaocheng Lu
|
Yuan Yuan
|
Qi Wang
NMS-Loss: Learning with Non-Maximum Suppression for Crowded Pedestrian Detection
Zekun Luo
|
Zheng Fang
|
Sixiao Zheng
|
Yabiao Wang
|
Yanwei Fu
Image Retrieval by Hierarchy-aware Deep Hashing Based on Multi-task Learning
Bowen Wang
|
Liangzhi Li
|
Yuta Nakashima
|
Takehiro Yamamoto
|
Hiroaki Ohshima
|
Yoshiyuki Shoji
|
Kenro Aihara
|
Noriko Kando
Weakly Supervised Sketch Based Person Search
Lan Yan
|
Wenbo Zheng
|
Fei-Yue Wang
|
Chao Gou
Personal Knowledge Base Construction from Multimodal Data
An-Zi Yen
|
Chia-Chung Chang
|
Hen-Hsen Huang
|
Hsin-Hsi Chen
2.5D Pose Guided Human Image Generation
Kang Yuan
|
Sheng Li
Collaborative Representation for Deep Meta Metric Learning
Min Zhu
|
Weifeng Liu
|
Kai Zhang
|
Ye Li
|
Peng Liu
|
Baodi Liu
Ten Questions in Lifelog Mining and Information Recall
An-Zi Yen
|
Hen-Hsen Huang
|
Hsin-Hsi Chen
Bag of Tricks for Building an Accurate and Slim Object Detector for Embedded Applications
Yongkun Du
|
Zhineng Chen
|
Caiyan Jia
|
Xuanya Li
|
Yu-Gang Jiang
Efficient-ROD: Efficient Radar Object Detection based on Densely Connected Residual Network
Chih-Chung Hsu
|
Chieh Lee
|
Lin Chen
|
Min-Kai Hung
|
Andy Yu-Lun Lin
|
Xian-Yu Wang
DANet: Dimension Apart Network for Radar Object Detection
Bo Ju
|
Wei Yang
|
Jinrang Jia
|
Xiaoqing Ye
|
Qu Chen
|
Xiao Tan
|
Hao Sun
|
Yifeng Shi
|
Errui Ding
Object Detection on Embedded Systems for Traffic in Asian Countries
Bao-Hong Lai
|
Hsun-Ping Hsieh
Squeeze-and-Excitation network-Based Radar Object Detection With Weighted Location Fusion
Pengliang Sun
|
Xuetong Niu
|
Pengfei Sun
|
Kele Xu
ROD2021 Challenge: A Summary for Radar Object Detection Challenge for Autonomous Driving Applications
Yizhou Wang
|
Jenq-Neng Hwang
|
Gaoang Wang
|
Hui Liu
|
Kwang-Ju Kim
|
Hung-Min Hsu
|
Jiarui Cai
|
Haotian Zhang
|
Zhongyu Jiang
|
Renshu Gu
Embedded YOLO: Faster and Lighter Object Detection
Wen-Kai Wu
|
Chien-Yu Chen
|
Jiann-Shu Lee
Radar Object Detection Using Data Merging, Enhancement and Fusion
Jun Yu
|
Xinlong Hao
|
Xinjian Gao
|
Qiang Sun
|
Yuyu Liu
|
Peng Chang
|
Zhong Zhang
|
Fang Gao
|
Feng Shuang
Scene-aware Learning Network for Radar Object Detection
Zangwei Zheng
|
Xiangyu Yue
|
Kurt Keutzer
|
Alberto L. Sangiovanni-Vincentelli
GPT2MVS: Generative Pre-trained Transformer-2 for Multi-modal Video Summarization
Jia-Hong Huang
|
Luka Murn
|
Marta Mrak
|
Marcel Worring
Impact of Interaction Strategies on User Relevance Feedback
Omar Shahbaz Khan
|
Björn Þór Jónsson
|
Jan Zahálka
|
Stevan Rudinac
|
Marcel Worring
Automatic Baseball Pitch Overlay
Ting-Hsuan Chou
|
Wei-Ta Chu
Video Action Retrieval Using Action Recognition Model
Yuko Iinuma
|
Shin'ichi Satoh
MeTILDA: Platform for Melodic Transcription in Language Documentation and Application
Mitchell Lee
|
Praveena Avula
|
Min Chen
IR Questioner: QA-based Interactive Retrieval System
Rintaro Yanagi
|
Ren Togo
|
Takahiro Ogawa
|
Miki Haseyama
Reproducibility Companion Paper: Knowledge Enhanced Neural Fashion Trend Forecasting
Yunshan Ma
|
Yujuan Ding
|
Xun Yang
|
Lizi Liao
|
Wai Keung Wong
|
Tat-Seng Chua
|
Jinyoung Moon
|
Hong-Han Shuai
A Beneficial Dual Transformation Approach for Deep Learning Networks Used in Steel Surface Defect Detection
Fityanul Akhyar
|
Chih-Yang Lin
|
Gugan S. Kathiresan
Discrete Tchebichef Transform for Versatile Video Coding
Ka-Hou Chan
|
Sio Kei Im
Fire Detection using Transformer Network
Mohammad Shahid
|
Kai-Lung Hua
Visible-infrared Person Re-identification with Human Body Parts Assistance
Huangpeng Dai
|
Qing Xie
|
Jiachen Li
|
Yanchun Ma
|
Lin Li
|
Yongjian Liu
Look Back Again: Dual Parallel Attention Network for Accurate and Robust Scene Text Recognition
Zilong Fu
|
Hongtao Xie
|
Guoqing Jin
|
Junbo Guo
Contextualized Keyword Representations for Multi-modal Retinal Image Captioning
Jia-Hong Huang
|
Ting-Wei Wu
|
Marcel Worring
MSAV: An Unified Framework for Multi-view Subspace Analysis with View Consistence
Huibing Wang
|
Guangqi Jiang
|
Jinjia Peng
|
Xianping Fu
A Tensor Sparse Representation-Based CBMIR System for Computer-Aided Diagnosis of Focal Liver Lesions and its Pilot Trial
Jian Wang
|
Xian-Hua Han
|
Lanfen Lin
|
Hongjie Hu
|
Yen-Wei Chen
M-DFNet: Multi-phase Discriminative Feature Network for Retrieval of Focal Liver Lesions
Yingying Xu
|
Jing Liu
|
Lanfen Lin
|
Hongjie Hu
|
Ruofeng Tong
|
Jingsong Li
|
Yen-Wei Chen
M2GUDA: Multi-Metrics Graph-Based Unsupervised Domain Adaptation for Cross-Modal Hashing
Chengyuan Zhang
|
Zhi Zhong
|
Lei Zhu
|
Shichao Zhang
|
Da Cao
|
Jianfeng Zhang
Human Pose Estimation based on Attention Multi-resolution Network
Congcong Zhang
|
Ning He
|
Qixiang Sun
|
Xiaojie Yin
|
Ke Lu
ICDAR'21: Intelligent Cross-Data Analysis and Retrieval
Minh-Son Dao
|
Michael Alexander Riegler
|
Duc-Tien Dang-Nguyen
|
Cathal Gurrin
|
Minh-Triet Tran
|
Thanh-Binh Nguyen
Introduction to the Fourth Annual Lifelog Search Challenge, LSC'21
Cathal Gurrin
|
Björn Þór Jónsson
|
Klaus Schöffmann
|
Duc-Tien Dang-Nguyen
|
Jakub Lokoc
|
Minh-Triet Tran
|
Wolfgang Hürst
|
Luca Rossetto
|
Graham Healy
MMArt-ACM'21: International Joint Workshop on Multimedia Artworks Analysis and Attractiveness Computing in Multimedia 2021
Min-Chun Hu
|
Ichiro Ide
|
Kensuke Tobitani
MMPT'21: International Joint Workshop on Multi-Modal Pre-Training for Multimedia Understanding
Bei Liu
|
Jianlong Fu
|
Shizhe Chen
|
Qin Jin
|
Alexander G. Hauptmann
|
Yong Rui
CEA'21: The 13th Workshop on Multimedia for Cooking and Eating Activities
Yoko Yamakata
|
Atsushi Hashimoto