ICMR '21: International Conference on Multimedia Retrieval, Taipei, Taiwan, August 21-24, 2021
Wen-Huang Cheng, Mohan S. Kankanhalli, Meng Wang, Wei-Ta Chu, Jiaying Liu, Marcel Worring (Editors)
- Anthology ID:
- 2021.mir_conference-2021
- Year:
- 2021
- Venue:
- mir_conference
- Publisher:
- ACM
- URL:
- https://doi.org/10.1145/3460426
- DOI:
- 10.1145/3460426
- DBLP:
- conf/mir/2021
ICMR '21: International Conference on Multimedia Retrieval, Taipei, Taiwan, August 21-24, 2021
Combining Adversarial and Reinforcement Learning for Video Thumbnail Selection
Evlampios E. Apostolidis
|
Eleni Adamantidou
|
Vasileios Mezaris
|
Ioannis Patras
Efficient Indexing of 3D Human Motions
Petra Budíková
|
Jan Sedmidubský
|
Pavel Zezula
Global Relation-Aware Attention Network for Image-Text Retrieval
Jie Cao
|
Shengsheng Qian
|
Huaiwen Zhang
|
Quan Fang
|
Changsheng Xu
MS-SincResNet: Joint Learning of 1D and 2D Kernels Using Multi-scale SincNet and ResNet for Music Genre Classification
Pei-Chun Chang
|
Yong-Sheng Chen
|
Chang-Hsing Lee
MLFont: Few-Shot Chinese Font Generation via Deep Meta-Learning
Xu Chen
|
Lei Wu
|
Minggang He
|
Lei Meng
|
Xiangxu Meng
Facial Structure Guided GAN for Identity-preserved Face Image De-occlusion
Yiu-Ming Cheung
|
Mengke Li
|
Rong Zou
Heterogeneous Side Information-based Iterative Guidance Model for Recommendation
Feifei Dai
|
Xiaoyan Gu
|
Zhuo Wang
|
Mingda Qian
|
Bo Li
|
Weiping Wang
Dense Scale Network for Crowd Counting
Feng Dai
|
Hao Liu
|
Yike Ma
|
Xi Zhang
|
Qiang Zhao
Leveraging Two Types of Global Graph for Sequential Fashion Recommendation
Yujuan Ding
|
Yunshan Ma
|
Wai Keung Wong
|
Tat-Seng Chua
HSGMP: Heterogeneous Scene Graph Message Passing for Cross-modal Retrieval
Yu Duan
|
Yun Xiong
|
Yao Zhang
|
Yuwei Fu
|
Yangyong Zhu
GCNBoost: Artwork Classification by Label Propagation through a Knowledge Graph
Cheikh Brahim El Vaigh
|
Noa Garcia
|
Benjamin Renoust
|
Chenhui Chu
|
Yuta Nakashima
|
Hajime Nagahara
Can Action be Imitated? Learn to Reconstruct and Transfer Human Dynamics from Videos
Yuqian Fu
|
Yanwei Fu
|
Yu-Gang Jiang
SAGN: Semantic Adaptive Graph Network for Skeleton-Based Human Action Recognition
Ziwang Fu
|
Feng Liu
|
Jiahao Zhang
|
Hanyang Wang
|
Chengyi Yang
|
Qing Xu
|
Jiayin Qi
|
Xiangling Fu
|
Aimin Zhou
Text-Guided Visual Feature Refinement for Text-Based Person Search
Liying Gao
|
Kai Niu
|
Zehong Ma
|
Bingliang Jiao
|
Tonghao Tan
|
Peng Wang
RGB-D Scene Recognition based on Object-Scene Relation and Semantics-Preserving Attention
Yuhui Guo
|
Xun Liang
Multi-Feature Graph Attention Network for Cross-Modal Video-Text Retrieval
Xiaoshuai Hao
|
Yucan Zhou
|
Dayan Wu
|
Wanqian Zhang
|
Bo Li
|
Weiping Wang
HPOF: 3D Human Pose Recovery from Monocular Video with Optical Flow
Bin Ji
|
Chen Yang
|
Shunyu Yao
|
Ye Pan
Leveraging EfficientNet and Contrastive Learning for Accurate Global-scale Location Estimation
Giorgos Kordopatis-Zilos
|
Panagiotis Galopoulos
|
Symeon Papadopoulos
|
Ioannis Kompatsiaris
Relation-aware Hierarchical Attention Framework for Video Question Answering
Fangtao Li
|
Ting Bai
|
Chenyu Cao
|
Zihe Liu
|
Chenghao Yan
|
Bin Wu
Cross-Modal Image-Recipe Retrieval via Intra- and Inter-Modality Hybrid Fusion
Jiao Li
|
Jialiang Sun
|
Xing Xu
|
Wei Yu
|
Fumin Shen
Unsupervised Deep Cross-Modal Hashing by Knowledge Distillation for Large-scale Cross-modal Retrieval
Mingyong Li
|
Hongya Wang
A Unified-Model via Block Coordinate Descent for Learning the Importance of Filter
Qinghua Li
|
Xue Zhang
|
Cuiping Li
|
Hong Chen
Local-enhanced Interaction for Temporal Moment Localization
Guoqiang Liang
|
Shiyu Ji
|
Yanning Zhang
Reading Scene Text by Fusing Visual Attention with Semantic Representations
Zhiguang Liu
|
Liangwei Wang
|
Jian Qiao
Generative Adversarial Networks with Bi-directional Normalization for Semantic Image Synthesis
Jia Long
|
Hongtao Lu
A Smart Adversarial Attack on Deep Hashing Based Image Retrieval
Junda Lu
|
Mingyang Chen
|
Yifang Sun
|
Wei Wang
|
Yi Wang
|
Xiaochun Yang
Image-to-Image Transfer Makes Chaos to Order
Sanbi Luo
|
Tao Guo
Summary of the 2021 Embedded Deep Learning Object Detection Model Compression Competition for Traffic in Asian Countries
Yu-Shu Ni
|
Chia-Chi Tsai
|
Jiun-In Guo
|
Jenq-Neng Hwang
|
Bo-Xun Wu
|
Po-Chi Hu
|
Ted T. Kuo
|
Po-Yu Chen
|
Hsien-Kai Kuo
Nested Dense Attention Network for Single Image Super-Resolution
Cheng Qiu
|
Yirong Yao
|
Yuntao Du
Multi-scale Dynamic Network for Temporal Action Detection
Yifan Ren
|
Xing Xu
|
Fumin Shen
|
Zheng Wang
|
Yang Yang
|
Heng Tao Shen
Distractor-Aware Tracker with a Domain-Special Optimized Benchmark for Soccer Player Tracking
Zikai Song
|
Zhiwen Wan
|
Wei Yuan
|
Ying Tang
|
Junqing Yu
|
Yi-Ping Phoebe Chen
Efficient Nearest Neighbor Search by Removing Anti-hub
Kimihiro Tanaka
|
Yusuke Matsui
|
Shin'ichi Satoh
A Denoising Convolutional Neural Network for Self-Supervised Rank Effectiveness Estimation on Image Retrieval
Lucas Pascotti Valem
|
Daniel Carlos Guimarães Pedronette
Know Yourself and Know Others: Efficient Common Representation Learning for Few-shot Cross-modal Retrieval
Shaoying Wang
|
Hanjiang Lai
|
Zhenyu Shi
Neural Symbolic Representation Learning for Image Captioning
Xiaomei Wang
|
Lin Ma
|
Yanwei Fu
|
Xiangyang Xue
G-CAM: Graph Convolution Network Based Class Activation Mapping for Multi-label Image Recognition
Yangtao Wang
|
Yanzhao Xie
|
Yu Liu
|
Lisheng Fan
NASTER: Non-local Attentional Scene Text Recognizer
Lei Wu
|
Xueliang Liu
|
Yanbin Hao
|
Yunjie Ma
|
Richang Hong
Few-Shot Action Localization without Knowing Boundaries
Ting-Ting Xie
|
Christos Tzelepis
|
Fan Fu
|
Ioannis Patras
Learning Hierarchical Visual-Semantic Representation with Phrase Alignment
Baoming Yan
|
Qingheng Zhang
|
Liyu Chen
|
Lin Wang
|
Leihao Pei
|
Jiang Yang
|
Enyun Yu
|
Xiaobo Li
|
Binqiang Zhao
Social Relation Analysis from Videos via Multi-entity Reasoning
Chenghao Yan
|
Zihe Liu
|
Fangtao Li
|
Chenyu Cao
|
Zheng Wang
|
Bin Wu
Aligning Visual Prototypes with BERT Embeddings for Few-Shot Learning
Kun Yan
|
Zied Bouraoui
|
Ping Wang
|
Shoaib Jameel
|
Steven Schockaert
TEACH: Attention-Aware Deep Cross-Modal Hashing
Hong-Lei Yao
|
Yu-Wei Zhan
|
Zhen-Duo Chen
|
Xin Luo
|
Xin-Shun Xu
Scene Text Recognition with Cascade Attention Network
Min Zhang
|
Meng Ma
|
Ping Wang
Multi-Attention Audio-Visual Fusion Network for Audio Spatialization
Wen Zhang
|
Jie Shao
Multi-Initialization Graph Meta-Learning for Node Classification
Feng Zhao
|
Donglin Wang
|
Xintao Xiang
Question-Guided Semantic Dual-Graph Visual Reasoning with Novel Answers
Xinzhe Zhou
|
Yadong Mu
Joint Hand-Object Pose Estimation with Differentiably-Learned Physical Contact Point Analysis
Nan Zhuang
|
Yadong Mu
HINFShot: A Challenge Dataset for Few-Shot Node Classification in Heterogeneous Information Network
Zifeng Zhuang
|
Xintao Xiang
|
Siteng Huang
|
Donglin Wang
Learning to Select: A Fully Attentive Approach for Novel Object Captioning
Marco Cagrandi
|
Marcella Cornia
|
Matteo Stefanini
|
Lorenzo Baraldi
|
Rita Cucchiara
Semi-supervised Many-to-many Music Timbre Transfer
Yu-Chen Chang
|
Wen-Cheng Chen
|
Min-Chun Hu
Text-Enhanced Attribute-Based Attention for Generalized Zero-Shot Fine-Grained Image Classification
Yan-He Chen
|
Mei-Chen Yeh
Spatio-Temporal Activity Detection and Recognition in Untrimmed Surveillance Videos
Konstantinos Gkountakos
|
Despoina Touska
|
Konstantinos Ioannidis
|
Theodora Tsikrika
|
Stefanos Vrochidis
|
Ioannis Kompatsiaris
Cross-Modal Self-Attention with Multi-Task Pre-Training for Medical Visual Question Answering
Haifan Gong
|
Guanqi Chen
|
Sishuo Liu
|
Yizhou Yu
|
Guanbin Li
Body Shape Calculator: Understanding the Type of Body Shapes from Anthropometric Measurements
Shintami Chusnul Hidayati
|
Yeni Anistyasari
Unsupervised Video Summarization via Multi-source Features
Hussain Kanafani
|
Junaid Ahmed Ghauri
|
Sherzod Hakimov
|
Ralph Ewerth
Evaluating Contrastive Models for Instance-based Image Retrieval
Tarun Krishna
|
Kevin McGuinness
|
Noel E. O'Connor
AWFA-LPD: Adaptive Weight Feature Aggregation for Multi-frame License Plate Detection
Xiaocheng Lu
|
Yuan Yuan
|
Qi Wang
NMS-Loss: Learning with Non-Maximum Suppression for Crowded Pedestrian Detection
Zekun Luo
|
Zheng Fang
|
Sixiao Zheng
|
Yabiao Wang
|
Yanwei Fu
Image Retrieval by Hierarchy-aware Deep Hashing Based on Multi-task Learning
Bowen Wang
|
Liangzhi Li
|
Yuta Nakashima
|
Takehiro Yamamoto
|
Hiroaki Ohshima
|
Yoshiyuki Shoji
|
Kenro Aihara
|
Noriko Kando
Weakly Supervised Sketch Based Person Search
Lan Yan
|
Wenbo Zheng
|
Fei-Yue Wang
|
Chao Gou
Personal Knowledge Base Construction from Multimodal Data
An-Zi Yen
|
Chia-Chung Chang
|
Hen-Hsen Huang
|
Hsin-Hsi Chen
2.5D Pose Guided Human Image Generation
Kang Yuan
|
Sheng Li
Collaborative Representation for Deep Meta Metric Learning
Min Zhu
|
Weifeng Liu
|
Kai Zhang
|
Ye Li
|
Peng Liu
|
Baodi Liu
Ten Questions in Lifelog Mining and Information Recall
An-Zi Yen
|
Hen-Hsen Huang
|
Hsin-Hsi Chen
Bag of Tricks for Building an Accurate and Slim Object Detector for Embedded Applications
Yongkun Du
|
Zhineng Chen
|
Caiyan Jia
|
Xuanya Li
|
Yu-Gang Jiang
Efficient-ROD: Efficient Radar Object Detection based on Densely Connected Residual Network
Chih-Chung Hsu
|
Chieh Lee
|
Lin Chen
|
Min-Kai Hung
|
Andy Yu-Lun Lin
|
Xian-Yu Wang
DANet: Dimension Apart Network for Radar Object Detection
Bo Ju
|
Wei Yang
|
Jinrang Jia
|
Xiaoqing Ye
|
Qu Chen
|
Xiao Tan
|
Hao Sun
|
Yifeng Shi
|
Errui Ding
Object Detection on Embedded Systems for Traffic in Asian Countries
Bao-Hong Lai
|
Hsun-Ping Hsieh
Squeeze-and-Excitation network-Based Radar Object Detection With Weighted Location Fusion
Pengliang Sun
|
Xuetong Niu
|
Pengfei Sun
|
Kele Xu
ROD2021 Challenge: A Summary for Radar Object Detection Challenge for Autonomous Driving Applications
Yizhou Wang
|
Jenq-Neng Hwang
|
Gaoang Wang
|
Hui Liu
|
Kwang-Ju Kim
|
Hung-Min Hsu
|
Jiarui Cai
|
Haotian Zhang
|
Zhongyu Jiang
|
Renshu Gu
Embedded YOLO: Faster and Lighter Object Detection
Wen-Kai Wu
|
Chien-Yu Chen
|
Jiann-Shu Lee
Radar Object Detection Using Data Merging, Enhancement and Fusion
Jun Yu
|
Xinlong Hao
|
Xinjian Gao
|
Qiang Sun
|
Yuyu Liu
|
Peng Chang
|
Zhong Zhang
|
Fang Gao
|
Feng Shuang
Scene-aware Learning Network for Radar Object Detection
Zangwei Zheng
|
Xiangyu Yue
|
Kurt Keutzer
|
Alberto L. Sangiovanni-Vincentelli
GPT2MVS: Generative Pre-trained Transformer-2 for Multi-modal Video Summarization
Jia-Hong Huang
|
Luka Murn
|
Marta Mrak
|
Marcel Worring
Impact of Interaction Strategies on User Relevance Feedback
Omar Shahbaz Khan
|
Björn Þór Jónsson
|
Jan Zahálka
|
Stevan Rudinac
|
Marcel Worring
Automatic Baseball Pitch Overlay
Ting-Hsuan Chou
|
Wei-Ta Chu
Video Action Retrieval Using Action Recognition Model
Yuko Iinuma
|
Shin'ichi Satoh
MeTILDA: Platform for Melodic Transcription in Language Documentation and Application
Mitchell Lee
|
Praveena Avula
|
Min Chen
IR Questioner: QA-based Interactive Retrieval System
Rintaro Yanagi
|
Ren Togo
|
Takahiro Ogawa
|
Miki Haseyama
Reproducibility Companion Paper: Knowledge Enhanced Neural Fashion Trend Forecasting
Yunshan Ma
|
Yujuan Ding
|
Xun Yang
|
Lizi Liao
|
Wai Keung Wong
|
Tat-Seng Chua
|
Jinyoung Moon
|
Hong-Han Shuai
A Beneficial Dual Transformation Approach for Deep Learning Networks Used in Steel Surface Defect Detection
Fityanul Akhyar
|
Chih-Yang Lin
|
Gugan S. Kathiresan
Discrete Tchebichef Transform for Versatile Video Coding
Ka-Hou Chan
|
Sio Kei Im
Fire Detection using Transformer Network
Mohammad Shahid
|
Kai-Lung Hua
Visible-infrared Person Re-identification with Human Body Parts Assistance
Huangpeng Dai
|
Qing Xie
|
Jiachen Li
|
Yanchun Ma
|
Lin Li
|
Yongjian Liu
Look Back Again: Dual Parallel Attention Network for Accurate and Robust Scene Text Recognition
Zilong Fu
|
Hongtao Xie
|
Guoqing Jin
|
Junbo Guo
Contextualized Keyword Representations for Multi-modal Retinal Image Captioning
Jia-Hong Huang
|
Ting-Wei Wu
|
Marcel Worring
MSAV: An Unified Framework for Multi-view Subspace Analysis with View Consistence
Huibing Wang
|
Guangqi Jiang
|
Jinjia Peng
|
Xianping Fu
A Tensor Sparse Representation-Based CBMIR System for Computer-Aided Diagnosis of Focal Liver Lesions and its Pilot Trial
Jian Wang
|
Xian-Hua Han
|
Lanfen Lin
|
Hongjie Hu
|
Yen-Wei Chen
M-DFNet: Multi-phase Discriminative Feature Network for Retrieval of Focal Liver Lesions
Yingying Xu
|
Jing Liu
|
Lanfen Lin
|
Hongjie Hu
|
Ruofeng Tong
|
Jingsong Li
|
Yen-Wei Chen
M2GUDA: Multi-Metrics Graph-Based Unsupervised Domain Adaptation for Cross-Modal Hashing
Chengyuan Zhang
|
Zhi Zhong
|
Lei Zhu
|
Shichao Zhang
|
Da Cao
|
Jianfeng Zhang
Human Pose Estimation based on Attention Multi-resolution Network
Congcong Zhang
|
Ning He
|
Qixiang Sun
|
Xiaojie Yin
|
Ke Lu
ICDAR'21: Intelligent Cross-Data Analysis and Retrieval
Minh-Son Dao
|
Michael Alexander Riegler
|
Duc-Tien Dang-Nguyen
|
Cathal Gurrin
|
Minh-Triet Tran
|
Thanh-Binh Nguyen
Introduction to the Fourth Annual Lifelog Search Challenge, LSC'21
Cathal Gurrin
|
Björn Þór Jónsson
|
Klaus Schöffmann
|
Duc-Tien Dang-Nguyen
|
Jakub Lokoc
|
Minh-Triet Tran
|
Wolfgang Hürst
|
Luca Rossetto
|
Graham Healy
MMArt-ACM'21: International Joint Workshop on Multimedia Artworks Analysis and Attractiveness Computing in Multimedia 2021
Min-Chun Hu
|
Ichiro Ide
|
Kensuke Tobitani
MMPT'21: International Joint Workshop on Multi-Modal Pre-Training for Multimedia Understanding
Bei Liu
|
Jianlong Fu
|
Shizhe Chen
|
Qin Jin
|
Alexander G. Hauptmann
|
Yong Rui
CEA'21: The 13th Workshop on Multimedia for Cooking and Eating Activities
Yoko Yamakata
|
Atsushi Hashimoto