Main » ICMR » 2021 »

ICMR '21: International Conference on Multimedia Retrieval, Taipei, Taiwan, August 21-24, 2021

Wen-Huang Cheng, Mohan S. Kankanhalli, Meng Wang, Wei-Ta Chu, Jiaying Liu, Marcel Worring (Editors)


Anthology ID:
2021.mir_conference-2021
Year:
2021
Venue:
mir_conference
Publisher:
ACM
URL:
https://doi.org/10.1145/3460426
DOI:
10.1145/3460426
DBLP:
conf/mir/2021
BibTeX:
Download

doi dblp
ICMR '21: International Conference on Multimedia Retrieval, Taipei, Taiwan, August 21-24, 2021

doi dblp
Combining Adversarial and Reinforcement Learning for Video Thumbnail Selection
Evlampios E. Apostolidis | Eleni Adamantidou | Vasileios Mezaris | Ioannis Patras

doi dblp
Efficient Indexing of 3D Human Motions
Petra Budíková | Jan Sedmidubský | Pavel Zezula

doi dblp
Global Relation-Aware Attention Network for Image-Text Retrieval
Jie Cao | Shengsheng Qian | Huaiwen Zhang | Quan Fang | Changsheng Xu

doi dblp
MS-SincResNet: Joint Learning of 1D and 2D Kernels Using Multi-scale SincNet and ResNet for Music Genre Classification
Pei-Chun Chang | Yong-Sheng Chen | Chang-Hsing Lee

doi dblp
MLFont: Few-Shot Chinese Font Generation via Deep Meta-Learning
Xu Chen | Lei Wu | Minggang He | Lei Meng | Xiangxu Meng

doi dblp
Facial Structure Guided GAN for Identity-preserved Face Image De-occlusion
Yiu-Ming Cheung | Mengke Li | Rong Zou

doi dblp
Heterogeneous Side Information-based Iterative Guidance Model for Recommendation
Feifei Dai | Xiaoyan Gu | Zhuo Wang | Mingda Qian | Bo Li | Weiping Wang

doi dblp
Dense Scale Network for Crowd Counting
Feng Dai | Hao Liu | Yike Ma | Xi Zhang | Qiang Zhao

doi dblp
Leveraging Two Types of Global Graph for Sequential Fashion Recommendation
Yujuan Ding | Yunshan Ma | Wai Keung Wong | Tat-Seng Chua

doi dblp
HSGMP: Heterogeneous Scene Graph Message Passing for Cross-modal Retrieval
Yu Duan | Yun Xiong | Yao Zhang | Yuwei Fu | Yangyong Zhu

doi dblp
GCNBoost: Artwork Classification by Label Propagation through a Knowledge Graph
Cheikh Brahim El Vaigh | Noa Garcia | Benjamin Renoust | Chenhui Chu | Yuta Nakashima | Hajime Nagahara

doi dblp
Can Action be Imitated? Learn to Reconstruct and Transfer Human Dynamics from Videos
Yuqian Fu | Yanwei Fu | Yu-Gang Jiang

doi dblp
SAGN: Semantic Adaptive Graph Network for Skeleton-Based Human Action Recognition
Ziwang Fu | Feng Liu | Jiahao Zhang | Hanyang Wang | Chengyi Yang | Qing Xu | Jiayin Qi | Xiangling Fu | Aimin Zhou

doi dblp
Text-Guided Visual Feature Refinement for Text-Based Person Search
Liying Gao | Kai Niu | Zehong Ma | Bingliang Jiao | Tonghao Tan | Peng Wang

doi dblp
RGB-D Scene Recognition based on Object-Scene Relation and Semantics-Preserving Attention
Yuhui Guo | Xun Liang

doi dblp
Multi-Feature Graph Attention Network for Cross-Modal Video-Text Retrieval
Xiaoshuai Hao | Yucan Zhou | Dayan Wu | Wanqian Zhang | Bo Li | Weiping Wang

doi dblp
HPOF: 3D Human Pose Recovery from Monocular Video with Optical Flow
Bin Ji | Chen Yang | Shunyu Yao | Ye Pan

doi dblp
Leveraging EfficientNet and Contrastive Learning for Accurate Global-scale Location Estimation
Giorgos Kordopatis-Zilos | Panagiotis Galopoulos | Symeon Papadopoulos | Ioannis Kompatsiaris

doi dblp
Relation-aware Hierarchical Attention Framework for Video Question Answering
Fangtao Li | Ting Bai | Chenyu Cao | Zihe Liu | Chenghao Yan | Bin Wu

doi dblp
Cross-Modal Image-Recipe Retrieval via Intra- and Inter-Modality Hybrid Fusion
Jiao Li | Jialiang Sun | Xing Xu | Wei Yu | Fumin Shen

doi dblp
Unsupervised Deep Cross-Modal Hashing by Knowledge Distillation for Large-scale Cross-modal Retrieval
Mingyong Li | Hongya Wang

doi dblp
A Unified-Model via Block Coordinate Descent for Learning the Importance of Filter
Qinghua Li | Xue Zhang | Cuiping Li | Hong Chen

doi dblp
Local-enhanced Interaction for Temporal Moment Localization
Guoqiang Liang | Shiyu Ji | Yanning Zhang

doi dblp
Reading Scene Text by Fusing Visual Attention with Semantic Representations
Zhiguang Liu | Liangwei Wang | Jian Qiao

doi dblp
Generative Adversarial Networks with Bi-directional Normalization for Semantic Image Synthesis
Jia Long | Hongtao Lu

doi dblp
A Smart Adversarial Attack on Deep Hashing Based Image Retrieval
Junda Lu | Mingyang Chen | Yifang Sun | Wei Wang | Yi Wang | Xiaochun Yang

doi dblp
Image-to-Image Transfer Makes Chaos to Order
Sanbi Luo | Tao Guo

doi dblp
Summary of the 2021 Embedded Deep Learning Object Detection Model Compression Competition for Traffic in Asian Countries
Yu-Shu Ni | Chia-Chi Tsai | Jiun-In Guo | Jenq-Neng Hwang | Bo-Xun Wu | Po-Chi Hu | Ted T. Kuo | Po-Yu Chen | Hsien-Kai Kuo

doi dblp
Nested Dense Attention Network for Single Image Super-Resolution
Cheng Qiu | Yirong Yao | Yuntao Du

doi dblp
Multi-scale Dynamic Network for Temporal Action Detection
Yifan Ren | Xing Xu | Fumin Shen | Zheng Wang | Yang Yang | Heng Tao Shen

doi dblp
Distractor-Aware Tracker with a Domain-Special Optimized Benchmark for Soccer Player Tracking
Zikai Song | Zhiwen Wan | Wei Yuan | Ying Tang | Junqing Yu | Yi-Ping Phoebe Chen

doi dblp
Efficient Nearest Neighbor Search by Removing Anti-hub
Kimihiro Tanaka | Yusuke Matsui | Shin'ichi Satoh

doi dblp
A Denoising Convolutional Neural Network for Self-Supervised Rank Effectiveness Estimation on Image Retrieval
Lucas Pascotti Valem | Daniel Carlos Guimarães Pedronette

doi dblp
Know Yourself and Know Others: Efficient Common Representation Learning for Few-shot Cross-modal Retrieval
Shaoying Wang | Hanjiang Lai | Zhenyu Shi

doi dblp
Neural Symbolic Representation Learning for Image Captioning
Xiaomei Wang | Lin Ma | Yanwei Fu | Xiangyang Xue

doi dblp
G-CAM: Graph Convolution Network Based Class Activation Mapping for Multi-label Image Recognition
Yangtao Wang | Yanzhao Xie | Yu Liu | Lisheng Fan

doi dblp
NASTER: Non-local Attentional Scene Text Recognizer
Lei Wu | Xueliang Liu | Yanbin Hao | Yunjie Ma | Richang Hong

doi dblp
Few-Shot Action Localization without Knowing Boundaries
Ting-Ting Xie | Christos Tzelepis | Fan Fu | Ioannis Patras

doi dblp
Learning Hierarchical Visual-Semantic Representation with Phrase Alignment
Baoming Yan | Qingheng Zhang | Liyu Chen | Lin Wang | Leihao Pei | Jiang Yang | Enyun Yu | Xiaobo Li | Binqiang Zhao

doi dblp
Social Relation Analysis from Videos via Multi-entity Reasoning
Chenghao Yan | Zihe Liu | Fangtao Li | Chenyu Cao | Zheng Wang | Bin Wu

doi dblp
Aligning Visual Prototypes with BERT Embeddings for Few-Shot Learning
Kun Yan | Zied Bouraoui | Ping Wang | Shoaib Jameel | Steven Schockaert

doi dblp
TEACH: Attention-Aware Deep Cross-Modal Hashing
Hong-Lei Yao | Yu-Wei Zhan | Zhen-Duo Chen | Xin Luo | Xin-Shun Xu

doi dblp
Scene Text Recognition with Cascade Attention Network
Min Zhang | Meng Ma | Ping Wang

doi dblp
Multi-Attention Audio-Visual Fusion Network for Audio Spatialization
Wen Zhang | Jie Shao

doi dblp
Multi-Initialization Graph Meta-Learning for Node Classification
Feng Zhao | Donglin Wang | Xintao Xiang

doi dblp
Question-Guided Semantic Dual-Graph Visual Reasoning with Novel Answers
Xinzhe Zhou | Yadong Mu

doi dblp
Joint Hand-Object Pose Estimation with Differentiably-Learned Physical Contact Point Analysis
Nan Zhuang | Yadong Mu

doi dblp
HINFShot: A Challenge Dataset for Few-Shot Node Classification in Heterogeneous Information Network
Zifeng Zhuang | Xintao Xiang | Siteng Huang | Donglin Wang

doi dblp
Learning to Select: A Fully Attentive Approach for Novel Object Captioning
Marco Cagrandi | Marcella Cornia | Matteo Stefanini | Lorenzo Baraldi | Rita Cucchiara

doi dblp
Semi-supervised Many-to-many Music Timbre Transfer
Yu-Chen Chang | Wen-Cheng Chen | Min-Chun Hu

doi dblp
Text-Enhanced Attribute-Based Attention for Generalized Zero-Shot Fine-Grained Image Classification
Yan-He Chen | Mei-Chen Yeh

doi dblp
Spatio-Temporal Activity Detection and Recognition in Untrimmed Surveillance Videos
Konstantinos Gkountakos | Despoina Touska | Konstantinos Ioannidis | Theodora Tsikrika | Stefanos Vrochidis | Ioannis Kompatsiaris

doi dblp
Cross-Modal Self-Attention with Multi-Task Pre-Training for Medical Visual Question Answering
Haifan Gong | Guanqi Chen | Sishuo Liu | Yizhou Yu | Guanbin Li

doi dblp
Body Shape Calculator: Understanding the Type of Body Shapes from Anthropometric Measurements
Shintami Chusnul Hidayati | Yeni Anistyasari

doi dblp
Unsupervised Video Summarization via Multi-source Features
Hussain Kanafani | Junaid Ahmed Ghauri | Sherzod Hakimov | Ralph Ewerth

doi dblp
Evaluating Contrastive Models for Instance-based Image Retrieval
Tarun Krishna | Kevin McGuinness | Noel E. O'Connor

doi dblp
AWFA-LPD: Adaptive Weight Feature Aggregation for Multi-frame License Plate Detection
Xiaocheng Lu | Yuan Yuan | Qi Wang

doi dblp
NMS-Loss: Learning with Non-Maximum Suppression for Crowded Pedestrian Detection
Zekun Luo | Zheng Fang | Sixiao Zheng | Yabiao Wang | Yanwei Fu

doi dblp
Image Retrieval by Hierarchy-aware Deep Hashing Based on Multi-task Learning
Bowen Wang | Liangzhi Li | Yuta Nakashima | Takehiro Yamamoto | Hiroaki Ohshima | Yoshiyuki Shoji | Kenro Aihara | Noriko Kando

doi dblp
Weakly Supervised Sketch Based Person Search
Lan Yan | Wenbo Zheng | Fei-Yue Wang | Chao Gou

doi dblp
Personal Knowledge Base Construction from Multimodal Data
An-Zi Yen | Chia-Chung Chang | Hen-Hsen Huang | Hsin-Hsi Chen

doi dblp
2.5D Pose Guided Human Image Generation
Kang Yuan | Sheng Li

doi dblp
Collaborative Representation for Deep Meta Metric Learning
Min Zhu | Weifeng Liu | Kai Zhang | Ye Li | Peng Liu | Baodi Liu

doi dblp
Ten Questions in Lifelog Mining and Information Recall
An-Zi Yen | Hen-Hsen Huang | Hsin-Hsi Chen

doi dblp
Bag of Tricks for Building an Accurate and Slim Object Detector for Embedded Applications
Yongkun Du | Zhineng Chen | Caiyan Jia | Xuanya Li | Yu-Gang Jiang

doi dblp
Efficient-ROD: Efficient Radar Object Detection based on Densely Connected Residual Network
Chih-Chung Hsu | Chieh Lee | Lin Chen | Min-Kai Hung | Andy Yu-Lun Lin | Xian-Yu Wang

doi dblp
DANet: Dimension Apart Network for Radar Object Detection
Bo Ju | Wei Yang | Jinrang Jia | Xiaoqing Ye | Qu Chen | Xiao Tan | Hao Sun | Yifeng Shi | Errui Ding

doi dblp
Object Detection on Embedded Systems for Traffic in Asian Countries
Bao-Hong Lai | Hsun-Ping Hsieh

doi dblp
Squeeze-and-Excitation network-Based Radar Object Detection With Weighted Location Fusion
Pengliang Sun | Xuetong Niu | Pengfei Sun | Kele Xu

doi dblp
ROD2021 Challenge: A Summary for Radar Object Detection Challenge for Autonomous Driving Applications
Yizhou Wang | Jenq-Neng Hwang | Gaoang Wang | Hui Liu | Kwang-Ju Kim | Hung-Min Hsu | Jiarui Cai | Haotian Zhang | Zhongyu Jiang | Renshu Gu

doi dblp
Embedded YOLO: Faster and Lighter Object Detection
Wen-Kai Wu | Chien-Yu Chen | Jiann-Shu Lee

doi dblp
Radar Object Detection Using Data Merging, Enhancement and Fusion
Jun Yu | Xinlong Hao | Xinjian Gao | Qiang Sun | Yuyu Liu | Peng Chang | Zhong Zhang | Fang Gao | Feng Shuang

doi dblp
Scene-aware Learning Network for Radar Object Detection
Zangwei Zheng | Xiangyu Yue | Kurt Keutzer | Alberto L. Sangiovanni-Vincentelli

doi dblp
GPT2MVS: Generative Pre-trained Transformer-2 for Multi-modal Video Summarization
Jia-Hong Huang | Luka Murn | Marta Mrak | Marcel Worring

doi dblp
Impact of Interaction Strategies on User Relevance Feedback
Omar Shahbaz Khan | Björn Þór Jónsson | Jan Zahálka | Stevan Rudinac | Marcel Worring

doi dblp
Automatic Baseball Pitch Overlay
Ting-Hsuan Chou | Wei-Ta Chu

doi dblp
Video Action Retrieval Using Action Recognition Model
Yuko Iinuma | Shin'ichi Satoh

doi dblp
MeTILDA: Platform for Melodic Transcription in Language Documentation and Application
Mitchell Lee | Praveena Avula | Min Chen

doi dblp
IR Questioner: QA-based Interactive Retrieval System
Rintaro Yanagi | Ren Togo | Takahiro Ogawa | Miki Haseyama

doi dblp
Reproducibility Companion Paper: Knowledge Enhanced Neural Fashion Trend Forecasting
Yunshan Ma | Yujuan Ding | Xun Yang | Lizi Liao | Wai Keung Wong | Tat-Seng Chua | Jinyoung Moon | Hong-Han Shuai

doi dblp
A Beneficial Dual Transformation Approach for Deep Learning Networks Used in Steel Surface Defect Detection
Fityanul Akhyar | Chih-Yang Lin | Gugan S. Kathiresan

doi dblp
Discrete Tchebichef Transform for Versatile Video Coding
Ka-Hou Chan | Sio Kei Im

doi dblp
Fire Detection using Transformer Network
Mohammad Shahid | Kai-Lung Hua

doi dblp
Visible-infrared Person Re-identification with Human Body Parts Assistance
Huangpeng Dai | Qing Xie | Jiachen Li | Yanchun Ma | Lin Li | Yongjian Liu

doi dblp
Look Back Again: Dual Parallel Attention Network for Accurate and Robust Scene Text Recognition
Zilong Fu | Hongtao Xie | Guoqing Jin | Junbo Guo

doi dblp
Contextualized Keyword Representations for Multi-modal Retinal Image Captioning
Jia-Hong Huang | Ting-Wei Wu | Marcel Worring

doi dblp
MSAV: An Unified Framework for Multi-view Subspace Analysis with View Consistence
Huibing Wang | Guangqi Jiang | Jinjia Peng | Xianping Fu

doi dblp
A Tensor Sparse Representation-Based CBMIR System for Computer-Aided Diagnosis of Focal Liver Lesions and its Pilot Trial
Jian Wang | Xian-Hua Han | Lanfen Lin | Hongjie Hu | Yen-Wei Chen

doi dblp
M-DFNet: Multi-phase Discriminative Feature Network for Retrieval of Focal Liver Lesions
Yingying Xu | Jing Liu | Lanfen Lin | Hongjie Hu | Ruofeng Tong | Jingsong Li | Yen-Wei Chen

doi dblp
M2GUDA: Multi-Metrics Graph-Based Unsupervised Domain Adaptation for Cross-Modal Hashing
Chengyuan Zhang | Zhi Zhong | Lei Zhu | Shichao Zhang | Da Cao | Jianfeng Zhang

doi dblp
Human Pose Estimation based on Attention Multi-resolution Network
Congcong Zhang | Ning He | Qixiang Sun | Xiaojie Yin | Ke Lu

doi dblp
ICDAR'21: Intelligent Cross-Data Analysis and Retrieval
Minh-Son Dao | Michael Alexander Riegler | Duc-Tien Dang-Nguyen | Cathal Gurrin | Minh-Triet Tran | Thanh-Binh Nguyen

doi dblp
Introduction to the Fourth Annual Lifelog Search Challenge, LSC'21
Cathal Gurrin | Björn Þór Jónsson | Klaus Schöffmann | Duc-Tien Dang-Nguyen | Jakub Lokoc | Minh-Triet Tran | Wolfgang Hürst | Luca Rossetto | Graham Healy

doi dblp
MMArt-ACM'21: International Joint Workshop on Multimedia Artworks Analysis and Attractiveness Computing in Multimedia 2021
Min-Chun Hu | Ichiro Ide | Kensuke Tobitani

doi dblp
MMPT'21: International Joint Workshop on Multi-Modal Pre-Training for Multimedia Understanding
Bei Liu | Jianlong Fu | Shizhe Chen | Qin Jin | Alexander G. Hauptmann | Yong Rui

doi dblp
CEA'21: The 13th Workshop on Multimedia for Cooking and Eating Activities
Yoko Yamakata | Atsushi Hashimoto