ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27 - 30, 2022
Vincent Oria, Maria Luisa Sapino, Shin'ichi Satoh, Brigitte Kerhervé, Wen-Huang Cheng, Ichiro Ide, Vivek K. Singh (Editors)
- Anthology ID:
- 2022.mir_conference-2022
- Year:
- 2022
- Venue:
- mir_conference
- Publisher:
- ACM
- URL:
- https://doi.org/10.1145/3512527
- DOI:
- 10.1145/3512527
- DBLP:
- conf/mir/2022
ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27 - 30, 2022
TransPCC: Towards Deep Point Cloud Compression via Transformers
Zujie Liang
|
Fan Liang
The Impact of Dataset Splits on Classification Performance in Medical Videos
Markus Fox
|
Klaus Schoeffmann
OSCARS: An Outlier-Sensitive Content-Based Radiography Retrieval System
Xiaoyuan Guo
|
Jiali Duan
|
Saptarshi Purkayastha
|
Hari Trivedi
|
Judy Wawira Gichoya
|
Imon Banerjee
Unseen Food Segmentation
Yuma Honbu
|
Keiji Yanai
DMPCANet: A Low Dimensional Aggregation Network for Visual Place Recognition
Yinghao Wang
|
Haonan Chen
|
Jiong Wang
|
Yingying Zhu
VideoCLIP: A Cross-Attention Model for Fast Video-Text Retrieval Task with Image CLIP
Yikang Li
|
Jenhao Hsiao
|
Chiuman Ho
Music-to-Dance Generation with Multiple Conformer
Mingao Zhang
|
Changhong Liu
|
Yong Chen
|
Zhenchun Lei
|
Mingwen Wang
OCR-oriented Master Object for Text Image Captioning
Wenliang Tang
|
Zhenzhen Hu
|
Zijie Song
|
Richang Hong
Supervised Contrastive Vehicle Quantization for Efficient Vehicle Retrieval
Yongbiao Chen
|
Kaicheng Guo
|
Fangxin Liu
|
Yusheng Huang
|
Zhengwei Qi
Fashion Style-Aware Embeddings for Clothing Image Retrieval
Rino Naka
|
Marie Katsurai
|
Keisuke Yanagi
|
Ryosuke Goto
Multiple Biological Granularities Network for Person Re-Identification
Shuyuan Tu
|
Tianzhen Guan
|
Li Kuang
TriReID: Towards Multi-Modal Person Re-Identification via Descriptive Fusion Model
Yajing Zhai
|
Yawen Zeng
|
Da Cao
|
Shaofei Lu
Temporal-Consistent Visual Clue Attentive Network for Video-Based Person Re-Identification
Bingliang Jiao
|
Liying Gao
|
Peng Wang
Pluggable Weakly-Supervised Cross-View Learning for Accurate Vehicle Re-Identification
Lu Yang
|
Hongbang Liu
|
Lingqiao Liu
|
Jinghao Zhou
|
Lei Zhang
|
Peng Wang
|
Yanning Zhang
An Effective Two-way Metapath Encoder over Heterogeneous Information Network for Recommendation
Yanbin Jiang
|
Huifang Ma
|
Xiaohui Zhang
|
Zhixin Li
|
Liang Chang
Multi-Modal Contrastive Pre-training for Recommendation
Zhuang Liu
|
Yunpu Ma
|
Matthias Schubert
|
Yuanxin Ouyang
|
Zhang Xiong
Flexible Order Aware Sequential Recommendation
Mingda Qian
|
Xiaoyan Gu
|
Lingyang Chu
|
Feifei Dai
|
Haihui Fan
|
Bo Li
Sequential Intention-aware Recommender based on User Interaction Graph
Jinpeng Chen
|
Yuan Cao
|
Fan Zhang
|
Pengfei Sun
|
Kaimin Wei
TransHash: Transformer-based Hamming Hashing for Efficient Image Retrieval
Yongbiao Chen
|
Sheng Zhang
|
Fangxin Liu
|
Zhigang Chang
|
Mang Ye
|
Zhengwei Qi
Constructing Phrase-level Semantic Labels to Form Multi-Grained Supervision for Image-Text Retrieval
Zhihao Fan
|
Zhongyu Wei
|
Zejun Li
|
Siyuan Wang
|
Haijun Shan
|
Xuanjing Huang
|
Jianqing Fan
Relevance-based Margin for Contrastively-trained Video Retrieval Models
Alex Falcon
|
Swathikiran Sudhakaran
|
Giuseppe Serra
|
Sergio Escalera
|
Oswald Lanz
CLIP4Hashing: Unsupervised Deep Hashing for Cross-Modal Video-Text Retrieval
Yaoxin Zhuo
|
Yikang Li
|
Jenhao Hsiao
|
Chiuman Ho
|
Baoxin Li
Nearest Neighbor Search with Compact Codes: A Decoder Perspective
Kenza Amara
|
Matthijs Douze
|
Alexandre Sablayrolles
|
Hervé Jégou
Teaching a New Dog Old Tricks: Contrastive Random Walks in Videos with Unsupervised Priors
Jan Schutte
|
Pascal Mettes
FedNKD: A Dependable Federated Learning Using Fine-tuned Random Noise and Knowledge Distillation
Shaoxiong Zhu
|
Qi Qi
|
Zirui Zhuang
|
Jingyu Wang
|
Haifeng Sun
|
Jianxin Liao
Weakly Supervised Fine-grained Recognition based on Combined Learning for Small Data and Coarse Label
Anqi Hu
|
Zhengxing Sun
|
Qian Li
Real-Time Deepfake System for Live Streaming
Yifei Fan
|
Modan Xie
|
Peihan Wu
|
Gang Yang
EmoMTB: Emotion-aware Music Tower Blocks
Alessandro B. Melchiorre
|
David Penz
|
Christian Ganhör
|
Oleg Lesota
|
Vasco Fragoso
|
Florian Friztl
|
Emilia Parada-Cabaleiro
|
Franz Schubert
|
Markus Schedl
ViRMA: Virtual Reality Multimedia Analytics
Aaron Duane
|
Björn Þór Jónsson
Person Search by Uncertain Attributes
Tingting Dong
|
Jianquan Liu
Dual-Level Decoupled Transformer for Video Captioning
Yiqi Gao
|
Xinglin Hou
|
Wei Suo
|
Mengyang Sun
|
Tiezheng Ge
|
Yuning Jiang
|
Peng Wang
Cross-Modal Retrieval between Event-Dense Text and Image
Zhongwei Xie
|
Lin Li
|
Luo Zhong
|
Jianquan Liu
|
Ling Liu
Learning Hierarchical Semantic Correspondences for Cross-Modal Image-Text Retrieval
Sheng Zeng
|
Changhong Liu
|
Jun Zhou
|
Yong Chen
|
Aiwen Jiang
|
Hanxi Li
Ingredient-enriched Recipe Generation from Cooking Videos
Jianlong Wu
|
Liangming Pan
|
Jingjing Chen
|
Yu-Gang Jiang
Cross-lingual Adaptation for Recipe Retrieval with Mixup
Bin Zhu
|
Chong-Wah Ngo
|
Jingjing Chen
|
Wing Kwong Chan
Disentangled Representations and Hierarchical Refinement of Multi-Granularity Features for Text-to-Image Synthesis
Pei Dong
|
Lei Wu
|
Lei Meng
|
Xiangxu Meng
Style-woven Attention Network for Zero-shot Ink Wash Painting Style Transfer
Haochen Sun
|
Lei Wu
|
Xiang Li
|
Xiangxu Meng
Automatic Visual Recognition of Unexploded Ordnances Using Supervised Deep Learning
Georgios Begkas
|
Panagiotis Giannakeris
|
Konstantinos Ioannidis
|
Georgios Kalpakis
|
Theodora Tsikrika
|
Stefanos Vrochidis
|
Ioannis Kompatsiaris
Generating Topological Structure of Floorplans from Room Attributes
Yu Yin
|
Will Hutchcroft
|
Naji Khosravan
|
Ivaylo Boyadzhiev
|
Yun Fu
|
Sing Bing Kang
MultiCLU: Multi-stage Context Learning and Utilization for Storefront Accessibility Detection and Evaluation
Xuan Wang
|
Jiajun Chen
|
Hao Tang
|
Zhigang Zhu
UF-VTON: Toward User-Friendly Virtual Try-On Network
Yuan Chang
|
Tao Peng
|
Ruhan He
|
Xinrong Hu
|
Junping Liu
|
Zili Zhang
|
Minghua Jiang
Learning Sample Importance for Cross-Scenario Video Temporal Grounding
Peijun Bao
|
Yadong Mu
Efficient Linear Attention for Fast and Accurate Keypoint Matching
Suwichaya Suwanwimolkul
|
Satoshi Komorita
Video2Subtitle: Matching Weakly-Synchronized Sequences via Dynamic Temporal Alignment
Ben Xue
|
Chenchen Liu
|
Yadong Mu
Dual-Channel Localization Networks for Moment Retrieval with Natural Language
Bolin Zhang
|
Bin Jiang
|
Chao Yang
|
Liang Pang
Phrase-level Prediction for Video Temporal Localization
Sizhe Li
|
Chang Li
|
Minghang Zheng
|
Yang Liu
Joint Modality Synergy and Spatio-temporal Cue Purification for Moment Localization
Xingyu Shen
|
Long Lan
|
Huibin Tan
|
Xiang Zhang
|
Xurui Ma
|
Zhigang Luo
HybridVocab: Towards Multi-Modal Machine Translation via Multi-Aspect Alignment
Ru Peng
|
Yawen Zeng
|
Junbo Zhao
Improving Image Captioning via Enhancing Dual-Side Context Awareness
Yiqi Gao
|
Ning Wang
|
Wei Suo
|
Mengyang Sun
|
Peng Wang
Improve Image Captioning by Modeling Dynamic Scene Graph Extension
Minghao Geng
|
Qingjie Zhao
Summarizing Videos using Concentrated Attention and Considering the Uniqueness and Diversity of the Video Frames
Evlampios E. Apostolidis
|
Georgios Balaouras
|
Vasileios Mezaris
|
Ioannis Patras
Fashion Image Search via Anchor-Free Detector
Shanchuan Gao
|
Fankai Zeng
|
Lu Cheng
|
Jicong Fan
|
Mingbo Zhao
Unsupervised Contrastive Masking for Visual Haze Classification
Jingyu Li
|
Haokai Ma
|
Xiangxian Li
|
Zhuang Qi
|
Lei Meng
|
Xiangxu Meng
MuLER: Multiplet-Loss for Emotion Recognition
Anwer Slimi
|
Mounir Zrigui
|
Henri Nicolas
STAFNet: Swin Transformer Based Anchor-Free Network for Detection of Forward-looking Sonar Imagery
Xingyu Zhu
|
Yingshuo Liang
|
Jianlei Zhang
|
Zengqiang Chen
Camouflaged Poisoning Attack on Graph Neural Networks
Chao Jiang
|
Yi He
|
Richard Chapman
|
Hongyi Wu
Accelerated Sign Hunter: A Sign-based Black-box Attack via Branch-Prune Strategy and Stabilized Hierarchical Search
Siyuan Li
|
Guangji Huang
|
Xing Xu
|
Yang Yang
|
Fumin Shen
DiGAN: Directional Generative Adversarial Network for Object Transfiguration
Zhen Luo
|
Yingfang Zhang
|
Peihao Zhong
|
Jingjing Chen
|
Donglong Chen
GIO: A Timbre-informed Approach for Pitch Tracking in Highly Noisy Environments
Xiaoheng Sun
|
Xia Liang
|
Qiqi He
|
Bilei Zhu
|
Zejun Ma
Source-free Temporal Attentive Domain Adaptation for Video Action Recognition
Peipeng Chen
|
Andy J. Ma
Review of Deep Learning Models for Spine Segmentation
Neng Zhou
|
Hairu Wen
|
Yi Wang
|
Yang Liu
|
Longfei Zhou
3D-Augmented Contrastive Knowledge Distillation for Image-based Object Pose Estimation
Zhidan Liu
|
Zhen Xing
|
Xiangdong Zhou
|
Yijiang Chen
|
Guichun Zhou
Selective Hypergraph Convolutional Networks for Skeleton-based Action Recognition
Yiran Zhu
|
Guangji Huang
|
Xing Xu
|
Yanli Ji
|
Fumin Shen
Self-Lifting: A Novel Framework for Unsupervised Voice-Face Association Learning
Guangyu Chen
|
Deyuan Zhang
|
Tao Liu
|
Xiaoyong Du
Revisiting Performance Measures for Cross-Modal Hashing
Hongya Wang
|
Shunxin Dai
|
Ming Du
|
Bo Xu
|
Mingyong Li
Local Slot Attention for Vision and Language Navigation
Yifeng Zhuang
|
Qiang Sun
|
Yanwei Fu
|
Lifeng Chen
|
Xiangyang Xue
Cross-Pixel Dependency with Boundary-Feature Transformation for Weakly Supervised Semantic Segmentation
Yuhui Guo
|
Xun Liang
|
Tang Hui
|
Bo Wu
|
Xiangping Zheng
Mobile Emotion Recognition via Multiple Physiological Signals using Convolution-augmented Transformer
Kangning Yang
|
Benjamin Tag
|
Yue Gu
|
Chaofan Wang
|
Tilman Dingler
|
Greg Wadley
|
Jorge Gonçalves
VAC-Net: Visual Attention Consistency Network for Person Re-identification
Weidong Shi
|
Yunzhou Zhang
|
Shangdong Zhu
|
Yixiu Liu
|
Sonya Coleman
|
Dermot Kerr
MFGAN: A Lightweight Fast Multi-task Multi-scale Feature-fusion Model based on GAN
Lijia Deng
|
Yu-Dong Zhang
Adaptive Temporal Grouping for Black-box Adversarial Attacks on Videos
Zhipeng Wei
|
Jingjing Chen
|
Hao Zhang
|
Linxi Jiang
|
Yu-Gang Jiang
Parallelism Network with Partial-aware and Cross-correlated Transformer for Vehicle Re-identification
Guangqi Jiang
|
Huibing Wang
|
Jinjia Peng
|
Xianping Fu
Motor Learning based on Presentation of a Tentative Goal
Siqi Sun
|
Yongqing Sun
|
Mitsuhiro Goto
|
Shigekuni Kondo
|
Dan Mikami
|
Susumu Yamamoto
Extracting Precedence Relations between Video Lectures in MOOCs
Kui Xiao
|
Youheng Bai
|
Yan Zhang
M2TR: Multi-modal Multi-scale Transformers for Deepfake Detection
Junke Wang
|
Zuxuan Wu
|
Wenhao Ouyang
|
Xintong Han
|
Jingjing Chen
|
Yu-Gang Jiang
|
Ser-Nam Li
Blindfold Attention: Novel Mask Strategy for Facial Expression Recognition
Bo Fu
|
Yuanxin Mao
|
Shilin Fu
|
Yonggong Ren
|
Zhongxuan Luo
MSSPQ: Multiple Semantic Structure-Preserving Quantization for Cross-Modal Retrieval
Lei Zhu
|
Liewu Cai
|
Jiayu Song
|
Xinghui Zhu
|
Chengyuan Zhang
|
Shichao Zhang
Lesion Localization in OCT by Semi-Supervised Object Detection
Yue Wu
|
Yang Zhou
|
Jianchun Zhao
|
Jingyuan Yang
|
Weihong Yu
|
Youxin Chen
|
Xirong Li
Weakly Supervised Pediatric Bone Age Assessment Using Ultrasonic Images via Automatic Anatomical RoI Detection
Yunyan Yan
|
Chuanbin Liu
|
Hongtao Xie
|
Sicheng Zhang
|
Zhendong Mao
I2-Net: Intra- and Inter-scale Collaborative Learning Network for Abdominal Multi-organ Segmentation
Chao Suo
|
Xuanya Li
|
Donghui Tan
|
Yuan Zhang
|
Xieping Gao
SA-NAS-BFNR: Spatiotemporal Attention Neural Architecture Search for Task-based Brain Functional Network Representation
Fenxia Duan
|
Chunhong Cao
|
Xieping Gao
Weakly-supervised Cerebrovascular Segmentation Network with Shape Prior and Model Indicator
Qian Wu
|
Yufei Chen
|
Ning Huang
|
Xiaodong Yue
FreqCAM: Frequent Class Activation Map for Weakly Supervised Object Localization
Runsheng Zhang
Reproducibility Companion Paper: Human Object Interaction Detection via Multi-level Conditioned Network
Yunqing He
|
Xu Sun
|
Hui Jiang
|
Tongwei Ren
|
Gangshan Wu
|
Maria Sinziana Astefanoaei
|
Andreas Leibetseder
Introduction to the Fifth Annual Lifelog Search Challenge, LSC'22
Cathal Gurrin
|
Liting Zhou
|
Graham Healy
|
Björn Þór Jónsson
|
Duc-Tien Dang-Nguyen
|
Jakub Lokoc
|
Minh-Triet Tran
|
Wolfgang Hürst
|
Luca Rossetto
|
Klaus Schöffmann
MAD '22 Workshop: Multimedia AI against Disinformation
Bogdan Ionescu
|
Giorgos Kordopatis-Zilos
|
Adrian Popescu
|
Luca Cuccovillo
|
Symeon Papadopoulos
ICDAR'22: Intelligent Cross-Data Analysis and Retrieval
Minh-Son Dao
|
Michael Alexander Riegler
|
Duc-Tien Dang-Nguyen
|
Cathal Gurrin
|
Yuta Nakashima
|
Mianxiong Dong
MMArt-ACM 2022: 5th Joint Workshop on Multimedia Artworks Analysis and Attractiveness Computing in Multimedia
Naoko Nitta
|
Anita Min-Chun Hu
|
Kensuke Tobitani