International Conference on Multimedia Retrieval (2022)
Workshops
MAD@ICMR 2022: Proceedings of the 1st International Workshop on Multimedia AI against Disinformation, Newark, NJ, USA, June 27 - 30, 2022
Towards Generalization in Deepfake Detection
Luisa Verdoliva
Let the Chatbot Speak! Freedom of Expression and Synthetic Media
Katja De Vries
On the Generalizability of Two-dimensional Convolutional Neural Networks for Fake Speech Detection
Christoforos Papastergiopoulos
|
Anastasios Vafeiadis
|
Ioannis Papadimitriou
|
Konstantinos Votis
|
Dimitrios Tzovaras
Spectral Denoising for Microphone Classification
Luca Cuccovillo
|
Antonio Giganti
|
Paolo Bestagini
|
Patrick Aichroth
|
Stefano Tubaro
Automatic and Manual Detection of Generated News: Case Study, Limitations and Challenges
Jérémie Bogaert
|
Marie-Catherine de Marneffe
|
Antonin Descampe
|
François-Xavier Standaert
Extractive-Boolean Question Answering for Scientific Fact Checking
Loïc Rakotoson
|
Charles Letaillieur
|
Sylvain Massip
|
Fréjus A. A. Laleye
How Did Europe's Press Cover Covid-19 Vaccination News? A Five-Country Analysis
David Alonso del Barrio
|
Daniel Gática-Pérez
Automatic Detection of Bot-generated Tweets
Julien Tourille
|
Babacar Sow
|
Adrian Popescu
Cross-Forgery Analysis of Vision Transformers and CNNs for Deepfake Image Detection
Davide Alessandro Coccomini
|
Roberto Caldelli
|
Fabrizio Falchi
|
Claudio Gennaro
|
Giuseppe Amato
The MeVer DeepFake Detection Service: Lessons Learnt from Developing and Deploying in the Wild
Spiros Baxevanakis
|
Giorgos Kordopatis-Zilos
|
Panagiotis Galopoulos
|
Lazaros Apostolidis
|
Killian Levacher
|
Ipek Baris Schlicht
|
Denis Teyssou
|
Ioannis Kompatsiaris
|
Symeon Papadopoulos
Uncovering the Strength of Capsule Networks in Deepfake Detection
Dan-Cristian Stanciu
|
Bogdan Ionescu
Fake News Detection Based on Multi-Modal Classifier Ensemble
Yi Shao
|
Jiande Sun
|
Tianlin Zhang
|
Ye Jiang
|
Jianhua Ma
|
Jing Li
ICDAR@ICMR 2022: Proceedings of the 3rd ACM Workshop on Intelligent Cross-Data Analysis and Retrieval, Newark, NJ, USA, June 27 - 30, 2022
Explainable Artificial Intelligence for Human Embryo Cell Cleavage Stages Analysis
Akriti Sharma
|
Mette H. Stensen
|
Erwan Delbarre
|
Trine B. Haugen
|
Hugo Lewi Hammer
Multimodal Cheapfakes Detection by Utilizing Image Captioning for Global Context
Tuan-Vinh La
|
Quang-Tien Tran
|
Thanh-Phuc Tran
|
Anh-Duy Tran
|
Duc-Tien Dang-Nguyen
|
Minh-Son Dao
Tone Classification for Political Advertising Video using Multimodal Cues
Anh-Khoa Vo
|
Yuta Nakashima
A Hybrid Transformer Network for Detection of Risk Situations on Multimodal Life-Log Health Data
Rupayan Mallick
|
Jenny Benois-Pineau
|
Akka Zemmari
|
Marion Pech
|
Thinhinane Yebda
|
Hélène Amieva
|
Laura Middleton
Predicting High-risk Congestion Areas During Heavy Rain Using Multi Prediction Model and Maximum Periodic Frequent Pattern Algorithms
Minh-Dang Tran
|
Nazmudeen Mohamed Saleem
IoT-based Multimodal Analysis for Smart Education: Current Status, Challenges and Opportunities
Wenbin Gan
|
Minh-Son Dao
|
Koji Zettsu
|
Yuan Sun
Is More Realistic Better? A Comparison of Game Engine and GAN-based Avatars for Investigative Interviews of Children
Pegah Salehi
|
Syed Zohaib Hassan
|
Saeed Shafiee Sabet
|
Gunn Astrid Baugerud
|
Miriam Sinkerud Johnson
|
Pål Halvorsen
|
Michael A. Riegler
Towards Intellectual Property Rights Protection in Big Data
Rafik Hamza
|
Minh-Son Dao
|
Sadanori Ito
|
Koji Zettsu
DeDigi: A Privacy-by-Design Platform for Image Forensics
Chi-Hao Tran
|
Quoc-Thang Tran
|
Quynh-Chau Long-Vu
|
Hai-Son Nguyen
|
Anh-Duy Tran
|
Duc-Tien Dang-Nguyen
FedMCRNN: Federated Learning using Multiple Convolutional Recurrent Neural Networks for Sleep Quality Prediction
Tran Anh Khoa
|
Do-Van Nguyen
|
Phuoc Van Nguyen Thi
|
Koji Zettsu
Efficient Resource Allocation using Federated Learning in Cellular Networks
Son Cao Nguyen
|
Minh Hoang
|
Tinh Phuc Vo
|
Duc Ngoc Minh Dang
LSC@ICMR 2022: Proceedings of the 5th Annual on Lifelog Search Challenge, Newark, NJ, USA, June 27 - 30, 2022
An Introduction to Retrieval and Reminiscence from Lifelog Archives at NTCIR
Frank Hopfgartner
Memento 2.0: An Improved Lifelog Search Engine for LSC'22
Naushad Alam
|
Yvette Graham
|
Cathal Gurrin
MEMORIA: A Memory Enhancement and MOment RetrIeval Application for LSC 2022
Ricardo Ribiero
|
Alina Trifan
|
António J. R. Neves
LifeSeeker 4.0: An Interactive Lifelog Search Engine for LSC'22
Thao-Nhu Nguyen
|
Tu-Khiem Le
|
Van-Tu Ninh
|
Minh-Triet Tran
|
Thanh-Binh Nguyen
|
Graham Healy
|
Sinéad Smyth
|
Annalina Caputo
|
Cathal Gurrin
Flexible Interactive Retrieval SysTem 3.0 for Visual Lifelog Exploration at LSC 2022
Nhat Hoang-Xuan
|
Hoang-Phuc Trang-Trung
|
E.-Ro Nguyen
|
Thanh-Cong Le
|
Mai-Khiem Tran
|
Tu-Khiem Le
|
Van-Tu Ninh
|
Cathal Gurrin
|
Minh-Triet Tran
vitrivr at the Lifelog Search Challenge 2022
Silvan Heller
|
Luca Rossetto
|
Loris Sauter
|
Heiko Schuldt
E-Myscéal: Embedding-based Interactive Lifelog Retrieval System for LSC'22
Ly-Duyen Tran
|
Manh-Duy Nguyen
|
Binh T. Nguyen
|
Hyowon Lee
|
Liting Zhou
|
Cathal Gurrin
Multimodal Interactive Lifelog Retrieval with vitrivr-VR
Florian Spiess
|
Heiko Schuldt
Voxento 3.0: A Prototype Voice-Controlled Interactive Search Engine for Lifelog
Ahmed Alateeq
|
Mark Roantree
|
Cathal Gurrin
lifeXplore at the Lifelog Search Challenge 2022
Andreas Leibetseder
|
Daniela Stefanics
|
Klaus Schoeffmann
ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27 - 30, 2022
ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27 - 30, 2022
TransPCC: Towards Deep Point Cloud Compression via Transformers
Zujie Liang
|
Fan Liang
The Impact of Dataset Splits on Classification Performance in Medical Videos
Markus Fox
|
Klaus Schoeffmann
OSCARS: An Outlier-Sensitive Content-Based Radiography Retrieval System
Xiaoyuan Guo
|
Jiali Duan
|
Saptarshi Purkayastha
|
Hari Trivedi
|
Judy Wawira Gichoya
|
Imon Banerjee
Unseen Food Segmentation
Yuma Honbu
|
Keiji Yanai
DMPCANet: A Low Dimensional Aggregation Network for Visual Place Recognition
Yinghao Wang
|
Haonan Chen
|
Jiong Wang
|
Yingying Zhu
VideoCLIP: A Cross-Attention Model for Fast Video-Text Retrieval Task with Image CLIP
Yikang Li
|
Jenhao Hsiao
|
Chiuman Ho
Music-to-Dance Generation with Multiple Conformer
Mingao Zhang
|
Changhong Liu
|
Yong Chen
|
Zhenchun Lei
|
Mingwen Wang
OCR-oriented Master Object for Text Image Captioning
Wenliang Tang
|
Zhenzhen Hu
|
Zijie Song
|
Richang Hong
Supervised Contrastive Vehicle Quantization for Efficient Vehicle Retrieval
Yongbiao Chen
|
Kaicheng Guo
|
Fangxin Liu
|
Yusheng Huang
|
Zhengwei Qi
Fashion Style-Aware Embeddings for Clothing Image Retrieval
Rino Naka
|
Marie Katsurai
|
Keisuke Yanagi
|
Ryosuke Goto
Multiple Biological Granularities Network for Person Re-Identification
Shuyuan Tu
|
Tianzhen Guan
|
Li Kuang
TriReID: Towards Multi-Modal Person Re-Identification via Descriptive Fusion Model
Yajing Zhai
|
Yawen Zeng
|
Da Cao
|
Shaofei Lu
Temporal-Consistent Visual Clue Attentive Network for Video-Based Person Re-Identification
Bingliang Jiao
|
Liying Gao
|
Peng Wang
Pluggable Weakly-Supervised Cross-View Learning for Accurate Vehicle Re-Identification
Lu Yang
|
Hongbang Liu
|
Lingqiao Liu
|
Jinghao Zhou
|
Lei Zhang
|
Peng Wang
|
Yanning Zhang
An Effective Two-way Metapath Encoder over Heterogeneous Information Network for Recommendation
Yanbin Jiang
|
Huifang Ma
|
Xiaohui Zhang
|
Zhixin Li
|
Liang Chang
Multi-Modal Contrastive Pre-training for Recommendation
Zhuang Liu
|
Yunpu Ma
|
Matthias Schubert
|
Yuanxin Ouyang
|
Zhang Xiong
Flexible Order Aware Sequential Recommendation
Mingda Qian
|
Xiaoyan Gu
|
Lingyang Chu
|
Feifei Dai
|
Haihui Fan
|
Bo Li
Sequential Intention-aware Recommender based on User Interaction Graph
Jinpeng Chen
|
Yuan Cao
|
Fan Zhang
|
Pengfei Sun
|
Kaimin Wei
TransHash: Transformer-based Hamming Hashing for Efficient Image Retrieval
Yongbiao Chen
|
Sheng Zhang
|
Fangxin Liu
|
Zhigang Chang
|
Mang Ye
|
Zhengwei Qi
Constructing Phrase-level Semantic Labels to Form Multi-Grained Supervision for Image-Text Retrieval
Zhihao Fan
|
Zhongyu Wei
|
Zejun Li
|
Siyuan Wang
|
Haijun Shan
|
Xuanjing Huang
|
Jianqing Fan
Relevance-based Margin for Contrastively-trained Video Retrieval Models
Alex Falcon
|
Swathikiran Sudhakaran
|
Giuseppe Serra
|
Sergio Escalera
|
Oswald Lanz
CLIP4Hashing: Unsupervised Deep Hashing for Cross-Modal Video-Text Retrieval
Yaoxin Zhuo
|
Yikang Li
|
Jenhao Hsiao
|
Chiuman Ho
|
Baoxin Li
Nearest Neighbor Search with Compact Codes: A Decoder Perspective
Kenza Amara
|
Matthijs Douze
|
Alexandre Sablayrolles
|
Hervé Jégou
Teaching a New Dog Old Tricks: Contrastive Random Walks in Videos with Unsupervised Priors
Jan Schutte
|
Pascal Mettes
FedNKD: A Dependable Federated Learning Using Fine-tuned Random Noise and Knowledge Distillation
Shaoxiong Zhu
|
Qi Qi
|
Zirui Zhuang
|
Jingyu Wang
|
Haifeng Sun
|
Jianxin Liao
Weakly Supervised Fine-grained Recognition based on Combined Learning for Small Data and Coarse Label
Anqi Hu
|
Zhengxing Sun
|
Qian Li
Real-Time Deepfake System for Live Streaming
Yifei Fan
|
Modan Xie
|
Peihan Wu
|
Gang Yang
EmoMTB: Emotion-aware Music Tower Blocks
Alessandro B. Melchiorre
|
David Penz
|
Christian Ganhör
|
Oleg Lesota
|
Vasco Fragoso
|
Florian Friztl
|
Emilia Parada-Cabaleiro
|
Franz Schubert
|
Markus Schedl
ViRMA: Virtual Reality Multimedia Analytics
Aaron Duane
|
Björn Þór Jónsson
Person Search by Uncertain Attributes
Tingting Dong
|
Jianquan Liu
Dual-Level Decoupled Transformer for Video Captioning
Yiqi Gao
|
Xinglin Hou
|
Wei Suo
|
Mengyang Sun
|
Tiezheng Ge
|
Yuning Jiang
|
Peng Wang
Cross-Modal Retrieval between Event-Dense Text and Image
Zhongwei Xie
|
Lin Li
|
Luo Zhong
|
Jianquan Liu
|
Ling Liu
Learning Hierarchical Semantic Correspondences for Cross-Modal Image-Text Retrieval
Sheng Zeng
|
Changhong Liu
|
Jun Zhou
|
Yong Chen
|
Aiwen Jiang
|
Hanxi Li
Ingredient-enriched Recipe Generation from Cooking Videos
Jianlong Wu
|
Liangming Pan
|
Jingjing Chen
|
Yu-Gang Jiang
Cross-lingual Adaptation for Recipe Retrieval with Mixup
Bin Zhu
|
Chong-Wah Ngo
|
Jingjing Chen
|
Wing Kwong Chan
Disentangled Representations and Hierarchical Refinement of Multi-Granularity Features for Text-to-Image Synthesis
Pei Dong
|
Lei Wu
|
Lei Meng
|
Xiangxu Meng
Style-woven Attention Network for Zero-shot Ink Wash Painting Style Transfer
Haochen Sun
|
Lei Wu
|
Xiang Li
|
Xiangxu Meng
Automatic Visual Recognition of Unexploded Ordnances Using Supervised Deep Learning
Georgios Begkas
|
Panagiotis Giannakeris
|
Konstantinos Ioannidis
|
Georgios Kalpakis
|
Theodora Tsikrika
|
Stefanos Vrochidis
|
Ioannis Kompatsiaris
Generating Topological Structure of Floorplans from Room Attributes
Yu Yin
|
Will Hutchcroft
|
Naji Khosravan
|
Ivaylo Boyadzhiev
|
Yun Fu
|
Sing Bing Kang
MultiCLU: Multi-stage Context Learning and Utilization for Storefront Accessibility Detection and Evaluation
Xuan Wang
|
Jiajun Chen
|
Hao Tang
|
Zhigang Zhu
UF-VTON: Toward User-Friendly Virtual Try-On Network
Yuan Chang
|
Tao Peng
|
Ruhan He
|
Xinrong Hu
|
Junping Liu
|
Zili Zhang
|
Minghua Jiang
Learning Sample Importance for Cross-Scenario Video Temporal Grounding
Peijun Bao
|
Yadong Mu
Efficient Linear Attention for Fast and Accurate Keypoint Matching
Suwichaya Suwanwimolkul
|
Satoshi Komorita
Video2Subtitle: Matching Weakly-Synchronized Sequences via Dynamic Temporal Alignment
Ben Xue
|
Chenchen Liu
|
Yadong Mu
Dual-Channel Localization Networks for Moment Retrieval with Natural Language
Bolin Zhang
|
Bin Jiang
|
Chao Yang
|
Liang Pang
Phrase-level Prediction for Video Temporal Localization
Sizhe Li
|
Chang Li
|
Minghang Zheng
|
Yang Liu
Joint Modality Synergy and Spatio-temporal Cue Purification for Moment Localization
Xingyu Shen
|
Long Lan
|
Huibin Tan
|
Xiang Zhang
|
Xurui Ma
|
Zhigang Luo
HybridVocab: Towards Multi-Modal Machine Translation via Multi-Aspect Alignment
Ru Peng
|
Yawen Zeng
|
Junbo Zhao
Improving Image Captioning via Enhancing Dual-Side Context Awareness
Yiqi Gao
|
Ning Wang
|
Wei Suo
|
Mengyang Sun
|
Peng Wang
Improve Image Captioning by Modeling Dynamic Scene Graph Extension
Minghao Geng
|
Qingjie Zhao
Summarizing Videos using Concentrated Attention and Considering the Uniqueness and Diversity of the Video Frames
Evlampios E. Apostolidis
|
Georgios Balaouras
|
Vasileios Mezaris
|
Ioannis Patras
Fashion Image Search via Anchor-Free Detector
Shanchuan Gao
|
Fankai Zeng
|
Lu Cheng
|
Jicong Fan
|
Mingbo Zhao
Unsupervised Contrastive Masking for Visual Haze Classification
Jingyu Li
|
Haokai Ma
|
Xiangxian Li
|
Zhuang Qi
|
Lei Meng
|
Xiangxu Meng
MuLER: Multiplet-Loss for Emotion Recognition
Anwer Slimi
|
Mounir Zrigui
|
Henri Nicolas
STAFNet: Swin Transformer Based Anchor-Free Network for Detection of Forward-looking Sonar Imagery
Xingyu Zhu
|
Yingshuo Liang
|
Jianlei Zhang
|
Zengqiang Chen
Camouflaged Poisoning Attack on Graph Neural Networks
Chao Jiang
|
Yi He
|
Richard Chapman
|
Hongyi Wu
Accelerated Sign Hunter: A Sign-based Black-box Attack via Branch-Prune Strategy and Stabilized Hierarchical Search
Siyuan Li
|
Guangji Huang
|
Xing Xu
|
Yang Yang
|
Fumin Shen
DiGAN: Directional Generative Adversarial Network for Object Transfiguration
Zhen Luo
|
Yingfang Zhang
|
Peihao Zhong
|
Jingjing Chen
|
Donglong Chen
GIO: A Timbre-informed Approach for Pitch Tracking in Highly Noisy Environments
Xiaoheng Sun
|
Xia Liang
|
Qiqi He
|
Bilei Zhu
|
Zejun Ma
Source-free Temporal Attentive Domain Adaptation for Video Action Recognition
Peipeng Chen
|
Andy J. Ma
Review of Deep Learning Models for Spine Segmentation
Neng Zhou
|
Hairu Wen
|
Yi Wang
|
Yang Liu
|
Longfei Zhou
3D-Augmented Contrastive Knowledge Distillation for Image-based Object Pose Estimation
Zhidan Liu
|
Zhen Xing
|
Xiangdong Zhou
|
Yijiang Chen
|
Guichun Zhou
Selective Hypergraph Convolutional Networks for Skeleton-based Action Recognition
Yiran Zhu
|
Guangji Huang
|
Xing Xu
|
Yanli Ji
|
Fumin Shen
Self-Lifting: A Novel Framework for Unsupervised Voice-Face Association Learning
Guangyu Chen
|
Deyuan Zhang
|
Tao Liu
|
Xiaoyong Du
Revisiting Performance Measures for Cross-Modal Hashing
Hongya Wang
|
Shunxin Dai
|
Ming Du
|
Bo Xu
|
Mingyong Li
Local Slot Attention for Vision and Language Navigation
Yifeng Zhuang
|
Qiang Sun
|
Yanwei Fu
|
Lifeng Chen
|
Xiangyang Xue
Cross-Pixel Dependency with Boundary-Feature Transformation for Weakly Supervised Semantic Segmentation
Yuhui Guo
|
Xun Liang
|
Tang Hui
|
Bo Wu
|
Xiangping Zheng
Mobile Emotion Recognition via Multiple Physiological Signals using Convolution-augmented Transformer
Kangning Yang
|
Benjamin Tag
|
Yue Gu
|
Chaofan Wang
|
Tilman Dingler
|
Greg Wadley
|
Jorge Gonçalves
VAC-Net: Visual Attention Consistency Network for Person Re-identification
Weidong Shi
|
Yunzhou Zhang
|
Shangdong Zhu
|
Yixiu Liu
|
Sonya Coleman
|
Dermot Kerr
MFGAN: A Lightweight Fast Multi-task Multi-scale Feature-fusion Model based on GAN
Lijia Deng
|
Yu-Dong Zhang
Adaptive Temporal Grouping for Black-box Adversarial Attacks on Videos
Zhipeng Wei
|
Jingjing Chen
|
Hao Zhang
|
Linxi Jiang
|
Yu-Gang Jiang
Parallelism Network with Partial-aware and Cross-correlated Transformer for Vehicle Re-identification
Guangqi Jiang
|
Huibing Wang
|
Jinjia Peng
|
Xianping Fu
Motor Learning based on Presentation of a Tentative Goal
Siqi Sun
|
Yongqing Sun
|
Mitsuhiro Goto
|
Shigekuni Kondo
|
Dan Mikami
|
Susumu Yamamoto
Extracting Precedence Relations between Video Lectures in MOOCs
Kui Xiao
|
Youheng Bai
|
Yan Zhang
M2TR: Multi-modal Multi-scale Transformers for Deepfake Detection
Junke Wang
|
Zuxuan Wu
|
Wenhao Ouyang
|
Xintong Han
|
Jingjing Chen
|
Yu-Gang Jiang
|
Ser-Nam Li
Blindfold Attention: Novel Mask Strategy for Facial Expression Recognition
Bo Fu
|
Yuanxin Mao
|
Shilin Fu
|
Yonggong Ren
|
Zhongxuan Luo
MSSPQ: Multiple Semantic Structure-Preserving Quantization for Cross-Modal Retrieval
Lei Zhu
|
Liewu Cai
|
Jiayu Song
|
Xinghui Zhu
|
Chengyuan Zhang
|
Shichao Zhang
Lesion Localization in OCT by Semi-Supervised Object Detection
Yue Wu
|
Yang Zhou
|
Jianchun Zhao
|
Jingyuan Yang
|
Weihong Yu
|
Youxin Chen
|
Xirong Li
Weakly Supervised Pediatric Bone Age Assessment Using Ultrasonic Images via Automatic Anatomical RoI Detection
Yunyan Yan
|
Chuanbin Liu
|
Hongtao Xie
|
Sicheng Zhang
|
Zhendong Mao
I2-Net: Intra- and Inter-scale Collaborative Learning Network for Abdominal Multi-organ Segmentation
Chao Suo
|
Xuanya Li
|
Donghui Tan
|
Yuan Zhang
|
Xieping Gao
SA-NAS-BFNR: Spatiotemporal Attention Neural Architecture Search for Task-based Brain Functional Network Representation
Fenxia Duan
|
Chunhong Cao
|
Xieping Gao
Weakly-supervised Cerebrovascular Segmentation Network with Shape Prior and Model Indicator
Qian Wu
|
Yufei Chen
|
Ning Huang
|
Xiaodong Yue
FreqCAM: Frequent Class Activation Map for Weakly Supervised Object Localization
Runsheng Zhang
Reproducibility Companion Paper: Human Object Interaction Detection via Multi-level Conditioned Network
Yunqing He
|
Xu Sun
|
Hui Jiang
|
Tongwei Ren
|
Gangshan Wu
|
Maria Sinziana Astefanoaei
|
Andreas Leibetseder
Introduction to the Fifth Annual Lifelog Search Challenge, LSC'22
Cathal Gurrin
|
Liting Zhou
|
Graham Healy
|
Björn Þór Jónsson
|
Duc-Tien Dang-Nguyen
|
Jakub Lokoc
|
Minh-Triet Tran
|
Wolfgang Hürst
|
Luca Rossetto
|
Klaus Schöffmann
MAD '22 Workshop: Multimedia AI against Disinformation
Bogdan Ionescu
|
Giorgos Kordopatis-Zilos
|
Adrian Popescu
|
Luca Cuccovillo
|
Symeon Papadopoulos
ICDAR'22: Intelligent Cross-Data Analysis and Retrieval
Minh-Son Dao
|
Michael Alexander Riegler
|
Duc-Tien Dang-Nguyen
|
Cathal Gurrin
|
Yuta Nakashima
|
Mianxiong Dong
MMArt-ACM 2022: 5th Joint Workshop on Multimedia Artworks Analysis and Attractiveness Computing in Multimedia
Naoko Nitta
|
Anita Min-Chun Hu
|
Kensuke Tobitani