International Journal of Multimedia Information Retrieval (2022) - IR Anthology

Main » IJMIR »

International Journal of Multimedia Information Retrieval (2022)

Content

2022 Volume 11 Issue 4 20 papers
2022 Volume 11 Issue 3 11 papers
2022 Volume 11 Issue 2 8 papers
2022 Volume 11 Issue 1 5 papers

2022 Volume 11 Issue 4

doi dblp
Contrastive self-supervised learning: review, progress, challenges and future research directions
Pranjal Kumar | Piyush Rawat | Siddhartha Chauhan

doi dblp
Human pose estimation using deep learning: review, methodologies, progress and future research directions
Pranjal Kumar | Siddhartha Chauhan | Lalit Kumar Awasthi

doi dblp
Special issue on cross-modal retrieval and analysis
Jianlong Wu | Richang Hong | Qi Tian

doi dblp
Prototype local-global alignment network for image-text retrieval
Lingtao Meng | Feifei Zhang | Xi Zhang | Changsheng Xu

doi dblp
Who is gambling? Finding cryptocurrency gamblers using multi-modal retrieval methods
Zhengjie Huang | Zhenguang Liu | Jianhai Chen | Qinming He | Shuang Wu | Lei Zhu | Meng Wang

doi dblp
Your heart rate betrays you: multimodal learning with spatio-temporal fusion networks for micro-expression recognition
Ren Zhang | Ning He | Shengjie Liu | Ying Wu | Kang Yan | Yuzhe He | Ke Lu

doi dblp
Multi-aware coreference relation network for visual dialog
Zefan Zhang | Tianling Jiang | Chunping Liu | Yi Ji

doi dblp
Video deblurring and flow-guided feature aggregation for obstacle detection in agricultural videos
Keyang Cheng | Xuesen Zhu | Yongzhao Zhan | Yunshen Pei

doi dblp
TCKGE: Transformers with contrastive learning for knowledge graph embedding
Xiaowei Zhang | Quan Fang | Jun Hu | Shengsheng Qian | Changsheng Xu

doi dblp
FDAM: full-dimension attention module for deep convolutional neural networks
Silin Cai | Changping Wang | Jiajun Ding | Jun Yu | Jianping Fan

doi dblp
FCT: fusing CNN and transformer for scene classification
Yuxiang Xie | Jie Yan | Lai Kang | Yanming Guo | Jiahui Zhang | Xidao Luan

doi dblp
Semantic-aware visual scene representation
Mohammad Javad Parseh | Mohammad Rahmanimanesh | Parviz Keshavarzi | Zohreh Azimifar

doi dblp
Generative adversarial networks for 2D-based CNN pose-invariant face recognition
Mohamed Kas | Youssef El Merabet | Yassine Ruichek | Rochdi Messoussi

doi dblp
A novel method for video shot boundary detection using CNN-LSTM approach
Benoughidene Abdel Halim | Titouna Faiza

doi dblp
Visual and semantic ensemble for scene text recognition with gated dual mutual attention
Zhiguang Liu | Liangwei Wang | Jian Qiao

doi dblp
MHA-WoML: Multi-head attention and Wasserstein-OT for few-shot learning
Junyan Yang | Jie Jiang | Yanming Guo

doi dblp
Gender classification from face images using central difference convolutional networks
Mohammadreza Sheikh Fathollahi | Rezvan Heidari

doi dblp
Tri-RAT: optimizing the attention scores for image captioning
You Yang | Yongzhi An | Juntao Hu | Longyue Pan

doi dblp
Multimodal Quasi-AutoRegression: forecasting the visual popularity of new fashion products
Stefanos-Iordanis Papadopoulos | Christos Koutlis | Symeon Papadopoulos | Ioannis Kompatsiaris

doi dblp
Similar interior coordination image retrieval with multi-view features
Ren Togo | Yuki Honma | Maiku Abe | Takahiro Ogawa | Miki Haseyama

2022 Volume 11 Issue 3

doi dblp
Cross-domain image retrieval: methods and applications
Xiaoping Zhou | Xiangyu Han | Haoran Li | Jia Wang | Xun Liang

doi dblp
A literature review and perspectives in deepfakes: generation, detection, and applications
Deepak Dagar | Dinesh Kumar Vishwakarma

doi dblp
Text detection, recognition, and script identification in natural scene images: a Review
Veronica Naosekpam | Nilkanta Sahu

doi dblp
Organ segmentation from computed tomography images using the 3D convolutional neural network: a systematic review
Ademola Enitan Ilesanmi | Taiwo Ilesanmi | Oluwagbenga Paul Idowu | Drew A. Torigian | Jayaram K. Udupa

doi dblp
Generative adversarial networks and its applications in the biomedical image segmentation: a comprehensive survey
Ahmed Iqbal | Muhammad Sharif | Mussarat Yasmin | Mudassar Raza | Shabib Aftab

doi dblp
Semantic-enhanced discriminative embedding learning for cross-modal retrieval
Hao Pan | Jun Huang

doi dblp
Music emotion recognition based on segment-level two-stage learning
Na He | Sam Ferguson

doi dblp
RGBD deep multi-scale network for background subtraction
Ihssane Houhou | Athmane Zitouni | Yassine Ruichek | Salah Eddine Bekhouche | Mohamed Kas | Abdelmalik Taleb-Ahmed

doi dblp
InceptionDepth-wiseYOLOv2: improved implementation of YOLO framework for pedestrian detection
Sweta Panigrahi | U. S. N. Raju

doi dblp
How can users' comments posted on social media videos be a source of effective tags?
Mehdi Ellouze

doi dblp
A unified approach of detecting misleading images via tracing its instances on web and analyzing its past context for the verification of multimedia content
Deepika Varshney | Dinesh Kumar Vishwakarma

2022 Volume 11 Issue 2

doi dblp
Anomaly detection using edge computing in video surveillance system: review
Devashree R. Patrikar | Mayur Rajaram Parate

doi dblp
Caption TLSTMs: combining transformer with LSTMs for image captioning
Jie Yan | Yuxiang Xie | Xidao Luan | Yanming Guo | Quanzhi Gong | Suru Feng

doi dblp
DC-GNN: drop channel graph neural network for object classification and part segmentation in the point cloud
Md. Meraz | Md Afzal Ansari | Mohammed Javed | Pavan Chakraborty

doi dblp
Multi-sensor human activity recognition using CNN and GRU
Ohoud Nafea | Wadood Abdul | Ghulam Muhammad

doi dblp
A local representation-enhanced recurrent convolutional network for image captioning
Xiaoyi Wang | Jun Huang

doi dblp
Siamese coding network and pair similarity prediction for near-duplicate image detection
Marco Fisichella

doi dblp
PDS-Net: A novel point and depth-wise separable convolution for real-time object detection
Masum Shah Junayed | Md Baharul Islam | Hassan Imani | Tarkan Aydin

doi dblp
Few2Decide: towards a robust model via using few neuron connections to decide
Jian Li | Yanming Guo | Songyang Lao | Xi Zhao | Liang Bai | Haoran Wang

2022 Volume 11 Issue 1

doi dblp
Interactive video retrieval evaluation at a distance: comparing sixteen interactive video search systems in a remote setting at the 10th Video Browser Showdown
Silvan Heller | Viktor Gsteiger | Werner Bailer | Cathal Gurrin | Björn Þór Jónsson | Jakub Lokoc | Andreas Leibetseder | Frantisek Mejzlík | Ladislav Peska | Luca Rossetto | Konstantin Schall | Klaus Schoeffmann | Heiko Schuldt | Florian Spiess | Ly-Duyen Tran | Lucia Vadicamo | Patrik Veselý | Stefanos Vrochidis | Jiaxin Wu

doi dblp
A review on deep learning in medical image analysis
S. Suganyadevi | V. Seethalakshmi | K. Balasamy

doi dblp
A fast and robust affine-invariant method for shape registration under partial occlusion
Sinda Elghoul | Faouzi Ghorbel

doi dblp
Enhancing the performance of 3D auto-correlation gradient features in depth action classification
Mohammad Farhad Bulbul | Saiful Islam | Zannatul Azme | Preksha Pareek | Md. Humaun Kabir | Hazrat Ali

doi dblp
Multimodal image and audio music transcription
Carlos de la Fuente | Jose J. Valero-Mas | Francisco J. Castellanos | Jorge Calvo-Zaragoza