International Journal of Multimedia Information Retrieval (2022)
Content
- 2022 Volume 11 Issue 4 20 papers
- 2022 Volume 11 Issue 3 11 papers
- 2022 Volume 11 Issue 2 8 papers
- 2022 Volume 11 Issue 1 5 papers
2022 Volume 11 Issue 4
Contrastive self-supervised learning: review, progress, challenges and future research directions
Pranjal Kumar
|
Piyush Rawat
|
Siddhartha Chauhan
Human pose estimation using deep learning: review, methodologies, progress and future research directions
Pranjal Kumar
|
Siddhartha Chauhan
|
Lalit Kumar Awasthi
Special issue on cross-modal retrieval and analysis
Jianlong Wu
|
Richang Hong
|
Qi Tian
Prototype local-global alignment network for image-text retrieval
Lingtao Meng
|
Feifei Zhang
|
Xi Zhang
|
Changsheng Xu
Who is gambling? Finding cryptocurrency gamblers using multi-modal retrieval methods
Zhengjie Huang
|
Zhenguang Liu
|
Jianhai Chen
|
Qinming He
|
Shuang Wu
|
Lei Zhu
|
Meng Wang
Your heart rate betrays you: multimodal learning with spatio-temporal fusion networks for micro-expression recognition
Ren Zhang
|
Ning He
|
Shengjie Liu
|
Ying Wu
|
Kang Yan
|
Yuzhe He
|
Ke Lu
Multi-aware coreference relation network for visual dialog
Zefan Zhang
|
Tianling Jiang
|
Chunping Liu
|
Yi Ji
Video deblurring and flow-guided feature aggregation for obstacle detection in agricultural videos
Keyang Cheng
|
Xuesen Zhu
|
Yongzhao Zhan
|
Yunshen Pei
TCKGE: Transformers with contrastive learning for knowledge graph embedding
Xiaowei Zhang
|
Quan Fang
|
Jun Hu
|
Shengsheng Qian
|
Changsheng Xu
FDAM: full-dimension attention module for deep convolutional neural networks
Silin Cai
|
Changping Wang
|
Jiajun Ding
|
Jun Yu
|
Jianping Fan
FCT: fusing CNN and transformer for scene classification
Yuxiang Xie
|
Jie Yan
|
Lai Kang
|
Yanming Guo
|
Jiahui Zhang
|
Xidao Luan
Semantic-aware visual scene representation
Mohammad Javad Parseh
|
Mohammad Rahmanimanesh
|
Parviz Keshavarzi
|
Zohreh Azimifar
Generative adversarial networks for 2D-based CNN pose-invariant face recognition
Mohamed Kas
|
Youssef El Merabet
|
Yassine Ruichek
|
Rochdi Messoussi
A novel method for video shot boundary detection using CNN-LSTM approach
Benoughidene Abdel Halim
|
Titouna Faiza
Visual and semantic ensemble for scene text recognition with gated dual mutual attention
Zhiguang Liu
|
Liangwei Wang
|
Jian Qiao
MHA-WoML: Multi-head attention and Wasserstein-OT for few-shot learning
Junyan Yang
|
Jie Jiang
|
Yanming Guo
Gender classification from face images using central difference convolutional networks
Mohammadreza Sheikh Fathollahi
|
Rezvan Heidari
Tri-RAT: optimizing the attention scores for image captioning
You Yang
|
Yongzhi An
|
Juntao Hu
|
Longyue Pan
Multimodal Quasi-AutoRegression: forecasting the visual popularity of new fashion products
Stefanos-Iordanis Papadopoulos
|
Christos Koutlis
|
Symeon Papadopoulos
|
Ioannis Kompatsiaris
Similar interior coordination image retrieval with multi-view features
Ren Togo
|
Yuki Honma
|
Maiku Abe
|
Takahiro Ogawa
|
Miki Haseyama
2022 Volume 11 Issue 3
Cross-domain image retrieval: methods and applications
Xiaoping Zhou
|
Xiangyu Han
|
Haoran Li
|
Jia Wang
|
Xun Liang
A literature review and perspectives in deepfakes: generation, detection, and applications
Deepak Dagar
|
Dinesh Kumar Vishwakarma
Text detection, recognition, and script identification in natural scene images: a Review
Veronica Naosekpam
|
Nilkanta Sahu
Organ segmentation from computed tomography images using the 3D convolutional neural network: a systematic review
Ademola Enitan Ilesanmi
|
Taiwo Ilesanmi
|
Oluwagbenga Paul Idowu
|
Drew A. Torigian
|
Jayaram K. Udupa
Generative adversarial networks and its applications in the biomedical image segmentation: a comprehensive survey
Ahmed Iqbal
|
Muhammad Sharif
|
Mussarat Yasmin
|
Mudassar Raza
|
Shabib Aftab
Semantic-enhanced discriminative embedding learning for cross-modal retrieval
Hao Pan
|
Jun Huang
Music emotion recognition based on segment-level two-stage learning
Na He
|
Sam Ferguson
RGBD deep multi-scale network for background subtraction
Ihssane Houhou
|
Athmane Zitouni
|
Yassine Ruichek
|
Salah Eddine Bekhouche
|
Mohamed Kas
|
Abdelmalik Taleb-Ahmed
InceptionDepth-wiseYOLOv2: improved implementation of YOLO framework for pedestrian detection
Sweta Panigrahi
|
U. S. N. Raju
How can users' comments posted on social media videos be a source of effective tags?
Mehdi Ellouze
A unified approach of detecting misleading images via tracing its instances on web and analyzing its past context for the verification of multimedia content
Deepika Varshney
|
Dinesh Kumar Vishwakarma
2022 Volume 11 Issue 2
Anomaly detection using edge computing in video surveillance system: review
Devashree R. Patrikar
|
Mayur Rajaram Parate
Caption TLSTMs: combining transformer with LSTMs for image captioning
Jie Yan
|
Yuxiang Xie
|
Xidao Luan
|
Yanming Guo
|
Quanzhi Gong
|
Suru Feng
DC-GNN: drop channel graph neural network for object classification and part segmentation in the point cloud
Md. Meraz
|
Md Afzal Ansari
|
Mohammed Javed
|
Pavan Chakraborty
Multi-sensor human activity recognition using CNN and GRU
Ohoud Nafea
|
Wadood Abdul
|
Ghulam Muhammad
A local representation-enhanced recurrent convolutional network for image captioning
Xiaoyi Wang
|
Jun Huang
Siamese coding network and pair similarity prediction for near-duplicate image detection
Marco Fisichella
PDS-Net: A novel point and depth-wise separable convolution for real-time object detection
Masum Shah Junayed
|
Md Baharul Islam
|
Hassan Imani
|
Tarkan Aydin
Few2Decide: towards a robust model via using few neuron connections to decide
Jian Li
|
Yanming Guo
|
Songyang Lao
|
Xi Zhao
|
Liang Bai
|
Haoran Wang
2022 Volume 11 Issue 1
Interactive video retrieval evaluation at a distance: comparing sixteen interactive video search systems in a remote setting at the 10th Video Browser Showdown
Silvan Heller
|
Viktor Gsteiger
|
Werner Bailer
|
Cathal Gurrin
|
Björn Þór Jónsson
|
Jakub Lokoc
|
Andreas Leibetseder
|
Frantisek Mejzlík
|
Ladislav Peska
|
Luca Rossetto
|
Konstantin Schall
|
Klaus Schoeffmann
|
Heiko Schuldt
|
Florian Spiess
|
Ly-Duyen Tran
|
Lucia Vadicamo
|
Patrik Veselý
|
Stefanos Vrochidis
|
Jiaxin Wu
A review on deep learning in medical image analysis
S. Suganyadevi
|
V. Seethalakshmi
|
K. Balasamy
A fast and robust affine-invariant method for shape registration under partial occlusion
Sinda Elghoul
|
Faouzi Ghorbel
Enhancing the performance of 3D auto-correlation gradient features in depth action classification
Mohammad Farhad Bulbul
|
Saiful Islam
|
Zannatul Azme
|
Preksha Pareek
|
Md. Humaun Kabir
|
Hazrat Ali
Multimodal image and audio music transcription
Carlos de la Fuente
|
Jose J. Valero-Mas
|
Francisco J. Castellanos
|
Jorge Calvo-Zaragoza