Main » IJMIR »

International Journal of Multimedia Information Retrieval (2022)


up

2022 Volume 11 Issue 4

doi dblp
Contrastive self-supervised learning: review, progress, challenges and future research directions
Pranjal Kumar | Piyush Rawat | Siddhartha Chauhan

doi dblp
Human pose estimation using deep learning: review, methodologies, progress and future research directions
Pranjal Kumar | Siddhartha Chauhan | Lalit Kumar Awasthi

doi dblp
Special issue on cross-modal retrieval and analysis
Jianlong Wu | Richang Hong | Qi Tian

doi dblp
Prototype local-global alignment network for image-text retrieval
Lingtao Meng | Feifei Zhang | Xi Zhang | Changsheng Xu

doi dblp
Who is gambling? Finding cryptocurrency gamblers using multi-modal retrieval methods
Zhengjie Huang | Zhenguang Liu | Jianhai Chen | Qinming He | Shuang Wu | Lei Zhu | Meng Wang

doi dblp
Your heart rate betrays you: multimodal learning with spatio-temporal fusion networks for micro-expression recognition
Ren Zhang | Ning He | Shengjie Liu | Ying Wu | Kang Yan | Yuzhe He | Ke Lu

doi dblp
Multi-aware coreference relation network for visual dialog
Zefan Zhang | Tianling Jiang | Chunping Liu | Yi Ji

doi dblp
Video deblurring and flow-guided feature aggregation for obstacle detection in agricultural videos
Keyang Cheng | Xuesen Zhu | Yongzhao Zhan | Yunshen Pei

doi dblp
TCKGE: Transformers with contrastive learning for knowledge graph embedding
Xiaowei Zhang | Quan Fang | Jun Hu | Shengsheng Qian | Changsheng Xu

doi dblp
FDAM: full-dimension attention module for deep convolutional neural networks
Silin Cai | Changping Wang | Jiajun Ding | Jun Yu | Jianping Fan

doi dblp
FCT: fusing CNN and transformer for scene classification
Yuxiang Xie | Jie Yan | Lai Kang | Yanming Guo | Jiahui Zhang | Xidao Luan

doi dblp
Semantic-aware visual scene representation
Mohammad Javad Parseh | Mohammad Rahmanimanesh | Parviz Keshavarzi | Zohreh Azimifar

doi dblp
Generative adversarial networks for 2D-based CNN pose-invariant face recognition
Mohamed Kas | Youssef El Merabet | Yassine Ruichek | Rochdi Messoussi

doi dblp
A novel method for video shot boundary detection using CNN-LSTM approach
Benoughidene Abdel Halim | Titouna Faiza

doi dblp
Visual and semantic ensemble for scene text recognition with gated dual mutual attention
Zhiguang Liu | Liangwei Wang | Jian Qiao

doi dblp
MHA-WoML: Multi-head attention and Wasserstein-OT for few-shot learning
Junyan Yang | Jie Jiang | Yanming Guo

doi dblp
Gender classification from face images using central difference convolutional networks
Mohammadreza Sheikh Fathollahi | Rezvan Heidari

doi dblp
Tri-RAT: optimizing the attention scores for image captioning
You Yang | Yongzhi An | Juntao Hu | Longyue Pan

doi dblp
Multimodal Quasi-AutoRegression: forecasting the visual popularity of new fashion products
Stefanos-Iordanis Papadopoulos | Christos Koutlis | Symeon Papadopoulos | Ioannis Kompatsiaris

doi dblp
Similar interior coordination image retrieval with multi-view features
Ren Togo | Yuki Honma | Maiku Abe | Takahiro Ogawa | Miki Haseyama


up

2022 Volume 11 Issue 3

doi dblp
Cross-domain image retrieval: methods and applications
Xiaoping Zhou | Xiangyu Han | Haoran Li | Jia Wang | Xun Liang

doi dblp
A literature review and perspectives in deepfakes: generation, detection, and applications
Deepak Dagar | Dinesh Kumar Vishwakarma

doi dblp
Text detection, recognition, and script identification in natural scene images: a Review
Veronica Naosekpam | Nilkanta Sahu

doi dblp
Organ segmentation from computed tomography images using the 3D convolutional neural network: a systematic review
Ademola Enitan Ilesanmi | Taiwo Ilesanmi | Oluwagbenga Paul Idowu | Drew A. Torigian | Jayaram K. Udupa

doi dblp
Generative adversarial networks and its applications in the biomedical image segmentation: a comprehensive survey
Ahmed Iqbal | Muhammad Sharif | Mussarat Yasmin | Mudassar Raza | Shabib Aftab

doi dblp
Semantic-enhanced discriminative embedding learning for cross-modal retrieval
Hao Pan | Jun Huang

doi dblp
Music emotion recognition based on segment-level two-stage learning
Na He | Sam Ferguson

doi dblp
RGBD deep multi-scale network for background subtraction
Ihssane Houhou | Athmane Zitouni | Yassine Ruichek | Salah Eddine Bekhouche | Mohamed Kas | Abdelmalik Taleb-Ahmed

doi dblp
InceptionDepth-wiseYOLOv2: improved implementation of YOLO framework for pedestrian detection
Sweta Panigrahi | U. S. N. Raju

doi dblp
How can users' comments posted on social media videos be a source of effective tags?
Mehdi Ellouze

doi dblp
A unified approach of detecting misleading images via tracing its instances on web and analyzing its past context for the verification of multimedia content
Deepika Varshney | Dinesh Kumar Vishwakarma