-
A privacy-preserving and traitor tracking content-based image retrieval scheme in cloud computing Multimedia Syst. (IF 1.563) Pub Date : 2021-01-09 Zhangdong Wang, Jiaohua Qin, Xuyu Xiang, Yun Tan
To ensure the image security, a large number of ciphertext image retrieval methods have been studied and applied, such as homomorphic encryption and multi-key encryption. However, most of these algorithms do not consider the protection of image copyright, user information and traitor tracking. For this reason, this paper proposes a privacy-preserving and traitor tracking content-based image retrieval
-
Multichannel speech separation using hybrid GOMF and enthalpy-based deep neural networks Multimedia Syst. (IF 1.563) Pub Date : 2021-01-06 Yannam Vasantha Koteswararao, C. B. Rama Rao
Speech signal is commonly debased by room reverberation and included noises in genuine climates. This paper focuses on disengaging objective speech signals in reverberant conditions from multichannel input signals. To overcome all the existing drawbacks, this work proposes an efficient technique like, multichannel speech signal separation using a new hybrid method that combines grasshopper optimization-based
-
Steganography in animated emoji using self-reference Multimedia Syst. (IF 1.563) Pub Date : 2021-01-06 Zhiying Zhu, Qichao Ying, Zhenxing Qian, Xinpeng Zhang
Animated emoji is a kind of GIF image, which is widely used in online social networks (OSN) for its efficiency in transmitting vivid and personalized information. Aiming at realizing covert communication in animated emoji, this paper proposes an improved steganography framework in animated emoji. We propose a self-reference algorithm to improve the steganography security. Meanwhile, the relations between
-
Myocardial infarction detection based on deep neural network on imbalanced data Multimedia Syst. (IF 1.563) Pub Date : 2021-01-06 Mohamed Hammad, Monagi H. Alkinani, B. B. Gupta, Ahmed A. Abd El-Latif
Myocardial infarction (MI) is an acute interruption of blood flow to the heart, which causes the heart to suffer from a deficiency of blood and ischemia, so the heart muscle is damaged, and cells can die and lose their function. Despite the low incidence of MI in the world, it is still a common disease-causing death. Therefore, detecting the MI signals early can reduce mortality. This paper presented
-
Secure speech coding communication using hyperchaotic key generators for AMR-WB codec Multimedia Syst. (IF 1.563) Pub Date : 2021-01-06 Messaouda Boumaraf, Fatiha Merazka
Classical cryptosystems are not suitable for all situations for they fail to meet some current security requirements due to their small keyspace, which makes them prone to brute-force attacks, as well as their inefficiency in real-time applications due to their considerable time complexity. Recently, secure applications such as hyperchaos-based speech signals became a focal topic for research. Their
-
An image classification model based on transfer learning for ulcerative proctitis Multimedia Syst. (IF 1.563) Pub Date : 2021-01-04 Feng Zeng, Xingcun Li, Xiaoheng Deng, Lan Yao, Guanghui Lian
Ulcerative colitis (UC) can be classified as proctitis, left-sided colitis or pancolitis, usually with rectal involvement at the beginning. Mucosal carcinogenesis is one of the most severe complications of UC. Persistent inflammation of the rectal mucosa may be an essential cause of mucosal cancer, thus the detection of rectal inflammation is of great significance. In this paper, we propose a transfer
-
Reversible data hiding in encrypted medical DICOM image Multimedia Syst. (IF 1.563) Pub Date : 2021-01-04 Ping Kong, Di Fu, Xinran Li, Chuan Qin
This paper proposes a novel reversible data hiding (RDH) scheme in encrypted domain for medical DICOM images. Although a lot of RDH schemes for encrypted images have been presented, however, most of them are unsuitable for medical DICOM images, as they do not utilize the features of the DICOM image format, and the recovery accuracy is low because medical images have large areas with the same pixel
-
A compressed string matching algorithm for face recognition with partial occlusion Multimedia Syst. (IF 1.563) Pub Date : 2021-01-04 Krishnaveni Bommidi, Sridhar Sundaramurthy
There has been less attention towards the research on face recognition with partial occlusion. Facial accessories such as masks, sunglasses, and caps, etc., cause partial occlusion which results in a significant performance drop of the face recognition system. In this paper, a novel compressed string matching algorithm based on run-length encoding (CSM-RL) is proposed to solve the partial occlusion
-
RDH-based dynamic weighted histogram equalization using for secure transmission and cancer prediction Multimedia Syst. (IF 1.563) Pub Date : 2021-01-04 Rashid Abbasi, Jianwen Chen, Yasser Al-Otaibi, Amjad Rehman, Asad Abbas, Weiwei Cui
Image contrast enhancement is a prerequisite and plays a very important role in many image processing field like medical imaging, face recognition, computer-vision, and satellite imaging. In this paper we proposed reversible data hiding based Limited Dynamic Weighted Histogram Equalization techniques for Abnormal Tumor regions which improve the contrast, transmit the hidden secret information, preserve
-
Reversible data hiding in adjacent zeros Multimedia Syst. (IF 1.563) Pub Date : 2021-01-04 Heba Abdel-Nabi, Ali Al-Haj
Data hiding techniques are used to embed information into digital objects for the purpose of copyright protection, covert communication, media notation, among other applications. Compared with traditional data hiding techniques, reversible data hiding (RDH) can restore the original digital object after extracting the hidden data. This makes RDH very useful for applications requiring high fidelity of
-
PAMTEL-RT: web-based multimedia platform for tele-assistance of pediatric health emergencies in real time in training centers Multimedia Syst. (IF 1.563) Pub Date : 2021-01-04 Fernando Boronat, Pascual Escrivá, Pau Salvador, Fátima Pareja, Javier Pastor
In this paper, PAMTEL-RT, a web-based platform to provide remote health professional support and guidance during the initial assistance of emergency situations is presented. In particular, this platform focuses on the pediatric population and has been specifically designed to be used by non-healthcare professionals in training centers, such as educational or sport centers. It allows pediatricians in
-
FU-Net: fast biomedical image segmentation model based on bottleneck convolution layers Multimedia Syst. (IF 1.563) Pub Date : 2021-01-04 Bekhzod Olimov, Karshiev Sanjar, Sadia Din, Awaise Ahmad, Anand Paul, Jeonghong Kim
Recently, the introduction of Convolutional Neural Network (CNNs) has advanced the way of solving image segmentation tasks. Semantic image segmentation has considerably benefited from employing various CNN models. The most widely used network in this field is U-Net and its different variations. However, these models require significant number of trainable parameters, floating-point operations per second
-
Face spoofing detection based on chromatic ED-LBP texture feature Multimedia Syst. (IF 1.563) Pub Date : 2020-11-27 Xin Shu, Hui Tang, Shucheng Huang
Face spoofing detection, also known as liveness detection, is a challenging and one of the most active research areas in computer vision. In this paper, a novel texture descriptor, namely equilibrium difference local binary pattern (ED-LBP), is proposed for the representation and recognition of face texture. First, the adjacent pixels discrepancy in a facial image is fully considered and the discrepancy
-
LMSN:a lightweight multi-scale network for single image super-resolution Multimedia Syst. (IF 1.563) Pub Date : 2020-11-24 Yiye Zou, Xiaomin Yang, Marcelo Keese Albertini, Farhan Hussain
With the development of deep learning (DL), convolutional neural networks (CNNs) have shown great reconstruction performance in single image super-resolution (SISR). However, some methods blindly deepen the networks to purchase the performance, which neglect to make full use of the multi-scale information of different receptive fields and ignore the efficiency in practice. In this paper, a lightweight
-
A TextCNN and WGAN-gp based deep learning frame for unpaired text style transfer in multimedia services Multimedia Syst. (IF 1.563) Pub Date : 2020-11-23 Mingxuan Hu, Min He, Wei Su, Abdellah Chehri
With the rapid growth of big multimedia data, multimedia processing techniques are facing some challenges, such as knowledge understanding, semantic modeling, feature representation, etc. Hence, based on TextCNN and WGAN-gp (improved training of Wasserstein GANs), a deep learning framework is suggested to improve the efficiency of discriminating the specific style features and the style-independent
-
How does context influence music preferences: a user-based study of the effects of contextual information on users’ preferred music Multimedia Syst. (IF 1.563) Pub Date : 2020-11-21 Imen Ben Sassi, Sadok Ben Yahia
To simplify effective music filtering, recommender systems (RS) have received great attention from both industry and academia area. To select which music to recommend, traditional RS uses an approximation of users’ real interests. However, while discarding users’ contexts, profiles information is not able to reflect their exact needs and to provide overpowering recommendations. One of the main issues
-
Blockchain-enabled supply chain: analysis, challenges, and future directions Multimedia Syst. (IF 1.563) Pub Date : 2020-11-20 Sohail Jabbar, Huw Lloyd, Mohammad Hammoudeh, Bamidele Adebisi, Umar Raza
Managing the integrity of products and processes in a multi-stakeholder supply chain environment is a significant challenge. Many current solutions suffer from data fragmentation, lack of reliable provenance, and diverse protocol regulations across multiple distributions and processes. Amongst other solutions, Blockchain has emerged as a leading technology, since it provides secure traceability and
-
A hybrid firefly and particle swarm optimization algorithm applied to multilevel image thresholding Multimedia Syst. (IF 1.563) Pub Date : 2020-11-19 Taymaz Rahkar Farshi, Ahad K. Ardabili
There are many techniques for conducting image analysis and pattern recognition. This papers explores a way to optimize one of these techniques—image segmentation—with the help of a novel hybrid optimization algorithm. Image segmentation is mostly used for a semantic segmentation of images, and thresholding is one the most common techniques for performing this segmentation. Otsu’s and Kapur’s thresholding
-
A novel AFNCS algorithm for super-resolution SAR in curve trajectory Multimedia Syst. (IF 1.563) Pub Date : 2020-11-18 Tan Qu, Yan Zhang, Jiaji Wu
With the improvement of SAR resolution, super-resolution SAR imaging is more and more widely used in the all-time and all-weather video surveillance and remote sensing imaging. One implementation of super-resolution SAR is that radar works in the spotlight mode. In the case with highly squinted angle and acceleration, the azimuth space variance and the coupling between range and azimuth will become
-
An efficient 2D encoding/decoding technique for optical communication system based on permutation vectors theory Multimedia Syst. (IF 1.563) Pub Date : 2020-11-16 Hassan Yousif Ahmed, Medien Zeghid, Waqas A. Imtiaz, Teena Sharma, Abdellah Chehri
One dimensional encoding in optical communications has limitations in terms of the number of supported users and large bandwidth consumption. This study presents a new algorithm to generate two-dimensional (2D) encoding utilizing permutation vectors (PV) theory for incoherent multiple access network to suppress multiple access interference (MAI) and system complexity. The proposed code design approach
-
Multi-scale skip-connection network for image super-resolution Multimedia Syst. (IF 1.563) Pub Date : 2020-11-11 Jing Liu, Jianhui Ge, Yuxin Xue, Wenjuan He, Qindong Sun, Shancang Li
A skip-connection learning framework-based convolution neural network (CNN) has recently achieved great success in image super-resolution (SR). However, most CNN models based on the skip-connection learning framework do not fully make use of potential multi-scale features of images. In this paper, we propose a multi-scale skip-connection network (MSN) to improve the visual quality of the image SR.
-
CyberBERT: BERT for cyberbullying identification Multimedia Syst. (IF 1.563) Pub Date : 2020-11-11 Sayanta Paul, Sriparna Saha
Cyberbullying can be delineated as a purposive and recurrent act, which is aggressive in nature, done via different social media platforms such as Facebook, Twitter, Instagram, and others. A state-of-the-art pre-training language model, BERT (Bidirectional Encoder Representations from Transformers), has achieved remarkable results in many language understanding tasks. In this paper, we present a novel
-
Intelligent multimodal medical image fusion with deep guided filtering Multimedia Syst. (IF 1.563) Pub Date : 2020-11-11 B. Rajalingam, Fadi Al-Turjman, R. Santhoshkumar, M. Rajesh
Medical image fusion is a synthesis of visual information present in any number of medical imaging inputs into a single fused image without any distortion or loss of detail. It enhances image quality by retaining specific features to improve the clinical applicability of medical imaging for treatment and evaluation of medical conditions. A big challenge in the processing of medical images is to incorporate
-
A novel method for ECG signal classification via one-dimensional convolutional neural network Multimedia Syst. (IF 1.563) Pub Date : 2020-11-11 Xuan Hua, Jungang Han, Chen Zhao, Haipeng Tang, Zhuo He, Qinghui Chen, Shaojie Tang, Jinshan Tang, Weihua Zhou
This paper develops an end-to-end ECG signal classification algorithm based on a novel segmentation strategy and 1D Convolutional Neural Networks (CNN) to aid the classification of ECG signals and alleviate the workload of physicians. The ECG segmentation strategy named R-R-R strategy (i.e., retaining ECG data between the R peaks just before and after the current R peak) is used for segmenting the
-
Automatic liver segmentation from abdominal CT volumes using improved convolution neural networks Multimedia Syst. (IF 1.563) Pub Date : 2020-11-09 Zhe Liu, Kai Han, Zhaohui Wang, Jing Zhang, Yuqing Song, Xu Yao, Deqi Yuan, Victor S. Sheng
Segmentation of the liver from abdominal CT images is an essential step for computer-aided diagnosis and surgery planning. The U-Net architecture is one of the most well-known CNN architectures which achieved remarkable successes in both medical and biological image segmentation domain. However, it does not perform well when the target area is small or partitioned. In this paper, we propose a novel
-
Foreground detection using motion histogram threshold algorithm in high-resolution large datasets Multimedia Syst. (IF 1.563) Pub Date : 2020-11-06 Fakhri Alam Khan, Muhammad Nawaz, Muhammad Imran, Arif Ur Rahman, Fawad Qayum
Background subtraction, being the most cited algorithm for foreground detection, encounters the major problem of proper threshold value at run time. For effective value of the threshold at run time in background subtraction algorithm, the primary component of the foreground detection process, motion is used, in the proposed algorithm. For the said purpose, the smooth histogram peaks and valley of the
-
An image watermarking technique using hybrid signals as watermarks Multimedia Syst. (IF 1.563) Pub Date : 2020-11-03 Amita Singha, Muhammad Ahsan Ullah
In digital communication, it is very necessary to prevent the unlawful intrusion of unwanted parties. Apart from that, the authenticity of a signal being sent is a key factor otherwise false data can break the commitment between the sender and the receiver of the signal. By digital watermarking, both the insurance of the authenticity and the prevention of unlawful intrusion can be maintained by inserting
-
Bayer image demosaicking and denoising based on specialized networks using deep learning Multimedia Syst. (IF 1.563) Pub Date : 2020-10-30 Alaa O. Khadidos, Adil O. Khadidos, Fazal Qudus Khan, Georgios Tsaramirsis, Awais Ahmad
Demosaicking is the way toward reproducing a full hued picture from a deficient shaded picture. The single sensor doesn't catch all hues for a single pixel. To address this, a color filter array (CFA) is utilized to get a hued picture from a single sensor. The created picture from CFA is called a mosaic picture. In this research, we utilize specialized networks to remove the noise from Bayer images
-
A novel category detection of social media reviews in the restaurant industry Multimedia Syst. (IF 1.563) Pub Date : 2020-10-24 Mohib Ullah Khan, Abdul Rehman Javed, Mansoor Ihsan, Usman Tariq
Social media platforms have enabled users to share their thoughts, ideas, and opinions on different subject matters and meanwhile generate lots of information which can be adopted to understand people’s emotion towards certain products. This information can be effectively applied for Aspect Category Detection (ACD). Similarly, people’s emotions and recommendation-based Artificial Intelligence (AI)-powered
-
Robust and fast image hashing with two-dimensional PCA Multimedia Syst. (IF 1.563) Pub Date : 2020-10-20 Xiaoping Liang, Zhenjun Tang, Xiaolan Xie, Jingli Wu, Xianquan Zhang
Image hashing is a useful technology of many multimedia systems, such as image retrieval, image copy detection, multimedia forensics and image authentication. Most of the existing hashing algorithms do not reach a good classification between robustness and discrimination and some hashing algorithms based on dimensionality reduction have high computational cost. To solve these problems, we propose a
-
Resampling parameter estimation via dual-filtering based convolutional neural network Multimedia Syst. (IF 1.563) Pub Date : 2020-10-18 Lin Peng, Xin Liao, Mingliang Chen
Resampling detection is an important problem in image forensics. Several exiting approaches have been proposed to solve it, but few of them focus on resampling parameter estimation. Especially, the estimation of downsampling scenarios is very challenging. In this paper, we propose a dual-filtering based convolutional neural network (CNN) to extract features directly from the images. First, we analyze
-
Video compression using hybrid hexagon search and teaching–learning-based optimization technique for 3D reconstruction Multimedia Syst. (IF 1.563) Pub Date : 2020-10-17 B. Veerasamy, S. Annadurai
Motion estimation from a video sequence is an interesting issue in video processing. Nowadays, research has been focused on global optimization techniques, that estimate the optical flow for pixel neighborhoods. In this paper, a hybrid statistically effective motion estimation procedure has been proposed for better effectiveness video compression. This method explores by utilizing a hexagonal search
-
Floor of log: a novel intelligent algorithm for 3D lung segmentation in computer tomography images Multimedia Syst. (IF 1.563) Pub Date : 2020-10-15 Solon Alves Peixoto, Aldísio G. Medeiros, Mohammad Mehedi Hassan, M. Ali Akber Dewan, Victor Hugo C. de Albuquerque, Pedro P. Rebouças Filho
This work presents a high-performance approach for 3D lung segmentation tasks in computer tomography images using a new intelligent machine learning algorithm called Floor of Log(FoL). The Support Vector Machine was used to learn the better parameter of the FoL algorithm using the parenchyma and its border as labels. Sensitivity, Matthews Correlation Coefficient (MCC), Hausdorff Distance (HD), Dice
-
Low-complexity reversible data hiding in encrypted image via MSB hierarchical coding and LSB compression Multimedia Syst. (IF 1.563) Pub Date : 2020-10-14 Fang Cao, Xiaokang Qian, Bowen An
Recently, reversible data hiding in encrypted image (RDHEI) has attracted extensive attention, which can be used in secure cloud computing and privacy protection effectively. In this paper, a low-complexity RDHEI scheme based on MSB hierarchical coding and LSB compression is proposed. Content owner first utilizes a specific encryption method, including pixel encrypting, block, and pixel scrambling
-
Cyberbullying detection solutions based on deep learning architectures Multimedia Syst. (IF 1.563) Pub Date : 2020-10-13 Celestine Iwendi, Gautam Srivastava, Suleman Khan, Praveen Kumar Reddy Maddikunta
Cyberbullying is disturbing and troubling online misconduct. It appears in various forms and is usually in a textual format in most social networks. Intelligent systems are necessary for automated detection of these incidents. Some of the recent experiments have tackled this issue with traditional machine learning models. Most of the models have been applied to one social network at a time. The latest
-
Image super-resolution model using an improved deep learning-based facial expression analysis Multimedia Syst. (IF 1.563) Pub Date : 2020-10-13 Pyoung Won Kim
Image upsampling and noise removal are important tasks in digital image processing. Single-image upsampling and denoising influence the quality of the resulting images. Image upsampling is known as super-resolution (SR) and referred to as the restoration of a higher-resolution image from a given low-resolution image. In facial expression analysis, the resolution of the original image directly affects
-
Image steganography based on Kirsch edge detection Multimedia Syst. (IF 1.563) Pub Date : 2020-10-12 Sudipta Kumar Ghosal, Agneet Chatterjee, Ram Sarkar
Conventional steganography methods fabricate the secret information into the cover pixels without analyzing the pixel intensities of an image. As a result, some minor pixel level manipulations may lead to huge visual distortion in the stego-image. To this end, in this paper, a novel steganographic scheme based on Kirsch edge detector is proposed. The aim of the scheme is to maximize the payload by
-
Sanskrit to universal networking language EnConverter system based on deep learning and context-free grammar Multimedia Syst. (IF 1.563) Pub Date : 2020-10-11 Sitender, Seema Bawa
Machine Translation is a mechanism of transforming text from one language to another with the help of computer technology. Earlier in 2018, a machine translation system had been developed by the authors that translate Sanskrit text to Universal Networking Language expressions and was named as SANSUNL. The work presented in this paper is an extension of SANSUNL system by enhancing POS tagging, Sanskrit
-
User segmentation via interpretable user representation and relative similarity-based segmentation method Multimedia Syst. (IF 1.563) Pub Date : 2020-10-09 Younghoon Lee, Sungzoon Cho
User segmentation is an essential element of marketing and product development that considers customers’ needs and recognizes the heterogeneity of those needs. In a key study of smartphone user segmentation, Lee et al. analyzed app usage sequencing using seq2seq architecture. However, despite achieving meaningful results, their approach could not provide a robust interpretation of user segmentation
-
Habitat mapping using deep neural networks Multimedia Syst. (IF 1.563) Pub Date : 2020-10-03 Muhammad Yasir, Arif Ur Rahman, Moneeb Gohar
Habitat mapping is an important and challenging task that helps in monitoring, managing, and preserving ecosystems. It becomes more challenging when marine habitats are mapped, as it is difficult to get quality images in an underwater environment. Moreover, achieving good location accuracy in underwater environments is an additional issue. Sonar imagery has good quality but is hard to be analyzed.
-
A hybrid algorithm for underwater image restoration based on color correction and image sharpening Multimedia Syst. (IF 1.563) Pub Date : 2020-09-27 Haiyang Meng, Yongjie Yan, Chengtao Cai, Renjie Qiao, Feng Wang
In this paper, an effective method is proposed for restoring underwater image based on color correction and image sharpening. The main purpose of this method is to improve the visibility of underwater images. Traditional methods generally adopt color balancing method to restore the images. However, we found that color balancing has a poor effectiveness on underwater image when the values of blue channel
-
The state of the art of deep learning models in medical science and their challenges Multimedia Syst. (IF 1.563) Pub Date : 2020-09-25 Chandradeep Bhatt, Indrajeet Kumar, V. Vijayakumar, Kamred Udham Singh, Abhishek Kumar
With time, AI technologies have matured well and resonated in various domains of applied sciences and engineering. The sub-domains of AI, machine learning (ML), deep learning (DL), and associated statistical tools are getting more attention. Therefore, various machine learning models are being created to take advantage of the data available and accomplish tasks, such as automatic prediction, classification
-
An improved algorithm of multi-exposure image fusion by detail enhancement Multimedia Syst. (IF 1.563) Pub Date : 2020-09-21 Zhong Qu, Xu Huang, Ling Liu
Multi-exposure image fusion is an effective method for depicting high dynamic range of the target scene in a single image. However, there are still some problems remaining: the preserving of global contrast, the preserving of the local details in saturated regions, and the existence of halo artifacts. To solve these problems, this paper proposes a new multi-exposure image fusion algorithm with detail
-
ALBERT-based fine-tuning model for cyberbullying analysis Multimedia Syst. (IF 1.563) Pub Date : 2020-09-18 Jatin Karthik Tripathy, S. Sibi Chakkaravarthy, Suresh Chandra Satapathy, Madhulika Sahoo, V. Vaidehi
With the world’s interaction moving more and more toward using online social media platforms, the advent of cyberbullying has also raised its head. Multiple forms of cyberbullying exist from the more common text based to images or even videos, and this paper will explore the context of textual comments. Even in the niche area of considering only text-based data, several approaches have already been
-
Multi-nonlinear multi-view locality-preserving projection with similarity learning for random cross-view gait recognition Multimedia Syst. (IF 1.563) Pub Date : 2020-09-14 Xiaoyun Chen, Yeyuan Kang, Zhiping Chen
View variation is one of the greatest challenges in the field of gait recognition. Subspace learning approaches are designed to solve this issue by projecting cross-view features into a common subspace before recognition. However, similarity measures are data-dependent, which results in low accuracy when cross-view gait samples are randomly arranged. Inspired by the recent developments of data-driven
-
Gaussian Hermite polynomial based lossless medical image compression Multimedia Syst. (IF 1.563) Pub Date : 2020-09-09 S. N. Kumar, A. Ahilan, Ajay Kumar Haridhas, Jins Sebastian
The role of compression is inevitable in the storage and transmission of medical images. The polynomial based image compression is proposed in this work for the compression of abdomen CT medical images. The input images are preprocessed by min–max normalization; the pixels are scanned and subjected to polynomial approximation. The polynomial approximated coefficients are subjected to llyods quantization
-
A fast CU partition algorithm based on sum of region-directional dispersion for virtual reality 360° video Multimedia Syst. (IF 1.563) Pub Date : 2020-09-02 Mengmeng Zhang, Yuhao Wang, Zhi Liu, Zhao Wang
Virtual reality 360° video has an ultra-high resolution (usually 4–8 K), which takes more coding time than traditional video. Considering the degree of horizontal stretching in a different area of the ERP projected video, a fast coding unit (CU) partition algorithm based on the sum of region-directional dispersion is proposed. The relationship between the current block and its adjacent ones is measured
-
Multimedia image and video retrieval based on an improved HMM Multimedia Syst. (IF 1.563) Pub Date : 2020-09-01 Yanbing Liu, Sanjev Dhakal, Binyao Hao
In today's information age, information is gathered from text and more complex media, such as images, audio, and video. Among these data sources, the rapid growth of video information has led to it to gradually become the main source of information in people's lives. Video information is characterized by many kinds of information, complex forms, and a low degree of structure. Therefore, effectively
-
Fast CU partition decision for H.266/VVC based on the improved DAG-SVM classifier model Multimedia Syst. (IF 1.563) Pub Date : 2020-08-27 Qiuwen Zhang, Yihan Wang, Lixun Huang, Bin Jiang, Xiao Wang
One of the biggest changes in H.266/Versatile Video Coding (VVC) is introduced quad-tree with nested multi-type tree (QTMT) coding tree architecture, where the multi-type tree (MTT) structure in H.266/VVC includes binary tree (BT) and ternary tree (TT). Compared with H.265/High Efficiency Video Coding (HEVC) which only is divided by quad-tree (QT), the QTMT architecture makes the coding unit (CU) partition
-
Integrating Gaussian mixture model and dilated residual network for action recognition in videos Multimedia Syst. (IF 1.563) Pub Date : 2020-08-20 Ming Fang, Xiaoying Bai, Jianwei Zhao, Fengqin Yang, Chih-Cheng Hung, Shuhua Liu
Action recognition in video is one of the important applications in computer vision. In recent years, the two-stream architecture has made significant progress in action recognition, but it has not systematically explored spatial–temporal features. Therefore, this paper proposes an integrated approach using Gaussian mixture model (GMM) and dilated convolution residual network (GD-RN) for action recognition
-
Hybrid optimal algorithm-based 2D discrete wavelet transform for image compression using fractional KCA Multimedia Syst. (IF 1.563) Pub Date : 2020-08-19 V. Geetha, V. Anbumani, G. Murugesan, S. Gomathi
Due to the low compression performance of traditional compression models, we have developed a new HOA based Fractional KCA with 2D-DWT for improving the multispectral image quality. In this paper, we present a novel multispectral image compression method for improving the complexity by maintaining quality reconstruction and also reducing the size of the storage of multispectral images. Initially, Karhunen–Loeve
-
Emperor Penguin optimized event recognition and summarization for cricket highlight generation Multimedia Syst. (IF 1.563) Pub Date : 2020-08-18 Hansa Shingrakhia, Hetal Patel
Cricket highlight generation is the process of summarizing a full-length video to a shortened form which should preserve the important moments present in the original video. In this paper, a new approach has been proposed for recognizing the key events and summarization. Audio features are initially used for extracting the excitement clips. Then, the important events like replay, players, umpires,
-
Color image quantization with peak-picking and color space Multimedia Syst. (IF 1.563) Pub Date : 2020-08-12 Taymaz Rahkar Farshi
Color image quantization is a significant procedure of reducing the huge range of color values of a digital color image into a limited range. In this paper, an automated clustering of pixels and color quantization algorithm is proposed. The ideal number of representative colors is unknown beforehand in most color quantization algorithms. This is an important handicap in most practical cases. The proposed
-
Semantic image segmentation algorithm in a deep learning computer network Multimedia Syst. (IF 1.563) Pub Date : 2020-08-10 Defu He, Chao Xie
Semantic image segmentation in computer networks is designed to determine the category to which each pixel in an image belongs. It is a basic computer vision task and has a very wide range of applications in practice. In recent years, semantic image segmentation algorithms in computer networks based on deep learning have attracted widespread attention due to their fast speed and high accuracy. However
-
An effective security assessment approach for Internet banking services via deep analysis of multimedia data Multimedia Syst. (IF 1.563) Pub Date : 2020-08-10 Sana Khattak, Sadeeq Jan, Iftikhar Ahmad, Zahid Wadud, Fazal Qudus Khan
With the emergence of cyber technology, the biggest evolution has been observed in the use of Internet for financial purposes, in particular for the Internet banking sector. However, with the increase in the number of Internet banking users, many security issues have been discovered. In the recent past, there have been many successful cyber-attacks on the Internet banking services (IBS) throughout
-
Online multi-object tracking using KCF-based single-object tracker with occlusion analysis Multimedia Syst. (IF 1.563) Pub Date : 2020-08-06 Honghong Yang, Sheng Gao, Xiaojun Wu, Yumei Zhang
Most state-of-the-art multiple-object tracking (MOT) methods adopt the tracking-by-detection (TBD) paradigm, which is a two-step procedure including the detection module and the tracking module. In these methods, the tracking performance heavily depends on initial detections and data association. In this paper, we present an online MOT method by introducing a single-object tracking (SOT) based on correlation
-
Deep learning-based multi-modal approach using RGB and skeleton sequences for human activity recognition Multimedia Syst. (IF 1.563) Pub Date : 2020-07-25 Pratishtha Verma, Animesh Sah, Rajeev Srivastava
The deep learning techniques have achieved great success in the application of human activity recognition (HAR). In this paper, we propose a technique for HAR that utilizes the RGB and skeleton information with the help of a convolutional neural network (Convnet) and long short-term memory (LSTM) as a recurrent neural network (RNN). The proposed method has two parts: first, motion representation images
-
Low-light-level image enhancement algorithm based on integrated networks Multimedia Syst. (IF 1.563) Pub Date : 2020-07-22 Peng Wang, Jiao Wu, Haiyan Wang, Xiaoyan Li, Yongxia Yang
In dark or poorly lit environments, it is often difficult for the naked eye to distinguish low-light-level images because of low brightness, low contrast and noise, and it is difficult to perform subsequent image processing (such as video surveillance and target detection). To solve these problems, this paper proposes a low-light-level image enhancement algorithm based on deep learning. First, the
-
Food image classification and image retrieval based on visual features and machine learning Multimedia Syst. (IF 1.563) Pub Date : 2020-07-21 Pengcheng Wei, Bo Wang
Research on image retrieval and classification in the food field has become one of the more and more concerned research topics in the field of multimedia analysis and applications. In recent years, with the rapid development of the Internet industry and multimedia technology, image classification and retrieval technology has become a research hotspot at home and abroad. Traditional keyword-based image
-
MQTT-SN, CoAP, and RTP in wireless IoT real-time communications Multimedia Syst. (IF 1.563) Pub Date : 2020-07-14 Rolando Herrero
A great number of Internet of things (IoT) applications rely on real-time communication (RTC) mechanisms for transmission of media. Essentially, applications analyze and process media to make decisions that typically affect actuation and control of embedded devices. IoT networks, however, are subjected to constrains that limit the computational and resource complexity of all entities involved. This
Contents have been reproduced by permission of the publishers.