样式: 排序: IF: - GO 导出 标记为已读
-
Scale Enhancement Network for Object Detection in Aerial Images Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2024-03-13 Shihan Mao, Zhi Wang, Qineng He, Zhangqing Zhu
The main challenge for object detection in aerial images is small object detection. Most existing methods use feature fusion strategies to enhance small object features in shallow layers but ignore the problem of inconsistent small object local region responses between feature layers, namely the semantic gap, which may lead to underutilization of small object information in multiple feature layers
-
DAGAN: A GAN Network for Image Denoising of Medical Images Using Deep Learning of Residual Attention Structures Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2024-03-13 Guoxiang Tong, Fangning Hu, Hongjun Liu
Medical images are susceptible to noise and artifacts, so denoising becomes an essential pre-processing technique for further medical image processing stages. We propose a medical image denoising method based on dual-attention mechanism for generative adversarial networks (GANs). The method is based on a GAN model with fused residual structure and introduces a global skip-layer connection structure
-
Leveraging Sampling Schemes on Skewed Class Distribution to Enhance Male Fertility Detection with Ensemble AI Learners Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2024-03-07 Debasmita GhoshRoy, P. A. Alvi, KC Santosh
Designing effective AI models becomes a challenge when dealing with imbalanced/skewed class distributions in datasets. Addressing this, re-sampling techniques often come into play as potential solutions. In this investigation, we delve into the male fertility dataset, exploring 14 re-sampling approaches to understand their impact on enhancing predictive model performance. The research employs conventional
-
Residual Network for Image Compression Artifact Reduction Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2024-03-07 Jianhua Hu, Guixiang Luo, Bo Wang, Weimei Wu, Jiahui Yang, Jianding Guo
This paper proposes an image compression algorithm based on Swin Transformer and residual network (STRN), aiming to reduce blurring and distortions in traditionally compressed images. The algorithm utilizes a dual-channel mechanism to remove artifacts from the image, which takes advantage of the complementary features of the transform and residual networks. The Swin Transformer networks address the
-
A Rate Control Scheme for VVC Intercoding Using a Linear Model Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2024-03-05 Heqiang Wang, Xuekai Wei, Weizhi Xian, Jun Luo, Huayan Pu, Zhigang Chu, Xin Wang, Xueyong Xu, Chang Lu, Mingliang Zhou
Versatile video coding (VVC) aims to achieve high compression but also issues like varying content/network conditions. Existing rate control (RC) methods struggle to achieve optimal quality under these complex scenarios. This paper proposes a novel RC scheme for VVC based on a linear model. The Lagrange minimization multiplier is introduced under bit budget constraints, allowing optimized bit allocation
-
Identification Method of Unmanned Aerial Vehicle Graphical Control Strategy Based on Cloud Server Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2024-02-29 Zhengyu Liu, Zhenbang Cheng, Yu Liu, Qing Jiang
With the rapid development of unmanned aerial vehicle (UAV) technology, UAV has been widely used in agricultural plant protection, electric power inspection, security patrols, and other fields. However, the control system of the UAV is a complex human–computer interaction system, which requires higher requirements in practical applications. Due to differences in hardware design, software development
-
Research on Multi-Source Heterogeneous Big Data Fusion Method Based on Feature Level Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2024-02-29 Yanyan Chen, Chenxi Wang, Yuchen Zhou, Yuhang Zuo, Zixuan Yang, Hui Li, Juan Yang
With the development of research on multi-modal data fusion and its combination with online data management, the application of multi-modal big data fusion in information management systems is more and more extensive. How to integrate multi-modal big data effectively is the key technology to building an efficient information management system. In this paper, based on the combination of a multi-support
-
Head Pose Estimation Based on Multi-Level Feature Fusion Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2024-02-28 Chunman Yan, Xiao Zhang
Head Pose Estimation (HPE) has a wide range of applications in computer vision, but still faces challenges: (1) Existing studies commonly use Euler angles or quaternions as pose labels, which may lead to discontinuity problems. (2) HPE does not effectively address regression via rotated matrices. (3) There is a low recognition rate in complex scenes, high computational requirements, etc. This paper
-
Optimized Ensemble Machine Learning Approach for Emotion Detection from Thermal Images Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2024-02-22 Jayaprakash Katual, Amit Kaul
Emotions indicate the feelings of the individual which are linked with personal experiences, moods, and affective states. Detection of emotion can be helpful in many fields like maintaining a patient’s psychological well-being, surveillance, driver monitoring, etc. In this paper, an effective machine learning approach has been put forth for emotion detection where an ensemble of three out of five best-performing
-
A Novel Multi-Data-Augmentation and Multi-Deep-Learning Framework for Counting Small Vehicles and Crowds Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2024-02-20 Chun-Ming Tsai, Frank Y. Shih
Counting small pixel-sized vehicles and crowds in unmanned aerial vehicles (UAV) images is crucial across diverse fields, including geographic information collection, traffic monitoring, item delivery, communication network relay stations, as well as target segmentation, detection, and tracking. This task poses significant challenges due to factors such as varying view angles, non-fixed drone cameras
-
Medical Image Segmentation Using Grey Wolf-Based U-Net with Bi-Directional Convolutional LSTM Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2024-02-19 G. Tamilmani, CH. Phaneendra Varma, V. Brindha Devi, G. Ramesh Babu
In recent years, deep learning-based networks have been able to achieve state-of-the-art performance in medical image segmentation. U-Net, one of the currently available networks, has proven to be effective when applied to the segmentation of medical images. A Convolutional Neural Network’s (CNN) performance is heavily dependent on the network’s architecture and associated parameters. There are many
-
Hybrid Optimized Deep Learning-Based Bacilli Segmentation and Infection-Level Identification of Tuberculosis Using Sputum Images Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2024-02-14 P. Sathish, Preethi D, Clara Shanthi Dominic, G. Kadiravan
Presently, one of the foremost health issues and an extremely transferrable disease is Tuberculosis which is spreading worldwide. Tuberculosis is generally produced by mycobacterium tuberculosis and can cause death if it is not detected at premature stages. Therefore, a precise and efficient approach is essential for the identification of tuberculosis. The physical analysis of sputum smears through
-
Pelican Whale Optimization Enabled Deep Learning Framework for Video Steganography Using Arnold Transform-Based Embedding Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2024-02-14 G Suresh, G Manikandan, G Bhuvaneswari, P Shanthakumar
Steganography refers to hiding a secret message from various sources, such as images, videos, audio and so on. The advantage of steganography is to avoid data hacking in transmission medium during the transmission of information sources. Video steganography is superior to image steganography since the videos can hide a substantial quantity of secret messages more than the image. Hence, this research
-
Saliency and Depth-Aware Full Reference 360-Degree Image Quality Assessment Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2024-02-09 Xuekai Wei, Qunyue Huang, Bin Fang, Lei Ouyang, Weizhi Xian, Jun Luo, Huayan Pu, Xueyong Xu, Chang Lu, Hao Nan, Xu Liu, Yachao Li, Mingliang Zhou
With the widespread adoption of virtual reality and 360-degree video, there is a pressing need for objective metrics to assess quality in this immersive panoramic format reliably. However, existing image quality assessment models developed for traditional fixed-viewpoint content do not fully consider the specific perceptual issues involved in 360-degree viewing. This paper proposes a 360-degree image
-
LCSTR: Scene Text Recognition with Large Convolutional Kernels Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2024-02-09 Jiale Wang, Lina Yang, Jing Wang, Haoyan Yang, Lin Bai, Patrick Shen-Pei Wang, Xichun Li, Huiwu Luo, Huafu Xu
The task of scene text recognition involves processing information from two modalities: images and text, thereby requiring models to have the ability to extract features from images and model sequences simultaneously. Although linguistic knowledge greatly aids scene text recognition tasks, the extensive use of language models in sequence modeling and model prediction stages in recent years has made
-
A Gaze Estimation Method Based on Binocular Cameras Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2024-02-01 Zihan Wu, Changyuan Wang, Gang Sun, Zhen Fu
In recent years, multi-stream gaze estimation methods have become mainstream, which estimate gaze point by eye picture or combine with facial appearance, have achieved considerable accuracy. However, these methods based on a single camera fail to obtain accurate eye spatial position information. To address this issue, we propose a multi-stream gaze estimation model that incorporates spatial position
-
Boosting Multi-Label Classification Performance Through Meta-Model Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2024-01-31 Sonia Guehria, Habiba Belleili, Nabiha Azizi, Djamel Zenakhra
Multi-label classification problem, where each instance can be associated with multiple labels, has received considerable attention from machine learning community. To address the inherent challenges of multi-label classification including data imbalance, label dependence, and high dimensionality, ensemble approaches have been developed, gaining popularity across various real-world applications. This
-
All-Day Object Detection and Recognition for Blind Zones of Vehicles Using Deep Learning Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2024-01-31 Tsorng-Lin Chia, Pei-June Liu, Ping-Sheng Huang
The neglect of perception ability to the surrounding traffic conditions has always been the major cause of traffic accidents and the inattention to blind spots is the most important factor during driving. Existing solutions are facing the problems of using expensive equipment, wrong classification of the target object type, not suitable for nighttime, and incorrectly determining if the target object
-
Spatial Decomposition and Aggregation for Attention in Convolutional Neural Networks Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2024-01-31 Meng Zhu, Weidong Min, Hongyue Xiang, Cheng Zha, Zheng Huang, Longfei Li, Qiyan Fu
Channel attention has been shown to improve the performance of deep convolutional neural networks efficiently. Channel attention adaptively recalibrates the importance of each channel, determining what to attend to. However, channel attention only encodes inter-channel information but neglects the importance of positional information. Positional information is crucial in determining where to attend
-
Neural Network-Based Algorithm for Identification of Recaptured Images Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2024-01-29 Changming Liu, Yanjun Sun, Lin Deng, Yan Sun
With the improvement of digital image display technology, the “secondary imaging” caused by digital cameras is also gradually popularized, and the quality of the recaptured image formed by this imaging is also getting higher and higher, and this kind of high-quality fake image has caused great threat to digital images security. We propose a neural network-based recaptured image identification algorithm
-
A Locally Weighted Linear Regression-Based Approach for Arbitrary Moving Shaky and Nonshaky Video Classification Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2024-01-29 Arnab Halder, Palaiahnakote Shivakumara, Umapada Pal, Michael Blumenstein, Palash Ghosal
Classification and identification of objects are complex and challenging in pattern recognition and artificial intelligence if a shaky and nonshaky camera captures the videos at different distances during the day and nighttime. This work presents a model for classifying a given video as a static, uniform, or arbitrarily moving videos so that the complexity of the problem can be reduced. To avoid the
-
An End-to-End Video Coding Method via Adaptive Vision Transformer Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2024-01-29 Haoyan Yang, Mingliang Zhou, Zhaowei Shang, Huayan Pu, Jun Luo, Xiaoxu Huang, Shilong Wang, Huajun Cao, Xuekai Wei, Weizhi Xian
Deep learning-based video coding methods have demonstrated superior performance compared to classical video coding standards in recent years. The vast majority of the existing deep video coding (DVC) networks are based on convolutional neural networks (CNNs), and their main drawback is that since CNNs are affected by the size of the receptive field, they cannot effectively handle long-range dependencies
-
Transformer with a Parallel Decoder for Image Captioning Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2024-01-29 Peilang Wei, Xu Liu, Jun Luo, Huayan Pu, Xiaoxu Huang, Shilong Wang, Huajun Cao, Shouhong Yang, Xu Zhuang, Jason Wang, Hong Yue, Cheng Ji, Mingliang Zhou
In this paper, a parallel decoder and a word group prediction module are proposed to speed up decoding and improve the effect of captions. The features of the image extracted by the encoder are linearly projected to different word groups, and then a unique relaxed mask matrix is designed to improve the decoding speed and the caption effect. First, since image captioning is composed of many words, sentences
-
Deep Residual Network with Pelican Cuckoo Search for Traffic Sign Detection Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2024-01-29 T. Kumaravel, P. Natesan
The timely and precise discovery of traffic signs is considered an effective part of modeling automated vehicle driving. However, the dimension of traffic signs accounted for a lower ratio of input pictures which elevated the complexity of discovery. Hence, a new model is devised using faster region-based convolution neural network (faster R-CNN) traffic for detecting traffic signs. The Region of Interest
-
M2-YOLOX: A Novel Method for Object Detection Based on an Improved YOLOX Algorithm Introducing a Global Attention Mechanism and a Feature Enhancement Module Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2024-01-29 Xiaofeng Bai, Kaijun Wu, Chenshuai Bai
Deep learning-based algorithms for detecting objects in remote sensing images have produced excellent results recently. However, the target recognition and classification process of remote sensing images has problems such as dense targets, uneven distribution, large-scale changes and complex backgrounds. In order to improve the effectiveness of existing detection methods, based on the YOLOX algorithm
-
Multi-Scale Feature Refined Network for Human Pose Estimation Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-11-09 Qiaoning Yang, Xiaodong Ji, Xiuhui Yang
Occlusive keypoints has been a challenge for human pose estimation, especially the mutual occlusion of human bodies. One possible solution to this problem is to utilize multi-scale features, where small scale features are capable of identifying keypoints, while large-scale features can capture the relationship between keypoints. Feature fusion among multi-scale features allows for the exchange of information
-
Depth-Constrained Network for Multi-Scale Object Detection Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-08-24 Guohua Liu, Yijun Li
Challenges such as complex backgrounds, drastic variations in target scales, and dense distributions exist in natural scenes. Some algorithms optimize multi-scale object detection performance by combining low-level and high-level information through feature fusion strategies. However, these methods overlook the inherent spatial properties of objects and the relationships between foreground and background
-
Drug Toxicity Prediction by Machine Learning Approaches Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-08-24 Yucong Shen, Frank Y. Shih, Hao Chen
Drug property prediction, especially toxicity, helps reduce risks in a range of real-world applications. In this paper, we aim to apply various machine-learning models for solving the drug toxicity prediction problem. Among various machine-learning approaches, we select five suitable representatives: random forest, multi-layer perceptron, logistic regression, graph convolutional neural network, and
-
Counting with Self-Weighted Multi-Scale Fusion Networks Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-08-19 Xin Xiong, Jie Shen, Ying Li, Wei He, Peng Li, Wenjie Yan
Because of the large-scale variation, counting in scenes of different densities is an extremely difficult task. In this paper, based on the attention mechanism, we propose a new self-weighted multi-scale fusion network structure named SMFNet to solve the problem of multi-scale changes and can significantly improve the effect of crowd counting in monitoring scene. The proposed SMFNet uses VGG as the
-
A Novel Thanka Image Inpainting Method with Euler’s Elastica and Iterative Denoising and Backward Projections Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-08-08 Qiaoqiao Li, Weilan Wang
This paper presents a brand-new Thanka picture inpainting technique based on Euler’s elastica, iterative denoising, and backward projections (EEIDBP). Specifically, a model of Euler’s elastica is introduced to estimate the original observation due to its lower staircasing effects and better approximation of natural images. A method for backward projection and iterative denoising is applied to achieve
-
Copy-Move Forgery Detection and Localization Using Deep Learning Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-08-04 Fatemeh Zare Mehrjardi, Ali Mohammad Latif, Mohsen Sardari Zarchi
Forgery detection is one of the challenging subjects in computer vision. Forgery is performed using image manipulation with editor tools. Image manipulation tries to change the concept of the image but preserves the integrity of the texture and structure of the image as much as possible. Images are used as evidence in some applications, so if the images are manipulated, they will not be reliable. The
-
A Framework for Personalized Human Activity Recognition Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-08-01 Hasan Ali Eri̇ş, Mehmet Ali Ertürk, Muhammed Ali Aydın
In today’s world, Human Activity Recognition (HAR) through video streams is actively used in every aspect of our life, such as automated surveillance systems and sports statistics are computed according to the videos with the help of HAR. Activity detection is not a new subject, and several methods are available. However, the most recent and most promising techniques rely on Convolutional Neural Networks
-
Deepfake Speech Recognition and Detection Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-07-21 Hung-Chang Chang
Deepfake technology, especially deep voice, which has been derived from artificial intelligence in recent years, is potentially harmful, and the public is not yet wary. However, many speech synthesis models measure the degree of true restitution by Mean Opinion Rating (MOS), a subjective assessment of naturalness and quality of speech by human subjects, but in future it will be difficult to distinguish
-
An Adaptive Ant Colony Algorithm Based on Local Information Entropy to Solve Distributed Constraint Optimization Problems Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-07-21 Meifeng Shi, Shichuan Xiao, Xin Feng
As a meta-heuristic algorithm, the ant colony algorithm has been successfully used to solve various combinatorial optimization problems. However, the existing algorithm that takes the power of ants to solve distributed constraint optimization problems (ACO_DCOP) is easy to fall into local optima. To deal with this issue, this paper presents an adaptive ant colony algorithm based on local information
-
DOMOPT: A Detection-Based Online Multi-Object Pedestrian Tracking Network for Videos Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-07-21 Ruohong Huan, Shuaishuai Zheng, Chaojie Xie, Peng Chen, Ronghua Liang
Due to the problem of low tracking accuracy and weak tracking stability of current multi-object pedestrian tracking algorithms in complex scenes for videos, a Detection-based Online Multi-Object Pedestrian Tracking (DOMOPT) network is proposed. First, a Multi-Level Feature Fusion (MLFF) pedestrian detection network is proposed based on the Center and Scale Prediction (CSP) algorithm. The pyramid convolutional
-
Intelligent Inversion of Coastal Earth Resistivity Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-07-20 Bo Tan, Zhuohong Pan, Xuefang Tong, Yan Wang, Xianghan Wang, Lei Gao
Coastal grounding electrodes are currently an important means to alleviate land grounding electrode land constraints. In order to better invert the terrestrial geodesic resistivity in the coastal region, this paper proposes a complete set of inversion technology schemes. First, this paper proposes a layered land model for the coastal region, and a composite geodetic model is modeled by the fold junction
-
A Method on Classification and Recognition of Noisy Plant Images Based on Visual Domain Perception Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-07-19 Hongbiao Xie, Mingkun Feng, Zhijie Lin, Jiyi Wu, Zhe Feng
At present, some achievements have been made in the research of plant leaf classification such as the introduction of artificial intelligence algorithm. But there are still some problems. First, the existing achievements do not consider the subjective perception mechanism and role of human visual system in leaf classification data labels. Second, the implementation of the deep learning algorithm completely
-
Customized Information Extraction and Processing Pipeline for Commercial Invoices Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-07-17 Pierce Lai, Abhishek Mohan, Seok Kim, Jung Soo Victor Chu, Samuel Lee, Prabhakar Kafle, Patrick Wang
Extracting information from scanned invoices and other commercial documents, a critical component of corporate function, typically requires significant manual processing. Much research has been conducted in the field of automated information extraction and document processing to alleviate the manual resources used for document analysis, but resultant literature and commercially available products have
-
A Novel Sentimental Analysis for Response to Natural Disaster on Twitter Data Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-07-17 Sachin Minocha, Birmohan Singh
The response to a natural disaster ultimately depends on credible and real-time information regarding impacted people and areas. Nowadays, social media platforms such as Twitter have emerged as the primary and fastest means of disseminating information. Due to the massive, imprecise, and redundant information on Twitter, efficient automatic sentiment analysis (SA) plays a crucial role in enhancing
-
Deep Active Recognition through Online Cognitive Learning Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-07-14 Jing Yang, Wencang Zhao, Minghua Lu, Jincai Huang
Deep models need a large number of labeled samples to be trained. Furthermore, in practical application settings where objects’ features are added or changed over time, it is difficult and expensive to get enough labeled samples in the beginning. Cognitive learning mechanism can actively raise the deep models’ proficiency online with a few training labels gradually. In this paper, inspired by human
-
Quality Inspection of 3D Printed Tubular Tissue Based on Machine Vision Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-07-14 Xiaoyan Wu, Shu Wang
This study investigated the three-dimensional (3D) printing of tubular tissue, especially vascular tissue, using a self-developed 3D bioprinter platform and tubular tissue support frame system based on machine vision technology. A 3D printing quality inspection scheme for tubular tissue based on machine vision was proposed by combining the current advanced image acquisition sensor device and theoretical
-
Energy-Saving Strategy Based on Image Super-Resolution for Wireless Image Sensor Networks Assisted by Cloud Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-07-07 Yalin Nie, Lei Gong, Zeyu Sun
Wireless image sensor networks (WISNs) collect surveillance images, resulting in copious quantities of data requiring processing and transmission within the network. To reduce and balance energy expenditure during in-network image data processing and transmission, this study introduces an energy-saving strategy based on image super-resolution for WISNs assisted by cloud. The strategy constructs an
-
Research on Defect Detection Method of Nonwoven Fabric Mask Based on Machine Vision Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-07-07 Jingde Huang, Zhangyu Huang, Xin Zhan
During the production, transportation and storage of nonwoven fabric mask, there are many damages caused by human or nonhuman factors. Therefore, checking the defects of nonwoven fabric mask in a timely manner to ensure the reliability and integrity, which plays a positive role in the safe use of nonwoven fabric mask. At present, the wide application of machine vision technology provides a technical
-
Emotion Recognition from Facial Expression Using Hybrid CNN–LSTM Network Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-07-07 M. Mohana, P. Subashini, M. Krishnaveni
Facial Expression Recognition (FER) is a prominent research area in Computer Vision and Artificial Intelligence that has been playing a crucial role in human–computer interaction. The existing FER system focuses on spatial features for identifying the emotion, which suffers when recognizing emotions from a dynamic sequence of facial expressions in real time. Deep learning techniques based on the fusion
-
A Hybrid Method for Enhancement of Both Contrast Distorted and Low-Light Images Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-07-06 Nurullah Ozturk, Serkan Ozturk
Many different histogram equalization (HE)-based image enhancement methods have been developed to overcome the problems of low or high image brightness, contrast sensitivity, and difficulty in revealing details of dark areas under low-light environments. In this paper, a novel image enhancement method based on HE and adaptive gamma correction with weight distribution (AGCWD) is proposed for natural
-
Cross-Modal Interaction Network for Video Moment Retrieval Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-07-06 Shen Ping, Xiao Jiang, Zean Tian, Ronghui Cao, Weiming Chi, Shenghong Yang
The video moment retrieval task aims to fetch a target moment in an untrimmed video, which best matches the semantics of a sentence query. Existing methods mainly focus on utilizing two separate modules: one learns intra-modal relations to understand video and query contents, and the other explores inter-modal interactions to build a semantic bridge between video and language. However, intra-modal
-
A Wavelet Basis ANN and 5-Class Decision Factor AI Algorithm Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-07-06 Xuepeng Liu, Dongmei Zhao, Yihang Peng, Jianping Li
The accuracy and reliability of continuous space curve estimation is the key to global exploration. An improved artificial intelligence algorithm is proposed for continuous space analysis. First, a wavelet basis ANN algorithm is proposed to determine the discretization strategy in continuous space. The hidden layer node transfer function in a BP neural network is replaced by a wavelet basis function
-
Multi-Angle Models and Lightweight Unbiased Decoding-Based Algorithm for Human Pose Estimation Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-07-05 Jianghai He, Weitong Zhang, Ronghua Shang, Jie Feng, Licheng Jiao
When a top-down method is taken to the task of human pose estimation, the accuracy of joint point localization is often limited by the accuracy of human detection. In addition, conventional algorithms commonly encode the image to generate a heat map before processing, but the systematic error in decoding the heat map back to the original image has an impact on the positioning. Therefore, to address
-
3D PET/CT Tumor Co-Segmentation Based on Background Subtraction Hybrid Active Contour Model Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-06-30 Laquan Li, Chuangbo Jiang, Patrick Shen-Pei Wang, Shenhai Zheng
Accurate tumor segmentation in medical images plays an important role in clinical diagnosis and disease analysis. However, medical images usually have great complexity, such as low contrast of computed tomography (CT) or low spatial resolution of positron emission tomography (PET). In the actual radiotherapy plan, multimodal imaging technology, such as PET/CT, is often used. PET images provide basic
-
Integration of Deep Direction Distribution Feature Extraction and Optimized Attention Based Double Hidden Layer GRNN Models for Robust Cursive Handwriting Recognition Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-06-30 D. Manibharathi, C. Vasanthanayaki
Cursive handwriting recognition (CHWR) is an interesting area of research as it has a wide range of applications but lacks an accurate approach to provide better results due to its character shapes, the non-uniform spacing between words and within a word, diverse placements of dots, and diacritics, and very low inter-class variation among individual classes. A novel CHWR model is proposed to enhance
-
Evaluation Quality of Chinese Baijiu Using GC–MS Based on SPCA and Neural Network Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-06-28 Mingju Chen, Anle Cui, Zhengxu Duan, Xingzhong Xiong
Currently, evaluating the quality of strong-flavor Baijiu (SFB) heavily relies on subjective sensory analysis, resulting in large deviations in evaluation. However, as there are no existing evaluation criteria for SFB quality, this study aimed to extract trace components and design an evaluation model using gas chromatography–mass spectrometry (GC–MS). First, the key component data was analyzed using
-
CLS-Net: An Action Recognition Algorithm Based on Channel-Temporal Information Modeling Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-06-27 Mengfan Xue, Jiannan Zheng, Tao Li, Dongliang Peng
The modeling of channel and temporal information is of crucial importance for action recognition tasks. To build a high-performance action recognition network by effectively capturing channel and temporal information, we propose CLS-Net: an action recognition algorithm based on channel-temporal information modeling. The proposed CLS-Net characterizes channel and temporal information by inserting multiple
-
Image Resizing for Object Detection: A Learnable Downsampler–Upsampler Pair with Differentiable Image Entropy Estimation Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-06-16 Chengjie Dai, Qiang Chen, Jingchao Xu, Hanshen Gong, Guanghua Song, Bowei Yang
In recent years, super-resolution neural networks have achieved good results in restoring super-resolution images from low-resolution ones. However, most subsequent tasks based on super-resolution images such as object detection are done by the computer. Considering this situation, we propose a learnable downsampler–upsampler pair, which can realize both the downscaling process and the upscaling process
-
Nondestructive Detection of Coal–Rock Interface Under Mining Environment Using Ground Penetrating Radar Image Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-06-14 Xin Wang, Duan Zhao, Yikun Wang
Shearer drum automatic height adjustment strategy under mining environment is based on the recognition of coal–rock interface and the ground penetrating radar (GPR) was used for coal–rock interface recognition in the study. First, a model was built to study the radar echo in complex coal seam and some simulations were made to study the influence of radar parameters. Second, the experiment study was
-
An Efficient and Accurate Cross-Domain Object Detection Method Using One-Level Feature and Domain Adaptation Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-06-12 Tianyuan Zhang, Xudong Song, Chen Zhu, Pan Liang, Jialiang Sun, Shuo Wang, Yunxian Cui, Changxian Li
To improve the accuracy of cross-domain object detection, the existing unsupervised domain adaptation (UDA) object detection methods mostly use Feature Pyramid Network (FPN), multiple Region Proposal Network (RPN), and multiple domain classifier, but these methods lead to complex network structures, slow model convergence, and low detection efficiency. To solve the above problems, this paper proposes
-
Prediction of Photovoltaic Output Power Based on Match Degree and Entropy Weight Method Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-06-10 Weiqiang Liao, Shixian Lin
In this study, an algorithm, which had combined the match degree and entropy weighting method, was proposed to predict the efficiency of the photovoltaic output power. First, the key characteristic quantities were selected by the correlation analyses of output power history data and meteorological data of the photovoltaic power generation system. Second, according to the Euclidean distances between
-
A Framework for Distributed Feature Selection Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-06-10 Mona Sharifnezhad, Mohsen Rahmani, Hossein Ghaffarian
Many current multivariate filter feature selection approaches consider redundancy and relevance between features and class vectors simultaneously. However, these multivariate filter algorithms calculate the suitability of features by only the intrinsic characteristics of the data. In this paper, we suggest a new distributed framework to offset the multivariate feature selection problem. We propose
-
Intelligent Control Knowledge-Based System for Cleaning Device of Rice–Wheat Combine Harvester Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-05-31 Qing Jiang, Yang Liu, Xiancun Zhou, Yadong Yang, Jing Zhang, Bin Jiang, Demei Mao, Yang Yang, Yuan Fu
In this paper, the operation process of cleaning of intelligent rice–wheat combine harvester is divided into two key steps: initial setting of cleaning device operation parameters and dynamic control of cleaning device operation parameters. Combined with the operation experience of cleaning control of agricultural machinery operators, the dynamic control knowledge-based system of cleaning device operation
-
The Lightweight Count System of Intensive Jellyfish Based on Deep Learning Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-05-29 Yun Jin, Haidong Zhang, Jiaxin Li, Weihong Bi
The number of jellyfish outbreaks is on the rise around the world, and they have been considered a serious ecological disaster. As part of the emergency response plan for jellyfish disasters, in-situ detection research that can distinguish jellyfish species and quantities is urgently required to support accurate data collection. As a typical fully supervised regression task, counting is usually regarded
-
A Robust Framework for Severity Detection of Knee Osteoarthritis Using an Efficient Deep Learning Model Int. J. Pattern Recognit. Artif. Intell. (IF 1.5) Pub Date : 2023-05-25 Rabbia Mahum, Aun Irtaza, Mohammed A. El-Meligy, Mohamed Sharaf, Iskander Tlili, Saamia Butt, Asad Mahmood, Muhammad Awais
With the changing lifestyle, a large population suffers from a bone disease known as an osteoarthritis affecting the knee, spine, and hip. Therefore, timely detection and classification of the disease are necessary to minimize the loss, however, it is a time-consuming task and requires various tests and physicians’ in-depth analysis. Thus, an accurate automated technique, timely detection and classification