-
Real-time adaptive fuzzy density clustering for multi-target data association Intell. Data Anal. (IF 0.651) Pub Date : 2021-01-26 Mousa Nazari; Saeid Pashazadeh
The problem of data association for tracking multiple targets based on using the ship-borne radar is addressed in this study. A robust fuzzy density clustering algorithm is proposed, that contains three steps. At first, a customized form of adaptive density clustering is used to determine valid measurements for each target’s state. In the second step, the degree of fuzzy membership for each valid measurement
-
A lazy feature selection method for multi-label classification Intell. Data Anal. (IF 0.651) Pub Date : 2021-01-26 Rafael B. Pereira; Alexandre Plastino; Bianca Zadrozny; Luiz H.C. Merschmann
In many important application domains, such as text categorization, biomolecular analysis, scene or video classification and medical diagnosis, instances are naturally associated with more than one class label, giving rise to multi-label classification problems. This has led, in recent years, to asubstantial amount of research in multi-label classification. More specifically, feature selection methods
-
A novel approach to fully representing the diversity in conditional dependencies for learning Bayesian network classifier Intell. Data Anal. (IF 0.651) Pub Date : 2021-01-26 Limin Wang; Peng Chen; Shenglei Chen; Minghui Sun
Bayesian network classifiers (BNCs) have proved their effectiveness and efficiency in the supervised learning framework. Numerous variations of conditional independence assumption have been proposed to address the issue of NP-hard structure learning of BNC. However, researchers focus on identifyingconditional dependence rather than conditional independence, and information-theoretic criteria cannot
-
Classification from positive and unlabeled data based on likelihood invariance for measurement Intell. Data Anal. (IF 0.651) Pub Date : 2021-01-26 Takeshi Yoshida; Takashi Washio; Takahito Ohshiro; Masateru Taniguchi
Abstract We propose novel approaches for classification from positive and unlabeled data (PUC) based on maximum likelihood principle. These are particularly suited to measurement tasks in which the class prior of the target object in each measurement is unknown and significantly different from the class prior used for training, while the likelihood function representing the observation process is invariant
-
Energy modeling of Hoeffding tree ensembles Intell. Data Anal. (IF 0.651) Pub Date : 2021-01-26 Eva García-Martín; Albert Bifet; Niklas Lavesson
Abstract Energy consumption reduction has been an increasing trend in machine learning over the past few years due to its socio-ecological importance. In new challenging areas such as edge computing, energy consumption and predictive accuracy are key variables during algorithm design and implementation. State-of-the-art ensemble stream mining algorithms are able to create highly accurate predictions
-
Comparative study on credit card fraud detection based on different support vector machines Intell. Data Anal. (IF 0.651) Pub Date : 2021-01-26 Chenglong Li; Ning Ding; Yiming Zhai; Haoyun Dong
Credit card fraud is the new financial fraud crime accompanied by the gradual development of the economy which causes billions of dollars of losses every year. Credit card fraud case not only seriously violated the cardholder benefits and financial institutions, but also undermined the credit management order. However, fraudsters keep exploring new crime strategies constantly which exacerbates the
-
Using the beta distribution technique to detect attacked items from collaborative filtering Intell. Data Anal. (IF 0.651) Pub Date : 2021-01-26 Ping-Yu Hsu; Jui-Yi Chung; Yu-Chin Liu
A recommendation system is based on the user and the items, providing appropriate items to the user and effectively helping the user to find items that may be of interest. The most commonly used recommendation method is collaborative filtering. However, in this case, the recommendation system willbe injected with false data to create false ratings to push or nuke specific items. This will affect the
-
Efficient facial expression recognition based on convolutional neural network Intell. Data Anal. (IF 0.651) Pub Date : 2021-01-26 Yongxiang Cai; Jingwen Gao; Gen Zhang; Yuangang Liu
The goal of research in Facial Expression Recognition (FER) is to build a robust and strong recognizability model. In this paper, we propose a new scheme for FER systems based on convolutional neural network. Part of the regular convolution operation is replaced by depthwise separable convolution to reduce the number of parameters and the computational workload; the self-adaption joint loss function
-
Using optimized statistical distances to confront distributed denial of service attacks in software defined networks Intell. Data Anal. (IF 0.651) Pub Date : 2021-01-26 Mozhgan Ghasabi; Mahmood Deypir
Software-defined networks (SDN) are an emerging architecture that provides promising amends to put an end to current infrastructure constraints by optimized bandwidth utilization, flexibility in network management and configuration, and pulling down operating costs in traditional network structures. Despite the advantages of this architecture, SDNs may become the victim of a distributed denial of service
-
Entropy difference and kernel-based oversampling technique for imbalanced data learning Intell. Data Anal. (IF 0.651) Pub Date : 2020-12-18 Xu Wu; Youlong Yang; Lingyu Ren
Class imbalance is often a problem in various real-world datasets, where one class contains a small number of data and the other contains a large number of data. It is notably difficult to develop an effective model using traditional data mining and machine learning algorithms without using data preprocessing techniques to balance the dataset. Oversampling is often used as a pretreatment method for
-
Sentiment analysis via dually-born-again network and sample selection Intell. Data Anal. (IF 0.651) Pub Date : 2020-12-18 Pinlong Zhao; Zefeng Han; Qing Yin; Shuxiao Li; Ou Wu
Text sentiment analysis is an important natural language processing (NLP) task and has received considerable attention in recent years. Numerous deep-learning based methods have been proposed in previous literature in terms of new deep neural networks (DNN) including new embedding strategies, new attention mechanisms, and new encoding layers. In this study, an alternative technical path is investigated
-
Multiclass spectral feature scaling method for dimensionality reduction Intell. Data Anal. (IF 0.651) Pub Date : 2020-12-18 Momo Matsuda; Keiichi Morikuni; Akira Imakura; Xiucai Ye; Tetsuya Sakurai
Irregular features disrupt the desired classification. In this paper, we consider aggressively modifying scales of features in the original space according to the label information to form well-separated clusters in low-dimensional space. The proposed method exploits spectral clustering to derive scaling factors that are used to modify the features. Specifically, we reformulate the Laplacian eigenproblem
-
Explainable and unexpectable recommendations using relational learning on multiple domains Intell. Data Anal. (IF 0.651) Pub Date : 2020-12-18 Sirawit Sopchoke; Ken-ichi Fukui; Masayuki Numao
In this research, we combine relational learning with multi-domain to develop a formal framework for a recommendation system. The design of our framework aims at: (i) constructing general rules for recommendations, (ii) providing suggested items with clear and understandable explanations, (iii) delivering a broad range of recommendations including novel and unexpected items. We use relational learning
-
Plant leaf recognition with shallow and deep learning: A comprehensive study Intell. Data Anal. (IF 0.651) Pub Date : 2020-12-18 Jozsef Suto
Nowadays there are hundreds of thousands known plant species on the Earth and many are still unknown yet. The process of plant classification can be performed using different ways but the most popular approach is based on plant leaf characteristics. Most types of plants have unique leaf characteristics such as shape, color, and texture. Since machine learning and vision considerably developed in the
-
Hybrid recommendation model based on deep learning and Stacking integration strategy Intell. Data Anal. (IF 0.651) Pub Date : 2020-12-18 Xiaolan Xie; Shantian Pang; Jili Chen
In the traditional recommendation algorithms, due to the rapid development of deep learning and Internet technology, user-item rating data is becoming increasingly sparse. The simple inner product interaction mode adopted by the collaborative filtering method has a cold start problem and cannot learn the complex nonlinear structural features between users and items, while the content-based algorithm
-
Reinforcement learning based metric filtering for evolutionary distance metric learning Intell. Data Anal. (IF 0.651) Pub Date : 2020-12-18 Bassel Ali; Koichi Moriyama; Wasin Kalintha; Masayuki Numao; Ken-Ichi Fukui
Data collection plays an important role in business agility; data can prove valuable and provide insights for important features. However, conventional data collection methods can be costly and time-consuming. This paper proposes a hybrid system R-EDML that combines a sequential feature selection performed by Reinforcement Learning (RL) with the evolutionary feature prioritization of Evolutionary Distance
-
Parameter evolution of the classifiers for disease diagnosis with offline data-driven hybrid systems Intell. Data Anal. (IF 0.651) Pub Date : 2020-12-18 Madhu Sudana Rao Nalluri; Kannan K; Xiao-Zhi Gao; Swaminathan V; Diptendu Sinha Roy
Automatic disease diagnosis is, in essence, a classification problem where the classifier has to be trained based on patients’ datasets and not entirely on doctors’ expert knowledge. In this paper, we present the design of such data-driven disease classifiers and fine-tuning classifier performanceby a multi-objective evolutionary algorithm. We have used sequential minimal optimization (SMO) classifier
-
A hybrid deep learning model for predicting and targeting the less immunized area to improve childrens vaccination rate Intell. Data Anal. (IF 0.651) Pub Date : 2020-12-18 G. Mohanraj; V. Mohanraj; J. Senthilkumar; Y. Suresh
There has been a major and rising interest in India for increasing vaccination rate among peoples to make the nation healthier and safer. In this paper, a new hybrid deep learning model is proposed to predict and target vaccination rates in the less immunized regions. The Rank-Based Multi-Layer Perceptron (R-MLP) hybrid deep learning framework uses the data collected from the recently updated District
-
Bayesian hierarchical K-means clustering Intell. Data Anal. (IF 0.651) Pub Date : 2020-09-30 Yue Liu; Bufang Li
Clustering algorithm is the foundation and important technology in data mining. In fact, in the real world, the data itself often has a hierarchical structure. Hierarchical clustering aims at constructing a cluster tree, which reveals the underlying modal structure of a complex density. Due to itsinherent complexity, most existing hierarchical clustering algorithms are usually designed heuristically
-
Canal-LASSO: A sparse noise-resilient online linear regression model Intell. Data Anal. (IF 0.651) Pub Date : 2020-09-30 Hejie Lei; Xingke Chen; Ling Jian
Least absolute shrinkage and selection operator (LASSO) is one of the most commonly used methods for shrinkage estimation and variable selection. Robust variable selection methods via penalized regression, such as least absolute deviation LASSO (LAD-LASSO), etc., have gained growing attention in works of literature. However those penalized regression procedures are still sensitive to noisy data. Furthermore
-
Boosting meta-learning with simulated data complexity measures Intell. Data Anal. (IF 0.651) Pub Date : 2020-09-30 Luís P.F. Garcia; Adriano Rivolli; Edesio Alcoba; Ana C. Lorena; André C.P.L.F. de Carvalho
Meta-Learning has been largely used over the last years to support the recommendation of the most suitable machine learning algorithm(s) and hyperparameters for new datasets. Traditionally, a meta-base is created containing meta-features extracted from several datasets along with the performance ofa pool of machine learning algorithms when applied to these datasets. The meta-features must describe
-
A novel algorithm for searching frequent gradual patterns from an ordered data set Intell. Data Anal. (IF 0.651) Pub Date : 2020-09-30 Jerry Lonlac; Engelbert Mephu Nguifo
Mining frequent simultaneous attribute co-variations in numerical databases is also called frequent gradual pattern problem. Few efficient algorithms for automatically extracting such patterns have been reported in the literature. Their main difference resides in the variation semantics used. However in applications with temporal order relations, those algorithms fail to generate correct frequent gradual
-
Could spatial features help the matching of textual data? Intell. Data Anal. (IF 0.651) Pub Date : 2020-09-30 Jacques Fize; Mathieu Roche; Maguelonne Teisseire
Textual data is available to an increasing extent through different media (social networks, companies data, data catalogues, etc.). New information extraction methods are needed since these new resources are highly heterogeneous. In this article, we propose a text matching process based on spatialfeatures and assessed through heterogeneous textual data. Besides being compatible with heterogeneous data
-
Recognition of speech emotion using custom 2D-convolution neural network deep learning algorithm Intell. Data Anal. (IF 0.651) Pub Date : 2020-09-30 Kudakwashe Zvarevashe; Oludayo O. Olugbara
Speech emotion recognition has become the heart of most human computer interaction applications in the modern world. The growing need to develop emotionally intelligent devices has opened up a lot of research opportunities. Most researchers in this field have applied the use of handcrafted featuresand machine learning techniques in recognising speech emotion. However, these techniques require extra
-
An efficient Bayesian network structure learning algorithm using the strategy of two-stage searches Intell. Data Anal. (IF 0.651) Pub Date : 2020-09-30 Huiping Guo; Hongru Li
It is important for Bayesian network (BN) structure learning, a NP-problem, to improve the accuracy and hybrid algorithms are a kind of effective structure learning algorithms at present. Most hybrid algorithms adopt the strategy of one heuristic search and can be divided into two groups: one heuristic search based on initial BN skeleton and one heuristic search based on initial solutions. The former
-
Fourier neural networks: A comparative study Intell. Data Anal. (IF 0.651) Pub Date : 2020-09-30 Malika Uteuliyeva; Abylay Zhumekenov; Rustem Takhanov; Zhenisbek Assylbekov; Alejandro J. Castro; Olzhas Kabdolov
We review neural network architectures which were motivated by Fourier series and integrals and which are referred to as Fourier neural networks. These networks are empirically evaluated in synthetic and real-world tasks. Neither of them outperforms the standard neural network with sigmoid activation function in the real-world tasks. All neural networks, both Fourier and the standard one, empirically
-
Spatial-time motifs discovery Intell. Data Anal. (IF 0.651) Pub Date : 2020-09-30 Heraldo Borges; Murillo Dutra; Amin Bazaz; Rafaelli Coutinho; Fábio Perosi; Fábio Porto; Florent Masseglia; Esther Pacitti; Eduardo Ogasawara
Discovering motifs in time series data has been widely explored. Various techniques have been developed to tackle this problem. However, when it comes to spatial-time series, a clear gap can be observed according to the literature review. This paper tackles such a gap by presenting an approach to discover and rank motifs in spatial-time series, denominated Combined Series Approach (CSA). CSA is based
-
An outlier ensemble for unsupervised anomaly detection in honeypots data Intell. Data Anal. (IF 0.651) Pub Date : 2020-07-15 Lynda Boukela; Gongxuan Zhang; Samia Bouzefrane; Junlong Zhou
Nowadays, computers, as well as smart devices, are connected through communication networks making them more vulnerable to attacks. Honeypots are proposed as deception tools but usually used as part of a proactive defense strategy. Hence, this article demonstrates how honeypots data can be analyzedin an active defense strategy. Furthermore, anomaly detection based on unsupervised machine learning techniques
-
An advanced profile hidden Markov model for malware detection Intell. Data Anal. (IF 0.651) Pub Date : 2020-07-15 Alireza Abbas Alipour; Ebrahim Ansari
The rapid growth of malicious software (malware) production in recent decades and the increasing number of threats posed by malware to network environments, such as the Internet and intelligent environments, emphasize the need for more research on the security of computer networks in information security and digital forensics. The method presented in this study identifies “species” of malware families
-
Two deterministic selection methods for the initial centers in fuzzy c-means based algorithms Intell. Data Anal. (IF 0.651) Pub Date : 2020-07-15 Liliane R. da Silva; Heloina A. Arnaldo; Huliane da Silva; Ronildo Moura; Benjamín Bedregal; Anne Magaly de P. Canuto
Fuzzy C-Means (FCM) is the most commonly used and discussed fuzzy clustering algorithm in the literature. Nevertheless, it is well known that the performance of FCM is strongly affected by the selection of the initial cluster centers. In other words, the selection of a good set of initial cluster centers plays an important role in the performance of this algorithm. The most common selection method
-
RobustRepStream: Robust stream clustering using self-controlled connectivity graph Intell. Data Anal. (IF 0.651) Pub Date : 2020-07-15 Ross Callister; Mihai Lazarescu; Duc-Son Pham
A major challenge in stream clustering is the evolution in the statistical properties of the underlying data. As clustering is inherently unsupervised, selecting suitable parameter values is often difficult. Clustering algorithms with sensitive parameters are often not robust to such changes, leading to poor clustering outputs. Algorithms using K-NN graphs face this problem, as they have a sensitive
-
An efficient algorithm for hiding sensitive-high utility itemsets Intell. Data Anal. (IF 0.651) Pub Date : 2020-07-15 Vy Huynh Trieu; Hai Le Quoc; Chau Truong Ngoc
Privacy-preserving utility itemset mining is the process of hiding sensitive-high utility itemsets (SHUIs) appearing in original database such that they will not be discovered in the sanitized database. The purpose of SHUI hiding algorithm is to conceal the set of SHUIs while minimizing the side effects which caused by data distortion process. In this paper, a novel algorithm, named EHSHUI (An Efficient
-
A hybrid node classification mechanism for influential node prediction in Social Networks Intell. Data Anal. (IF 0.651) Pub Date : 2020-07-15 M. Prakash; P. Pabitha
Social Networks is an essential phenomenon in all aspects through various perspectives. These networks contain a large number of users better termed as nodes and the connections between the users termed as edges. For efficient information processing and retrieving, accessing the influential node isessential for improving the diffusion process. To identify the influential node inside a heterogeneous
-
Feature learning for representing sparse networks based on random walks Intell. Data Anal. (IF 0.651) Pub Date : 2020-07-15 Thanh Le; Giang Tran; Bac Le
Identifying features to represent graphs such as social networks, protein graphs is increasingly common in both research and business communities, thanks to the fact that data has increased not only in quantity but also in complexity. This results in the graphs to be sparser because not all nodes are fully connected. In addition, if this whole graph is used as input data for learning algorithms e.g
-
A comparative analysis of Bayesian network structure learning algorithms applied to crime data Intell. Data Anal. (IF 0.651) Pub Date : 2020-07-15 Dalton Ieda Fazanaro; Helio Pedrini
The theories about crime and correction have their inception in the eighteenth century, highly influenced by the anthropological thoughts emerging during the age of Enlightenment. Throughout the decades, the criminological studies observed their sociological essence encompassing practices from other scientific fields to explain the more contemporary questions, becoming Criminology an inherently interdisciplinary
-
A novel adaptive k-NN classifier for handling imbalance: Application to brain MRI Intell. Data Anal. (IF 0.651) Pub Date : 2020-07-15 Ritaban Kirtania; Sushmita Mitra; B. Uma Shankar
The problem of efficiently classifying imbalanced data has become one of the most challenging tasks in machine learning. Some real world examples include medical image analysis, fraud detection, fault diagnosis, and anomaly detection. Although several data-level algorithms have been developed to address imbalance, they are typically subject to some restrictions. We propose a novel variant of the k-NN
-
Distributed frequent subgraph mining on evolving graph using SPARK Intell. Data Anal. (IF 0.651) Pub Date : 2020-05-21 N. Senthilselvan; V. Subramaniyaswamy; V. Vijayakumar; Hamid Reza Karimi; N. Aswin; Logesh Ravi
Within the graph mining context, frequent subgraph identification plays a key role in retrieving required information or patterns from the huge amount of data in a short period. The problem of finding frequent items in traditional mining changed to the innovation of subgraphs that recurrently occurs in graph datasets containing a single huge graph. Majority of the existing methods target static graphs
-
Online analytical processsing on graph data Intell. Data Anal. (IF 0.651) Pub Date : 2020-05-21 Leticia Gómez; Bart Kuijpers; Alejandro Vaisman
Online Analytical Processing (OLAP) comprises tools and algorithms that allow querying multidimensional databases. It is based on the multidimensional model, where data can be seen as a cube such that each cell contains one or more measures that can be aggregated along dimensions. In a “Big Data” scenario, traditional data warehousing and OLAP operations are clearly not sufficient to address current
-
Unsupervised learning of textual pattern based on Propagation in Bipartite Graph Intell. Data Anal. (IF 0.651) Pub Date : 2020-05-21 Thiago de Paulo Faleiros; Alan Valejo; Alneu de Andrade Lopes
Graph-based algorithms have aroused considerable interests in recent years by facilitating pattern recognition and learning via information propagation process through the graph. Here, we propose an unsupervised learning algorithm based on propagation on bipartite graph, referred to as Propagationin Bipartite Graph (PBG) algorithm. The contributions of this approach are threefold: 1) we present an
-
Estimating a one-class naive Bayes text classifier Intell. Data Anal. (IF 0.651) Pub Date : 2020-05-21 Yihong Zhang; Adam Jatowt
Nowadays more and more information extraction projects need to classify large amounts of text data. The common way to classify text is to build a supervised classifier trained on human-labeled positive and negative examples. In many cases, however, it is easy to label positive examples, but hard tolabel negative examples. In this paper, we address the problem of building a one-class classifier when
-
An efficient and robust bat algorithm with fusion of opposition-based learning and whale optimization algorithm Intell. Data Anal. (IF 0.651) Pub Date : 2020-05-21 Jinkun Luo; Fazhi He; Jiashi Yong
Bat algorithm (BA) has the advantage of fast convergence, but there is still room for improvement in accuracy and stability of solution. An efficient and robust fusion bat algorithm (ERFBA) is proposed to overcome these defects. In the population reconstruction, an effective diversity population (EDP) is reconstructed by designing a multi-strategy opposition-based learning with disturbance. In the
-
An active learning ensemble method for regression tasks Intell. Data Anal. (IF 0.651) Pub Date : 2020-05-21 Nikos Fazakis; Georgios Kostopoulos; Stamatis Karlos; Sotiris Kotsiantis; Kyriakos Sgarbas
Active learning is a typical approach for learning from both labeled and unlabeled examples aiming to build efficient and accurate predictive models at minimum expense under an expert’s guidance. Since there is a lack of labeled data in many scientific fields whilst, at the same time, the labelingcost of unlabeled data is typically high in terms of time and expenditure, active learning has grown rapidly
-
An improvement of SAX representation for time series by using complexity invariance Intell. Data Anal. (IF 0.651) Pub Date : 2020-05-21 Xuan-May Thi Le; Tuan Minh Tran; Hien T. Nguyen
In the area of time series data mining, a challenging task is to design an effectively and efficiently low-dimensional representation of high-dimensional time series data. Such an effective and efficient representation is important for dimensionality reduction of time series while preserving the core information embedded in the original one. Among popular representations of time series, Symbolic Aggregate
-
Extraction of qualitative behavior rules for industrial processes from reduced concept lattice Intell. Data Anal. (IF 0.651) Pub Date : 2020-05-21 Sérgio M. Dias; Luis E. Zárate; Mark A.J. Song; Newton J. Vieira; Ch. Aswani Kumar
Formal concept analysis (FCA) become an alternative approach to extract and represent knowledge of real world systems. That knowledge can be obtained from implications rules extracted of concept lattices formed by ordered formal concepts. However, in complex systems the number of formal concepts can be large. To deal with this complexity of the FCA, concept reduction techniques can be applied in order
-
Analyzing concept drift: A case study in the financial sector Intell. Data Anal. (IF 0.651) Pub Date : 2020-05-21 Andrés R. Masegosa; Ana M. Martínez; Darío Ramos-López; Helge Langseth; Thomas D. Nielsen; Antonio Salmerón
In this paper, we present a method for exploratory data analysis of streaming data based on probabilistic graphical models (latent variable models). This method is illustrated by concept drift tracking, using financial client data from a European regional bank. For this particular setting, the analyzed data spans the period from April 2007 to March 2014 and therefore starts before the beginning of
-
Domain sentiment dictionary construction and optimization based on multi-source information fusion Intell. Data Anal. (IF 0.651) Pub Date : 2020-03-27 Zuo Chen; Xin Li; Min Wang; Shenggang Yang
Sentiment analysis of text data, such as reviews, can help users and merchants make more favorable decisions. It is difficult to use the popular supervised learning method to complete the sentiment classification task because marking data manually is time-consuming and laborious. Unsupervised sentiment classification methods are mostly based on sentiment lexicons. The existing sentiment lexicons are
-
A cross-lingual sentiment topic model evolution over time Intell. Data Anal. (IF 0.651) Pub Date : 2020-03-27 Ibrahim Hussein Musa; Kang Xu; Feng Liu; Ibrahim Zamit; Waheed Ahmed Abro; Guilin Qi
Sentiment analysis in various languages has been a hot research topic with several applications. Most of the existing models have been reported to work well with widely used language. Were the lass directly applying these models to poor-quality corpora often leads to low results. Thus, to deal withthese shortcoming we propose a cross-lingual sentiment topic model evolution over time (CLSTOT) which
-
A comparison study on nonlinear dimension reduction methods with kernel variations: Visualization, optimization and classification Intell. Data Anal. (IF 0.651) Pub Date : 2020-03-27 Katherine C. Kempfert; Yishi Wang; Cuixian Chen; Samuel W.K. Wong
Because of high dimensionality, correlation among covariates, and noise contained in data, dimension reduction (DR) techniques are often employed to the application of machine learning algorithms. Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA), and their kernel variants (KPCA, KLDA) are among the most popular DR methods. Recently, Supervised Kernel Principal Component Analysis
-
Hybridization of population-based ant colony optimization via data mining Intell. Data Anal. (IF 0.651) Pub Date : 2020-03-27 Zeynep Adak; Ayhan Demiriz
We propose a hybrid application of Population Based Ant Colony Optimization that uses a data mining procedure to wisely initialize the pheromone entries. Hybridization of metaheuristics with data mining techniques has been studied by several researchers in recent years. In this line of research, frequent patterns in a number of initial high-quality solutions are extracted to guide the subsequent iterations
-
An improved opposition based learning firefly algorithm with dragonfly algorithm for solving continuous optimization problems Intell. Data Anal. (IF 0.651) Pub Date : 2020-03-27 Mehdi Abedi; Farhad Soleimanian Gharehchopogh
Nowadays, the existence of continuous optimization problems has led researchers to come up with a variety of methods to solve continues optimization problems. The metaheuristic algorithms are one of the most popular and common ways to solve continuous optimization problems. Firefly Algorithm (FA) is a successful metaheuristic algorithm for solving continuous optimization problems; however, although
-
Mining sequences in activities for time use analysis Intell. Data Anal. (IF 0.651) Pub Date : 2020-03-27 Jorge Rosales-Salas; Sebastián Maldonado; Alex Seret
By providing a complete record of time use for a given population, time use studies enable investigators to test various hypotheses concerning that behavior. However, the large number and variety of activity combinations that are relevant in time allocation choices and, therefore, time use analysis, makes measuring or even fully identifying all of them impossible without the proper data mining tools
-
An intrusion detection method based on active transfer learning Intell. Data Anal. (IF 0.651) Pub Date : 2020-03-27 Jingmei Li; Weifei Wu; Di Xue
Intrusion detection plays a very important role in the field of network security. In order to improve the intrusion detection rate, intrusion detection algorithms based traditional machine learning are widely used in this field. These methods generally satisfy the following two assumptions: the training and the testing data must be under the condition of the independent and identical distribution;
-
Efficient heuristics for learning Bayesian network from labeled and unlabeled data Intell. Data Anal. (IF 0.651) Pub Date : 2020-03-27 Zhiyi Duan; Limin Wang; Minghui Sun
Bayesian network classifiers (BNCs) are powerful tools to mine statistical knowledge from data and infer under conditions of uncertainty. However, most of the traditional BNCs focus on mining the dependency relationships existed in labeled data while neglecting the information hidden in unlabeled data, which may result in the biased decision boundaries. To address this issue, we introduce a new order-based
-
Biased transfer matching for less overlapping degree for unsupervised domain adaptation Intell. Data Anal. (IF 0.651) Pub Date : 2020-03-27 Yiran Wen; Xiu Cao; Xueping Wang; Fangyuan Liu
Domain adaptation is an important branch of transfer learning. Previous studies have always taken efforts to minimize the optimization goal, but they neglect the relative quality of features or instances. For example, a classic work treats different instances equally in a degree and chooses these instances which minimize the optimization function value. This method will discard these instances that
Contents have been reproduced by permission of the publishers.