Abstract
The prompt enhancement of Telecom turned to be a vibrant and economical industry, which comprises an intrinsically great perspective for customer churn, requiring exact churn prediction models. In recent times, there has been phenomenal responsiveness in the development of feature selection methods for a large number of datasets. Through this research work, a High Relevancy and Low Redundancy (HRLR) approach by consuming Vague Set (VS) has proposed for selecting the subset of features from the features set. This proposed method is based on the Minimum Redundancy and Maximum Relevancy (MRMR) approach by using Vague Set. The proposed HRLR-VS method is based on the filtered approach feature selection, where the features are selected only when the measure of feature-class relevancy is maximized and a measure of feature-feature redundancy is minimized. The collaboration of similarity measures and ranking algorithms are prepared by utilizing the vital notions of Vague Sets information energies by Information Gain, Gain Ratio, and Chi-Square methods. The projected approach has been employed with the Particle Swarm Optimization for probing the best feature subset. Further, it measures the efficacy of the projected approach HRHL-VS for telecommunication dataset. The performance metrics like Accuracy, Kappa Statistics, True Positive Rate, Precision, F-Measure, Recall, MAE, RRSE, RMSE and RAE are considered in this paper for evaluating the proposed HRLR-VS method. The proposed HRRL-VS method has compared with existing literature approaches like mRMR and FCBF. From the result obtained in this paper, the proposed HRLR-VS method better results in all aspects for selecting the feature subset in telecommunication dataset.
Keywords
Introduction
The researchers in the present world face some elusive problems which clue us to inoculate the vagueness to the problems. The concept of Vague Set (VS) is proposed by Torra [1] which is the extension of fuzzy sets. VSs are exemplified by the various causes of uncertainty or various opinions of experts is the biggest advantage. By employing VS, the scholars cogitate the several characteristics of uncertainty. The telecommunication industry has gone through tremendous changes over the last few decades such as addition of new services, technological advancements and increased competition due to deregulation. Customer churn moving from individual service provider to neighboring opponent in the business is a developing problem for many server-based enterprises and especially for the telecommunication business [2]. It is an example of the vital problem for project supervisors because missing a consumer is a low-cost chance for opponents to get consumers. It has summarized that the different cost with the procurement of new consumers in ten eras added for an industry as compared to maintaining the current consumer. Maintaining present customers’ heads resulted in significant improvement in businesses and diminished purchasing cost. These factors made us finally concentrate on customer churn prediction as an essential part of imperative decision making in telecom organizations and outlining the process that is the primary aim of Customer Relationship Management (CRM) [3]. The effect of this ever increasing problem has paved way for the growth of many predictive mechanisms that help some necessary tasks in the classification process and predictive modeling.
The development of high-dimensional data that are publically available on the internet in past several ages supports diverse machine learning applications for evolution in investigation. Since, machine learning methods have inherent difficulties in dealing with the critical amount of input attributes, attract researchers for developing an alternate methods and exploring algorithms. To use machine learning methods expeditiously, preprocessing of the input is necessary.
Feature selection [4] is an example of the common prominent and frequent techniques in data preprocessing and has converted a crucial part of the machine learning method it has also distinguished as attribute selection, variable subset or variable selection in statistics and machine learning. It is the process of identifying relevant and eliminating irrelevant, redundant or noisy features. This method rushes up data mining algorithms, improves predictive accuracy and understandability. Irrelevant features are those that provide no valuable knowledge, and irrelevant features present no further information than the presently selected features. In supervised inductive learning, feature selection performs a collection of candidate features utilizing one of the three strategies 5:
The specific dimension of the subset of attributes that optimizes the measure of valuation. The little dimension of the subset that provides a critical limitation on the measures of evaluation. In general, the subset with the best confinement between the evaluation measure and size
The rest of the paper organization as follows, section 2 depicts the preliminaries of Vague Set, Feature Selection techniques and Similarity measures. Section 3 represents the related works done in the field of churn customer prediction in the telecommunication industry. Section 4 presents the High Relevant Low Redundant Vague set-based feature selection for considering only the high relevant and low redundant features in the Telecommunication dataset. Section 5 depicts the Experiments which consists of Dataset description. Section 6 represents the result and discussion of the proposed HRLR-VS and existing feature selection techniques. Section 7 highlights of this research work summary
Preliminaries of vague set, feature selection and similarity measure
The proposed HRLR-VS feature selection algorithm practices Merit value, Similarity measure, Filter algorithms, and Vague sets for finding the HRLR feature subsets.
Vague set
CC matrix is symmetrical when P
VS
(X, Y) = P
VS
(Y, X). [0, 1] is the interval for CC and it is given by 0 ≤ P
VS
(X, Y) ≤1: implies that all correlation coefficients must be in the interval of [0, 1]. X and Y are equal, then the high value of CC is received when P
VS
(X, Y) =1 if X = Y
The exploration of the ranking algorithms like Gain Ratio, Information Gain and Chi-Square feature selection approaches are practiced in the proposed methodology are represented here.
Entropy is generally worked in the information theory measure, which symbolizes the clarity of the subjective collection of examples [6]. It is in the establishment of Gain Ratio, Information Gain, Symmetrical Uncertainty (SU). The entropy criterion is considered through the measure of the system’s randomness. The entropy of Y is depicted in the Equation (2.5):
The Gain Ratio [7] is the non-symmetrical measure that is presented to pay back on the bias of the Information Gain (IG) [8]. GR is given by Equation (2.8)
Feature Selection via chi square χ2 test [7] is another, regularly used technique. Chi-squared attribute evaluation assesses the value of a feature by calculating the value of the chi-squared statistic with reverence to the class. The preliminary hypothesis H0 is the assumption that the two features are distinct, and it is tested by chi-squared formula in the equation (2.10):
The major comparison measure calculates the resemblances among the u
th
and v
th
features is known as Inverse of Euclidean Distance (IED). The Equation (2.11) is represented as:
The PCC [9] calculation for the given u
th
and v
th
features are represented by the below given by the Equation (2.12)
The CS calculation among the features is given by the below Equation (2.13)
Hall et.al [11] proposed the merit. The experts projected a strategy to find the redundancy among the features and relevancy among class labels and features are represented by the Equation (2.14):
Ding, Chris, and Hanchuan Peng [12] introduced the Minimum Redundancy – Maximum Relevance feature selection framework for the accurate classification of phenotypes in the microarray datasets. This paper presented that through the filter feature selection approach the redundancy was reduced explicitly. Yu, Lei, and Huan Liu [13] proposed the Fast Correlation-Based Filter method for identifying the relevant attributes as well as the redundancy among the relevant attributes without pairwise correlation analysis Vijaya, J., E. Sivasankar, and S. Gayathri [14] introduced a hybrid model with ensemble techniques which are used to enhance the classification. The clustering techniques like Possibility Fuzzy C Means (PFCM), Possibility C Means, and Fuzzy C Means are used for clustering the customers into groups. The ensemble models like Random Subspace, Bagging and Boosting are utilized for building themodels.
Stripling, Eugen, et al [15] used a Genetic Algorithm in the training model that maximizes the Expected Maximum Profit Measure for Customer Churn (EMPC) was developed to obtain the most profitable churn model. The author demonstrated a classifier called ProfLogit, which is similar to the lasso-regularized logistic model.
Ahmed, Ammar AQ, and D. Maheswari [16] presented an ensemble stacking integrated uplifting-based schemes for predicting the churners in the telecom. This model developed by the high correlation levels. Óskarsdóttir, María, et al [17] suggested a similarity forest-based classification method for eliciting the call networks time series data for establishing the dynamic behavior of the customer. The proposed method with one-nearest neighbor performed better for the classification of time series.
De Caigny, Arno, Kristof Coussement, and Koen W. De Bock [18] introduced a new hybrid algorithm called Logit Leaf Model (LLM) for improving the classification of the customer churn data. The proposed algorithm compared with the random forests, decision trees, logistic model and logistic regression models.
Yu, Ruiyun, et al [19] proposed the algorithm called PBCCP which is based on the Particle Swarm Optimization and Back Propagation network. It is used for optimizing the thresholds and weights of the BP NN and to improve the accuracy of the customer churn prediction.
Jamalian, Elham, and Rahim Foukerdi [20] in this paper introduced a novel hybrid Data Mining method which composed of feature selection and classification for classifying the churn customers in telecommunication. The authors used Principle Component Analysis (PCA) for the feature selection process whereas C5.0 and LOLIMOT (Linear Tree Model) algorithms are utilized for the classification.
Faris, Hossam [21] suggested an intelligent model which combines the Feed forward Neural Network (FNN) and Particle Swarm Optimization for the prediction of churn customer in the telecommunication industry. In this paper, PSO utilized to optimize the weight and structure of the NN which simultaneously improve the prediction power.
Even if some of the earlier stated techniques exhibit advancements by forecast accuracy, these developments derived at the rate of intricacy and interpretability of the method. In contrast, the most preceding functions highlight on the forecast power of their methods, without generous adequate consideration to the issue of recognizing the most instructive variables that disturb the blend ofclients.
High relevancy low redundancy vague set (HRLR-VS) based feature selection method
The proposed HRLR-VS algorithm consists of three major classifications. Firstly, it has considered the features redundancy by engendering the VSs established on proportional measures. The subsequent portion processes its dependence between the class labels and features to feature selection methods. Finally, the process follows an appropriate subset of features by a Particle Swarm Optimization searching method. The HRLR-VS has three main parts: They are By provoking the Ranking Vague Sets (R-VSs), the relation between the class labels and the features are considered. By using the Similarity Vague Sets (S-VSs), the association between the features is taken into consideration. Using PSO search method for finding the best subset of features by executing the proposed relevance and redundancy algorithms.
Through the PSO search method, S-VSs are created in every round. The step by step procedure for HRLR-VS is described in ensuing algorithm. As a result, in the first line of HRLR-VS is the empty subset for adding the features iteratively.

Proposed Framework for High Relevant and Low Redundant Vague Set (HRLR-VS) based Feature Selection.
Terms and its description in the proposed DUPP-HRLR-VS feature selection algorithm
Input: Telecommunication Churn Customer Dataset
Output: Nominated Feature Subset
Step 1: Start
Step 2: FS ← {}
Step 3: Calling the function1 FCRL for generating the R-VSs and calculating their information energies.
Step 4: Finding the feature with maximum value in FCRL and storing it in MM and finding its list and holding it in FML. [MM, FML] ← maxFCRL
Step 5: m ← 1.
Initializing the variable n
Step 6: FS ← FS ∪ FML. Assigning the feature which achieves maximum value in FCRL is added to FS.
Step 7: mer¯file (m)← MM ; Saving the value of the selected feature to mer¯ file.
Step 8: num¯arr {m} ← FS. Saving FS in the m th of num ← arr
Step 9: While (m < s). This loop is used to generate s feature subset candidates
Step 9.1: forj ← 1tos. This loop finds the best feature in order to add the subset.
Step 9.2: Call the function2 called FFRD. This function is used to generate the S-VSs among FS and the j th feature and calculate their information energies.
Step 9.3: Calculation of Merit
Step 9.4: endfor
Step 10: mer (FS)← ∞. The merit value for existed features in SF considered as a large negative value to avoid selecting redundant features.
Step 11: [MM, FML] ← max (mer). Finding the feature with maximum value in merit and storing it in MM and finding its index and holding it inFML.
Step 12: FS ← FS ∪ {FML }. Adding the above features to the feature¯subset
Step 13: m ← m + 1. Increment the value of m
Step 14: num¯arr {m } ← FS. Saving the FS in the m th row of num¯arr
Step 15: mer¯file (m) ← MM. Saving the correspond value to the selected feature to mer¯file.
Step 16: endwhile
Step 17: list ← max (mer¯file). Finding the best features subset that obtains best merit.
Step 18: FS¯fin ← num¯arr (list). Retrieving the corresponding features subset from num¯arr.
Step 19: Returns Features¯subset
Step 20: End Then in the subsequent line of the proposed HRLR-VS feature selection algorithm, the function1 FCRL is known to engender R-VSs and the IE which represents the relevancy.
Suppose a dtaset with three attributes like A1, A2, and A3, is assumed. The computation for generating R-VSs by:
Input: Telecommunication Churn Customer Dataset
Output: FCRL
Step 1: for n ← s
Step 2: hR-VS n ← {IG (n) , GR (n) , CS (n) }. Calculating the information Gain (IG), Gain Ratio (GR) and Chi-Square (CS) scores for n th features and assigning them as Vague Set Element (VSE) of R-VSs.
Step 3:
Step 4: endfor
Step 5: Return FCRL
Step 6: End
After engendering the FCRL, the selection of attribute with high FCRL takes place. This key attribute known as FML and MM represents its merit value. FML is taken to the FS and for making the concluding decision, mer¯arr stores the MM. MM with high value is stored in num¯arr is measured with the intention of keep the FS in every action. By a merit, the best attribute hasnominated.
For computing the FFRD, produce a candidate attribute (c th attribute as canF) and S-VSs between attributes in FS. And to augment the canF to FS and FS¯temp is used to hold these attributes. Formerly, it creates all the possible mixtures of the current features in FS¯temp. The IE of all attributes are computed by IED, PCC and CS for generating the S-VSs. Finally, this function gives FFRD that contains the attributes relationship.
Input: Telecommunication Churn Customer Dataset
Output: FS and a candidate feature (canF)
Step 1: FS¯temp ← FS ∪ canF. Adding the candidate feature canF to FS.
Step 2: PP ← FS¯temp. Generate all possible pairs combination of canF.
Step 3: Initialize cnt ← 1
Step 4: for each pair, pair(u,v) in PP
Step 5: Calculating the IED (S1u,v) , PCC (S2u,v) , andCS (S3u,v) measure as a Vague Set element of S-VSs.
Step 7: cnt ← cnt + 1. Increment cnt
Step 8: end for
Step 9: Return D
Step 10: end
The redundancy between attributes are given by RDFF. Consider the FS = {1, 2} by the function 2, then the computation of S-VSs among 3
rd
attribute and FS take place. The following classification is calculated by the function 2. The three S-VSs are generated by subsequent step. Therefore, A is considering by: A = {a12, a13, a23
} and S-VSs are considered are:
Particle Swarm Optimization (PSO) has investigated on the basis of social behavior connected with bird’s accumulating for the optimization problem. A social behavior model of organisms that cooperate and live within big crowds is the impetus for PSO. The PSO is handier to put into accomplishment than Genetic Algorithm. It is for the motive that PSO doesn’t have an edge or mutation operators and flow of particles has effectively by using velocity function. In PSO, each particle transforms its flying memory and its partner’s flying inclusion following in mind the top goal of flying in the search space with velocity.
Results and discussion
Dataset description
The dataset is choosen from the famous dataset repository Kaggle.com. It is the dataset composed of churn and non-churn customers data of hte telecommunication industry. But the name of the company has not mentioned in this dataset. The subsequent Table 2 depicts the description about the dataset. This dataset is poised of 21 features [20]. It is the combination of churner and non-churners customer details. Their call rates, Message rates, International call rates etc are included in this dataset.
Description of the Dataset
Description of the Dataset
Our proposed HRLR-VS based feature selection was carried out using Python 3.0 and the Classification techniques are implemented on MATLAB R2018b. Following Table 3 depicts the performance metrics. These metrics has considered for evaluating the existing techniques with proposed technique.
Performance Metrics for evaluation of proposed technique
Performance Metrics for evaluation of proposed technique
From the Table 3,
True Positive (TP) = the number of cases correctly identified as churners
True Negative (TN) = the number of cases correctly identified as Non-churners
False Positive (FS) = the number of cases incorrectly identified as churners
False Negative (FN) = the number of cases incorrectly identified as Non-churners
θ = True value of feature and
The correlation tells that how much θ and
Table 4 depicts the number of features obtained by feature selection methods like Information Gain, Gain Ratio, Chi-Square and proposed HRLR-VS feature selection method. From Table 4, it is clear that the proposed method generates a smaller number of strongly relevant and low redundant features than the existing methods like IG, GR, and CS. IG gives 18 features, GR produces 13 features, CS generates 14 features whereas the proposed method gives only 11 features. The features from the proposed method are evaluated by using the metrics in Table 3. The evaluation has done in three classification methods by existing feature selection methods and the proposed method. The classification methods like Artificial Neural Network, Naïve Bayes and Support Vector Machine are utilized in this research work for evaluation of the proposed method and existing feature selection for the classification of churn customers.
Number of features obtained by existing feature selection methods like Information Gain, Gain Ratio, Chi-Square and proposed method
Table 5 depicts the performance analysis on the accuracy of the existing and proposed feature selection method using ANN, NB, and SVM classification methods. From the Table 5, SVM gives maximum accuracy for original dataset, IG processed dataset and proposed feature selection processed dataset gives maximum accuracy when using Artificial Neural Network as the classifier, whereas GR and CS processed datasets attains maximum accuracy using SVM classification. Figure 2 depicts the graphical representation of the accuracy of the proposed feature selection method and existing methods by using classification methods like ANN, NB, and SVM.

Graphical representation of the accuracy of the proposed HRLR-VS method with existing feature selection methods using ANN, NB, and SVM.
Performance analysis on Accuracy
Table 6 depicts the performance analysis on the Kappa Statistic of the proposed HRLR-VS method and existing feature selection methods using ANN, NB, and SVM classification methods. From the Table 6, ANN gives the maximum value of Kappa Statistics for an original dataset and proposed HRLR-VS method processed a dataset, GR and CS processed datasets gives maximum Kappa statistic value using NB and SVM gives the maximum Kappa Statistic value for IG processed dataset. Figure 3 depicts the graphical representation of the Kappa Statistic of the proposed feature selection method and existing methods by using classification methods like ANN, NB, and SVM.

Graphical representation of the Kappa Statistic value of the proposed HRLR-VS method with existing feature selection methods using ANN, NB and SVM.
Performance analysis on Kappa Statistic of the proposed HRLR-VS and existing feature selection methods using ANN, NB and SVM
Table 7 depicts the performance analysis on True Positive Rate (TPR) of the proposed HRLR-VS method and existing feature selection methods using ANN, NB, and SVM classification methods. From the Table 7, ANN gives the maximum value of True Positive Rate (TPR) for IG processed dataset and proposed HRLR-VS method processed a dataset, Original dataset, GR and CS processed datasets gives maximum TPR value using SVM. Figure 4 depicts the graphical representation of the TPR value of the proposed feature selection method and existing methods by using classification methods like ANN, NB, and SVM.

Graphical representation of the True Positive Rate (TPR) value of the proposed HRLR-VS method with existing feature selection methods using ANN, NB and SVM.
Performance analysis on True Positive Rate (Recall) of the proposed HRLR-VS and existing feature selection methods using ANN, NB and SVM
Table 8 depicts the performance analysis on False Positive Rate (RPR) of the proposed HRLR-VSmethod and existing feature selection methods using ANN, NB, and SVM classification methods. From the Table 8, ANN gives minimum value of False Positive Rate (FPR) for IG, GR, CS processed datasets and proposed HRLR-VSmethod processed a dataset, original dataset gives least FPR value using NB. Figure 5 depicts the graphical representation of the FPR value of the proposed feature selection method and existing methods by using classification methods like ANN, NB, and SVM.

Graphical representation of the False Positive Rate (FPR) value of the proposed HRLR-VS method with existing feature selection methods using ANN, NB and SVM.
Performance analysis on False Positive Rate (FPR) of the proposed HRLR-VS and existing feature selection methods using ANN, NB and SVM
Table 10 depicts the performance analysis on F-Measure of the proposed HRLR-VS method and existing feature selection methods using ANN, NB, and SVM classification methods. From Table 10, ANN gives the maximum value of F-Measure only for the proposed HRLR-VS method, whereas other datasets give maximum value when using SVM as the classifier. Figure 7 depicts the graphical representation of the F-Measure value of the proposed feature selection method and existing methods by using classification methods like ANN, NB, and SVM.

Graphical representation of the F-Measure value of the proposed HRLR-VS method with existing feature selection methods using ANN, NB and SVM.
Performance analysis on F-Measure of the proposed HRLR-VS and existing feature selection methods using ANN, NB and SVM
Table 12 presents the performance analysis on Root Mean Squared Error (RMSE) of the proposed HRLR-VS method and existing feature selection methods using ANN, NB and SVM classification methods. From Table 12, ANN gives the least RMSE value for IG, GR and proposed HRLR-VS method processed a dataset, original dataset gives least RMSE value when using NB as the classifier. The other processed dataset gives the least RMSE value using SVM. Figure 9 depicts the graphical representation of the Root Mean Squared Error (RMSE) value of the proposed feature selection method and existing methods by using classification methods like ANN, NB, and SVM.

Graphical representation of the Root Mean Squared Error (RMSE) value of the proposed HRLR-VS method with existing feature selection methods using ANN, NB and SVM.
Performance analysis on Root Mean Squared Error (RMSE) of the proposed HRLR-VS and existing feature selection methods using ANN, NB and SVM
Table 9 depicts the performance analysis on Precision of the proposed HRLR-VS method and existing feature selection methods using ANN, NB, and SVM classification methods. From Table 9, ANN gives the maximum value of precision for original dataset, CS and Proposed HRLR-VS processed datasets and IG and GR processed datasets gives the least Precision value using NB. Figure 6 depicts the graphical representation of the Precision value of the proposed feature selection method and existing methods by using classification methods like ANN, NB, and SVM.

Graphical representation of the Precision value of the proposed HRLR-VS method with existing feature selection methods using ANN, NB and SVM.
Performance analysis on Precision of the proposed HRLR-VS and existing feature selection methods using ANN, NB and SVM
Table 11 presents the performance analysis on Mean Absolute Error (MAE) of the proposed HRLR-VS method and existing feature selection methods using ANN, NB, and SVM classification methods. From Table 11, ANN gives the least MAE value for the original dataset and proposed HRLR-VS method processed a dataset, whereas other feature selection processed datasets give maximum value when using SVM as the classifier. Figure 8 depicts the graphical representation of the Mean Absolute Error (MAE) value of the proposed feature selection method and existing methods by using classification methods like ANN, NB, and SVM.

Graphical representation of the Mean Absolute Error (MAE) value of the proposed HRLR-VS method with existing feature selection methods using ANN, NB and SVM.
Performance analysis on Mean Absolute Error (MAE) of the proposed HRLR-VS and existing feature selection methods using ANN, NB and SVM
Table 13 presents the performance analysis on Relative Absolute Error (RAE) of the proposed HRLR-VS method and existing feature selection methods using ANN, NB, and SVM classification methods. From Table 13, ANN gives the least RAE value for original IG and proposed HRLR-VS method processed a dataset, whereas GR and CS processed dataset give the least RAE value when using SVM as the classifier. Figure 10 depicts the graphical representation of the Root Mean Squared Error (RMSE) value of the proposed feature selection method and existing methods by using classification methods like ANN, NB, and SVM.

Graphical representation of the Relative Absolute Error (RAE) value of the proposed HRLR-VS method with existing feature selection methods using ANN, NB and SVM.
Performance analysis on Relative Absolute Error (RAE) of the proposed HRLR-VS and existing feature selection methods using ANN, NB and SVM
Table 14 presents the performance analysis on Root Relative Squared Error (RRSE) of the proposed HRLR-VS method and existing feature selection methods using ANN, NB, and SVM classification methods. From Table 14, ANN gives the least RRSE value for IG and proposed HRLR-VS method processed a dataset, whereas original, GR and CS processed dataset gives the least RAE value when using SVM as the classifier. Figure 11 depicts the graphical representation of the Root Mean Squared Error (RMSE) value of the proposed feature selection method and existing methods by using classification methods like ANN, NB, and SVM.

Graphical representation of the Root Relative Squared Error (RRSE) value of the proposed HRLR-VS method with existing feature selection methods using ANN, NB and SVM.
Performance analysis on Root Relative Squared Error (RRSE) of the proposed HRLR-VS and existing feature selection methods using ANN, NB and SVM
The two commonly used feature selection approach like Minimum Redundancy Maximum Relevancy (mRMR) [12] and Fast Correlation-Based Filter (FCBF) [13] Method are considered for evaluating the proposed HRLR-VS feature selection method for telecommunication dataset. Table 15 depicts the number of features obtained by proposed HRLR-VS method and existing literature works. Table 16 represents the performance analysis of the proposed HRLR-VS method with mRMR and FCBF approach. For evaluating the proposed HRLR-VS with existing literature, Artificial Neural Network classification technique has used in the Table 16. From the Table 16, it is clear that the proposed HRLR-VS gives more accuracy, Kappa Statistic, TPR, Precision, Recall and F-Measure than mRMR and FCBF approaches. The error rates like MAE, RMSE, RRSE, RAE also reduced for proposed HRLR-VS method. Table 17 depicts the total execution time (in seconds) for the proposed HRLR-VS, IG, GR, CS, mRMR and FCBF methods. From the Table 17, proposed HRLR-VS method took less time for executing the algorithm than the existing methods
Number of features obtained by proposed HRLR-VS method and existing literature (mRMR & FCBF)
Number of features obtained by proposed HRLR-VS method and existing literature (mRMR & FCBF)
Performance analysis of the Proposed HRLR-VS and existing Literature approaches (mRMR and FCBF) using ANN classification method
Total Execution time in seconds for Existing and Proposed Feature selection methods
There are more number of problems currently exists with encountering many features. Customers and usage data generated in telecommunication industry come across such problems. A number of feature selection techniques have been assessed to determine their efficiency for decreasing the number of dimensions in a dataset and for enhancing the precision of classifier which uses these features.
In this paper, choosing the least feature subset having low redundancy and high relevancy by blending the similarity measures and filter feature selection method with the Particle Swarm search method are described. A new hybrid algorithm and methodology are proposed in this work and an exhaustive experiment is trialed for determining the effectiveness of the proposed techniques.
Threshold is the problem that is mainly suffered by the ranking of feature selection methods. The parameters and threshold are not essential for processing data using proposed HRLR-VS. Experimental results are explained on the performance of proposed algorithm for choosing the least feature subset with more Accuracy. The performance measures such as Kappa Statistic, Precision, True Positive Rate and F-Measure, and lower error rates like False Positive Rate, MAE, RMSE, RRSE and RAE values used for evaluating the proposed method. From the results, it is concluded that the proposed HRLR-VS method performs better than the existing methods like IG, GR, and CS when using Artificial Neural Network as the classifier.
