Abstract
Many real-time applications, including some emerging ones, rely on high-dimensional feature datasets. For simplifying the high-dimensional data, the various models are available by using the different feature optimization techniques, clustering and classification techniques. Even though the high-dimensional data is not handled effectively due to the increase in the number of features and the huge volume of data availability. In particular, the high-dimensional medical data needs to be handled effectively to predict diseases quickly. For this purpose, we propose a new Internet of Things and Fuzzy-aware e-healthcare system for predicting various diseases such as heart, diabetes, and cancer diseases effectively. The proposed system uses a newly proposed Intelligent Mahalanobis distance aware Fuzzy Weighted K-Means Clustering Algorithm (IMFWKCA) for grouping the high dimensional data and also applies a newly proposed Moth-Flame Optimization Tuned Temporal Convolutional Neural Network (MFO-TCNN) for predicting the diseases effectively. The experiments have been done by using the UCI Repository Machine Learning datasets and live streaming patient records for evaluating the proposed e-healthcare system and have proved as better than others by achieving better performance in terms of precision, recall, f-measure, and prediction accuracy.
Introduction
Today, the Internet of things (IoT) is widely applied in various emerging fields, including medical, in the world. More than 25 billion smart devices will be connected by the year 2020. When connected, the huge volume of devices may produce an enormous volume of data that is extracted by using machine learning algorithms. Even though the huge volume of available data is dominating the computational models that are available in the literature and taking more time due to the data movement and the cost difference between memory and processing. For example, the clustering technique that is identified as a well-known unsupervised learning is helpful for extracting the data according to the similarity between the available data. Many solutions are available in the cloud for handling this task and also transmitting the huge volume of data with congestion in the communication paths. These kinds of congestion problems may affect the applications in terms of maintaining privacy preservation and security. To handle these issues smoothly, the feature selection and optimization techniques are incorporated to reduce the data weights.
The role of feature selection is increasing drastically in data analysis due to the rapid growth of data volume in various fields. Data mining, machine learning, and deep learning techniques are widely used for analysing data by applying and taking into account necessary features through dimensionality reduction. For identifying the most useful features, various methods are available such as feature selection, optimization, clustering, and classification. Specifically, the high-dimensional data sets are to be filtered out of the relevant and irrelevant features to reduce the dimensionality. The various feature selection methods are available under the three categories such as filter approach, wrapper approach, and embedded approach [1]. Among them, the filter approach ranks the available features by considering the scores of each attribute. The threshold for making the selection process as relevant and irrelevant features is fixed. All the filter methods are discussed in a survey conducted by Lazar et al. [2]. Next, the wrapper methods [3] are applied for preparing a subset that contains the more relevant features. The subsets are evaluated by calculating scores. Here, the recursive feature elimination process is incorporated like greedy search to identify the relevant features and also eliminate the irrelevant features for performing the feature selection process. The embedded methods include the fitting process that incorporates the Lasso regression [4] and gradient boosting [5] methods.
The clustering method plays a major role in resolving the issues by applying data mining and machine learning techniques. The objective of the clustering method is to form groups of objects into clusters that have high similarity and also neglect dissimilarities. Even with the availability of a large volume of data analysis techniques in the literature, a lot of issues are not resolved on high-dimensional data sets [6] due to the availability of more irrelevant, redundant, and noisy data. The clustering technique is used to categorize and group the relevant features together with the consideration of relevancy scores of features. While handling the different types of data parallelly, the time complexity is also increased and it is also not able to accommodate the data on the available memory spaces. It is necessary to propose a new intelligent clustering method for handling the different dimensionality of data effectively and helpful for making effective decisions. Many clustering algorithms have been proposed by various researchers under categories such as hierarchical, density, and prototype clustering. Even though, these standard clustering algorithms performed well on low dimensional data, they did not achieve enough performance on high dimensional data. For improving the performance on high dimensional data, the deep learning technique is incorporated into the clustering process. This work introduces a new deep clustering technique for resolving the high dimensionality problems and achieving better performance.
The major contributions to this work are as follows: To propose a new IoT and Fuzzy-aware e-healthcare system for predicting diseases effectively by analyzing high-dimensional medical records. To propose a new Intelligent Mahalanobis distance based Fuzzy Weighted K-Means Clustering Algorithm (IMFWKCA) algorithm for grouping the relevant features and reducing the dimensionality of the input dataset. To develop a new Moth-Flame feature optimization tuned Temporal Convolutional Neural Network (MFO-TCNN) technique for making final decisions on patient records with the help of optimized features To achieve better performance in terms of good precision, recall, f1-score and prediction accuracy with reasonable time taken for the decision making process.
The rest of this research article is organized with four more sections, including Literature Survey, System Architecture, Proposed Model, Result and Discussion, Conclusion, and Future Works. The literature survey section conducts a brief survey highlighting the benefits and drawbacks of clustering, feature selection, deep learning, medical field, and high-dimensional data. The system architecture section describes the workflow of the proposed monitoring system. The proposed model demonstrates the necessary background information and the newly proposed methodology in this work. The experimental results and the explanation are described in the results section. The work is concluded by highlighting the contributions and achievements quantitatively.
Related works
Much research work has been done in the direction of handling high-dimensional data by applying various newly proposed clustering and optimization methods [7–10]. Among them, Thanh et al. [11] developed the density and K-Nearest Neighbor with Kernel Aware Clustering method for handling high-dimensional data. The KNN with a kernel density scheme is helpful for making the possible clusters according to the various densities over the high-dimensional data. Their cluster method was tested with an image dataset and also achieved better performance than other methods. Bouveyron et al. [12] proposed and implemented a new data clustering method that is capable of handling high-dimensional data and also measures the particular subspace and the dimension of intrinsic. Finally, they have conducted experiments with live high-dimensional datasets and proved that they are better than other clustering techniques available in the literature. Jing et al. [13] presented a k-means clustering method to group the high-dimensional data into sub-spaces. The k-means clustering method is used to apply the values for calculating the weight for every dimension in an individual cluster and also apply the values for identifying the suitable subsets. They have achieved better performance than other methods in terms of prediction accuracy. Wu and Gu [14] presented a new clustering scheme to categorize the high-dimensional data. Their method applies supremum and infimum to represent the clusters according to the schema and also assigns the suitable non-sample objects for effective clustering.
Jiang et al. [15] developed a PL-Tree aware clustering method for handling high-dimensional data. Chen et al. [16] proposed a new feature grouping aware K-means clustering method to gather the relevant featured data in a high-dimensional dataset. They have incorporated two different kinds of weight assignment procedures for identifying the relevant features in every group. In addition, they have incorporated a few additional steps for calculating the subspace weights. Finally, they have proved that their clustering method is better than others in terms of prediction and relevancy accuracy.
Wang et al. [17] developed a new soft subspace-aware clustering technique that incorporates the partition index, subspace clustering, and fuzzy clustering. Moreover, they have built a convergence theorem for analyzing the viewpoints. Finally, the experiments have been conducted by using sparse and noisy text data from high dimensional datasets. Muja and Lowe [18] developed a new method to match the binary features by searching multiple clustering trees and also proposed a distributed nearest neighbor matching framework for handling the high-dimensional data.
Radhika et al. [19] developed a new document subspace clustering method for handling high-dimensional data which is capable of handling the high-dimensional data effectively by obtaining good prediction accuracy and clustering process with less processing time. Tao et al. [20] developed a new k-means clustering method that incorporates the intelligence and weights for performing the searching process globally and also finds the centre point for each cluster. The assigned weights are helpful for making effective decisions on high-dimensional data and have also proved better than other methods in terms of prediction accuracy.
Wang et al. [21] developed a new fast adaptive k-means subspace clustering method for providing a flexible clustering process for high-dimensional datasets. Moreover, it is capable of selecting the most useful features for performing an effective clustering process. Finally, they have proved that their method is better than other clustering methods in terms of prediction accuracy. Tapia et al. [22] presented a prediction methodology that was developed for evaluating the success of impatient on high dimensional data and also aimed to learn new combinations between the individual features. They evaluated various feature selection methods and classifiers in order to design the best prediction method with the highest accuracy. In their analysis, they achieved an accuracy of 82% using the best first method. Sui et al. [23] developed a new sparse representation aware DSC method called the evolutionary dynamic sparse subspace clustering approach that consists of two phases, namely static learning and a live clustering process. They have considered the average sparsity concentration index and also compared it with the sparsity concentration index. Finally, they proved that their method is better than other methods in terms of relevancy prediction accuracy and grouping process.
Guan et al. [24] developed a new optimization technique called the history-guided differential evolution method for performing effective grouping and also enhancing the feature selection process on various synthetic datasets that are available as high-dimensional data. Their technique achieved better performance than other methods that are considered to have other sets of features for forming a group. Lei et al. [25] developed a new weighted density aware clustering and analytic hierarchy process to identify a suitable method in the medical field. The hierarchical cluster analysis is incorporated with the newly proposed clustering method for enhancing the clustering performance in terms of a better service provider selection process. All the available methods are not achieving good performance in terms of prediction accuracy on patient records that are available as high dimensional data. To fulfil the current requirement, an IoT-aware e-Healthcare system is proposed in this work.
System architecture
The workflow of the proposed e-healthcare system is demonstrated in Fig. 1 that consists of nine important components such as IoT device, Sliding Window, UCI Machine Learning Repository, decision manager, feature optimizer, clustering module, classification module, rule base and temporal agent. The workflow of the proposed e-healthcare system is demonstrated in Fig. 1, which consists of nine important components such as IoT device, Sliding Window, UCI Machine Learning Repository, decision manager, feature optimizer, clustering module, classification module, rule base, and temporal agent.

Proposed system architecture.
The IoT device is used to collect the necessary live data from the patients in various locations. The collected live streaming data is to be forwarded to the decision manager through a sliding window process, and it applies a feature optimizer to optimize the patient records with the most contributed features. The decision manager analyses the necessary streaming data from the sliding window using the feature optimizer and it incorporates it with the temporal convolutional neural networks for performing hyper tuning and better decision making.
The decision manager applies the clustering algorithm to group the sub datasets that are identified as better subsets with optimized features. Finally, the decision manager decides the patient records whether they are affected by diseases or not. Moreover, it uses the rules that are stored in the rule base for categorizing the disease types and the levels according to the values contained in the important features of the datasets.
In this work, a new IoT and Fuzzy-aware e-healthcare system is proposed to predict whether the patient record is affected by diseases such as heart disease, diabetes disease, and cancer disease along with their severity levels effectively. This e-healthcare system works with the newly proposed Intelligent Mahalanobis distance aware Fuzzy Weighted K-Means Clustering Algorithm (IMFWKCA) to group the high dimensional data and a newly developed Moth Flame Optimization Tuned Temporal Convolutional Neural Network (MFO-TCNN) for predicting diseases effectively. In this work, the high-dimensional data is grouped by applying the proposed clustering algorithm according to the differences between the records and reducing the dimensionality as well. Here, the high-dimensional data is subjected to the dimensionality reduction process and grouped into different subgroups. Then, the grouped dataset is forwarded to perform the classification by using the newly proposed feature optimization tuned temporal CNN. Here, the existing MFO algorithm is used for tuning the features with adjusted weights to the Temporal CNN to make a final decision on medical records. This section explains the data clustering process, feature optimization and classification. First, the data clustering is explained by demonstrating the workflow of the proposed IMFWKCA.
Clustering
In this work, a new enhanced version of the Intelligent Mahalanobis distance-aware Weighted Fuzzy K-Means Clustering Algorithm (IMWFKMCA) is proposed for effectively grouping the relevant data in the high-dimensional dataset. Moreover, an intelligent agent is used for performing the weight assignment process.
K-Means clustering
A group of objects are used in the standard K-means clustering that represents the quantitative value of each input feature as matrix X = (x
hu
) where x
hu
indicates that the feature value at the specific entity I mean that a group of entities (M), U represents the group of features (L). This method is capable of producing a subgroup SG ={ SG1, . . . . , SG
k
} of H in the K number of non-empty and non-overlapping subsets that represent as SG
k
⊂ I, clusters, the centroid value is pointed out using the variable c
k
and the vector is represented with N-dimensional for the feature space
This subsection describes in detail the intelligent agent-based K-Means clustering method by applying a pattern used in the existing work [26] for identifying the initial cluster (SG) and also finding the centroid value (cp) by minimizing the alternatively represented as given in equation (2).
Here, the SG is identified as a subgroup dataset that is removed from the whole dataset and identified as a new subgroup in which the irrelevant records are removed. Finally, the single records are removed, and the remaining centroids are K-means initialized.
The standard K-Means clustering is enhanced further with the consideration of fuzzy logic and weights. The fuzzy clustering process works based on the membership function in the process of assigning the centroid value. The improvement of the fuzzy clustering is the incorporation of the weight assignment process. The Minkowski Weighted K-Means clustering algorithm was developed by Amorim and Mirkin [26]. They have enhanced equation (1) with the consideration of unknown weights that are to be assigned for the features of input records
Where, the weights are guessed as non-negative value and cumulative value is represented as wt u ⩾ 0, ∑u∈MWT u = 1. The variable β is represent the exponential value and finalized by the supervised method or user.
The right-side expression represented by β power Mahalanobis metric between the points that are rescaled the entity
The fuzzy clustering process is represented by the necessary fuzzy membership function sg
k
= (sg
hk
) , h ∈ H, in which sg
hk
(0 ⩽ sg
ik
⩽ 1) indicates the degree of membership of the entity H for the k number of clusters. Let assumed that the condition ∑
k
SG
hk
= 1. The extra changes are used for computing the feature weights to every cluster separately. This work is done according to the work [26] for bringing a great flexibility to the method. Thus, the variable wt
uk
is used instead of wt
u
with wt
uk
being the feature weights u (u = 1 . . . M) for the number of clusters as k (k = 1 . . . K). Therefore, the next criterion is finetuned and presented in the equation (4):
The Mahalanobis distance measurement formula is given in equation (5), which is used to calculate the distance between the records that are available in each subgroup.
Generally, the centroid values are calculated based on the differences between the records that are available in each cluster or subgroup.
Intelligent Mahalanobis distance aware Fuzzy Weighted K-Means method
The steps of the proposed Mahalanobis Fuzzy Weighted K-means Clustering Algorithm are presented as below:
1. For the given centroid values cp
k
and the weights wt
u
= (wt
uk
), update the cluster based on the formula given in equation (6) that is derived from the equation (4) with the constraints by ∑
k
SG
hk
= 1:
The distance between the points
The positive values are included to every item in the sum for avoiding the division by zero and it is also assumed as α > 1.
2. The fuzzy clusters as sub groups SG
k
= (SG
hk
) and the necessary weights wt
u
= (wt
uk
) , change the centroid point value cp
k
= (cp
hk
) of the every sub group SG
k
as its Mahalanobis center point so that, at each u, cp
ku
is a value of reducing β value using Mahalanobis distance to the groups objects that are represented as given in equation (8).
For finding the Mahalanobis center point, the steepest descent method is used for computation process according to the work [26].
3. The subgroups SG
k
= (SG
hk
) and the center point values cp
k
= (cp
hk
), change the weights based on the formula given in equation (4) constrained by
Here, the positive value is considered for every item in the sum and it also assumed as β ≠ 1 for avoiding the division by 0.
Feature optimization is an essential task for enhancing the performance of the prediction process in a short span of time. Generally, the moths are identified as insects that are similar to butterflies, and they have two milestones: as adults and as larvae. Moreover, the larvae are converted into moths. They have evolved to fly at night by applying the moonlight. They have utilized the transverse orientation that is helpful for performing the navigation process. The moths are flying with fixed angle with respect to moon and also be an effective method to travel long distance in a straight path [27]. Suppose the moths are changed by artificial light and also demonstrate their behaviors. The reason for the inefficiency of the transverse orientation process that is used to move in straight line when the light is so far away. When moths see the artificial light, they have tried to maintain a same angle with the light for flying in straight line. Since the artificial light is very closed when it is compared to the moon even though it is maintaining a same angle to the light source that causes a deadly spiral fly path for moths [27]. The existing Moth-flame Optimization Algorithm is explained in the next subsection.
MFO algorithm
The MFO method [28] assumes that the moths are candidate solutions, and the variables are also identified as the positions of the moths. Here, the moths are flying in different dimensions, such as single dimensional, two-dimensional, three-dimensional, and hyper-dimensional by changing their positions. Since the MFOA [28] is a population-aware method, it represents the group of moths in a matrix as below:
Where the variable n represents the number of moths and the variable d indicates the number of dimensions,
Here, an array is prepared for representing moths for storing the respective fitness values and is presented as a matrix as below:
Where the variable n indicates the number of moths, the fitness value is the return value of each moth. The key components of the MFOA are flames and considered as a matrix that is similar to the moth matrix as below:
Where the variable n indicates the number of moths and the variable d represents the number of dimensions, the equation (12) represents the dimension of the variable MF and OFM arrays are equal. For the moth flames, it is assumed that an array is used to store the respective fitness values as below:
Where the variable n indicates the number of moths, the MFO method consists of 3 tuples that approximate the global optimal of the optimization issues and are defined as below:
The variable J indicates the method which creates a random population of moths and their respective fitness values. The mathematical model of this function is as follows:
Here, Q is a function that moves the moths around the search area, and it receives the matrix (M) and also returns the updated value.
Here, S is a function that returns a true value if the termination criterion is fulfilled, and it returns false when the termination condition is not fulfilled.
With the variables J, Q, and S, the MFO framework, which is defined as the function J being assigned to M. Then, if the S (M) is false, then the P (M) is assigned to M. Moreover, the function J creates the starting level solutions and also computes the objective values. Any random value is also to be applied in this method that is represented as below:
For I = 1 to n do
For j = 1 to d do
MOTH(i,j)=(UBND(i)-LBND(i))
* RND()+LBND(i);
End
End
OM = FF(M);
Here, there are two arrays such as UBND and LBND, and the relevant matrices are defined by the upper bound and lower bound of the variables as below:
Where, the variable LBND i represents the lower bound of the ith variable.
After performing the initialization process, the function P is executed iteratively until the function T returns the value as true. The function P is the main method by which the moths in search areas. Now, update the moth’s position according to the flame by applying the equation (12).
Where, the variable M i represents the ith moth, the variable F j represents jth flame, and the variable S indicates the spiral method.
In this work, a logarithmic spiral is chosen for updating the moths. Even though any kind of spiral is applied by following the conditions, including the starting point of the spiral is to be started with moth, the final point of the spiral is the flame’s position, and the spiral range fluctuation will not cross the search area. Let us consider the points and define a logarithmic spiral of MFO, which is represented in the equation (13).
Where, the variable D
i
represents the ith moth distance for the jth flame, the variable b represents the constant to define the spiral shape, and the variable t represents a random number that is between -1 and 1. Moreover, the value of the variable D is computed by using the formula given in equation (22).
Where, the variable M i represent the ith moth, and the variable F j represents the jth moth and the variable D i represents the distance between the ith and jth flames.
Next, the moths’ positions are changed according to the ‘n’ locations in the search area, which degrades the exploitation process of the best solution. This work proposes an adaptive method for the various flames by using the formula given in equation (23).
Where the variable l indicates the number of iterations, the variable N represents the maximum number of flames, and the variable T represents the maximum number of iterations.
Even though the positions of moths are updated according to the best flame in the final steps of iterations. Moreover, it gradually decreases the number of flame balances that are exploiting and exploring the search area. The steps of the function P are as follows: Apply the equation (22) for changing the flame. Initialize the FF(M) into OM. If IT_No.==1 Then Perform sort operation for M and assigned to F Perform the sort operation for OM and store the result into OF. Else Perform the sorting operation for Mt - 1 and Mt, store the results into F. Perform the sorting operation for Mt - 1 and Mt, store the results into F. End For i = 1 to n do For j = 1 to d do Update the values of r and t Find the D value by applying the equation (22) according to the respective moth. Change the M(I,j) value by applying the equations (21) according to the concern moth. End End
Accordingly, the function P is executed until it returns the value true by the function T. After performing the termination process for the P function, it returns the best moth value and it is returned as an optimum.
This subsection explains the newly developed, tuned Temporal CNN that is used for predicting diseases effectively by handling the high-dimensional patient records in the form of benchmark datasets and live streaming datasets. In this paper, we propose a new Moth Flame Feature Optimization based Temporal Convolutional Neural Network (MFO-TCNN) for performing disease prediction processes. The proposed model is capable of predicting the diseases and their levels without the intervention of human beings. The proposed MFO-TCNN has a high prediction accuracy due to the MFO’s incorporation of an effective tuning process on the existing T-CNN. The structure of the T-CNN is explained in the next subsection.
T-CNN
The existing Temporal Convolutional Neural Network (T-CNN) [29] works like a standard CNN with the incorporation of the temporal constraints. Generally, the CNN consists of five layers, and each layer is responsible for performing a certain job. Among them, the convolutional layers build the feature maps and the classification layers help finalize the result of CNN. All the layers of CNN are explained one by one in detail in this section. First, the input layer preprocesses the input values of the attributes by eliminating the noisy, redundant, and meaningless data by applying the basic level filter. Second, the convolutional layers confine the features that are extracted or filtered from the input layer by applying the filters, and the feature mapping process is initiated and obtained the results for effective classification. Moreover, the values (neurons) between the two different layers are connected together by considering the necessary weightages for the concerned attributes. The convolutional layer’s output is presented in equation (24).
The third layer is a normalization layer, which is available in the convolutional and ReLU layers for enhancing the training attributes of the input dataset. Moreover, it is used to normalize the gradient process and the normalization process effectively by adjusting the training process. The fourth layer is ReLU, which is used to map the layers in the convolutional layer part by applying the non-parametric layer called the ReLU layer and also not inhibiting the weights or bias values. The fifth layer is the Max-Pooling layer that is responsible for performing the feature mapping process and is also generated by applying the down sampling process of the convolutional layer on input data. The sixth layer is a fully connected layer that is used to map the features of convolutional layers and also forward them to the fully connected layer where the final decision is made and the final result is obtained as per the given equation (25).
The various weightages used in this proposed CNN are identified by using the MFOA and performing the training process. The seventh layer is the softmax layer, which produces an output which is normalized with the help of the softmax activation function for ensuring and enabling the classification tasks. Finally, the classification layer is used for performing classification processes with the help of the activation function on all the inputs of the various disease datasets, including heart, diabetes, and cancer.
The tuning process is a process of feature selection and optimization over the input datasets. For safeguarding and maintaining human health from various deadly diseases, human health is to be monitored periodically to detect the occurrence of the symptoms. The identification of symptoms is helpful for identifying the various diseases accurately. It is an important task that detects the diseases accurately. If one fails to identify the symptoms or wrongly identifies the symptoms, it may severely affect the human being. To perform this task, the Temporal CNN is applied in this work, which is used to predict and detect diseases. The high prediction accuracy is obtained by using the optimal features along with the necessary weights of the T-CNN using the newly proposed MFO.
In this work, the newly proposed IoT and Fuzzy logic incorporated e-healthcare system is developed to predict which patients are affected by diseases. This system is used to predict heart disease, diabetes disease, and cancer disease and also to know the disease severity level. This e-healthcare system incorporates the newly proposed Intelligent Mahalanobis distance aware Fuzzy Weighted K-Means Clustering Algorithm (IMFWKCA) to group the high dimensional data and a newly developed Moth Flame Optimization Tuned Temporal Convolutional Neural Network (MFO-TCNN) to predict the diseases. Here, the IMFWKCA is used to group the high-dimensional data and also to reduce the dimensionality. Then, the reduced dataset is considered as input for the newly proposed MFO-TCNN to categorize the record as disease affected or not. In addition, the disease severity is also measured by the proposed classifier.
Results and discussion
This section provides the detail about the experimental setup and datasets, and evaluation parameters in different subsections and also demonstrated the experimental results and comparative analysis. First, the experimental setup is explained in detail.
Experimental setup and datasets
The proposed e-healthcare system has been developed and implemented using the Python programming language. The experiments have been done by applying the newly developed software to the IoT devices with Raspberry PI software. The required sensors are fixed in the new e-healthcare system to measure the heartbeat rate, blood pressure and glucose level. All this information has been collected and is also stored as patient records. In addition, the heart, diabetes, and cancer datasets are available in the standard UCI repository for performing the training and testing processes. The IoT devices are equipped with the necessary hardware devices, microcontrollers, and LoRa communication devices to transmit the patient data for the newly developed software. Moreover, the serum cholesterol and glucose levels have been collected with the help of a wearable device, and the electrocardiographic data is also measured by applying the heart monitor board (AD8232). In addition, the Raspberry PI software is used for collecting the highest heart rate, oldpeak, and slop [30].
Evaluation parameters
The evaluation parameters such as accuracy (ACC), recall (REC), specificity (SPEC), f1-score (FS) value, and Matthew’s correlation coefficient (MCC) are calculated by using the formulae given in the equations (26)–(31) for evaluating the proposed model by considering the True Positive (TP), True Negative (TN), False Positive (FP), and False Negative (FN).
This subsection demonstrates the experimental results that prove the efficiency and effectiveness of the proposed e-healthcare system and also proves it to be better than other healthcare systems that are available in the literature in terms of prediction accuracy.
The prediction analysis over the optimized featured dataset as five different clusters is demonstrated in Fig. 2. This analysis considered the precision, recall, and f1-score values for analyzing the prediction performance.

Prediction with optimized featured datasets.
The performance significance on the optimized featured dataset is demonstrated by conducting various experiments that consider the different five clusters in terms of the standard evaluation parameters. When compared to other clusters, cluster 5 achieves the best performance in all parameters, while clusters 2 and 3 achieve lower performance.
The comparative analysis between the proposed e-healthcare system, T-CNN, and T-CNN with proposed IMFWKCA is shown in Table 1. Here, five different experiments have been conducted for the standard benchmark dataset and the streaming patient records as an input dataset. Here, heart, diabetes, and cancer disease datasets are considered in both types of datasets for conducting experiments.
Comparative analysis
The proposed e-healthcare system performed better than the T-CNN and the T-CNN with the proposed IMFWKCA in all the five experiments. Here, the performance of the existing T-CNN is lower than the proposed e-healthcare system and T-CNN with IMFWKCA due to the unavailability of the proposed data clustering and feature optimization algorithm. Moreover, better prediction accuracy is obtained in all the experiments for the streaming dataset than for the standard benchmark dataset.
The training and testing time taken is analyzed and shown in Table 2, which considers two different datasets that are used in this work for evaluating the proposed e-healthcare application. The training time and testing time of two different datasets were considered in this time analysis.
Training (TR) and Testing (TST) time analysis
The training and testing time for the e-healthcare application is slightly higher than the existing T-CNN alone and also lower than the existing disease prediction system. The incorporation of the newly proposed clustering and feature optimization algorithms may consume more time than the existing Temporal CNN. However, it achieved better prediction accuracy in all the experiments and clusters that were demonstrated in the previous results.
The prediction accuracy analysis between the domain expert and the proposed e-healthcare system is demonstrated in Fig. 3. The different numbers of records from the various types of diseases such as coronary artery and vascular disease, heart rhythm disorders (arrhythmias), heart failure, structural heart disease, diabetes, and cancer disease affected patients’ records have been considered in this work.

Prediction accuracy analysis between the domain expert and proposed e-healthcare system.
The newly developed e-healthcare system outperforms the domain expert in prediction accuracy. The prediction accuracy on various patient records goes down and also stabilizes at the same percentage of prediction accuracy when the number of patient records is increased from 1000 records. The reason for the achievement is the use of an effective deep learning algorithm with feature optimization and the clustering method.
Figure 4 shows the comparative analysis between the proposed e-healthcare system and the existing disease prediction systems such as T-CNN with feature selection [29], CNN, FTCM [7], TFMM-PSO [9], and Artificial Neural Network (ANN). Here, the five different experiments, such as Experiment 1 (E1), Experiment 2 (E2), Experiment 3 (E3), Experiment 4 (E4), and Experiment 5 (E5), have been conducted with different sets of patient records with the combination of live streaming datasets and the benchmark dataset for evaluating the proposed e-Healthcare system and the existing disease prediction systems.

Comparative analysis.
In this comparative analysis, the performance of the proposed e-Healthcare system was obtained with better performance in all the five different experiments, such as E1, E2, E3, E4 and E5. The experiments have been conducted by using various combinations of patient records which are selected randomly from the live streaming dataset and the standard dataset. The reason for the performance improvement is the application of effective feature optimization tuned to Temporal CNN and the proposed Intelligent Mahalanobis distance based Fuzzy Weighted K-Means Clustering Algorithm. Here, the fuzzy logic and temporal constraints are useful for making effective decisions on patient records.
In this paper, a new IoT and Fuzzy-aware e-healthcare system has been proposed and implemented for predicting heart disease, diabetes disease, and cancer disease effectively. The proposed system works with the proposed Mahalanobis distance-aware Fuzzy Weighted K-Means Clustering Algorithm (MDFWKCA) to group the high dimensional data and a newly developed Moth Flame Optimization Tuned Temporal Convolutional Neural Network (MFO-TCNN) for predicting the diseases effectively. In this work, the high-dimensional data is grouped by applying the proposed clustering algorithm according to the differences between the records and reducing the dimensionality as well. Here, the high-dimensional data is subjected to the dimensionality reduction process and grouped into different subgroups. Then, the grouped dataset is forwarded to perform the classification by using the newly proposed feature optimization tuned temporal CNN. Here, the existing MFO algorithm is used for tuning the features with adjusted weights to the Temporal CNN to make a final decision on medical records. The proposed IoT-aware e-healthcare system is evaluated by using the standard benchmark datasets and the live streaming patient records that are generated through IoT devices and from various hospital records by conducting various experiments and also achieving better performance in terms of precision, recall, f-measure and prediction accuracy. Future work can be done in this direction with the introduction of a new optimization technique for tuning the deep learning algorithm to improve the disease prediction accuracy.
