Abstract
Coal is a primary natural resource of fuel that is efficiently used for electricity generation, steel or iron production, and household usage. Characterization is needed for industries to understand the quality of coal before shipping. Currently, industries follow chemical, microscopical, and machine-based analysis as the gold standard for coal characterization. These conventional analyses of coal are an indispensable method over the years and have tested by high skilled petrologists. Though, these types of optical or machine-dependent recognition of coal category samples are quite slow, expensive, and restricted by subjective analyses with less accuracy. The main aim of this research is to propose an accurate, time, and cost-effective machine learning-based automated characterization system by analyzing coal color and textural features. This paper comes up with a quantitative approach toward the characterization of dissimilar types of coal for better utilization in industries. The proposed ensemble learning coal characterization method provides an accuracy of around 97% and takes less computational time than conventional methods. Hence, the proposed automated coal characterization system provides support to industries in the development of computer-aided assessment of coal category samples.
Introduction
Coal is a natural fossil fuel source of energy for electricity generation, steel or iron production, and household practices. The characterization of coal quality plays a vital role in the suitable exploitation of coal. The quality of coal is usually followed by accomplishing arbitrary quality tests on the ultimate finished coal wedges before shipping to industries. Currently, industries evaluate the grade of coal using various conventional methods, namely chemical analysis, microscopical, and machine-based analysis established from prevailing expertise that is validated with ordinary samples [1]. Characterization using these conventional procedures are time-consuming, expensive, and also dependence on expert petrologists that may give subjectivity in the result. Chemical analysis results loss of concentrated coal during the process of raw ore crushing. It also causes severe air pollution because of the emission of chemical gases and the crushing of coal blocks into fine particles.
Hence, an automated coal characterization system would be preferred for industries which is much more intelligent than the traditional one and also environment-friendly. Generally, coal images can be categorized according to their structural, color, and textural characteristics. This research comes with an idea to classify the coal quality corresponding to its color and textural features utilizing image processing, pattern recognition techniques, and machine learning models.
Related works
In recent years, with the popularization of computer and high-performance imaging equipment, the method of image-based characterization is easy to be realized. It can be seen that microscopic images color, structures, and texture provides excellent differences in the analysis of coal, and these features have been frequently used to characterize them. Till date, the countless researcher has suggested the coal characterization techniques. Hong et al. [2] proposed a coal gauge sorting system using a convolutional neural network. Hao et al. [3] discussed the image-based fractal characterization of microfracture structure in coal using CT images. Qinghe et al. [4] studied pores in pulverized coal using the image analysis method. The feature engineering methods, such as the extraction of feature texture based on the gray level symbiotic matrix, [5, 6] is used to identify the coal gangue. Zhang et al. [7] suggested a genetic algorithm based on SVM for the prediction of ash content in coarse coal by image analysis. Mejia et al. [8] proposed the automated maceral characterization using histogram analysis of the color features of the images. Van et al. [9] modeled the dissemination of numerous coal constituents in the feature space utilizing a simple nearest neighbor and the hybrid model to categorize every pixel in the image. Jing et al. [10] discussed the coal analysis of topological and geometrical characteristics of the fracture system using X-ray images. Li et al. [11] suggested the quantitative characterization and visualization of coal fracture using micro CT scanning. Ren et al. [12] discussed the fractal characterization and pore structures of synclines coal. Joana et al. [13] discussed the geochemical and petrographical characterization of coal using the genetic geodynamic model. Chaves et al. [14–16] proposed different methods for char characterization using image processing. There are many other methods introduced in literature for coal characterization [17–19].
The existing schemes are satisfactory but unable to extract image features like color and textural features with better accuracy for coal quality characterization using an image processing method. Hence, in this paper, we intended at the development of a computer-based coal quality characterization system utilizing image processing and machine learning methods for subtyping the coal categories as per classes CL1, CL2, and CL3, which are further named as good, average and poor quality of coal. The main objective of the proposed work is to develop a machine learning system using an ensemble-based algorithm for automated coal characterization. The motivation behind this research is to overcome the quality examination challenges confronted by the manufacturing and mining industries by offering the computer-based technique. The proposed scheme improves the results that can be acquired by examining the textural and color features of coal samples. Such computerized systems guarantee the reliability, cost-effectiveness, exactness, consistency in outcomes, and easiness. This study demonstrates the comparative performance analysis of the proposed model (ensemble of classifiers (EoC)) and standard classifiers (i.e., decision tree, linear discriminant, and support vector machine) for coal characterization. Consistent characterization results have been accomplished utilizing the noticeable textural, color features, and the ensemble of classifiers (EoC).
Ensemble learning-based coal characterization
The ensemble learning-based characterization of coal types from the optical microscopic images is precisely presented in Fig. 1.

Ensemble coal characterization system.
These methods of automated coal characterization process have been concisely presented in the following sections.
The different types of coal sample images are collected from the CSIR-National Metallurgical Laboratory (NML), Jamshedpur, India. A Leica DM4500P optical light microscope has been used for capturing of coal images of size 1024*1024 pixels. It is assembled by monochrome camera and oil immersion lenses utilizing the green incident light. It is imitated under the polished surface. Here, the 500x magnification was utilized to capture the images. The captured image of high resolution provides good discrimination between different grades of coal. The overall one hundred twenty images have been captured from three different kinds of specified coal samples (i.e., CL1, CL2, and CL3). All images have been manually labelled, i.e., assigned to one of three categories of coal, by petrologists, according to its characteristics like carbon content, ash content etc. The microscopic pictures of different coal samples are shown in Fig. 2.

Different types of coal microscopic images.
The original coal images are comparatively bigger, so to characterize it appropriately, the method of sub-imaging is required. The chosen region of interest (ROI) should comprise repeated patterns of features. It is required because every coal sub-image sample in the original coal image has to be calculated for discriminating them into three classes of coal characteristics. The bounding-box [20] technique is used to crop the sub-image from the original sample image. The rectangular bounding-box method is used here. The rectangular area coordinates can be calculated by doing the mean of each pixel coordinates in that particular ROI area. Then, we provide the bounding box information like the top-left corner point and size of the crop window to crop the images. This method outcomes in sub-images comprising a distinctive solitary pattern with the hypothesis that there does not have any touching ranges or areas. The sub imaging stages are demonstrated in Fig. 3. A total of three hundred sixty sub-images containing a single coal texture pattern has been cropped and used as an input image for the coal characterization process.

Sub imaging of original sample image.
Some of the noises appeared during the image capturing and cropping process. So, to avoid this problem, pre-processing techniques are used that are achieved by selective filtering and contrast enhancement methods [21].
The process of feature extractions is beneficial when the image is more extensive. In this procedure, some features of the sample are extracted with the help of standard feature descriptors. Three types of features are extracted for coal image-based characterization: textural, color, and morphological features. In this paper, color and textural features have been used for automated coal characterization.
Color is measured as a significant feature descriptor for the characterization of coal categories. If the specific area is exaggerated, then the coal category area transforms the color efficiently. Color histogram technique [22] is widely used in extracting color features. The restriction in using this method is that it does not consider the spatial arrangement of color. To overcome this issue, the HSV color spaces [23] have been used here for feature extraction. HSV comprises of an H-hue, S-saturation, and V-value (brightness). Hue correlates precisely to the idea of tone, whereas saturation relates to the concept of tint, and value tells about the intensity. The hue, brightness, and saturation (HSV) are the color model considered here as a feature vector. Figure 4 shows the HSV color space representation of three different types of coal samples.

HSV color space representation of coal samples: (a) CL1, (b) CL2 and (c) CL3.
The gray-level co-occurrence matrix (GLCM) is used for describing the luminance variation and spatial correlation of images [24]. It is a method of extricating Haralick’s textural features. This dual probability takes the structure of a square array GL
d
, with two-dimensional matrix, rows, and columns, which is equivalent to several discrete gray levels (intensities) in the image being inspected. It can be represented as:
Here, n xy is occurrences number of pixels (x, y) present at a distance d in an image. The GL d co-occurrence matrix is having dimension n×n, where, n is a number of grays level in the image. In this research, five features of GLCM (i.e., correlation, contrast, energy, entropy, and homogeneity) were extracted for each type of coals and are used as feature vectors.
Altogether, these three color and five textural feature vectors are utilized in calculating the moment values (i.e., mean, skewness variance, and kurtosis) for each of the sub-imaged samples for feature extraction and then, provided as an input to the classifier to evaluate the accuracy of the machine learning models for coal quality characterization. There is a total of seventeen features that have been extracted, out of which five are textural features, and thirteen are color features.
Before feature selection and classification, the normalization of the featured data set, which has a diverse range of values and also, to evaluate the characterization ability of each feature or group of features in the labeled classes, is one of the mandatory steps for automated coal characterization. Hence, every input value has to be dispensed discretely and for every feature (X
i
) in training set, mean (α), and standard deviation (γ) are calculated. The arithmetical expression to compute the normalized features is [25]:
Classifier performance is affected by selected features, so it is essential to select appropriate features before classification. Hence, this process is known as feature selection. One of the statistical methods used for feature selection is ANOVA [26] testing. The Analysis of Variance, popularly known as the ANOVA, can be used in cases where there are more than two groups to select the appropriate features. A total of ten features has been selected using the ANOVA method.
In a pattern recognition problem, classification is primarily utilized for distributing the feature space into numerous classes or categories depending upon similar features. Depending upon a number of categories, each feature vector is assigned a category label, which is a pre-defined integer number and is based upon the classifier outcome. Every classifier must be designed in such a way that the use of a group of inputs deliver the anticipated set of outcomes. The aggregate estimated data are partitioned into testing and training data set. The training data is utilized for updating the weights, and this process of network training is known as learning. The testing data has been utilized for validating the performance of the classifier. In this research, we suggest the utilization of the ensemble of the classifier for labeling each coal sub-image as CL1, CL2, CL3 categories based on a group of estimated features. The performance of extracted color and textural features in classifications has also been examined with three other traditional classifiers, i.e., decision tree (DTC), linear discriminant (LD), and support vector machine (SVM). The appropriate parameter tuning has been done for every classifier to accomplish the optimal accuracy, and similar training and test data sets have been utilized for all while calculating their distinct classifier performance. Decision tree (DTC) [27] classification method has been used effectively for widespread varieties of the classification problem. The ‘tree’ consists of internal nodes, leaves, and the root node. Internal nodes split the dataset while leaves are called terminal nodes comprises of similar classes. The configuration of DTC includes fine tree type, with a hundred maximum splits using Gini’s diversity index. The main limitation with DTC is overfitting and also delivers less information on the connection between predictors and response. Linear Discriminant (LD) [28] classifier is a method utilized for data categorization and dimensionality deduction. The primary purpose of LD is to reserve most of the class discrimination facts without missing any information. The configuration of LD includes a full covariance structure. The small sample size issues are the main disadvantage of LD. Support vector machine (SVM) [29] used to examine patterns and data for adequate classification. SVM is a non-probabilistic classifier used to categorize new samples into corresponding similar categories using the hyperplane. The configuration of SVM includes a radial basis function as a kernel function with a five-kernel scale. The main drawback of SVM is that it has several vital constraints that need to be set appropriately to accomplish the optimal classification outcomes for any specified problems.
To overcome these issues, multi classifier methods or ensemble methods have been suggested here for better coal characterization. Multi classifier methods are more superior than single classifier equivalents because of several reasons: it diminishes the possibility of poor selection; complicated decision boundary has been learned efficiently; and beneficial with different features.
Ensemble classification for automated coal characterization
It is extremely required to sustain a high classification and less error percentage in an automated coal characterization system. Though, it is tough for a single machine learning classifier to accomplish this for a complex coal image characterization problem. To overcome these issues, an ensemble-of-classifiers (EoC) [30] based method for the classifications of extracted coal textural and color features are examined here. An ensemble classifier method, also called a multi classifier method, made by merging many miscellaneous classifiers. Miscellany may be accomplished by utilizing fully diverse groups of classifiers and training data set for every classifier. The concept behind this is that every member of the ensemble classifier will produce diverse decision boundaries and creates diverse error, and appropriate grouping of classifiers will decrease the overall error. Bagging is utilized here to promote diversity to train every member of the ensemble classifier utilizing arbitrarily depicted a subset of the training set. Class labels have been produced by distinct classifiers that are united utilizing majority votes, and class label selected by maximum classifiers is the ultimate ensemble classifier conclusion. The architecture of an ensemble of classifiers (EoC) for coal characterization is depicted in Fig. 5.

Ensemble classifier architecture for coal characterization.
Here, in the primary phase, DTC has been used as source classifier, and flexible styles of DTC gained utilizing diverse parameters setting has been used to form an ensemble. After that, in the secondary phase, groups of classifiers having diverse topologies have been utilized for constructing the ensemble. Generally, the combination or ensemble of 3 distinct classifiers, i.e., DTC, LD, and SVM, has been discovered to execute best with available data and utilized for final coal characterization. Hence, only the investigational outcomes of the secondary phase of execution of an ensemble of classifiers with diverse topologies are shown in Table 1. The algorithm for ensemble classification for an automated coal characterization has been briefly discussed in few steps.
Classification accuracy with standard classifier
Algorithm:
Choose one classifier as source classifier.
Select a group of three or five classifiers having different topologies to construct an ensemble.
Train the data.
The majority of voting gives a final class label.
The cross-validation technique [31] is used when there is not sufficient data present to generate the validation set. In this piece of research, we have used the k-fold cross-validation resampling process. In this process, the original sample set is divided via stratified split into i equivalent size disjoint subsets S1, S2, ... ... . Si, and after that i, runs of the training sample tests are executed. Taking into consideration the value of i as five, the entire sample sets are arbitrarily separated into five portions, through which all category is signified in a similar range as original sample sets. Hence, the process is executing an entire of five epochs having a different grouping of testing and training samples. Ultimately, the five operation approximations have been averaged to provide on the whole approximation of classifier recital in terms of computational time and accuracy.
Results
The projected idea has been executed using MATLAB 2015, and the investigational model has been processed utilizing an Intel(R) Core (TM) i7 with 3.20Ghz PC speed, through 64-bit, RAM of 24GB executed on the professional operating system of Windows 10. Coal quality is characterized based upon color and textural features present in the coal samples. The data are balanced. The measured feature value analysis demonstrates that the coal samples are distinguishable and appropriate classifier with better accuracy is needed for characterization. Considering this, an ensemble classifier system has been suggested for the coal quality characterization. Table 1 shows the average performance of classifiers based on accuracy utilizing five-fold cross-validation.
Here, the best overall accuracy of around 97% is achieved with the proposed ensemble classifier method for coal sub-image samples with a five-fold cross-validation method. In EoC, we can see that the accuracy is more than 90 % in each five-fold constantly, and the average of every fold was discovered to be higher than all other classifiers. The computational time (in a sec) that consists of training, as well as the testing stage, has been recorded for each of the mentioned classifiers and shown in Fig. 6.

The computational time for characterization.
The average computation time for EoC was discovered to be a little bit higher than that of DTC, LD, and SVM but still less than the manual methods of industries. Hence, the recommended system could support in real mining, steel, and manufacturing industries for the screening of coal samples.
The prediction of the model in classification problem is one of the mandatory steps to recognize the feasibility of the proposed method. This can be done in various ways. In this paper, we use scatter graphical plot analysis, confusion matrix, and ROC curve for the prediction of the proposed model.
Scatter plot
The scatter plot [32] graphical analysis observes which variables separate the class colors most clearly. Here, blue points represent class CL1, red points CL2, and yellow CL3, respectively. It has been observed that CL1 is easily separable from the other two classes with all four classifiers. The CL2 and CL3 classes are much closer together in all classifier measurements. Here,×symbol indicates misclassified points. The blue points (CL1) are all correctly classified, but some of the other two categories of coal are misclassified. Figure 7 shows the different classifier scatter plots.

Scatter plot: (a) DTC, (b) LD, (c) SVM and (d) EoC.
The CM (confusion matrix) [33] chart has been used to recognize how the currently chosen classifier executed in every class. Here, rows illustrate correct classes, while columns illustrate the predicted classes. Also, diagonal cells illustrate where correct classes and predicted classes counterpart. If compartments are green, then the classifier has been performed excellently and categorized as the interpretations of this particular true class appropriately. The true positive rates for accurately categorized points displayed in the green cell in the True Positive Rate column, while the false-negative rate for wrongly categorized points displayed in the light orange cell in the False Negative Rate column. Figure 8 shows the confusion matrix obtained by different classifiers.

Confusion Matrix: (a) DTC, (b) LD, (c) SVM and (d) EoC.
The ROC curve [34] demonstrates the false-positive rates versus true positive rates for the chosen trained classifier. An accurate outcome having no misclassified object becomes 90 degrees at the top left of the graph. A bad outcome shows a line or streak at 45 degrees. The AOC (Area Under Curve) number is a degree of the overall excellence of the classifier. Here, EoC shows the best result with AOC value 1. Figure 9 shows the ROC of different classifiers.

ROC curve: (a) DTC, (b) LD, (c) SVM and (d) EoC.
This prediction of above methods predicted that the machine learning-based automated coal characterization shows better performance concerning the accuracy and computational time than existing conventional methods. So, we can say that the proposed intelligent characterization technique of coal microscopic images using ensemble learning can be feasible and recommended to industries for coal characterization.
Coal is the frequently used fossil fuel for steel, power industries, and also for household practices. It has unique properties and behaviors, which makes it necessary to characterize for quality inspection before shipping to industries. Petrologist’s screening of coal samples is always a slow and subjective process. The underlying theme of this research is to develop an automated screening method using an ensemble classifier system for the quality characterization of coal. The proposed multi classifier system has an accuracy of around 97% and also, its computational time is faster than conventional methods. Hence, the proposed system could be used in industries for the preliminary screening of coal samples and assists in the development of an automated coal characterization system.
Footnotes
Acknowledgment
The council of scientific and industrial research (CSIR) India, supports the research as mentioned above under Direct SRF. We would also like to thanks Dr. Vivek Mishra, CSIR-NML, Jamshedpur, India, and Dr. Subrajeet Mohapatra, BIT Mesra, Ranchi, India for his valuable comments and guidance. The authors would also like to thanks to anonymous reviewers for their valuable suggestions and comments.
