Abstract
In this paper, to improve the situation of singleness of selecting results in hesitant fuzzy set decision-making and expand the range of choices for decision makers, we construct a hesitant fuzzy set clustering algorithm combined with fuzzy matroid operation. The algorithm synthesizes the r-cut set, fuzzy shrinking matroids in the fuzzy matroids and the operational properties of the fuzzy derived matroids, the r value also is used to connect the two types of fuzzy matroids to form a clustering algorithm. Finally, we apply the algorithm to the hesitant fuzzy set decision-making of job seekers choosing recruitment websites, each recruitment website as an optional scheme is divided into three categories of excellent to inferior schemes to provide job seekers with ideas and methods for favorably selecting recruitment websites.
Keywords
Introduction
Matroid was introduced by Whitney [1] in 1935 and looked as the concept of extended linear algebra and graph theory. This theory provides great help for solving the combinatorial optimization problems. Matroid has been developed in combination with other theories over the years. For example, the expansion of matroid of rough set [2], concept of lattice [3, 4], etc. Among them, The theory of fuzzy matroid, which combines matroid and fuzzy sets, was considered as an important extension of matroid. Fuzzy set theory was first proposed by Zadeh [5] in 1965. The research on the way of expending fuzzy implications by Han et al. [6]. In 1988, the theory of fuzzy matroid was first proposed by Goetschel and Voxman in “Fuzzy Matroids” [7]. In the development of fuzzy matroid, the properties of L-fuzzy matroids and generalized fuzzy matroids was discussed by Shi [8, 9]. Li et al. [10] mainly studied and proved the tree structure of G-V fuzzy matroid. Li et al. [11] studied the related definitions and operators of intuitionistic fuzzy matroids and the relations between intuitionistic fuzzy matroids, G-V fuzzy matroids and H fuzzy matroids. Similarly, fuzzy set theory was also applied to the expression of fuzzy problems. In 2009–2010, Torra and Narukawa [12, 13] proposed the concept of hesitant fuzzy sets for the first time. Since then, scholars at home and abroad have started in-depth research on hesitant fuzzy decision-making problems.
In recent years, with the deepening of informationization of social, the application of cluster analysis in hesitant fuzzy set decision-making has attracted much attention. There already had some clustering algorithms for different fuzzy numbers. For intuitionistic fuzzy numbers, there had intuitionistic fuzzy hierarchical clustering and the netting clustering analysis of Xu [14]; for type-2 fuzzy numbers, there had interval C-means fuzzy clustering and the similarity numbers of Hwang et al. [15] and Yang et al. [16]; for hesitant fuzzy numbers, there had hesitant fuzzy clustering with correlation coefficients of Chen et al. [17], hesitant fuzzy hierarchical clustering method for HFAH based on distance matrix and it used cases and hesitant fuzzy minimum spanning tree clustering algorithm combined with graph theory of Zhang et al. [18], hesitant fuzzy K-means clustering with hierarchical clustering as initial result of Chen et al. [19]. In addition, there were some literatures on the application of fuzzy clustering model, Alghamdi et al. [20] used the fuzzy clustering method of C-means to classify the academic buildings in universities, which were three categories with different contribution to the environment, so as to improve the academic buildings with low contribution. Han et al. [21] through scaling of linguistic variables and minimizing cross-entropy principle classified the tourist destinations in different regions and analyzed the robustness of the clustering results. According to the existing garbage management regulations and the different impacts of different types of garbage on the environment, Cao et al. [22] used the fuzzy clustering analysis method to reclassify the garbage, so as to reduce the damage to the environment. Guajardo et al. [23] introduced Mahalanobis distance into the traditional distance clustering method to generalize the distance in the fuzzy system. This method could also identify the distance that could not be measured by the traditional Euclidean distance. Cai et al. [24] constructed a similarity clustering method combining reconstruction coefficient and pairwise distance, which could effectively reduce the influence of outliers. D’Urso et al. [25] proposed the robust fuzzy c-medoids clustering method for imprecise data, it could improve information gain.
In the existing results of the research of hesitant fuzzy clustering algorithm, most of the algorithms focus on the calculation of the correlation matrix and the distance between the schemes and the ideal schemes, the elements of these algorithms basically include the calculation and weight of hesitant fuzzy numbers and most of clustering results were only a few scheme sets, but there was no further comparison for the clustering results. This paper combines hesitant fuzzy information with fuzzy matroid computation, then the schemes are clustered by computing and transforming the subsets of fuzzy matroids, finally, according to the clustering results, there kinds of scheme sets meaning advantages to disadvantages can be obtained for decision makers to choose.
Methods
In this paper, we establish a hesitant fuzzy clustering model through the operation of fuzzy matroids.
Firstly, by Definition 2.1, we define the sign of hesitant fuzzy evaluation value.
The hesitant fuzzy numbers in this model are all the normalized numbers that length of k, expressed as h ={ t1, t2, ⋯ , t k }.
According to Zeshui Xu’s [27] data processing method, there are two ways: pessimistic rule and optimistic rule. They apply to prudent decision makers and risky decision makers respectively. We obtain the normalized hesitant fuzzy numbers by pessimistic rule. Let all hesitant fuzzy numbers exchange to equal length hesitant fuzzy numbers and reorder the numbers in hesitant fuzzy numbers by value from small to large, expressed as:
In the hesitant fuzzy decision-making, suppose there are m kinds of alternatives, expressed as: A1,...,A i ,...,A m and n kinds of related attributes, expressed as: C1,...,C j ,...,C n .
Then the normalized matrix of the hesitant fuzzy evaluation matrix can be expressed as:
Let
and
Then we define the fuzzy matroids based on definition 2.2 and 2.3.
Support set:
r-Cut set:
R+ (μ) = { μ (x) |μ (x) > 0 } . m (μ) = min R- + (μ),
|μ| = ∑x∈Xμ (x).
(1) (Hereditary property)
if μ ∈ ℓ , υ ∈ F (E) , υ ⩽ μ, then υ∈ ℓ.
(2) (Exchange property)
if μ, υ ∈ ℓ , |suppμ| < |suppυ|, then exists ω∈ ℓ, such that μ < ω ⩽ (μ ∨ υ). m (ω)⩾ min { m (μ) , m (υ) }.
We define the mapping of the article clustering model according to Definition 2.2. The following defines the mapping of the clustering model,
Define the mapping
Then, establish the initial evaluation fuzzy matroid,
Secondly, we establish the systolic fuzzy matroid to calculate schemes of I. Systolic fuzzy matroid is contraction of original matroid according to the relation conditions of r-Cut sets and the comparison results of mappings in subset families. Systolic fuzzy matroid can help us to select a class of schemes with higher fuzzy evaluation values, so we cluster and select Systolic fuzzy matroids for the first time. Then we list the conditions we need.
Let
and the other four conditions as follows, α, β, γ, μ ∈ F, α ⩾ γ and define, α ⩾ γ calculation formula is
C
r
(α) , C
r
(γ) ⊆ C. C
r
(α) ∩ C
r
(γ) = φ. C
r
(α) ∪ C
r
(γ) = C.
Let I = { μ ∈ F|μ ⩽ α, ∃ β ∈ F,
Note: let
Thirdly, we use the induced matroid to do the second clustering. Based on the average value of the induced function, the induced matroid can select the schemes with largest number of attributes. Based on the results of the first clustering, the schemes are divided into kinds of II and III.
Related definitions of induced matroid sequences and induced functions are listed in definitions 2.4 and 2.5.
then M
r
= (E, I
r
) is the matroid about E. ∃0 = r0 < r1 < ⋯ < r
n
⩽ 1 is fundamental sequence on M, the induced matroid sequence is expressed as:
if
if
According to these definitions, we list the formulas for calculating the induced functions.
Define number sequence r1,...,r
q
, let
We establish the induced matroid sequences:
Let
In that, and
X2 ={ μ|μ ∈ I1 } is the set of schemes of I, X3 ={ μ| ∀ μ ∈ I - I1 } is the set of schemes of III.
The last step is to get the evaluation clustering results, we list the results of the second and third steps, and the clustering results are shown as follows.
The elements in set X1 are the first kind, them in set X2 are the second kind, them in set X3 are the third kind.
Now, we express the whole algorithm process through Fig. 1.

Algorithm flow chat.
In the second part of this paper, we have constructed a complete clustering model. In this part, we choose the problem of selection of the recruitment website for job seekers to illustrate the use of this method. Let’s talk about the importance of solving this problem and preliminary establishment of the model.
In the digital age, the operation of all walks of life is closely related to the Internet, and online recruitment has gradually become an extremely important recruitment method, how to choose the recruitment website has become the primary problem for online job seekers. Many scholars have put forward some evaluation methods for this problem. From the two-way perspective of enterprises and job seekers, Dineen et al. [31] pointed out that the feedback of online recruitment was effective for job seekers; Coyle et al. [32] believed that the recruitment website could attract the attention of job seekers in form and expand the scope of website users; Cober et al. [33] pointed out that the website which focused on the recruitment process and screening process was more attractive to job seekers, etc. Most of the evaluation focused on the judgment of the influencing factors or the analysis of them, but the overall evaluation of each recruitment website is slightly weak.
We establish one clustering model combined with fuzzy matroid to study the methods to solve this kind of problems, considering the overall advantages and disadvantages of recruitment websites and the fuzziness of website evaluation. The influencing factors used in the example were the five attributes which affect the satisfaction of recruitment website constructed by Noh [34].
They are:
C1: information quality perception,
C2: website resource richness,
C3: website usability,
C4: website responsiveness,
C5: safety and reliability.
The following ten representative recruitment websites are selected for the example:
A1: 51job.com,
A2: zhaopin.com,
A3: chinahr.com,
A4: 58.com,
A5: chinajob.com,
A6: liepin.com,
A7: jobcn.com,
A8:job5156.com,
A9: wutongguo.com,
A10: job168.com.
And three experts made fuzzy evaluation on these websites, the hesitant fuzzy evaluation values are shown in Table 1.
The Hesitant Fuzzy Evaluation Values
The Hesitant Fuzzy Evaluation Values
We combine the example to apply the model in this paper, including four steps.
The normalized evaluation matrix M is expressed as Table 2.
The Normalized Evaluation Matrix
Those are the ten mappings in fuzzy set, the set of mappings is expressed as:
then establish evaluation matroid,
Take the r-value,
Then we can calculate the r-Cut set of each mapping according to formula (1) and select several r-Cut sets satisfying the four conditions 1), 2), 3), 4). And we can get the projects set I through formula (2) to be filtered in the
Calculate C r (μ i ) , i = 1, ⋯ , 10,
we can get
when the four conditions are satisfied,
get the set I, get
establishing one contractive fuzzy matroid,
the schemes set of I is obtained as follows,
including the two schemes A2 and A4.
The result of calculation is
According to the value of r1, family of sets A can be obtained, and the power numbers of A would be got by formula (10). Then the mean value of the power numbers can be used as r threshold value to select the projects with no less than 3 excellent attributes.
The available set is a subset family of all attributes and Σ
M
= (0.6, 0.6, 0.6, ⋯ , 0.5, 0.5, 0.5, 0.5, 0.5)
The schemes of II are A1, A6, A10. The schemes of III are A3, A5, A7, A8, A9.
The model results show that, when choosing the recruitment website, the choices should be sorted according to the following preferences:
The selection method is more profitable method for job seekers.
In this paper, from the results calculated by the clustering model, we can see that the preferred recruitment websites for job seeker are 58.com and zhaopin.com, the second type of recruitment websites are 51job.com, liepin.com and job168.com, the third type of recruitment websites which are slightly worse choices for job seekers are chinahr.com, chinajob.com, jobcn.com, job5156.com and wutongguo.com.
In this part, we will compare the results of the algorithm case in this paper with existing ranking websites results.
Now, we compare the results with the ranking websites. According to the ranking of most recruitment websites on November 20, 2020, as shown in Table 3.
Comprehensive ranking of the most recruitment websites
Comprehensive ranking of the most recruitment websites
Data Sources: top.chinaz.com.
The ranking from top.chinaz.com in Table 3 is based on the numbers of Alexa weekly ranking, Baidu weight, PR and anti-chains, it can reflect the popularity and importance of the websites from the two aspects of website traffic and website quality up to November 20, 2020.
In Table 3, except that the position of 51job.com and 58.com, job5156.com and job168.com are contrary to the results of clustering model in this paper, the difference between the ranking and calculation results of these four websites may be due to different calculation properties and evaluation methods, the ranking of other websites basically corresponds to the calculation results of the model. This shows that the algorithm model in this paper is applicability.
This clustering model is different from the traditional ranking in that it classifies and sorts the alternatives by fuzzy matroid. As present, many scholars have studied the hesitant fuzzy decision-making problems with multiple alternatives, either single ranking results or clustering results of similar schemes, but there are few sorted clustering results. The algorithm in this paper can be applied to the selection of multiple schemes, it is easy to calculate and the clustering results are concise. But the clustering threshold r value of this algorithm is too dependent on the evaluation value, which would lead to high subjectivity of the calculation results, and it does not distinguish the weight of evaluation attributes, the work of increasing attribute weight and objectivity of evaluation can be considered in the next stage.
Based on the calculation of contractive matroid and induced matroid in fuzzy matroid theory, this paper establishes a clustering model. The model can divide the schemes into three categories, and provide decision-makers with choice ideas from the optimal arrangement. At the end of the paper, taking how job seekers choose recruitment websites as an example, we use this clustering model and make the final results. The calculation process of the model shows that the model focuses on the integrity of the attributes in each scheme, in the process of scheme screening, the r value used by the first cluster is related to the evaluation number, the second clustering is related to the number of qualified attributes in the scheme. the clustering method is more comprehensive in numerical aspect.
However, the data processing in the model, as well as the correlation between the attributes and consideration of the attribute weight, these two aspects are slightly deficient and still need to be improved, there is also a lack of data analysis in the model, we will strengthen the study of a lot kinds of algorithm analysis and the applications for different purposes. Especially in data analysis methods, now many methods of data analysis have been researched. For example, several better methods for this paper, Haghbin et al. [35] researched the asymptotic distribution of sample proportion applied to confidence interval and hypothesis. Mahmoudi et al. [36, 37] studied the comparison method of regression models of in dependent data sets and the goodness of fit test based on the multiple testing. In the future, we will improve the model by learning analysis methods suitable for hesitant fuzzy data and focus on the ranking function of hesitant fuzzy matroid and the reasonable attribute weight in the clustering method of hesitant fuzzy matroid, and improve the theoretical system of hesitant fuzzy matroid.
