Ontology geometry distance computation using deep learning technology

Abstract

The core problem of Ontology mapping and various kinds of ontology engineering applications is the calculation of similarity between concepts in ontology. From the machine learning point of view, by means of learning the sample set, it gets the optimal ontology similarity calculation function, so that each pair of concepts mapped to a positive real number, thus reflected the similarities between concepts. After representing the ontology using graph, the goal of ontology learning is to obtain a real-valued function, which maps each pair of vertices into real axes and uses distances to reflect the similarities between concepts of vertices. In this paper, we present an ontology learning algorithm in view of ontology geometry distance computation and deep learning tricks. The iteration procedure is designed and the experiments show the effectiveness of given ontology algorithm.

Keywords

1 Introduction

As a structured data storage, processing, analysis and calculation model, ontology is widely used in the mainstream of computer such as information retrieval, graphic image processing, pattern recognition and granularity computing. At the same time, due to its powerful auxiliary functions, it is applied to many disciplines such as physics, chemistry, biology, pharmacy, medicine and materials science. With the deepening research, new techniques are constantly updated and applied to practice, making the role of ontology more and more effective. See Swanson [1], Teymourlouie et al. [2], Viani et al. [3], Hippolyte et al. [3], Roth and Jornet [5], Cooper et al. [6], Pfaff and Krcmar [7], Travin et al. [8], Seipel et al. [9], and Blondet et al. [10] for more details.

In particular, ontology is a structured set of concepts that are related by certain form of structure. In general, the ontology of this association structure can be represented as the graph model. Each vertex in the graph corresponds to a concept in ontology, and the edge between two vertices indicates that there is some direct superior-subordinate relationship between the two concepts. Combined with learning theory, the goal of ontology learning is to obtain an ontology function that is used to calculate the similarity between ontology vertices. That is, by learning of ontology samples, we get $S : V \times V \to ℝ$ . But the problem is that the description of concepts in the general ontology is an individual description, not a paired description. Therefore, it is not convenient to obtain an ontology sample set of paired vertex similarity without the help of field experts. On the other hand, the function S is not intuitive and can’t express accurately the meaning of the whole ontology structure. A more reasonable way is through the sample set to learn the optimal ontology function $f : V \to ℝ$ which maps each vertex to a real number, and the similarity between two vertices v_i and v_j can be measured by |f (v_i) - f (v_j) |. The lager |f (v_i) - f (v_j) | is, the small similarity between v_i and v_j; on the contrary, the smaller |f (v_i) - f (v_j) | is, the higher similarity between v_i and v_j. Several ontology related algorithms and applications can refer to Hashemi et al. [11], Abdi et al. [12], Mesmer and Olewnik [13], Pan et al. [14], Rizzo et al. [15], Maldonado et al. [16], Ansari [17], Fouda et al. [18], and Zhu et al. [19].

Several papers contribute on the ontology learning algorithm from the theory and application point of view. Gao and Xu [20] presented the stability analysis of learning algorithms for ontology similarity computation. Gao et al. [21] studied the strong and weak stability of k-partite ranking based ontology learning algorithm. Gao et al. [22] proposed the ontology learning algorithm for similarity measuring and ontology mapping by means of linear programming. Gao and Farahani [23] determined the Generalization bounds and uniform bounds for multi-dividing ontology algorithms with convex ontology loss function from a statistical point of view. Gao et al. [24] discussed the distance learning tricks for ontology similarity measuring and ontology mapping.

In this paper, we focus on the ontology learning algorithm based on the geometry distance computation and deep learning technique. The rest of the paper is organized as follows: first we introduce the geometry distance calculating setting in ontology problem, and review the ontology distance computation algorithm which is projected on the positive cone in a reproducing kernel Hilbert space; then, we propose our main ontology geometry distance learning algorithm using deep learning neural networks; finally, we verify the effectiveness of the algorithm by several experiments.

2 Overview of geometry distance computation

In order to represent the ontology algorithm in a mathematical model, the information of each vertex on ontology graph is represented by a p-dimensional vector, i.e., $v \in ℝ^{p}$ . In this way, the similarity between the vertices of the ontology transforms to the geometric distance between the vectors. The most primitive way to calculate the geometric distance between vectors is to calculate the Euclidean distance between two vectors. However, this calculation method can’t reflect the essential relationship between ontology concepts. In this paper, the calculation of the optimal geometric distance d is obtained by studying ontology samples. Let’s begin from the notation and the setting of geometry distance computation. In what follows, we always assume V as an ontology set, as well as the ontology information matrix. Without causing confusion, we use the notion v (or u) to express ontology vertex, its corresponding vector and its corresponding ontology concept, which are no longer represented in bold.

Let I_n be the n × n identity matrix. The Frobenius norm of a matrix V is $∥ V ∥_{F} = \sqrt{Tr (V^{T} V)}$ , where Tr (V) is the trace of matrix V. The space of n × n symmetric positive definite matrices is denoted by $S_{+ +}^{n}$ . A function d : V × V → R⁺ can be regarded as an ontology distance function if for any v₁, v₂, v₃ ∈ V, it satisfies non-negativity (d (v₁, v₂) ≥0), distinguishability (d (v₁, v₂) =0 ⇔ v₁ = v₂), symmetry (d (v₁, v₂) = d (v₂, v₁)) and triangle inequality (d (v₁, v₃) ≤ d (v₁, v₂) + d (v₂, v₃)).

In this fashion, the class of Mahalanobis distances with $M \in S_{+ +}^{n}$ , $W \in ℝ^{d \times p}$ (d ≤ p) and M = W^TW can be defined as $\begin{matrix} d_{M} (v_{1}, v_{2}) & = & \sqrt{(v_{1} - v_{2})^{T} M (v_{1} - v_{2})} \\ = & \sqrt{(v_{1} - v_{2})^{T} W^{T} W (v_{1} - v_{2})} \\ = & ∥ W v_{1} - W v_{2} ∥_{2} . \end{matrix}$ It implies that learning a Mahalanobis ontology distance metric M can be transformed to determine a linear transformation W which projects each ontology vertex $v \in ℝ^{p}$ into a low-dimensional subspace, but keeping the geometric distance of two ontology vertices in the transformed space equal to the Mahalanobis distance in the original vector space.

In this section, we mainly overview the distance calculating based ontology learning algorithm connected to a reproducing kernel Hilbert space. In this setting, the training data can be denoted as ${(u_{i}, v_{i}, y_{i})}_{i = 1}^{n}$ where $u_{i}, v_{i} \in ℝ^{p}$ which correspond to certain ontology vertex, l_i ∈ {0, 1} with l_i = 1 if (u_i, v_i) is the pair of similarity ontology vertices (concepts), and l_i = 0 if (u_i, v_i) is the pair of dissimilarity ontology vertices (concepts). Let L₁ be the number of samples with l_i = 1 and L₀ be the number of samples with l_i = 0. Set $Ξ_{d} = \frac{1}{L_{0}} \sum_{i : l_{i} = 0} (u_{i} - v_{i}) (u_{i} - v_{i})^{T},$ $Ξ_{s} = \frac{1}{L_{1}} \sum_{i : l_{i} = 1} (u_{i} - v_{i}) (u_{i} - v_{i})^{T} .$ Hence, a dissimilarity hypothesis can be stated as

$\begin{matrix} ϱ (u_{i}, v_{i}) \\ = log {\frac{\frac{1}{\sqrt{2 π | Ξ_{d} |}} exp (- \frac{1}{2} (u_{i} - v_{i})^{T} Ξ_{d}^{- 1} (u_{i} - v_{i}))}{\frac{1}{\sqrt{2 π | Ξ_{d} |}} exp (- \frac{1}{2} (u_{i} - v_{i})^{T} Ξ_{s}^{- 1} (u_{i} - v_{i}))}} . \end{matrix}$

A large ϱ (u_i, v_i) implies that u_i and v_i are dissimilar, and vice-versa. Let Proj (·) be a projection to the cone of positive definite matrices. Then, the Mahalanobis matrix in ontology setting can be obtained by $M = Proj (Ξ_{s}^{- 1} - Ξ_{d}^{- 1})$ where the projection is deduced by eigen-decomposition of $Ξ_{s}^{- 1} - Ξ_{d}^{- 1}$ as UDU^T then M = UD₊U^T with D₊ = diag (max(d_i, ɛ)), D = diag (d_i) and ɛ > 0 is a very small real number.

Let $k : V \times V \to ℝ$ be a positive definite kernel defined on ontology space V. In terms of the Mercer theorem, a mapping φ : V → H to a Reproducing Kernel Hilbert Space (RKHS) H exists for any positive definite kernel. In this way, the purpose is to transform to get a Mahalanobis distance $d_{H} : H \times H \to ℝ^{+}$ associated with H, which can be expressed by $d_{H} (v_{1}, v_{2}) = \sqrt{(φ (v_{1}) - φ (v_{2}))^{T} M_{H} (φ (v_{1}) - φ (v_{2}))} .$ Moreover, the notations in this setting are re-written as $Ξ_{H, d} = \frac{1}{L_{0}} \sum_{i : l_{i} = 0} (φ (u_{i}) - φ (v_{i})) (φ (u_{i}) - φ (v_{i}))^{T},$ $Ξ_{H, s} = \frac{1}{L_{1}} \sum_{i : l_{i} = 1} (φ (u_{i}) - φ (v_{i})) (φ (u_{i}) - φ (v_{i}))^{T},$ $\begin{matrix} ϱ_{H} (u_{i}, v_{i}) \\ = & log {(\frac{e^{- \frac{1}{2} (φ (u_{i}) - φ (v_{i}))^{T} Ξ_{H, d}^{- 1} (φ (u_{i}) - φ (v_{i})))}}{\sqrt{2 π | Ξ_{H, d} |}} \\ / (\frac{e^{- \frac{1}{2} (φ (u_{i}) - φ (v_{i}))^{T} Ξ_{H, s}^{- 1} (φ (u_{i}) - φ (v_{i}))}}{\sqrt{2 π | Ξ_{H, d} |}})} . \end{matrix}$ And the clipping of the spectrum can be obtained by $M_{H} = {Proj}_{H} (Ξ_{H, s}^{- 1} - Ξ_{H, d}^{- 1})$ .

Next, we show how to get $Ξ_{H, s}^{- 1}$ and $Ξ_{H, d}^{- 1}$ . Given a set of ontology vertex pairs ${(u_{i}, v_{i})}_{i = 1}^{n}$ , set Z = (u₁, ⋯, u_n, v₁, ⋯, v_n) and $J J^{T} = \frac{1}{n} [\begin{matrix} I_{n} & - I_{n} \\ - I_{n} & I_{n} \end{matrix}]$ , then we infer $Ξ = \frac{1}{n} \sum_{i = 1}^{n} (u_{i} - v_{i}) (u_{i} - v_{i})^{T} = Z J J^{T} Z^{T} .$ Set Φ_Z = (φ (u₁), ⋯, φ (u_n), φ (v₁), ⋯, φ (v_n)), then the |H| dimensional covariance matrix Ξ_H associated with RKHS H is stated as $Ξ_{H} = Φ_{Z} J J^{T} Φ_{Z}^{T}$ . The kernel matrix of Z is denoted as $K_{Z} \in ℤ^{2 n \times 2 n}$ , which is defined by [K_Z] _ij equals to k (u_i, u_j) if i, j ≤ n; k (v_i, v_j) if i, j > n; and k (u_i, v_j) in the other case. Let the SVD decomposition of $J^{T} Φ_{Z}^{T} Φ_{Z} J = J^{T} K_{Z} J$ be $X_{Z} ϒ_{Z} X_{Z}^{T}$ . Set ι > 0 be a parameter and $W_{Z} = J X_{Z} \sqrt{I_{2 n} - ι ϒ_{Z}^{- 1}}$ . Then, the regularized estimate of Ξ_H can be formulated as ${\hat{Ξ}}_{H} = Φ_{Z} W_{Z} W_{Z}^{T} Φ_{Z}^{T} + ι I_{H}$ . Hence, we obtain ${\hat{Ξ}}_{H}^{- 1} = (Φ_{Z} W_{Z} W_{Z}^{T} Φ_{Z}^{T} + ι I_{H})^{- 1} = \frac{1}{ι} (I_{H} - Φ_{Z} W_{Z} ϒ_{Z}^{- 1} W_{Z}^{T} Φ_{Z}^{T})$ and ${\hat{Ξ}}_{s, H}^{- 1} - {\hat{Ξ}}_{d, H}^{- 1} = \frac{1}{ι} (Φ_{Z_{d}} W_{Z_{d}} ϒ_{Z_{d}}^{- 1} W_{Z_{d}}^{T} Φ_{Z_{d}}^{T} - Φ_{Z_{s}} W_{Z_{s}} ϒ_{Z_{s}}^{- 1} W_{Z_{s}}^{T} Φ_{Z_{s}}^{T})$ .

However, ${\hat{Ξ}}_{s, H}^{- 1} - {\hat{Ξ}}_{d, H}^{- 1}$ can’t be computed directly. We need the other trick to solve it. Set $C \in S_{+ +}^{n}$ , otn as the n ontology training vertices, $A_{s} = W_{Z_{s}} ϒ_{Z_{s}}^{- 1} W_{Z_{s}}^{T}$ and $A_{d} = W_{Z_{d}} ϒ_{Z_{d}}^{- 1} W_{Z_{d}}^{T}$ . We aim to solve the following projection problem which onto the cone of positive definite matrices in H: $\begin{matrix} \underset{C \in S_{+ +}^{n}}{argmin} L (C) \\ = & ∥ Φ_{otn} C Φ_{otn}^{T} + Φ_{Z_{s}} A_{s} Φ_{Z_{s}}^{T} - Φ_{Z_{d}} A_{d} Φ_{Z_{d}}^{T} ∥_{F}^{2} . \end{matrix}$

By computation, we yield $\begin{matrix} L (C) = Tr (K_{otn} C K_{otn} C) \\ + 2 Tr (K_{Z_{s}, otn} C K_{Z_{s}, otn}^{T} A_{s}) \\ - 2 Tr (K_{Z_{d}, otn} C K_{Z_{d}, otn}^{T} A_{d}) + C, \end{matrix}$ where C is a constant. By setting ▽_CL (C) =0, we get

$\begin{matrix} C^{*} & = & K_{otn}^{- 1} (K_{Z_{d}, otn}^{T} A_{d} K_{Z_{d}, otn} \\ - K_{Z_{s}, otn}^{T} A_{s} K_{Z_{s}, otn}) K_{otn}^{- 1} . \end{matrix}$ Finally, the Mahalanobis distance in H can be described as

$\begin{matrix} d_{H} (u, v) \\ = & \sqrt{(φ (u) - φ (v))^{T} Φ_{otn} C Φ_{otn}^{T} (φ (u) - φ (v))} \\ = & (k_{u, otn} C k_{u, otn}^{T} - 2 k_{u, otn} C k_{v, otn}^{T} \\ + k_{v, otn} C k_{v, otn}^{T})^{\frac{1}{2}} . \end{matrix}$

3 Deep learning based ontology distance calculating algorithm

Deep learning is a new research field in machine learning. In recent years, it has made breakthrough progress in large kinds of applications such as speech recognition and computer vision. The motivation of this model is to establish a model that simulates the neural connections of the human brain. When dealing with such signals as images, sounds and texts, the data features are described by stratification through multiple transformation stages, and the data interpretation is obtained (related studies can refer to Phong et al. [25], Hassan et al. [26], Oh et al. [27], Proenca and Neves [28], Biswas et al. [29], Ren et al. [30], Rad et al. [31], Lore et al. [32], Bianco et al. [33], Olmos et al. [34], and Treder et al. [35]). In this section, we introduce our main ontology distance function learning algorithm based on the deep neural network learning technique.

Assume that there are M + 1 layers in the designed network and d^(m) units in the m-th layer, where m ∈ {1, 2, ⋯, M}. Let $W^{(1)} \in ℝ^{d^{(1)} \times p}$ be a projection matrix to be learned in the first layer, $b^{(1)} \in ℝ^{d^{(1)}}$ be a bias vector and $s : ℝ^{p} \to ℝ^{p}$ be a component-wisely nonlinear activation function. For a fixed ontology vertex $v \in ℝ^{p}$ , the output in the first layer is then given by $h^{(1)} = s (W^{(1)} v + b^{(1)}) \in ℝ^{d^{(1)}}$ . Let h⁽¹⁾ be the output of the first layer and as well as the input of the second layer.

Similarly, the projection matrix, bias, and nonlinear activation function in the second layer are denoted by $W^{(2)} \in ℝ^{d^{(2)} \times d^{(1)}}$ , $b^{(2)} \in ℝ^{d^{(2)}}$ and s, respectively. Then, the the output of the second layer is determined by $h^{(2)} = s (W^{(2)} h^{(1)} + b^{(2)} \in ℝ^{d^{(2)}}$ . Thus, the output of the m-th layer can be formulated by $h^{(m)} = s (W^{(m)} h^{(m - 1)} + b^{(m)}) \in ℝ^{d (m)}$ and the output in the top level is $f (v) = h^{(M)} = s (W^{(M)} h^{(M - 1)} + b^{(M)}) \in ℝ^{d (M)}$ , where parametric nonlinear function $f : ℝ^{p} \to ℝ^{d^{(M)}}$ obtained by W^(m) and b^(m) for m ∈ {1, ⋯, M}. For a pair of ontology vertices v_i and v_j, then $f (v_{i}) = h_{i}^{(M)}$ , $f (v_{j}) = h_{j}^{(M)}$ , and $d_{f}^{2} (v_{i}, v_{j}) = ∥ f (v_{i}) - f (v_{j}) ∥_{2}^{2}$ .

Define the threshold parameter τ₁, τ₂ with τ₂ > τ₁ > 0, and τ > 1 is related to τ₁ and τ₂. We use $l_{ij} (τ - d_{f}^{2} (v_{i}, v_{j})) > 1$ to enforce the margin between τ and $d_{f}^{2} (v_{i}, v_{j})$ , where τ = τ₁ + 1 = τ₂ - 1, l_ij = 1 if v_i and v_j are similar and l_ij = -1 if v_i and v_j are dissimilar. Let λ be a balance parameter, β be a sharpness parameter, [x] = max {x, 0}, $g (z) = \frac{log (1 + e^{z β})}{β}$ .Then the ontology optimization problem can be expressed as: $\underset{f}{argmin} J = \frac{\sum_{i, j} g (1 - l_{ij} (τ - d_{f}^{2} (v_{i}, v_{j})))}{2}$ (1) $+ \frac{λ}{2} \sum_{m = 1}^{M} (∥ W^{(m)} ∥_{F}^{2} + ∥ b^{(m)} ∥_{2}^{2}) .$ Let ⊗ be the element-wise multiplication, $c = 1 - l_{ij} (τ - d_{f}^{2} (v_{i}, v_{j}))$ , and $z_{i}^{(m)} = W^{(m)} h_{i}^{(m - 1)} + b^{(m)}$ . For m ∈ {1, ⋯, M - 1}, we set $Θ_{ij}^{(M)} = g^{'} (c) l_{ij} (h_{i}^{(M)} - h_{j}^{(M)}) \otimes s^{'} (z_{i}^{(M)}),$ $Θ_{ji}^{(M)} = g^{'} (c) l_{ij} (h_{j}^{(M)} - h_{i}^{(M)}) \otimes s^{'} (z_{j}^{(M)}),$ $Θ_{ij}^{(m)} = ((W^{(m + 1)})^{T} Θ_{ij}^{(m + 1)}) \otimes s^{'} (z_{i}^{(m)}),$ $Θ_{ji}^{(m)} = ((W^{(m + 1)})^{T} Θ_{ji}^{(m + 1)}) \otimes s^{'} (z_{j}^{(m)}) .$ In order to solve the ontology optimization problem (1), we apply the stochastic sub-gradient descent trick to obtain the parameters {W^(m), b^(m)} for m ∈ {1, 2, ⋯, M} with original inputs $h_{i}^{(0)} = v_{i}$ and $h_{j}^{(0)} = v_{j}$ as follows:

$\begin{matrix} \frac{\partial J}{\partial W^{(m)}} & = & \sum_{i, j} (Θ_{ij}^{(m)} (h_{i}^{(m - 1)})^{T} + Θ_{ji}^{(m)} (h_{j}^{(m - 1)})^{T}) \\ + λ W^{(m)}, \end{matrix}$ (2)

$\frac{\partial J}{\partial b^{(m)}} = \sum_{i, j} (Θ_{ij}^{(m)} + Θ_{ji}^{(m)}) + λ b^{(m)} .$ (3) Let μ be the learning rate, the whole procedure can be expressed as follows: Algorithm: Deep learning based ontology distance function learning algorithm Input: ontology sample data S = {(v_i, v_j, l_ij)}, the number of network layers M + 1, threshold τ, learning rate μ, iterative number T, balance parameter λ, convergence error ɛ. Initialize: set bias b^(m) as zero vector, let d⁽⁰⁾ be the dimension of input layer and m ∈ {1, ⋯, M}, we set $W^{(m)} \sim U [- \frac{\sqrt{6}}{\sqrt{d^{(m)} + d^{(m - 1)}}}, \frac{\sqrt{6}}{\sqrt{d^{(m)} + d^{(m - 1)}}}] .$ For t = 1, ⋯, T, do Randomly choose an ontology vertex pair (v_i, v_j, l_ij) from S; Set $h_{i}^{(0)} = v_{i}$ and $h_{j}^{(0)} = v_{j}$ ; For m = 1, ⋯, M, do Infer $h_{i}^{(m)}$ and $h_{j}^{(m)}$ in light of forward propagation. End For For m = M, M - 1, ⋯, 1, do Yield the gradient by means of (2) and (3). End For For m = 1, ⋯, M, do $W^{(m)} = W^{(m)} - μ \frac{\partial J}{\partial W^{(m)}},$ $b^{(m)} = b^{(m)} - μ \frac{\partial J}{\partial b^{(m)}} .$ End For Determine J_t in view of (1); If t > 1 and |J_t - J_t-1| < ɛ, then break. End For Output: ${W^{(m)}, b^{(m)}}_{m = 1}^{M}$ .

4 Experiments

In this section, we present four experiments to show the effectiveness of our proposed ontology learning algorithm.

4.1 Similarity measuring experiment on gene data

In our first experiment, we use the gene ontology data from http://www.geneontology. P @ N average precision ratio is applied to test the effectiveness of result data.

It’s easily seen from the Table 1 that the precision ratio calculated via our newly proposed algorithm is becoming higher than that via algorithms in Gao et al. [36 , 39], as N=3, 5, 10 or 20. Meanwhile, when N increases, the precision ratios will keep increasing with it. In result, the experiment shows clearly the efficiency and superiority of our newly proposed algorithm, compared with the method in Gao et al. [36 –38].

Table 1
The experiment data for gene ontology

P@3 average precision ratio P@5 average precision ratio P@10 average precision ratio P@20 average precision ratio

algorithm in our paper 0.5627 0.6841 0.8197 0.9454

algorithm in [36] 0.5201 0.6663 0.8275 0.9417

algorithm in [37] 0.5192 0.6569 0.8031 0.9192

algorithm in [38] 0.5649 0.6827 0.8124 0.9371

	P@3 average precision ratio	P@5 average precision ratio	P@10 average precision ratio	P@20 average precision ratio
algorithm in our paper	0.5627	0.6841	0.8197	0.9454
algorithm in [36]	0.5201	0.6663	0.8275	0.9417
algorithm in [37]	0.5192	0.6569	0.8031	0.9192
algorithm in [38]	0.5649	0.6827	0.8124	0.9371

4.2 Similarity measuring experiment on plant data

In our second experiment, we use the plant ontology data from http://www.plantontology.org. Again, P @ N average precision ratio is applied to test the effectiveness of data result.

It’s apparent in the Table 2 that when N= 3, 5, 10 or 20, the precision ratio from our newly proposed algorithm is much higher than that from algorithms which are proposed before in Gao et al. [36 , 39]. More than others, we notice that the precision ratios keep increasing with the increase of N. Therefore, the new algorithm is more effective than the other algorithms proposed before by [36 , 39].

Table 2
The experiment data for plant ontology

P@3 average precision ratio P@5 average precision ratio P@10 average precision ratio P@20 average precision ratio

algorithm in our paper 0.5351 0.6673 0.9023 0.9700

algorithm in [36] 0.5081 0.6549 0.8104 0.9317

algorithm in [38] 0.5360 0.6664 0.9004 0.9673

algorithm in [39] 0.5042 0.6216 0.7853 0.9034

	P@3 average precision ratio	P@5 average precision ratio	P@10 average precision ratio	P@20 average precision ratio
algorithm in our paper	0.5351	0.6673	0.9023	0.9700
algorithm in [36]	0.5081	0.6549	0.8104	0.9317
algorithm in [38]	0.5360	0.6664	0.9004	0.9673
algorithm in [39]	0.5042	0.6216	0.7853	0.9034

4.3 Similarity measuring experiment on physical education data

In the third experiment, we use the physical education ontology data which is widely used in the ontology learning applications. P @ N average precision ratio is also applied to test the effectiveness of average conclusion.

It’s easily seen from the Table 3 that the precision ratio calculated via our newly proposed algorithm is becoming higher than that via algorithms in Gao et al. [36 , 39]. And the larger N is, the more efficient our newly proposed algorithm is.

Table 3
The experiment data for plant ontology

P@1 average precision ratio P@3 average precision ratio P@5 average precision ratio

algorithm in our paper 0.6774 0.7957 0.9355

algorithm in [36] 0.6774 0.7957 0.9290

algorithm in [37] 0.6774 0.7849 0.9032

algorithm in [38] 0.6913 0.7634 0.8968

	P@1 average precision ratio	P@3 average precision ratio	P@5 average precision ratio
algorithm in our paper	0.6774	0.7957	0.9355
algorithm in [36]	0.6774	0.7957	0.9290
algorithm in [37]	0.6774	0.7849	0.9032
algorithm in [38]	0.6913	0.7634	0.8968

4.4 Similarity measuring experiment on humanoid robotics data

In the last experiment, we use the humanoid robotics ontology data which was defined in Gao and Zhu [39]. We use P @ N average precision ratio to test the effectiveness of average conclusion.

It’s easily seen from the Table 4 that the precision ratio calculated via our newly proposed algorithm is becoming higher than that via algorithms in Gao et al. [36 and 37], and Gao and Zhu [39]. And the larger N is, the more apparent the contrast between them will become. In other words, the newly proposed algorithm turns out to be more efficient that the other three algorithms.

Table 4
The experiment data for humanoid robotics ontology

P@1 average precision ratio P@3 average precision ratio P@5 average precision ratio

algorithm in our paper 0.4444 0.6296 0.8444

algorithm in [36] 0.4444 0.5370 0.8222

algorithm in [37] 0.2778 0.6111 0.7889

algorithm in [39] 0.4444 0.5185 0.6111

	P@1 average precision ratio	P@3 average precision ratio	P@5 average precision ratio
algorithm in our paper	0.4444	0.6296	0.8444
algorithm in [36]	0.4444	0.5370	0.8222
algorithm in [37]	0.2778	0.6111	0.7889
algorithm in [39]	0.4444	0.5185	0.6111

5 Conclusion

Deep learning is also known as unsupervised feature learning, in which features are extracted without human design and features are learned from the data. Depth learning is essentially a non-linear combination of the methods of representation learning. It indicates that learning refers to learning representations (or features) from data to extract useful information in the data when categorizing and predicting. Depth learning begins with raw data and transforms each representation (or feature) layer by layer into a higher-level, more abstract representation, thereby discovering the intricate structure of high-dimensional data. In this paper, we design an ontology distance learning algorithm by means of deep learning. It is applied to ontology similarity measuring and ontology mapping, and been used in various engineering applications.

Conflict of Interests

The authors hereby declare that there is no conflict of interests regarding the publication of this paper.

Footnotes

Acknowledgments

We thank the reviewers for their constructive comments in improving the quality of this paper. This work has been partially supported by Postdoctoral Research Grant of China (2017M621690), postdoctoral research grant in Jiangsu province (1701128B).

References

Swanson

L.W.

, Brain maps 4.0-Structure of the rat brain: An open access atlas with global nervous system nomenclature ontology and flatmaps, Journal of Comparative Neurology526(6) (2018), 935–943.

Teymourlouie

, Zaeri

, Nematbakhsh

, Thimm

and Staab

, Detecting hidden errors in an ontology using contextual knowledge, Expert Systems with Applications95 (2018), 312–323.

Viani

, Larizza

, Tibollo

, Napolitano

, Priori

S.G.

, Bellazzi

and Sacchi

, Information extraction from Italian medical reports: An ontology-driven approach, International Journal of Medical Informatics111 (2018), 140–148.

Hippolyte

J.L.

, Rezgui

, Li

, Jayan

and Howell

, Ontology-driven development of web services to support district energy applications, Automation in Construction86 (2018), 210–225.

Roth

and Jornet

, From object-oriented to fluid ontology: A case study of the materiality of design work in agile software development, Computer Supported CooperativeWork-The Journal of Collaborative Computing27(1) (2018), 37–75.

Cooper

, Meier

, Laporte

M.A.

, Elser

J.L.

, Mungall

, Sinn

B.T.

, Cavaliere

, Carbon

, Dunn

N.A.

, Smith

, Qu

B.T.

, Preece

, Zhang

, Todorovic

, Gkoutos

, Doonan

J.H.

, Stevenson

D.W.

, Arnaud

and Jaiswal

, The Planteome database: An integrated resource for reference ontologies, plant genomics and phenomics, Nucleic Acids Research46(D1) (2018) D1180–D1180.

Pfaff

and Krcmar

, A web-based system architecture for ontology-based data integration in the domain of IT benchmarking, Enterprise Information Systems12(3) (2018), 236–258.

Travin

, Popov

, Guler

A.T.

, Medvedev

, van der Plas-Duivesteijn

, Varela

, Kolder

I.C.R.M.

, Meijer

A.H.

, Spaink

H.P.

,and Palmblad

, COMICS: Cartoon visualization of omics data in spatial context using anatomical ontologies, Journal of Proteome Research17(1) (2018), 739–744.

Seipel

, Nogatz

and Abreu

, Domain-specific languages in PROLOG for declarative expert knowledge in rules and ontologies, Computer Languages Systems & Structures51 (2018), 102–117.

10.

Blondet

, Le Duigou

, Boudaoud

and Eynard

, An ontology for numerical design of experiments processes, Computers in Industry94 (2018), 26–40.

11.

Hashemi

, Khadivar

and Shamizanjani

, Developing a domain ontology for knowledge management technologies, Online Information Review42(1) (2018), 28–44.

12.

Abdi

, Idris

and Ahmad

, QAPD: An ontology-based question answering system in the physics domain, Soft Computing22(1) (2018), 213–230.

13.

Mesmer

and Olewnik

, Enabling supplier discovery through a part-focused manufacturing process ontology, International Journal of Computer Integrated Manufacturing31(1) (2018), 87–100.

14.

Pan

J.Z.

, Bobed

, Guclu

, Bobillo

, Kollingbaum

M.J.

, Mena

and Li

Y.F.

, Predicting reasoner performance on ABox intensive OWL 2 EL ontologies, International Journal on Semantic Web and Information Systems14(1) (2018), 1–30.

15.

Rizzo

, Fanizzi

, d’Amato

and Esposito

, Approximate classification with web ontologies through evidential terminological trees and forests, International Journal of Approximate Reasoning92 (2018), 340–362.

16.

Maldonado

, Prada

and Senosiain

M.J.

, On linear operators and bases on Kothe spaces, Applied Mathematics and Nonlinear Sciences1(2) (2016), 617–624.

17.

Ansari

A.A.

,Investigation of the effect of albedo and oblateness on the circular restricted four variable body problem, Applied Mathematics and Nonlinear Sciences2(2) (2017), 529–542.

18.

Fouda

D.A.

, Hamdy

, Nouh

, Beheary

, Bakrey

and Saad

S.M.

, Model atmosphere analysis of two new early-type O4 dwarfs stars, Applied Mathematics and Nonlinear Sciences2(2) (2017), 559–564.

19.

Zhu

L.L.

, Pan

, Farahani

M.R.

and Gao

, Magnitude preserving based ontology regularization algorithm, Journal of Intelligent & Fuzzy Systems33 (2017), 3113–3122.

20.

Gao

and Xu

T.W.

, Stability analysis of learning algorithms for ontology similarity computation, Abstract and Applied Analysis (2013) Article ID 174802. 10.1155/2013-174802.

21.

Gao

, Gao

and Zhang

Y.G.

, Strong and weak stability of k-partite ranking algorithm, Information-An International Interdisciplinary Journal15(11) (2012), 4585–4590.

22.

Gao

, Zhu

L.L.

, Guo

and Wang

K.Y.

, Ontology learning algorithm for similarity measuring and ontology mapping using linear programming, Journal of Intelligent & Fuzzy Systems33(5) (2017), 3153–3163.

23.

Gao

and Farahani

M.R.

, Generalization bounds and uniform bounds for multi-dividing ontology algorithms with convex ontology loss function, The Computer Journal60(9) (2017), 1289–1299.

24.

Gao

, Farahani

M.R.

, Aslam

and Hosamani

, Distance learning techniques for ontology similarity measuring and ontology mapping, Cluster Computing-The Journal of Networks Software Tools and Applications20(2) (2017), 959–968.

25.

Phong

L.T.

, Aono

, Hayashi

, Wang

L.H.

and Moriai

, Privacy-preserving deep learning via additively homomorphic encryption, IEEE Transactions on Information Forensics and Security13(5) (2018), 1333–1345.

26.

Hassan

H.M.

, Uddin

M.Z.

, Mohamed

and Almogren

, A robust human activity recognition system using smartphone sensors and deep learning, Future Generation Computer Systems-The International Journal of Escience81 (2018), 307–313.

27.

, Jung

J.H.

, Jeon

B.C.

and Youn

B.D.

, Scalable and unsupervised feature engineering using vibration-imaging and deep learning for rotor system diagnosis, IEEE Transactions on Industrial Electronics65(4) (2018), 3539–3549.

28.

Proenca

and Neves

J.C.

, Deep-PRWIS: Periocular recognition without the iris and sclera using deep learning frameworks, IEEE Transactions on Information Forensics and Security13(4) (2018), 888–896.

29.

Biswas

, Kuppili

, Edla

D.R.

, Suri

H.S.

, Saba

, Marinhoe

R.T.

, Sanches

J.M.

and Suri

J.S.

, Symtosis: A liver ultrasound tissue characterization and risk stratification in optimized deep learning paradigm, Computer Methods and Programs in Biomedicine155 (2018), 165–177.

30.

Ren

R.X.

, Hung

and Tan

K.C.

, A generic deep-learningbased approach for automated surface inspection, IEEE Transactions on Cybernetics48(3) (2018), 929–940.

31.

Rad

N.M.

, Kia

S.M.

, Zarbo

, van Laarhoven

, Jurman

, Venuti

, Marchiori

,and Furlanello

, Deep learning for automatic stereotypical motor movement detection using wearable sensors in autism spectrum disorders, Signal Processing144 (2018), 180–191.

32.

Lore

K.G.

, Stoecklein

, Davies

, Ganapathysubramanian

and Sarkar

, A deep learning framework for causal shape transformation, Neural Networks98 (2018), 305–317.

33.

Bianco

, Celona

, Napoletano

and Schettini

, On the use of deep learning for blind image quality assessment, Signal Image and Video Processing12(2) (2018), 355–362.

34.

Olmos

, Tabik

and Herrera

, Automatic handgun detection alarm in videos using deep learning, Neurocomputing275 (2018), 66–72.

35.

Treder

, Lauermann

J.L.

and Eter

, Automated detection of exudative age-related macular degeneration in spectral domain optical coherence tomography using deep learning, Graefes Archive for Clinical and Experimental Ophthalmology256(2) (2018), 259–265.

36.

Gao

, Guo

and Wang

K.Y.

, Ontology algorithm using singular value decomposition and applied in multidisciplinary, Cluster Computing-The Journal of Networks, Software Tools and Applications19(4) (2016), 2201–2210.

37.

Gao

, Zhu

L.L.

and Wang

K.Y.

, Ontology sparse vector learning algorithm for ontology similarity measuring and ontology mapping via ADAL technology. International Journal of Bifurcation and Chaos25(14) (2015), 1540034. 10.1142/S0218127415400349.

38.

Gao

, Baig

A.Q.

, Ali

, Sajjad

and Farahani

M.R.

, Margin based ontology sparse vector learning algorithm and applied in biology science, Saudi Journal of Biological Sciences24(1) (2017), 132–138.

39.

Gao

and Zhu

L.L.

, Gradient learning algorithms for ontology computing. Computational Intelligence and Neu-roscience (2014). 10.155/2014-438291.

40.

Gao

, Zhu

L.L.

and Wang

K.Y.

, Ranking based ontology scheming using eigenpair computation, Journal of Intelligent & Fuzzy Systems31(4) (2016), 2411–2419.

Ontology geometry distance computation using deep learning technology

Abstract

Keywords

1 Introduction

2 Overview of geometry distance computation

3 Deep learning based ontology distance calculating algorithm

4.1 Similarity measuring experiment on gene data

Table 3 The experiment data for plant ontology P@1 average precision ratio P@3 average precision ratio P@5 average precision ratio algorithm in our paper 0.6774 0.7957 0.9355 algorithm in [36] 0.6774 0.7957 0.9290 algorithm in [37] 0.6774 0.7849 0.9032 algorithm in [38] 0.6913 0.7634 0.8968

Conflict of Interests

Footnotes

Acknowledgments

References

Table 3
The experiment data for plant ontology

P@1 average precision ratio P@3 average precision ratio P@5 average precision ratio

algorithm in our paper 0.6774 0.7957 0.9355

algorithm in [36] 0.6774 0.7957 0.9290

algorithm in [37] 0.6774 0.7849 0.9032

algorithm in [38] 0.6913 0.7634 0.8968