Abstract
Data support is already driving the development of artificial intelligence. But it cannot solve the semantic problem of artificial intelligence. This requires improving the semantic understanding ability of artificial intelligence. Therefore, a question answering system based on semantic problem processing is proposed in this study. The question answering system utilizes an improved unsupervised method to extract keywords. This technology integrates the semantic feature information of text into traditional word graph model algorithms. On this basis, semantic similarity information is used to calculate and allocate the initial values and edge weights of each node in the PageRank model. And corresponding restart probability matrices and transition probability matrices are constructed for iterative calculation and keyword extraction. Simultaneously, an improved semantic dependency tree was utilized for answer extraction. The improved keyword extraction method shows a decreasing trend in
Introduction
Data support is the foundation for the development of artificial intelligence (AI). When dealing with the natural language (NL), AI cannot correctly handle the semantics of NL due to the insufficient understanding [1]. This has led to a debate on the issue of AI semantics problem (SP). However, data support cannot completely solve the SP of AI [2]. Therefore, a thorough study of AI language is also needed. In the research of it, mathematical methods such as mathematical logic can be used to solve SP [3]. In essence, AI pays more attention to the study of mathematical logic [4]. Therefore, to solve the data logic problem of AI, it is also necessary to focus on the processing of SP. Therefore, in this study, the focus was on improving the methods of Artificial Intelligence Semantic Problem (AISP). Question and Answer System (QA) is one of the main application forms of AI. The NL processing capability in QA is a key factor affecting its application. However, among the existing QA language processing methods, it is not yet possible to fully understand and master the more complex human language [5]. Therefore, it is necessary to further improve the NL processing method in QA. The correct analysis of the semantics of natural language is the main direction for improving NL processing methods. In the experiment, a QAbased text processing method was introduced to improve the processing ability of AINL. Based on this, the semantic understanding ability of intelligent QA is enhanced, thereby improving the autonomous decision-making ability of AI. The article is mainly divided into five parts. It first provides a brief introduction to the research background of the article, which leads to the following content. The literature review analyzes and summarizes the relevant methods. The method section mainly provides a detailed explanation of the methods used. The next part is the validation of the performance of the method. The conclusion is a summary of the entire text, as well as a description of the shortcomings and future prospects. The system response performance test was conducted through actual cases, and the results showed that the unsupervised keyword extraction algorithm that integrates semantic features is beneficial for improving the efficiency of keyword extraction. Effectively integrating multiple model information simultaneously can improve the progress of answer extraction. These results demonstrate that the improved algorithm proposed in this article has played a certain positive role in the dialogue effect of the human-machine question and answer model.
Literature review
Based on the growth of data and technological updates, AI has achieved continuous development and improvement [6]. QA is an important product of the development of AI. It is an important channel for communication between humans and machines. QA has been well applied in various fields through continuous development and improvement. The processing of NL is the main factor affecting the effectiveness of QA display applications [7]. In the existing QA, simple human-machine dialogue can already be carried out. However, due to the complexity of language, QA is still unable to provide very accurate answers during human-machine communication. Therefore, researchers have made different improvements to the NL processing methods in QA. Researchers used a combined NL processing method to process text vectors in QA. Through example analysis, it has been proven that NL processing methods based on knowledge graphs and other methods have high accuracy in text processing [8]. This combined NL processing method can improve the runtime while ensuring the accuracy of the method. Lauriola et al. investigated the application of deep learning technology in NLP and found that deep learning could affect various tasks in NLP. They emphasized the main applications and limitations of deep learning in NLP [9]. NL processing affects the analysis results of data, which has a significant impact on the results of medical research. Researchers have improved the NL processing method for information extraction in pathological reports [10]. This further enhances the application value of NL processing in human society. Semantic similarity is a key and challenging aspect of NL processing. Besher et al. utilized a dialogue system for contextual semantic analysis. They used encoders to deeply analyze the semantics of the text. This improves its ability to infer contextual semantics. This SP processing method can provide users with more accurate answers [11]. Kumar and other scholars have combined deep learning methods with NL processing. The combined method can measure semantic similarity. Through this method, hidden semantic relationships can be captured. This improves user retrieval efficiency and speed of problem solving [12].
The correct judgment of semantic similarity affects the accuracy of QA keyword and answer extraction. Semantic dependency tree is an important method for extracting semantic relationships in text. Researchers have improved the existing semantic dependency tree [13]. They introduced attention mechanisms into the basic model. And neural networks were combined in the experiment to extract relationships. In the validation results of the dataset, the improved method can effectively improve F1 scores. Shi et al. improved the traditional dependency tree relationship extraction strategy and introduced graph neural networks, which could adaptively obtain dependency paths related to entity relationships and reduce noise in the data. A large number of numerical experiments verified the effectiveness of the new method [14]. Wang et al. combined neural networks with semantic dependency trees. The new model can correctly handle the information of network text. This model can effectively extract structural information from text and remove excess structural information. The experimental results in the dataset confirm that the model has higher information processing ability when processing text information [15]. And the unsupervised learning shows good performance in data processing. Some scholars introduced the advantages of unsupervised learning in biological research. After summarizing and comparing the methods, they emphasized the importance of unsupervised learning in data processing. And they also pointed out some limitations of this method and the challenges it faced [16].
The above research results were summarized and analyzed in the experiment. The results show that language processing ability is the main direction to improve the effectiveness of QA applications. This requires proper handling of the semantics problem of NL. Research has shown that methods such as semantic dependency trees can solve the semantics problem of AI. Therefore, in this experiment, an improved unsupervised method was used for keyword extraction. Then, a combination of multiple models was adopted to improve the answer extraction method based on semantic dependency trees. It is hoped to further address the SP issue with artificial intelligence question answering system (AIQA).
Solution of AISP based on keyword extraction method
With the development of mathematical logic, AI has gradually derived various application forms. AIQA in machine translation is one of the application forms of AI. When applying AIQA, it can raise questions in the form of NL. Then answer the question using the same NL method through the computer system. The SP in AIQA is inevitable. The accuracy of semantics is related to the accuracy of computer answers [17]. AIQA needs to have a correct understanding and expression of NL [18]. If natural semantics cannot be understood correctly, the answers provided by QA can create obstacles in human-machine dialogue and communication. Therefore, further improvement of AIQA is needed. In the following research content, it is proposed to use keyword and answer extraction methods to solve SP in AIQA.
QA and related language processing technologies in AI
The emergence of AIQA is one of the more mature application forms of NL. The earliest people proposed using computers to establish a QA for patient psychotherapy. The continuous development of technology has made QA a hybrid multifunctional NL interaction model. It can provide users with relevant knowledge queries and problem-solving. The main steps of QA are shown in Fig. 1.
Framework diagram of the question answering system.
In the main steps of QA mentioned above, the understanding ability of the problem is first included. This section mainly analyzes various issues of user input, and conducts type judgment, grammar analysis, and keyword extraction. Then there is the information retrieval capability of QA. This section mainly uses the keywords extracted from the problem understanding step to locate candidate answers, in order to improve the accuracy of answer acquisition in the answer extraction step. The third step is answer extraction, which is an important technique in the entire QA process. Its purpose is to provide users with corresponding answers based on problem understanding and information retrieval. Due to its late start, there is still a significant gap between domestic research on QA and international level. The reason may be due to the complexity of Chinese semantics and the lack of corresponding language processing capabilities. In the existing NL processing technologies, it mainly involves the syntax, semantics, etc. of NL [19]. The NL processing methods mainly used include deep learning, structure tree, text vectorization, etc. The keyword extraction methods can include the supervised keyword extraction, the unsupervised keyword extraction, and the semisupervised keyword extraction. Supervised keyword extraction methods include genetic algorithms, support vector machines, etc. Unsupervised keyword extraction methods have strong scalability and are commonly used in keyword extraction techniques. This method can extract corresponding keywords based on word sorting, and has high convenience. Figure 2a shows the general pattern of unsupervised methods.
Unsupervised keyword extraction method and PageRank.
In the unsupervised keyword extraction method, it mainly includes the preprocessing stage of text, screening of candidate word sets, sorting of candidate words, evaluation of keywords, and final keyword acquisition. The main keyword extraction methods used in unsupervised methods include word graph models, such as the PageRank algorithm in Fig. 2b. This method can iteratively calculate word nodes, in which the random walk strategy is applied. And this method calculates the score of word nodes to represent the importance of words. The importance of words based on the calculated score ranking is determined. Due to its high scalability and convenience, the PageRank algorithm was chosen as the fundamental technology for keyword extraction in QA in this study.
The keyword extraction method is related to the performance of the entire QA semantic understanding. In this section, the PageRank algorithm mentioned above was selected as the basic method for keyword extraction. However, this method is easy to ignore the potential semantic information contained between text and words in iterative calculation. This can lead to insufficient understanding of textual information, thereby reducing the semantic comprehension ability of QA. To solve this problem, this experiment uses the improved unsupervised method and text vectorization method to extract text information. The obtained text and word order information were simultaneously input into the word graph model. Keyword extraction was performed in this model. Figure 3 shows the structure of the improved keyword extraction method.
System technical structure diagram.
Among the improved keyword extraction methods, the unsupervised keyword extraction method is the main core architecture of the improved method. The improved method is detailed in Fig. 4. In the improved unsupervised keyword extraction method, it mainly includes text preprocessing, construction of co-occurrence word graph, semantic representation of text, scoring of semantic features, and keyword extraction. The text preprocessing stage is to screen candidate words that meet the criteria. The semantic representation of text is to obtain the vector between semantics and text. The scoring method for semantic features is to score information on semantics and word order. Finally, the word with the highest score was selected as the keyword.
Flow chart of an improved unsupervised keyword extraction technology.
In the representation of text semantics, it is necessary to use the cosine similarity in Eq. (1) to calculate semantic similarity.
In Eq. (1),
In Eq. (2),
In Eq. (4),
In Eq. (5),
Vectorization is the process of associating words and vectors, by vectorizing text to obtain a low dimensional floating-point vector. Each dimension in the obtained feature vector represents the grammar or semantic feature information of the text or words. In addition, the semantic information and word order information between text and words can be reflected through feature vectors. By combining the obtained word order information, semantic information between text and words can be fully explored. There are currently many methods for obtaining vector representations, among which the main representative models include Word2vec, Doc2vec, and Send2vec. During the process of word vector training for the Word2vec model, there are two models: Continuous Bag of Words Model (CBOW) and Skip gram model. The CBOW model, as shown in Fig. 5, is divided into input layer, projection layer, and output layer. The principle of its idea is to take the intermediate word of a paragraph of text as the output and the context of the word as the input, and predict the probability of the occurrence of the intermediate word by using the context.
CBOW model architecture.
The Skip-gram framework model, as shown in Fig. 6, is also divided into input layer, projection layer, and output layer. Its model philosophy is opposite to CBOW, which predicts the probability of context based on the current word information of the sentence that appears. The co-occurrence information of words was considered in the training process of both models mentioned above. The word vectors obtained after training preserve the semantic correlation between words, and both can obtain semantic information of words.
Skip-gram model architecture.
In order to vectorize words and analyze word embedding techniques, this article selects the Word2vec public model for vector representation of words, and uses the Skip gram model during the word vector training process. The candidate set words obtained during the preprocessing stage will be mapped to the vector space through the Skip gram model in the experiment. Based on existing experimental experience, during the training, the spatial dimension of the word vector is set to 512.
The answer extraction method in AIQA is to obtain accurate answers to user questions. The main answer extraction methods include word bags, pattern matching, etc. These methods are easy to implement and understand, but there is a significant semantic difference between the obtained answer and the true answer. The answer extraction method based on semantic dependency tree has good application results in QA. This method can analyze the semantic dependency relationship between questions and answers, thereby obtaining the semantic relationship of words. Therefore, this study chose semantic dependency trees as the basis for the answer extraction method. In the experiment, it is assumed that
In Eq. (6),
In Eq. (7),
Semantic dependency trees can improve the similarity calculation, thereby mining complete semantic structure information. However, a simple semantic dependency tree disrupts the similarity of some semantic relationships when examining semantic feature information. This leads to insufficient accuracy in obtaining answers. Therefore, this study has improved the method. Figure 7 shows the system structure diagram of the improved method.
System structure diagram of improved answer extraction method.
The improved method is based on a semantic dependency tree, which integrates word position semantic feature information into the semantic dependency tree. In this fusion process, it is necessary to consider the word frequency information of semantic features. Semantic feature information and word frequency information can effectively measure the semantic similarity of keywords between sentences. In the improved answer extraction method, a combination of multiple models can be used to introduce these two pieces of information into the semantic dependency tree. This operation takes into account the fragmentation of partial semantic relationship similarity in the semantic dependency tree. This can to some extent compensate for the accuracy of similarity calculation in semantic dependency trees. This further improves the performance of answer extraction methods based on semantic dependency trees. Figure 8 shows the specific flowchart of the improvement method.
Improved answer extraction algorithm flowchart.
In the improved answer extraction method, it is necessary to conduct preliminary work on the questions raised by users. After completing the preliminary processing work, it is necessary to return to the corresponding candidate answer set. For questions and candidate answers,
In Eq. (9),
To incorporate location information into similarity calculation, the logarithmic linear model in Eq. (11) was introduced.
In Eq. (11),
The minimum error rate was selected for parameter adjustment. Equation (13) was assumed to be the answer sentence extracted from QA.
In Eq. (13),
The minimum error function in Eq. (15) can be obtained from Eqs (13) and (14).
From this, the minimum error rate function in Eq. (16) can be obtained.
Based on the above calculation process, the information of position and word frequency can be integrated into the calculation of semantic similarity in the experiment. This can improve answer extraction accuracy in QA.
In order to validate the performance of the studied method, all validation experiments were conducted in the same experimental environment. The specific experimental environment is shown as follows: the processor is Intel(R) Core(TM) i5-9300H CPU @ 2.40GHz, RAM is 8G, Backend development language is Python, the compilation platform is PyCharm, the learning framework platform is TensorFlow and TensorFlow is WeChat. The keyword dataset for the validation analysis experiment is sourced from NetEase News. This includes content from multiple fields such as politics, society, education, and the arts. During the validation experiment, the keywords annotated in the dataset were used as the control content for the validation experiment. The selected keyword is considered to be the correct extraction result only when and only when the selected keyword using the method is completely consistent with the annotated keyword.
In the validation analysis of this chapter, other different indicators were used to validate the performance of the methods used above. The comparison indicators selected include precision (
Due to the lack of suitable datasets for performance verification in the existing Chinese QA, 300 questions were selected as the test set in the network in this experiment. Table 1 shows the relevant issues in the self-made test set. It mainly includes issues such as numbers, place names, person names, and time. In the experiment, 240 questions were selected as training set, and the remaining 60 questions were selected as testing set.
Specific contents of the dataset
Specific contents of the dataset
Different evaluation index.
In the validation experiment of answer extraction method,
Comparison of 
The validation experiment also tested the overall performance of QA. Figure 11 compares the loss function value and running time of the improved method during training. This method has lower loss function value and running time. The final value of its loss function is 0.22, and the final running time is 2.54 ms. This means that the improved method can achieve better robustness after training. And this method effectively reduces the running time after training.
Loss function and running time.
As one of the evaluation indicators for machine learning and other methods, the receiver operating characteristic curve (ROC) can reflect the accuracy of the model. The area under the curve is larger, the corresponding method accuracy is higher. The PR curve is actually a curve made with precision and recall as variables, where recall is the horizontal axis and precision is the vertical axis. Similar to the ROC curve, the larger the area under the PR curve, the better the performance of the method. The ROC and PR of different methods are compared in Fig. 12. From Fig. 12a, the improved method has a larger area under the curve. And this method has smoother curve features. This represents that the improved method has higher accuracy and robustness. From Fig. 12b, it can also be seen that the method used in this study has a larger area under the PR curve, which is consistent with the results of the ROC curve. Both confirm the excellence of the research method. This is because the semantic features of word position and word frequency are integrated into the semantic similarity calculation of semantic dependency trees. This approach allows for a more comprehensive consideration of semantic feature factors, thereby improving the accuracy of answer extraction in the system and laying the foundation for optimizing the performance of the entire system.
Experimental results of each model on the STC-SeFun and Weibo datasets
ROC and PR of different methods.
In order to further validate the practical application performance of the question answering system proposed in the experiment, automated evaluation indicators BLEU1, BLEU2, and CWS were used in Table 2, as well as scores for fluency, information amount, and relevance of manual evaluation indicators. BLEU reflects the degree of overlap between the responses generated by the model and the actual responses. CWS reflects the amount of information in the conversation response by generating the number of content words in the response through the model. From the experimental results, it can be seen that the model proposed in this article is superior to other models in both automated evaluation indicators and manual indicators. The most important thing to note is that the proposed model has much higher information content in both CWS and manual evaluation indicators than other models, indicating that the model can effectively improve the information content of conversation responses. Combining the fluency index, it indicates that the model in this paper can balance the fluency and information content of reply generation. This is because the improved unsupervised keyword extraction technique proposed in the article can accurately obtain vector representation information of text and words. On this basis, the PageRank model was used in the experiment to improve the accuracy of keyword extraction. At the same time, the improved method was used in the experiment to improve the accuracy of answer extraction. These tasks can accurately locate the user’s questions in the human-machine question answering system, thus laying the foundation for smooth dialogue.
Natural language processing is a branch of artificial intelligence that involves the design and implementation of systems and algorithms that can interact through human language [20]. Due to the latest advancements in deep learning, the performance of NLP applications has been unprecedentedly improved [21, 22]. AISP is mainly embodied in mathematical logic [23]. How to correctly understand NL is an urgent issue that AIQA needs to solve [24]. To solve the SP problem, an improved keyword extraction method and answer extraction method were proposed in this experiment to improve the NL processing ability of AI. In the experiment, an improved unsupervised method was used to extract keywords, and an improved semantic dependency tree was used to extract answers. In the validation analysis comparison results, the
