Abstract
This paper proposes a diagnostic system, ESECFDS, to solve the problem of excessive smoke emission from a faulty car, based on Case-Based Reasoning (CBR) Methodology that accepts a user query as a new case, compares it with several stored cases in a Case-Base (CB) based on a similarity score using Jaccard Similarity method and provides feedback result to the user. The CBR cycle R4 Model of Aamodt and Plaza has been modified to R5 Model to improve the time complexity of the proposed algorithm of the system over the popular R4 Model.
Keywords
Introduction
Since the 20th century, the importance of the car in daily life as a means of conveyance has been discussed in [1]. The huge growth of car usage has a visible significant impact on our society. But many drivers face difficulties when their cars become faulty and out of control especially in transit. Detection of car faults is a complex process which involves a high degree of skill to identify and fix the fault within the specified time. An expert mechanic or driver, who has a lot of knowledge of finding fault, can do the same. Experts deal with various levels of difficulties of car fault; some being detected with visible effects. One of them is Excessive-Smoke-Emitting-Car. Sometimes, a car may be poorly tuned or maintained; it may emit an excess level of smoke. Smoke is a cause of concern and may suggest several problems that may happen underneath the hood of the car. Smoke colour may be black or white or blue. So, different colours of smoke coming out of the exhaust pipe may indicate different serious problems in the exhaust system, engine, or inside the fuel injection system. If expert knowledge can be documented in a computer to solve problems, dependence on the experts may be minimized.
This paper proposes the Excessive-Smoke-Emitting-Car Fault Diagnostic System (ESECFDS) that solves the problems of a faulty car that is emitting excessive smoke from the exhaust pipe. Using computational intelligence, ESECFDS determines the solution to the fault. The system, developed in this paper, is based on the principle of Case-Based reasoning. Case-Based Reasoning (CBR) is a technique for computer-aided intelligent diagnosis system. ESECFDS provides a query format to the user, so that the user may provide a new query case to the system. The case takes in a few attributes that uniquely identify the case along with the problem description & solution. The case is represented as Case
ESECFDS uses the Jaccard Similarity Method for the estimation of similarity between the new case and the stored cases. Jaccard similarity method is very simple and performs calculations between a pair of sets in minimum time. Section-9 shows the detail about the performance of the proposed methodology with the Jaccard similarity method.
The future work focuses on the possibility of designing of diagnosis system for car faults based on CBR Methodology that will deal with all types of car problems.
The remainder of the paper is organized as follows: Section 2 focuses on Case-Based Reasoning (CBR) Methodology; Section 3 on previous related works, Section 4 depicts the comparative analysis of prior works and R4, R5 model in ESECFDS. Section 5 focuses on proposed model architecture, Section 6 provides detail on Case-Base and case representation in the Case-Base, Section 7 describes our proposed methodology for similarity calculation between cases, and Section 8 describes the proposed algorithm in ESECFDS with simulation examples. Section 9 focuses on the performance analysis of proposed methodology in ESECFDS, Section 10 focuses on the limitations of ESECFDS and at last Section 11 ends up with a conclusion and future work.
Case-Base Reasoning Methodology
A general Artificial Intelligence methodology used for problem-solution, fault identification and treatment, learning, reasoning, and decision support is Case-Based reasoning or CBR. The technique of CBR is the concept of solving a new issue using old issues, so it is often called the approach of problem-solution based on experience. It is a technique for intelligent systems of computer-aided diagnosis. A case includes a description of the problem (i.e. features or symptoms) and a solution (i.e., a result of diagnosis). A Case-Base consists of cases those are previously stored. As seen in [2], many similar cases can have similar solutions. By searching for similar problems in the Case-Base, a new problem is solved. If with the same problem a similar case is found, then the solution to that problem will be adapted to be the solution to the new problem. When a new solution to the new problem or case has been found then that case with a new solution can be stored in the Case-Base in order to improve its competence, as referred to in [2]. CBR works in a cycle. Aamodt and Plaza in 1994, proposed the CBR cyclic model called the R4 model that has 4 activities; Retrieve cases, Reuse Cases, Revise cases, and Retain Cases. This paper used the R5 model of CBR. As in [3] shows, the R5 model provides repartitioning of the case and case structure before retrieval that may reduce search time for queries.
There are 5 activities of the R5 model.
Repartitioning: R5 models are partitioning the cases with the help of the content of its collection of cases as referred to in [3]. It’s an important task to build the structure of cases before retrieving the case.
Retrieve: Repartitioning provides a foundation for the retrieve case. Now in the retrieving stage, the case is retrieved from the closest similar group of cases. With the help of a similarity relationship, first is chosen as the most similar group. Then among the cases in a similar group, the closest case is chosen, as referred to in [3].
Reuse: This activity follows that the concept of a similar problem has a similar solution. R5 model solves the new problem or case re-using the solutions, contained in those stored retrieved case(s).
Revise: After the step of reuse, the R5 model verifies the applicability of the newly proposed solution in the real world.
Retain: At last, a new solved problem is updated to the Case-Base for future problem solution and finds the position in the most similar group of a case in case library as referred to in [3]. Figure 1 shows the CBR cycle of R5 Model.
CBR cycle of R5 Model.
Excessive-Smoke-Emitting-Car Fault Diagnosis System (ESECFDS) based on Case-Based Reasoning Methodology, follows the activities of the R5 Model of CBR.
Repartitioning: The Case-Base of ESECFDS stores cases which contain excessive-smoke- emission problems with solutions. As car may emit different colours of excessive smoke so cases may contain problems of smoke emission in different colours. So, according to the different smoke colours, Case-Base (CB) is repartitioned into different Sub-Case-Bases (SCB) where cases with same smoke colour are stored into the same Sub-Case-Base.
Retrieve: After the repartition stage, whenever a problem or a new case arrives in the system, the most similar case is retrieved from the closest similar Sub-Case-Base. With the help of the decision tree first is chosen as the most similar Sub-Case-Base. Then among the cases in that Sub-Case-Base, the most similar case is chosen.
Reuse: This activity follows the concept of a similar problem has a similar solution. So, at this stage, the problem of the new case is solved re-using the solutions of the retrieved case.
Revise: After the step of reuse, verification of the applicability of the newly proposed solution in the real world is done.
Retain: At last, the newly solved case is updated to the most appropriate Sub-Case-Base for future problem-solution.
In various diagnostic fields, the expert systems are used and can be well applied in the diagnosis of automobile faults.
A Vehicle Fault Diagnosis Expert System based on the fault tree analysis method was developed, as referred to in [4], to solve the complexity and difficulty of detection and analysis of vehicle faults. That designed expert system can effectively diagnose vehicle breakdowns and help users to exclude faults and maintain vehicles.
Car Failure and Malfunction Diagnosis Assistance System (CFMDAS) has been proposed in [5], using a decision tree which is extremely useful for supporting mechanics for the identification of car failure and training purposes. In addition, more specific and less time-consuming models are developed by collecting and maintaining useful information on such domains.
A distributed diagnosis agent system (DDAS) was built in [6], which is used on the basis of signal analysis and machine learning for vehicle fault diagnosis. Based on the information provided by the signal agents in DDAS, DDAS used CBR techniques to find the root cause of vehicle faults.
As proposed in [7], researchers have built a car malfunction fuzzy fault diagnostic system based on fault tree analysis that detects component faults and process disturbances. It can lead to system malfunctions by matching the process uncertainty data from the system with the pattern of IF statements stored in the computers. Surety factors for component failures and process disturbances for diagnostic sequence checking are also evaluated by that system.
A method of vehicle fault diagnosis that combines neural networks with generalization capability is proposed in the research paper [8], where a two-step ensemble strategy was proposed for the diagnosis of vehicle faults, an ensemble selection algorithm, BFES, and an analog Bayesian ensemble decision function, A-Bayesian-Entropy.
For their diagnostics, several papers have used rough set theory as in [9] to find fault in the vehicular transmission system, the rough set theory is used to deal with vehicle transmission system fault data.
Paper [10] gives an example of diesel engine valve clearance fault diagnosis and describes the algorithms and methods for extracting fault symptoms based on the principle of rough sets.
The power transformer fault diagnosis based on association rules obtained by the rough set has been seen in [11]. Here, a rough set is provided to produce rules of association that are used to diagnose power transformer faults.
In Semantic Analysis of Natural Language Queries Using Domain Ontology for Knowledge Access from the Database, the Jaccard similarity approach was used in [12], this paper describes a method for semantic analysis of natural language queries for Natural Language Interface to Database (NLIDB) using domain ontology. It needs better precision to incorporate NLIDB for extreme applications such as railway investigation, airway investigation, corporate, or government call centers.
The research paper [13] was done to help build an expert system for the diagnosis and repair of car failures under constraints such as time, location and availability of human expertise. In order to conclude the best means, which are clear and straightforward, to introduce and maintain when creating an expert system, the analysis of technologies for designing expert systems was undertaken.
Taking the improved nearest neighbor method which is based on the combination of hamming distance and Euclidian distance to solve case similarity, improve accuracy and efficiency of case matching, the research paper [14] shows the CBR retrieval method based on rough set theory.
Case-Based reasoning is used in the research paper [15] to help fault diagnosis using sensor information. This paper establishes that, based on sensor information and using Case-Based reasoning methods, computer-based diagnostic systems can be developed.
This paper [16] focuses on the application of CBR in a manufacturing environment with decision tree induction to examine the cause of defects occurring in the domain. By combining CBR with decision trees, the abstraction of domain information is made possible.
The diagnosis of automobile engine faults using the neural network was mentioned in [17]. The developed system is based on an engine fault table. Such a diagnostic module is intended to improve the system ’s usefulness.
In paper [18], a decision tree is applied for fault detection and classification in the transmission line. The DT based fault detection algorithm uses 1/4th cycle data of fault currents from fault inception, to generate the optimal decision tree for fault detection
The paper referred to in [19] provides an overview to the concept of rough sets as well as its benefit in working with uncertain knowledge in areas of defect detection. Here, a weight determination algorithm based on rough sets is proposed.
In [20], an effective random decision tree algorithm was discussed for Case-Based reasoning systems. To generate a stronger hybrid algorithm, this paper combines this algorithm with a simple similarity measure based on domain knowledge. This hybrid algorithm generates a lower average error consistently than the base algorithms
Reference was made to the development of a fault-diagnosis method (FDS) using Case-Based reasoning in [21]. Before the event of total system failure, the Fault-diagnosis system (FDS) could be helpful in determining the source of faults. Here is a study on such a diagnostic technique for minimizing maintenance activities and reducing downtime of the system.
For computer fault diagnosis, Case-Based reasoning is used in paper [22]. This system is based on the Case-Based Reasoning (CBR) methodology where the problem-solution depends on case, and the knowledge base contains certain cases. The system gives users the right to send questions and problems.
As detailed in [23], Case-Based reasoning is extended to complex medical diagnoses. This paper presents a CBR expert method that uses the algorithm of the K-nearest neighbor (KNN) to look for related cases based on the Euclidean distance calculation.
Comparative analysis
Comparative analysis of previous works with their outcomes
The previous related research papers used different methods for different purposes. Table 1 shows a comparative study of these works with their outcomes.
Comparative study of previous related works with their outcomes
Comparative study of previous related works with their outcomes
Table 1, continued
Table 1, continued
Comparative analysis of previous works those used Case-Based Reasoning Methodology in Automobile Domain is shown in Table 2.
Comparative previous work analysis of various Case-Based Reasoning Methodologies used in automobile domain
Comparative previous work analysis of various Case-Based Reasoning Methodologies used in automobile domain
Table 2, continued
Comparison analysis of the CBR R4 Model and R5 Model in ESECFDS is shown in Table 3.
The Fig. 2 graph shows the comparison of total calculation time between R4 and R5 Model in ESECFDS.
Comparison analysis of CBR R4 model and R5 model in ESECFDS
Comparison analysis of CBR R4 model and R5 model in ESECFDS
Footnotes
Acknowledgments
The authors would like to thank the Department of Computer Science and Engineering in NIT, Durgapur (WB, INDIA), for academically supporting this research work.












