Abstract
Dealing with temporal data imperfection is a crucial issue in several application domains. In fact, failure to handle these imperfections can have significant consequences and lead to incorrect analysis and decision-making. This is particularly true when handling imperfect temporal data inputs in applications for Alzheimer’s patients as a real example. In this context, there is a need for a global ontology that provides a semantic representation of temporal data imperfection. In the literature, there is a big number of ontologies that represent data. Some represent only perfect temporal data. Some others represent imperfect data but not temporal ones. To the best of our knowledge, there is no ontology that represents temporal data imperfection. In this paper, we represent “TimeOntoImperfection”, a usable global ontology that represents four types of imperfection: imprecision, uncertainty, both uncertainty and imprecision and conflict. We describe the structure of “TimeOntoImperfection”, then we conduct a case a study in which we illustrate the usefulness of our ontology. Finally, we introduce the validation part in the context of CAPTAIN MEMO - an ontology based memory prothesis dedicated to alzheimer patients- and we discuss the encouraging results derived from the evaluation step.
Keywords
Introduction
Bringing up the subject of data imperfection comes from the need to have “the right to make mistakes”, from inaccuracy, from doubt, from the desire to say “maybe” to things that we are not sure about and of speaking imprecisely. All data may be affected by imperfection. Temporal data is no exception. Indeed, the order relation established on these data can help to detect errors. Temporal data touch all life domains. For example, in the context of an application intended for people suffering from Alzheimer’s disease, specific features add complexity to the processing of data collection. Indeed, patients with Alzheimer’s disease suffer from memory discordance, changes in attention and concentration and a decrease in their abilities [1]. As a result of these changes and disabilities, the data provided by patients with Alzheimer’s disease are mostly subject to different types of imperfections. For example, an alzheimer patient can say “
In the litterature, different ontologies have already been proposed in order to describe perfect temporal data such as OWL-Time ontology 1 . These ontologies can be used or merged with other domain ontologies which need to be extent to represent temporal aspect. However, to our knowledge, there is no ontology that represents imperfect temporal data.
This paper presents and describes “TimeOntoImperfection”, a top-level ontology for temporal data imperfection. It is published and can be downloaded at the “TimeOntoImperfection home page” https://cedric.cnam.fr/isid/ontologies/files/TimeOntoImperfection.html. It reduces to four types of imperfection: imprecision, uncertainty, uncertainty and imprecision at the same time and conflict. It can be merged with other ontologies that need to be extent to dealing with imperfect temporal data. We describe all the component of our ontology: concepts and properties, as well as the reasoning and inference via the SWRL rules. The paper also presents an evaluation of “TimeOntoImperfection” in the context of CAPTAIN-MEMO [2], a memory prothesis dedicated to Alzheimer patients to palliate their mnesic problems. It is based on personLink ontology 2 [3] that describes family relationship.
The paper is structured as follows: In Section 2, a state of the art in the context of temporal data, typology of temporal data imperfection and ontology is reviewed. In section 3, we describe “TimeOntoImperfection” ontology. Section 4 describes a validation of the ontology via an implemented prototype. Section 5 presents an evaluation of the presented work in the context of CAPTAIN-MEMO memory prothesis. Finally, Section 6 summarizes the conclusions of the study as well as future lines of work.
State of the art
The present work is closely related to the three following research areas: (i) temporal data, (ii) data imperfection and (iii) ontology. In this section, we present preliminary concepts and related work regarding the mentioned areas as well as the intersection between them.
Temporal data
Temporal data has an order relation. This can help to detect errors. Temporal data can be represented using time points or time intervals. They can also be conceptualized as quantitative (precisely defined, for example using dates) or qualitative (designates qualitative temporal relationships as “before”). In the following, we explain temporal elements, temporal relations as well as temporal ontologies.
Temporal elements
According to the survey presented by [7] and the W3C working group that designed the TIME ontology, the time element is an entity used as a constituent element of a theory of time. These items could be “Time Points” and “Time Intervals”.
Temporal relations.
Temporal relations could be: Points-based relation, intervals-based relation, Point-interval relation and Interval-point relation.
In what follows, we present the two most cited qualitative temporal formalisms: Allen’s interval algebra [9] and Vilain and Kautz’s point algebra [11].
Allen’s relations between two precise time intervals
Allen’s relations between two precise time intervals
Vilain and Kautz points algebra
All these ontologies that represent the temporal aspect in a precise way and do not consider the imperfections that can affect the temporal data.
Temporal data imperfection
Data can be perfect, which means data without errors. It can also be subject to several types of imperfections. According to [19], “imperfection” is used as the most general label. The imperfection may be due to errors or semantic imperfections. We consider the word "imperfection" to be more general than the word "error". An error is a type of imperfection. [20] consider as “errors” any typing or spelling error when data entry is manual or associates values to the wrong instances due to software misbehavior (in the case of the use of applications or software). An “error” represents any syntactical error or input error in an application or software used. A "semantic imperfection" means any inaccuracy, incompleteness, confusion, uncertainty, etc. that touches the data. We classify data into data entered by users and other data from applications.
Uncertainty, according to [21] refers to the “quality” of the information reflecting the reality of its realization. Information is said to be uncertain if it is impossible to judge the binary truth (true/false) of this information. Data may contain uncertainties. This means that the exact value is (partially) unknown, however, usually some knowledge is present anyway, perhaps describing the value. Vagueness is a type of imperfection that refers to the "content" of information. Thus, when we do not have variables, characteristics, etc. to clearly describe the content of the information, the imperfection is called imprecision. The ambiguity of information is the fact that it leads to two or more interpretations, because its limits are not known. Incompleteness represents the lack of information provided by the source. It can be measured by the difference between the amount of information actually provided by the source and the amount of information the source should provide. Conflict characterizes at least two informations leading to contradictory and therefore incompatible interpretations.
Typology of temporal data imperfection
There are a large number of data imperfection typologies. Authors such as [22] and [23] propose generic typologies while others, such as [24, 25] and [26], offer typologies for specific domains. Some of these typologies correspond better to a reality than others. In addition, there is no fixed definition to qualify imperfect data, such as uncertainty and imprecision. We also note that the majority of these typologies share three common concepts which are imprecision, uncertainty and incompleteness. The types of imperfections are interdependent. According to [23], incompleteness is a source of uncertainty and imprecision can also be associated with incompleteness. According to [27], imprecision always refers to incompleteness. Imprecision can be a source of uncertainty, but not necessarily [27].
Temporal data may have more imperfections than those offered in the generic typologies due to the complexity of this type of data and the specificity it contains (i.e. it may be numerical or based on natural language). They may also depend on the general context and may be subject to several factors likely to interfere in the specification of the type of imperfection. Thus, generic typologies cannot be adapted to temporal data (i.e. they are inadequate). For example, if we have the information "I forget the last time I visited my uncle who lives in Japan", the temporal data refers to a “missing”, which is a type of imperfection that none of the existing generic typologies includes. Another example is “On the first day of the week we will have a meeting”. In this example, the temporal data indicates a "circumlocution", which is another type of imperfection that none of the generic typologies contain. Moreover, to our knowledge, there is no typology of temporal data imperfections.
In our previous work [28], we proposed a typology of temporal data imperfection. It is classified into direct imperfections of both numeric temporal data and natural language based temporal data, indirect imperfections that can be deduced from the direct ones and granularity (i.e., context - dependent temporal data) which is related to several factors such as person’s profile and multiculturalism. The direct imperfections can be deduced directly from the given data: uncertainty, typing error, conflict, imprecision, missing, circumlocution and uselessness. The Indirect imperfections are those that can be generated from the direct ones. We distinguish three types of indirect imperfections: (i) The incoherence can be generated from the uncertainty and typing error. (ii) The incompleteness can be generated from the imprecision and the missing. (iii) The redundancy can be generated from the uselessness. Granularity is the general context of the given temporal data which can determine the type of imperfection. Figure 1 depicts our typology.

Our typology of temporal data imperfection.
A classical ontology is a perfect ontology that does not model any imperfection [29]. However, the world includes inaccuracies and imperfections that cannot be represented by classical ontologies.
To enable agents to handle imperfection, an extension of ontologies that has the ability to support imperfect knowledge is mandatory. These imperfections can be uncertainty, imprecision, conflict, etc.
Several definitions of probabilistic ontologies have been proposed. [30] defines probabilistic ontology as “an explicit formal representation of disclosed knowledge of knowledge about a domain of application that includes: the types of entities existing in the domain; the properties of these entities; relationships between entities; the processes and events that occur with these entities; the statistical regularities that characterize the domain; inconclusive, ambiguous, incomplete, unreliable and dissonant knowledge; uncertainty regarding all forms of reported knowledge; where the term entity designates any concept (real, fictitious, concrete or abstract) that can be described and reasoned in the field of application”. According to [31], probabilistic ontologies provide a structured, shareable, and principled way to comprehensively describe knowledge about a domain and the uncertainty surrounding it. They also expand the possibilities of standard ontologies by introducing the requirement for a correct representation of statistical regularities and uncertain proofs about entities in an application domain. Ideally, the representation is in a format that can be read and processed by a computer. According to [32] probabilistic ontologies are used to describe in detail the knowledge of a domain and the uncertainty associated with this knowledge in a reasoned, structured and shareable way. [33] define probabilistic ontology simply as a classical ontology containing uncertain knowledge. In other words, an ontology is only a probabilistic ontology if it contains at least one probabilistic component [33].
In the literature, there are not many works defining possibilistic ontology. In the temporal domain, there is no possibilistic temporal ontology. [34] proposes a possibilistic ontology based on WordNet in the information retrieval process. According to [35], in a possibilistic ontology, the relations between the concepts are each endowed with a degree of possibility and a degree of necessity which respectively reflect to what degree is this relation possible? And how certain is this relationship [36] propose a possibilistic extension of OWL 2, called Poss-OWL 2, to deal with uncertain geographic ontology. [37] propose a possibilistic extension of the SROIQ(D) description logic called π-SROIQ(D) by incorporating a level of certainty for different elements of the SROIQ(D) DL to deal with uncertain geographic information. [38] propose a possibilistic ontology based on a case-based reasoning (CBR) approach to perform a possibilistic semantic retrieval algorithm that handles ambiguity and uncertainty problems.
Fuzzy ontology is an extension of precise ontology according to [40, 41] and [39]. It is based on the integration of fuzzy logic in the definition of precise ontology to represent and reason about imprecise data [42]. It is very useful in many areas. According to [43], a fuzzy ontology has nine components: (1) precise concepts, (2) fuzzy concepts, (3) precise semantic relations, (4) fuzzy semantic relations, (5) precise conceptual relations, (6) fuzzy conceptual relations, (7) fuzzification relations, (8) axioms and (9) instances. The precise components keep the same definitions. [53] presents a fuzzy-based ontology, called “MemoFuzzyOnto” to represent and reason about imprecise temporal data.
In the literature, very few works define an evidential ontology. [44] propose “BeliefOWL” which is a new approach to represent uncertainty in an OWL ontology. They only considered the case of the inclusion of uncertainty in the classes. This uncertainty is modeled via the Dempster-Shafer theory of evidence. They extended the classes of OWL ontologies with masses of beliefs, then they applied structural translation rules in order to obtain a DAG (Directed acyclic graph).
Discussion
Most of the cited ontologies of data imperfection do not consider the temporal dimension. Besides, most the cited temporal ontologies do not consider the imperfections that my affect temporal data. Some of the cited ontologies consider only one type of imperfection, such as imprecision, ant not all of them.
In our previous works [45, 52] and [49], we extent the 4D-fluent approach, the OWL-ontology and the Allen’s interval algebra to represent and reason about many defined types of imperfections defined in the presented typology, that may affect temporal data, using probability, possibility and evidence theories. Based on these approaches, we proposed many ontologies of respectively, imprecise, uncertain, both imprecise and uncertain, and conflicting temporal data [50].
Based on the proposed ontologies, we create an ontology, named “TimeOntoImperfection”, that encompasses all the treated types of imperfections of temporal data, as shown in Figure 2.

“TimeOntoImperfection” ontology composition
The temporal data imperfection ontology, named “TimeOntoImperfection” 3 , is an OWL 2 and a top-level ontology. It can be merged with other ontologies of domain that must be extended to represent and reason about imperfect temporal data. It is based on the extension of the 4D-fluent approach with predefined elements of the OWL-Time ontology, elements that we have defined to represent the targeted imperfections, and the extension of Allen’s interval algebra to reason about imperfect temporal data.
“ TimeOntoImperfection” ’s components
In the literature, there is not, to our knowledge, an ontology of temporal data imperfection. We create “TimeOntoImperfection” ontology using the Protégé ontology editor 4 . It contains 5 classes, 201 object properties and 44 data type properties. They represent time interval bounds, time points, dates and time clocks as well as all measures of the different types of imperfections that are the measures of certainties, measures of possibilities, measures of necessities and belief masses.
The proposed ontology is based on 4D-fluents approach, new ontological components and components based on OWL-Time ontology to represent imperfect quantitative temporal data and associated qualitative temporal relations in OWL2.
The 4D-?uents approach introduces two classes named “TimeSlice” and “TimeInterval”, and four properties named “tsTimeSliceOf”, “tsTimeIntervalOf”, “HasBeginnig”, and “HasEnd”.
We reduce to imprecision, uncertainty, both uncertainty and imprecision at the same time and conflict as kinds of temporal data imperfections. We clarify each part of the ontology in Figure 3.

“TimeOntoImperfection” representation
The proposed representation is based on extending the 4D-fluents approach, components defined in OWL-Time ontology as well as new ontology components that we define. “TimeSlice” is the class domain for entities representing temporal parts.
“time:TemporalEntity” is a subclass of the “TimeSlice” class. “time:TimeInterval” and “time:TimeInstant” are respectively the classes representing intervals and time points. “time:DateTimeDescription” is the class representing dates and time clocks. “time:TimeInterval”, “time:TimeInstant” and “time:DateTimeDescription” are subclasses of the “time:TemporalEntity” class. Figure 4 shows the concepts of the ontology in the protégé tool.

The concepts of “TimeOntoImperfection” ontology
We present the datatype properties of the ontology related to each type of imperfection. Figure 5 presents the datatype properties of the ontology in the protégé tool.

The datatype properties of “TimeOntoImperfection” ontology
The object property “TsTimeSliceInstance”, defined in “TimeOntoImperfection, links the class “TimeSlice” and the class “time:TemporalEntity” classes.
”Is_a” connects the “time:TemporalEntity” class with the “time:TimeInterval”, “time:TimeInstant” class and the “time:DateTimeDescription” class. In addition, we define object properties that represent qualitative relationships between two intervals (Interval-Interval), a time interval and a time point (Interval-Point), a time point and a time interval (Point -Interval) and two time points (Point-Point).
We instantiate object properties based on our extensions of Allen’s algebra for all the types of the treated imperfections. For example, “RelationIntervals” can be one of Allen’s relations. In other words, 13 object properties are associated: “BeforeIntervals”, “MeetsIntervals”, “OverlapsIntervals”, “StartsIntervals”, “DuringIntervals”, “EndsInterva-ls”, “AfterIntervals”, “MetByIntervals”, “Overla-ppedByIntervals”, “StartedbyIntervals”, “Contai-nsIntervals”, “EndedbyIntervals”, and “EqualsIntervals”. Figure 6 presents the object properties of the ontology in the protégé tool.

The object properties of “TimeOntoImperfection” ontology
We extend the Allen’s algebra to:(1) reason about imperfect quantitative temporal data to infer qualitative temporal relations and (2) to reason about the qualitative temporal relations to infer new ones.
We extend the Allen’s interval algebra to reason about imperfect time intervals. When considering perfect time intervals, our approach reduces to Allen’s interval algebra. We redefine the 13 Allen’s relations to propose temporal relations between imperfect time intervals.
For example, for uncertain time intervals, let A = [A-_ca-, A+_ca+]and B = [B-_cb-, B+_cb+]be two uncertain time intervals. For instance, we redefine the relation “Before (A, B)” as: “Before_ c (A, B)”; where “c” is the certainty degree associated to the relation “Before” between A and B. This means that the uncertain ending bound of the interval A is less than the uncertain beginning bound of B. Table 3 presents Allen’s relations between uncertain intervals.
Temporal relations between two uncertain time intervals A and B
The certainty degree “c” is inferred from the certainty degrees “c_a+” and “c_b-” using a Bayesian Network [51, 52].
All the other tables presenting Allen’s relation between imperfect time intervals of the related to imprecision, simultaneously uncertainty and imprecision, and conflict are detailed in our previous works (we can not cite it in the anonymous manuscript).
We infer via a set of SWRL rules our extension of Allen’s algebra. We use Pellet reasoner. For each qualitative temporal relation, we associate an SWRL rule. For example, the SWRL rule for inferring the temporal relation“Meet (I, J)” is as follows:
Figure 7 presents an example of a SWRL rule. It deduces the “ConflictingBeforeIntervals” relation. This one means the relation "Before" that may exist between two conflicting time intervals I and J

An example of a SWRL rule
We propose a prototype based on our “TimeOntoImperfection” ontology. This prototype has been implemented in Java. The main interface of the prototype allows the user to enter perfect and/or imperfect time intervals and time points and to calculate degrees of certainty, possibility, necessity and degrees of credibility and masses of belief based on our proposed approaches as shown in Figure 8.

User Interface of our prototype
After each new temporal data entry, the “Add Qualitative Temporal Relation” component is automatically executed to deduce the missing data, in particular the associated qualitative relations and the associated measures based on the SWRL rules.
Our prototype also makes it possible to perform a search on the data entered and saved by using a filter integrated in a choice bar. This is implemented with SPARQL queries.
In this section, we present the evaluations carried out within the framework of CAPTAIN MEMO. CAPTAIN MEMO [2] is an "intelligent" memory prosthesis intended for Alzheimer’s patients at the early stage of the disease to alleviate their memory problems. It was developed in collaboration between the CEDRIC laboratory (CNAM, France) and the MIRACL laboratory (University of Sfax, Tunisia), in particular the framework of the thesis of [1]. This prosthesis is based on a fuzzy temporal OWL 2 ontology, called MemoFuzzyOnto [53], and a multilingual and multicultural ontology to represent family relationships, called PersonLink [3], which allows to model and reason about the interpersonal relationships (for example, mother, neighbour). CAPTAIN MEMO aims to offer “intelligent” user interfaces based on semantic data processing, and adapting to the patient. It offers several services.
We integrate “TimeOntoImperfection ontology into CAPTAIN MEMO and we evaluate our approaches dealing with the different types of imperfection. This evaluation involves people with Alzheimer’s disease who stay in the Sfax retirement home in Tunisia.
This section aims to evaluate the effectiveness of our approaches to the imperfections of the temporal data processed. This evaluation is carried out within the framework of CAPTAIN MEMO. We use Precision measures.
A total of 20 patients with Alzheimer’s disease P=P_1...P_20and their associated caregivers C=C_1...C_20were recruited to participate in this study. All caregivers are first-degree relatives, such as a son or wife. All patients with Alzheimer’s disease are at an early stage of the disease. They were between 68 and 75 years old. We asked each patient’s legal sponsor for the consent letter. We excluded participants with overt behavioral disturbances, severe aphasia, and severe hearing and/or visual loss. Each patient was asked to answer the same 10 questions relating to their privacy and background at each testing session. The knowledge bases associated with the participants are structured using the “PersonLink ontology.
Two scenarios are proposed:
We compare the generated knowledge base of the second scenario to the golden standard of the first scenario. We use the Precision evaluation metric: >Pi@1 (|KBi/sc1 ∩ KBi/ GS1| / |KBi/sc1|): represents the Precision associated with the patient Pi according to the first scenario. Pi@2 (|KBi/sc2 ∩ KBi/ GS1| / |KBi/sc2|): represents the Accuracy associated with the patient Pi according to the second scenario.
Table 4 and Figure 9 show that the global average accuracy associated with the second scenario (0.850) is better than the global average accuracy associated with the first scenario (0.775). These results prove the relevance of the proposed approach.

Evaluation Results
Precision of the results of the evaluation of our approach
In this paper, we propose our ontology of temporal data imperfection named “TimeOntoImperfection”. The crucial point is that there is no ontology of temporal data imperfection. As already presented in the state of the art, the ontologies which represent the temporal aspect do not deal with imperfections. We implemented a prototype based on “TimeOntoImperfection” which we then integrated into the CAPTAIN MEMO memory prosthesis.
We evaluated our work on people with Alzheimer’s disease who stay in the retirement home of Sfax in Tunisia. In this evaluation, we proposed two scenarios: the first without integration of the prototype and the second by integrating it. The precision measurements obtained show the contribution of our proposed ontology. It proves the usefulness of the approach in the small amount of the entered data as all the inferences are well established and the precision results are promising.
As for continuity of this work, we plan to work on handling larger data and more Alzheimer patients to improve the experiments process and to make evaluations from a user perspective. Besides, we plan to handle other types of imperfection such as redundancy.
