EEPSA as a core ontology for energy efficiency and thermal comfort in buildings

Abstract

Achieving a comfortable thermal situation within buildings with an efficient use of energy remains still an open challenge for most buildings. In this regard, IoT (Internet of Things) and KDD (Knowledge Discovery in Databases) processes may be combined to address these problems, even though data analysts may feel overwhelmed by heterogeneity and volume of the data to be considered. Data analysts could benefit from an application assistant that supports them throughout the KDD process and aids them to discover which are the relevant variables for the matter at hand, or informing about relationships among relevant data. In this article, the EEPSA (Energy Efficiency Prediction Semantic Assistant) ontology which supports such an assistant is presented. The ontology is developed on the basis that a proper axiomatization shapes the set of admitted models better, and therefore, establishes the ground for a better interoperability. On the contrary, underspecification facilitates the admission of non-isomorphic models to represent the same state which hampers interoperability. This ontology is developed on top of three ODPs (Ontology Design Patterns) which include proper axioms in order to improve precedent proposals to represent features of interest and their respective qualities, as well as observations and actuations, the sensors and actuators that generate them, and the procedures used. Moreover, the ontology introduces six domain ontology modules integrated with the ODPs in such a manner that a methodical customization is facilitated.

Keywords

Ontology Ontology Design Patterns buildings energy efficiency thermal comfort

1. Introduction

In the early 2000s, Klepeis et al. (2001) estimated that people used to spend around 90% of their time indoors, and this is a situation that may still hold nowadays. Thence, feeling comfortable while staying indoors is a must. Building users’ comfort can be influenced by the visual, acoustic or thermal conditions to which they may be exposed, the latter being an aspect that may end up having a significant effect. According to the ANSI/ASHRAE Standard 55-2017,1

¹
https://www.ashrae.org/technical-resources/bookstore/standard-55-thermal-environmental-conditions-for-human-occupancy

thermal comfort can be defined as: “that condition of mind that expresses satisfaction with the thermal environment and is assessed by subjective evaluation”. Being a subjective perception, under the same thermal conditions people can experience different levels of comfort. Therefore, ensuring a thermally comfortable environment for all the users of a building is not a straightforward task.

Although many times being an overlooked factor, extensive research has been conducted proving the impact of thermal comfort on humans. Haynes (2008) and Hedge and Gaygen (2010) showed the relation between indoor environment conditions and working efficiency or productivity, which have a direct effect on company revenues. Mulville et al. (2016) demonstrated that indoor environment conditions can have a significant impact on occupants comfort, morale, health and wellbeing in commercial office buildings. It is also proved by Parsons (2014) that having an uncomfortable thermal situation involves many risks including clinical diseases, health impairments, and reduced human performance and work capacity. Therefore, all this evidence reinforces the need of ensuring comfortable thermal conditions in buildings.

However, occupants’ thermal comfort is not the only concern related to buildings. According to Abergel et al. (2019), the building sector consumes more than 35% of global energy and it is responsible for nearly 40% of energy-related CO₂ emissions in the EU. This is why, with a view to meeting the energy sustainability and minimize the climate change, this sector is addressed by the set of binding legislations agreed by the European Commission in the EU 2020 climate and energy package.2

https://ec.europa.eu/clima/policies/strategies/2020_en

Therefore, the efficient management of building energy is becoming the trend for the future generation of buildings.

Fulfilling occupants’ comfort whilst reducing energy consumption is still an unsolved problem in most buildings. Furthermore, it is important to note that certain type of buildings have specific features which may further hinder this problem. For example, tertiary buildings normally contain spaces with big dimensions which are prone to have bigger thermal inertia (Verbeke and Audenaert, 2018). This results in longer periods of time to heat up or cool down spaces and consequently, they cannot be effectively climatised with rather simple solutions like thermostat-based reactive systems. Instead, heating or cooling systems need to be activated in a specific mode in advance, in order to ensure having a comfortable thermal condition in a given time. The expansion of the Internet of Things (IoT) and Knowledge Discovery in Databases (KDD; Fayyad et al., 1996) techniques may allow to improve matters in this regard. According to Gubbi et al. (2013), the IoT facilitates the monitoring of real-world qualities and events thanks to physical things equipped with electronic components and ubiquitous intelligence that allow them to connect, interact and exchange data. This led to the massive amount of data available nowadays, which has the potential to enable new discoveries and improve decision making processes. However, this data tends to be diverse and heterogeneous. Devices from different vendors may represent data in different formats, and even when a common format is used, the internal data model schema typically varies. Moreover, data may come from disparate external sources (often referred to as exogenous data), which further aggravates the data heterogeneity situation. Furthermore, the great variety of technical data hinders the human comprehension with regards to assessing which data is relevant for the matter at hand. These circumstances definitely pose a challenge for data analysts in charge of a KDD process, which is a process that enables the extraction of useful knowledge from raw data by means of five steps: data selection, preprocessing, transformation, data mining and interpretation.

In this context, data analysts have to deal with data related to the building where energy efficiency and user thermal comfort is aimed at. Not only building structural elements need to be described, but also information about sensors and actuators deployed in the building, their location, features and certainly their measurements and actuations, among others. Under such circumstances where a deep energy efficiency, thermal comfort and building domain knowledge is required to efficiently handle all this information, having insufficient domain expertise could make data analysts feel overwhelmed. Consequently, they typically resort to a trial-and-error approach searching for variables and tasks that could be confidently used to make accurate predictions. Moreover, due to the plethora of possible combination of algorithms in each KDD phase, according to Bernstein et al. (2005), even expert data analysts may turn to this trial-and-error approach.

This is definitely an undesirable approach and it would be much more profitable to count with a KDD process assistant supported by technologies that enable the management of data semantics, data interrelationships, and knowledge representation. This means it is necessary to represent features of interest and their respective qualities, as well as observations and actuations, the sensors and actuators that generate them, and the procedures used. Furthermore, observations and actuations have to be described with respect to their values, in addition to their spatial and temporal context. Additionally, the specialization of these concepts for the domain in which the assistant is framed needs to be conveyed. In this paper, the development of a core ontology that supports such a KDD assistant is described.

The rest of this article is structured as follows. Section 2 reviews ontologies related to the domain of discourse. Section 3 presents the development of the proposed core ontology. Section 4 shows the usefulness of the ontology by applying it in a real-world use case. Finally, the conclusions of this work are shown in Section 5.

2. Related work

Linking or mapping raw data to existing ontologies or vocabularies, allows a better representation of the data, structuring it and setting formal types, relations, properties, and restrictions that hold among them. In addition, as stated by Noy (2004), it allows representing data coming from multiple sources in a unified way, thereby supporting data integration. Another benefit of the semantic annotation lies in the additional background knowledge about a domain that can be added to the dataset (Paulheim and Fümkranz, 2012; Liao et al., 2015). This leads to the enrichment of the dataset, as well as enabling the application of indexing techniques, which are based on resource URIs and ensure the retrieval and navigation through related resources (Andrews et al., 2012). Last but not least, after a semantic annotation process, data is more domain-oriented than the original source and allows more application-independent solutions. Consequently, there is no need for the user to be aware of raw data’s underlying structure.

Due to the aforementioned benefits, Yoon et al. (1999), Kopanas et al. (2002) and Pinto and Santos (2009) state that annotating data semantically can contribute to improving the KDD process. However, the semantic annotation process relies on adequate ontologies to unlock the aforementioned benefits. A comprehensive comparative review of ontologies involved in conceptualizations of observations and actuations with a special focus of the building domain is described in Esnaola-Gonzalez et al. (2020). Next, a brief overview of the most relevant ontologies is presented.

The initial Semantic Sensor Network (SSNO) ontology3

³
http://www.w3.org/ns/ssn/

presented by Compton et al. (2012) was developed by the W3C Semantic Sensor Networks Incubator Group (SSN-XG) and it proposed a conceptual schema for describing sensors, accuracy and capabilities of such sensors, their observations and methods used for sensing. Concepts for operating and survival ranges were also included, as well as sensors’ performance within those ranges. Finally, a structure for field deployment was defined to describe deployment lifetime and sensing purposes. The initial SSNO ontology was aligned with DOLCE ultra-lite (DUL) ontology4

⁴

http://www.ontologydesignpatterns.org/ont/dul/DUL.owl

and built on top of the Stimulus-Sensor-Observation (SSO) ODP presented by Janowicz and Compton (2010), which describes the relationships between sensors, stimulus, and observations. The SSNO ontology has been reused by other domain ontologies including the IoT-O ontology5

⁵

https://www.irit.fr/recherches/MELODI/ontologies/IoT-O

presented by Seydoux et al. (2016), the IoT-Lite ontology6

⁶

http://www.w3.org/Submission/iot-lite/

presented by Bermudez-Edo et al. (2017) and the FIESTA-IoT ontology7

⁷

http://ontology.fiesta-iot.eu/ontologyDocs/fiesta-iot/doc

presented by Agarwal et al. (2016) to name a few. There are ontologies like the Actuation-Actuator-Effect (AAE) ODP,8

⁸

http://ontologydesignpatterns.org/wiki/Submissions:Actuation-Actuator-Effect

which adapts the SSNO ontology’s SSO ODP for actuators, by modelling the relationship between an actuator and the effect it has on its environment through actuations.

The W3C Spatial Data on the Web Working Group (SDWWG9

⁹

https://www.w3.org/2015/spatial

) proposed an update of the SSNO ontology called SOSA/SSN ontology that became a W3C recommendation and was presented by Haller et al. (2019). This new ontology follows a horizontal and vertical modularization architecture by including a lightweight but self-contained core ontology called SOSA10

¹⁰

http://www.w3.org/ns/sosa/

(Sensor, Observation, Sample, and Actuator) for its elementary classes and properties. Furthermore, the SOSA/SSN ontology’s scope is not limited to observations, but it is extended to cover actuations and samplings. In line with the changes implemented in the SOSA/SSN ontology, SOSA drops the direct DUL alignment although it can still be optionally achieved via the SSN-DUL alignment module.11

¹¹

https://www.w3.org/ns/ssn/dul

Moreover, similar to the original SSO pattern, SOSA acts as a central building block for the new SOSA/SSN ontology but puts more emphasis on its lightweight expressivity and the ability to be used standalone. It is worth mentioning that there are ontologies like the Semantic Smart Sensor Network (S3N) ontology12

¹²

https://github.com/s3n-ontology/s3n/blob/master/s3n.ttl

presented by Sagar et al. (2018) which extend the SOSA/SSN ontology and others that reuse it such as the SmartEnv ontology13

¹³

https://w3id.org/smartenvironment/smartenv.owl

Alirezaie et al. (2018).

The SEAS Ontology14

¹⁴

https://w3id.org/seas/

presented by Lefrançois (2017) is an ontology designed as a set of simple core ODPs (Ontology Design Patterns) that can be instantiated for multiple engineering related verticals. The SEAS ontology modules are developed based on the following three core modules: the SEAS Feature of Interest ontology15

¹⁵

https://w3id.org/seas/FeatureOfInterestOntology

which defines features of interest (seas:FeatureOfInterest) and their qualities (seas:Property), the SEAS Evaluation ontology16

¹⁶

https://w3id.org/seas/EvaluationOntology

describing evaluation of these qualities, and the SEAS System ontology17

¹⁷

https://w3id.org/seas/SystemOntology

representing virtually isolated systems connected with other systems. On top of these core modules, several vertical SEAS ontology modules are defined, which are dependent of a specific domain. The Procedure Execution (PEP) ontology18

¹⁸

https://w3id.org/pep/

defines procedure executors that implement procedure methods, and generate procedure execution activities. Furthermore, PEP defines an ODP as a generalization of SOSA’s sensor-procedure-observation and actuator-procedure-actuation models. Additionally, the SEAS ontology offers a set of alignments to related ontologies including the SOSA/SSN.

The Observation ODP19

¹⁹

http://ontologydesignpatterns.org/wiki/Submissions:Observation

aims at representing observations of things, under a set of parameters. This set of parameters may include the place where the observation was made, the time when it was made, and any other feature concerning the specific thing being observed.

The IoT Application Profile (IoT-AP) ontology20

²⁰

http://stlab.istc.cnr.it/IoT-AP/IoT-AP.rdf, not available at the moment of writing this article.

presented by Gangemi et al. (2017), is an ontology for representing and modelling the knowledge within the domain of the IoT. The ontology is designed re-using ODPs such as the aforementioned Observation and the time indexed situation.21

²¹

http://ontologydesignpatterns.org/wiki/Submissions:TimeIndexedSituation

It focuses on observations, but it also covers sensors that generate those observations, their values and observation collections.

The ifcOWL ontology22

²²

http://ifcowl.openbimstandards.org/IFC4_ADD2.owl

Pauwels and Terkaj (2016) provides an OWL representation of the EXPRESS schemas of the ISO 16739:201323

²³

https://www.iso.org/standard/51622.html

IFC (Industry Foundation Classes), which is the open standard developed by buildingSMART24

²⁴

https://www.buildingsmart.org/

for representing building and construction data. The ifcOWL ontology defines a faithful mapping of the IFC EXPRESS schema replicating its conceptualization, which has been found inconvenient for some practical engineering use cases Pauwels and Roxin (2017). For example, the ifcOWL ontology’s conceptualization of some relationships and properties as instances of classes (i.e., ifc:IfcRelationship and ifc:IfcProperty) is counterintuitive to the Semantic Web modelling principles that would expect OWL properties to represent them. These instances are unnecessary in most of the applications and services that may use or query this information, therefore, their presence in the RDF graph raises its complexity unnecessarily.

The Building Topology Ontology25

²⁵

https://w3id.org/bot

Rasmussen et al. (2021) (BOT) is a minimal OWL DL ontology developed by the W3C LBD (Linked Building Data) Community Group26

²⁶

https://www.w3.org/community/lbd/

for covering core concepts of a building and for defining the relationships between their subcomponents. These basic concepts and properties make the schema light and no more complex than necessary, thus making the BOT a baseline extensible with concepts and properties from more domain specific ontologies. Therefore, the BOT serves as an ontology that could promote its reuse as a central ontology in the AEC (Architecture, Engineering and Construction) domain.

The DogOnt ontology27

²⁷

http://elite.polito.it/ontologies/dogont.owl

Bonino and Corno (2008) aims at formally representing the interoperation between domotic systems. Although it primarily models devices and their states and functionalities, the DogOnt ontology also supports the description of the residential buildings where devices may be deployed. The DogOnt authors claim that the ontology could be reused as a foundation towards a shared and unified schema for the AEC ontologies interoperability. However, the latest DogOnt version available at the moment of writing this article (version 4.0.2) counts with over 1,000 classes and over 70 properties, which may be rather large to reuse in many cases.

Each of the aforementioned ontologies focuses on a limited domain interest, either observations, actuations or buildings. However, these domains cover only a portion of the scope of a KDD process assistant. To the extent of the authors’ knowledge, at the moment of writing this article there is no ontology covering all the subjects of knowledge required to sustain such an assistant.

3. The EEPSA ontology

Towards the incorporation of the semantic technologies in the assistant that supports data analysts through the KDD process, it is of utmost importance to rely on proper ontologies and vocabularies that codify the required knowledge and enables a proper annotation of the data. As a matter of fact, such an ontology or network of ontologies may be the cornerstone of this assistant (Esnaola-Gonzalez et al., 2018a).

This section describes the EEPSA (Energy Efficiency Prediction Semantic Assistant) ontology28

²⁸
https://w3id.org/eepsa

which focuses on energy efficiency and thermal comfort in buildings, but it is aimed at being reusable and easily customizable for other use cases in similar domains.

3.1. Motivation

Data analysts dealing with energy efficiency and thermal comfort problems in buildings need to handle data from various domains and with different granularity levels. This data may describe building structural element properties including materials, heat transfer coefficients, and orientation of their boundaries. For example, a room located in the second floor of a building having a skylight with 2 m² of surface, and a door with a U-factor of 2.61 that is opened by swinging to the left, and connects the hall with the southern outside part of the building. Furthermore, data analysts need to consider information about sensors and actuators deployed in the building, their location, features and certainly their measurements and actuations. For example, a temperature sensor located in the meeting room 042 that measured 22°C on 13th February 2020 at 09:40, and a blind actuator that lowered blinds of window 23 on 21st June 2019 at 15:30. Likewise, data about weather conditions and weather forecasts for the building location are relevant, such as a forecast for Madrid made by the Spanish meteorology agency on 12th May 2020 at 13:00 forecasting a relative humidity of 62% on 10th May 2020 at 15:00, or a weather report that described cloudy skies during the morning of 7th November 2019 in Amsterdam. Additionally, energy consumption and generation data needs to be addressed, such as the domestic hot water consumption of the apartment 7A on 15th October 2019, or the energy generated by the PV panels installed in the roof of an office during the 25th June 2020. Even more, thermal knowledge may be of utmost importance, for example, that the temperature of a given room which is poorly insulated will be heavily affected by the outdoor temperature, indoor humidity and the solar radiation received. Other sources of information may also be pertinent including the space occupancy, work schedule or human related organization. For example, the 30th January 2020 is a reduced working hours day or the occupancy value of the meeting room 07 at 12:00 is of 6 people.

Data analysts may also benefit from resources that help them identifying the most relevant variables of the KDD problems they are trying to solve. For example, knowing the variables that influence a given waiting room of a hospital such as its volume, the number of seats available and its comfort. Furthermore, it would definitely be helpful to know that the comfort level of a hospital’s waiting room is affected by, on the one hand, it’s sound level which is affected by the walls soundproofing and by the external noise levels, which in turn will be affected by the nearby traffic density, and on the other, its temperature affected by the indoor humidity levels, the solar radiation and the occupancy level. In addition, data analysts could take advantage of resources that facilitate the exploration of information related to the devices deployed in the waiting room and the past events as well as the scheduled events.

All this evidence reinforces the need of a core ontology that covers all the aspects involved in energy efficiency and thermal comfort problems in buildings. To the extent of the authors’ knowledge, at the moment of writing this article there is no ontology covering all the aforementioned subjects of knowledge, therefore, the development of the EEPSA ontology is a prime task, not only for semantic representation purposes, but more importantly, for supporting data analysts in KDD processes.

3.2. Ontology development methodology

Ontologies must be carefully designed and implemented, as these tasks have a direct impact on their final quality. Therefore, the use of well-founded ontology development methodologies (e.g. On-To-Knowledge presented by Sure et al. (2004) and DILIGENT presented by Pinto et al. (2004)) is advised. For the development of the EEPSA ontology, the NeOn Methodology proposed by Suárez-Figueroa et al. (2012) was followed mainly because it does not prescribe a rigid workflow but instead it suggests a variety of paths. The NeOn Methodology comprises 9 scenarios supporting different aspects of the ontology development process, and the following scenarios inspired the development of the EEPSA ontology:

Scenario 1: From specification to implementation. In this scenario, the ORSD (Ontology Requirement Specification Document (Suárez-Figueroa and Gómez-Pérez, 2012)) was created, which collected the ontology purpose, its intended users, and the set of ontology requirements in the form of Competency Questions (CQ). For building and maintaining the ontology, the Protégé29

²⁹
https://protege.stanford.edu/

(Musen, 2015) tool version 5.1.0 was used and for managing the different versions of the ontology, the Git version-control system.

Scenario 7: Reusing ontology design patterns. In this scenario, existing ODPs were reviewed (e.g. from the ODP repository OntologyDesignPatterns.org30

³⁰

http://ontologydesignpatterns.org

). Since existing ODPs could not fully satisfy the requirements captured in the ORSD, a set of basic ODPs were developed. These ODPs were used as building blocks on top of whom the rest of the EEPSA ontology was developed.

Scenario 3: Reusing ontological resources. According to Simperl (2009) the reuse of ontological resources built by others that have already reached some degree of consensus is a good practice in ontology development processes. In this scenario, a set of ontologies were reviewed, assessed and compared to decide whether they were suitable for reuse.

Scenario 4: Reusing and re-engineering ontological resources. Following with the review of ontological resources made for the Scenario 3, in this scenario, the ones that were unsuitable to be reused as-they-were, were re-engineered prior to their reuse to satisfy the requirements identified.

3.3. Ontology requirements

One of the main components of the ORSD is the identification of functional requirements. These are content specific requirements that the ontology should fulfil, or in other words, the particular knowledge to be represented by the ontology. Towards such a goal, a group of data analysts (who are the primary final users of the ontology) were interviewed in order to identify the initial set of ontology requirements. Furthermore, thermal and energy domain experts were interviewed to elicit and formalize their knowledge and satisfy the requirements identified. The acquisition of these CQs was approached with a top-down strategy, identifying complex questions first that were later on decomposed in simpler ones. This approach derived in the identification of the following set of recurrent CQs that summarize basic requirements for assisting data analysts through a KDD process:

CQ01: Which are the qualities that influence a feature of interest?

CQ02: Which are the qualities that affect a given quality of a feature of interest?

CQ03: Which feature of interest does a given quality belong to?

CQ04: Which are the observations/actuations performed by a given procedure?

CQ05: Which are the observations/actuations performed by a given sensor/actuator?

CQ06: Which are the procedures implemented by a given sensor/actuator?

CQ07: Which are the features of interest on a given observation/actuation?

CQ08: Which are the qualities sensed/actuated by a given observations/actuations?

CQ09: Which are the features of interest of a given sensor/actuator?

CQ10: Which are the qualities sensed/actuated by a given sensor/actuator?

CQ11: Which is the value of an observation?

CQ12: When was an actuation generated?

CQ13: For what time interval or instant is valid an observation?

CQ14: For what spatial location is valid an observation?

For each competency question CQn, a twin competency question CQnⁱ can be considered, which consists in rephrasing the question in the opposite direction. For example, CQ01ⁱ would be defined as “Which are the features of interest influenced by a given quality?”. In terms of a SPARQL query, it means that the query variable is moved from the subject position to the object position, or the other way round, in the triple pattern.

Apart from these 14 basic CQs, 53 additional CQs were identified. The final 67 CQs were validated by a review performed by final users (data analysts) and domain experts, as suggested by Wieringa (1996). Although it was confirmed that this group of users was not aware of additional requirements, the EEPSA ontology is designed to be extended with additional knowledge from different experts to satisfy new use cases that may appear.

3.4. Developing the EEPSA ontology on top of ODPs

In ontology development processes, recurrent design problems may arise. Indeed, these problems may happen during the ontology conceptualization activity, the ontology formalization activity, or during the ontology implementation activity. Gangemi and Presutti (2009) defines an ODP as a modelling solution to solve this kind of problems. Furthermore, according to Hitzler et al. (2016), ODPs should ideally be extensible but self-contained, minimize ontological commitments to foster reuse, address one or more explicit requirements (such as use cases or competency questions), be associatable to an ontology unit test (Vrandečić and Gangemi, 2006), be the representation of a core notion in a domain of expertise (Gangemi and Presutti, 2010), be alignable to other patterns, span more than one application area or domain, address a single invariant instead of targeting multiple reoccurring issues at the same time, follow established modelling best practices, and so forth.

Developing the EEPSA ontology on top of ODPs was found convenient due to the great flexibility provided by this modelling solution, which allows a proper segmentation of the intended conceptualization. In this case, the considered recurrent CQs that summarize the basic requirements were divided in three subsets according to the subject of knowledge they covered: {CQ01, CQ02, CQ03}, {CQ04, CQ05, CQ06, CQ07, CQ08, CQ09, CQ10} and {CQ11, CQ12, CQ13, CQ14}.31

³¹
The rest of the CQs considered were satisfied by developing ontology modules as explained in Section 3.5.

In order to solve each of those subsets, an ODP was defined. The proposed ODPs are inspired by existing ontologies and ODPs which address similar CQs but they do not fully satisfy those previously enumerated.

Even though these ODPs are motivated by energy efficiency and thermal comfort problems in buildings, they are designed to be applicable to similar problems in different use cases. Therefore, for each ODP a set of alignments or mappings are developed. These alignments target domain ontologies as well as upper-level ontologies, as setting mappings to a common upper ontology alleviates integration problems (Noy, 2004), helps to ensure clarity in modelling and avoids errors that have unintended reasoning implications (Cox, 2016). These alignments are kept in separate files and are available online in each ODP’s documentation page. Furthermore, an instantiation example of each ODP is shown in the Appendix, along with SPARQL queries that solve some CQs.

Next, the three proposed ODPs are presented: the AffectedBy ODP,32

³²

https://w3id.org/affectedBy

the EEP (Execution-Executor-Procedure) ODP33

³³

https://w3id.org/eep

and the RC (Result-Context) ODP.34

³⁴

https://w3id.org/rc

3.4.1. The AffectedBy ODP

Data analysts dealing with energy efficiency problems in buildings would benefit from a resource that supports the discovery of relevant variables that affect the environment of a given space or another feature of interest. Any of these variables will be represented as qualities of a feature of interest. Specifically, the competency questions CQ01, CQ02 and CQ03 must be considered. Therefore, the conceptualization must include classes representing features of interest (aff:FeatureOfInterest) and their qualities (aff:Quality).

In the context of this article, the notion of a feature of interest can be understood as anything of a domain of discourse which is relevant to be modelled for analysing it. It may be a physical or an abstract thing, real or imaginary, a social convention or an invented concept. Typically, the analysis of a feature of interest is based on the qualities that describe aspects of its form or state, its content or context, its flow or dynamics. The appropriate representation and modelling of these qualities is relevant for supporting the analysis of features of interest. Furthermore, it is interesting to represent not only the association of qualities with features of interest, but also the relationships among qualities, possibly from different features of interest, or with pertinent contextual information.

As for the qualities, they can be considered basic entities which are qualifiable, quantifiable, observable, or operable. According to Masolo et al. (2003) where the DOLCE documentation was presented: “Qualities inhere to entities: every entity (including qualities themselves) comes with certain qualities, which exist as long as the entity exists. Within a certain ontology, we assume that these qualities belong to a finite set of quality types (like color, size, smell, etc., […]), and are characteristic for (inhere in) specific individuals: no two particulars can have the same quality, and each quality is specifically constantly dependent on the entity it inheres in: at any time, a quality can’t be present unless the entity it inheres in is also present”.

The SOSA/SSN ontology contains a building block that may be useful for this matter. As a matter of fact, the ssn:Property class is textually defined as “a quality of an entity. An aspect of an entity that is intrinsic to and cannot exist without the entity”, which comes directly from its predecessor the SSNO ontology that inherited it from DOLCE. Furthermore, the ssn:Property class is linked to the sosa:FeatureOfInterest class with the ssn:isPropertyOf object property. However, an inconvenience was spotted. This object property is not declared functional, so the following triples (graphically shown in Fig. 1) can be found in a triple set annotated with SOSA/SSN terms:

Fig. 1.

A SOSA/SSN annotated set of triples where the sensor observing temperature of room03 cannot be precisely found.

According to the aforementioned ssn:Property’s class textual definition, individual :temperature is intrinsic to and cannot exist without the existence of individual :room03. However, the triples shown contradict such definition because the individual :temperature represents a generic quality, that is, it is a quality of different entities. It is not possible to answer the query “Which sensor observes temperature of :room03?” precisely. It can be argued that more triples should be added concerning the observations that relate sensors and rooms. However, that could not be enough, as will be shown later in Section 3.4.2. As a matter of fact, García-Castro et al. (2020) verified that many datasets adopted the use of generic qualities. Moreover, it should be noted that such a representation may conceal conceptual disagreements if proper attention to conceptualization is neglected. For example, a cooling system may have three qualities of different nature: inlet air temperature, outlet air temperature and coolant temperature. The faithfulness to the DOLCE conceptualization requires that there should exist three different individuals representing the three different specific qualities, respectively. A recent publication about the SOSA/SSN ontology (Haller et al., 2019) is aware of this duality (generic, specific) and explicitly expresses that “multiple observations across different features of interest or by different sensors or both can measure the same generic property”. The publication also recognizes the choice to represent observable properties as inherent characteristics specific to a feature of interest. Therefore, the SOSA/SSN ontology allows different ways of modelling observable properties and it is expected that “communities and applications to develop their own approaches to building catalogues of observable properties and choosing appropriate levels of specificity”. However, the fact that different stakeholders adopt different modelling options may derive in interoperability problems. Frequently, the option of using generic qualities is promoted due to its simplicity that encourages reusability, but underspecification worsens the interoperability. According to Guarino et al. (2009), widening the set of admitted models goes against precisely approximate the set of intended models. In this case, the SOSA/SSN ontology admits different models to represent the same state of affairs but these models are not isomorphic, since one of them can be transformed into the other but the inverse transformation is not always possible. As a consequence, some questions can be correctly answered with one model but not with the other (as will be shown later in Section 3.4.2).

The AffectedBy ODP defines the aff:belongsTo object property as functional to support the notion that a quality is intrinsic to the feature of interest to which it belongs (inheres in). It is defined with aff:Quality as domain and aff:FeatureOfInterest as range, and it solves CQ03. Furthermore, the following axiom formalizes that every quality belongs to a feature of interest:

aff:Quality ⊑ ∃aff:belongsTo.aff:FeatureOfInterest .

In order to tackle this specific issue and to solve CQ02, the aff:affectedBy object property is introduced. This property has class aff:Quality both as its domain and its range, and it is declared to be transitive. Notice that no similar object property exists in the SOSA/SSN ontology.

Then, in order to solve CQ01, the aff:influencedBy35

³⁵

In the previous version of the AffectedBy ODP published in Esnaola-Gonzalez et al. (2018b), this object property was named aff:hasQuality. However, it was renamed after aff:influencedBy to avoid misleading interpretations.

object property with aff:FeatureOfInterest as its domain and aff:Quality as its range is introduced. The notion to be captured here is that an individual of aff:FeatureOfInterest is influenced by the qualities belonging (inhere) to it and also by the qualities affecting the qualities of the individual. Therefore, the following axioms are included in the AffectedBy ODP:

aff:belongsTo⁻¹ ⊑ aff:influencedBy, and

aff:influencedBy ∘ aff:affectedBy ⊑ aff:influencedBy .

It is worth noticing that, with the presented axiomatization of the AffectedBy ODP, only basic assertions using aff:belongsTo and aff:affectedBy are initially needed since the rest of true facts are inferred by OWL2 DL reasoners.

Fig. 2.

The AffectedBy ODP. (F) represents functional and (T) transitive properties.

A diagram of the AffectedBy ODP is shown in Fig. 2.

The SEAS Feature of Interest ontology presents a similar design to AffectedBy ODP. The seas:isPropertyOf object property links a seas:Property to a seas:FeatureOfInterest and it is declared functional. However, the property seas:hasProperty (inverse of seas:isPropertyOf) does not play the role of aff:influencedBy. Moreover, it also defines a symmetric object property seas:derivesFrom which links a seas:Property to another seas:Property it derives from, whose conceptualization is different from the aff:affectedBy object property. As a matter fact, the AffectedBy ODP and the SEAS Feature of Interest ontology could be defined as extensions of each other.

AffectedBy ODP alignments. The AffectedBy ODP is aligned with the SOSA/SSN ontology and the SEAS Feature of Interest ontology. Furthermore, it is mapped with the upper-level DUL ontology. These alignments are kept in separate files and are available online in the AffectedBy ODP’s documentation page https://w3id.org/affectedBy.

3.4.2. The EEP ODP

Another interesting information for data analysts working on energy efficiency and thermal comfort problems in buildings is addressed by CQ04, CQ05, CQ06, CQ07, CQ08, CQ09 and CQ10. These CQs are the requirements considered for the EEP ODP.

Fig. 3.

A SOSA/SSN annotated set of triples.

It may be questionable why competency questions related to results of observations or actuations are disregarded in this ODP, specially because it is common to include this information as parameters of observations or actuations. However, there are some modelling alternatives such as the SEAS Evaluation ontology, where the qualification of the value of a seas:Property is preferred. Moreover, different conceptualizations of the result and their spatio-temporal context may be conceived depending on the application. This is the rationale behind designing a separate ODP (presented in the next subsection) to represent result-related matters. Such a design intends to improve the reusability of the proposal, allowing users to easily replace such ODP if they are not satisfied with its modelling decision.

The aforementioned CQs (CQ04 to CQ10) have been tackled by the SOSA/SSN ontology. However, the following (typical) set of triples annotated with SOSA/SSN (graphically shown in Fig. 3) cannot properly solve a question like CQ10ⁱ: which is the sensor that observes the temperature of :room07?

The following query could try to answer the previous question, but the received answer is imprecise.

The rationale behind this issue is that the property path linking sensors to features of interest (through the sosa:Observation class) is not sufficiently constrained and therefore, there are admitted models which are not useful to answer some queries. Underspecification impedes enough discrimination of situations.

The proposed Execution-Executor-Procedure (EEP) ODP is an adaptation of the PEP ontology to fully satisfy the required competency questions, overcoming the indicated weaknesses about SOSA/SSN. Again, a careful axiomatization has been provided that permits to infer desirable property values from a minimum set of triple assertions, in contrast to other proposals where the lack of inferencing possibilities oblige to assert the complete set of needed triples.

The EEP ODP imports the AffectedBy ODP alongside with its notion that a quality is intrinsic to the feature of interest it belongs to. Apart from the two classes imported from the AffectedBy ODP (i.e. aff:FeatureOfInterest and aff:Quality), the EEP ODP consists of three more classes: eep:Execution, eep:Executor, and eep:Procedure (see Fig. 4). An individual of eep:Execution is an event upon a quality of a feature of interest, produced by an agent by performing a procedure. As for an individual of eep:Executor, it is an agent capable of performing tasks by following procedures. Lastly, an individual of eep:Procedure is a description of some actions to be executed by agents.

Fig. 4.

The Execution-Executor-Procedure (EEP) ODP. (F) represents functional and (T) transitive properties.

Note that individuals of class eep:Execution can be abstractly represented by a ternary relationship of its executor, the procedure used to produce the execution, and the quality of the feature of interest being considered. Accordingly, the class eep:Execution is the domain of the three functional object properties: eep:madeBy, eep:usedProcedure, and eep:onQuality. Moreover the following axioms are introduced:

eep:Execution ⊑ ∃eep:madeBy.epp:Executor,

eep:Execution ⊑ ∃eep:onQuality.eep:Quality, and

eep:Execution ⊑ ∃eep:usedProcedure.eep:Procedure

The object property eep:madeBy links an execution to the agent that performs the action; the object property eep:usedProcedure links an execution to the procedure that describes the task to be performed; and the object property eep:onQuality links an execution to the quality concerned by the execution. These three functional object properties jointly with the functional aff:belongsTo form the backbone of the EEP ODP.

The remaining object properties are: eep:implements, linking executors to procedures; eep:hasFeatureOfInterest, linking executions to features of interest; eep:forQuality, linking executors to qualities; and eep:forFeatureOfInterest, linking executors to features of interest. The values of all of them are inferred by the values of the four functional properties that form the backbone, due to the corresponding property chain axioms included in the EEP ODP:

eep:madeBy⁻¹ ∘ eep:usedProcedure ⊑ eep:implements,

eep:onQuality ∘ eep:belongsTo ⊑ eep:hasFeatureOfInterest,

eep:madeBy⁻¹ ∘ eep:onQuality ⊑ eep:forQuality, and

eep:forQuality ∘ eep:belongsTo ⊑ eep:forFeatureOfInterest .

EEP ODP alignments. The EEP ODP is aligned with the SOSA/SSN ontology, the PEP ontology and PROV-O. Furthermore, it is mapped to the upper-level DUL ontology. These alignments are kept in separate files and are available online in the EEP ODP’s documentation page https://w3id.org/eep.

3.4.3. The RC ODP

Although the AffectedBy and EEP ODPs alleviate much of the data analysts’ information needs, they may still require from data representing the results of the executions and their contexts. For example: which is the value of an observation? Or when was an actuation performed? This information may be collected answering the competency questions CQ11, CQ12, CQ13 and CQ14.

Every ontology or ontology network covering observations or actuations need to take into account the representation of these actions’ results. For example, the SOSA/SSN ontology uses the sosa:hasResult object property, the IoT Application Profile (IoT-AP) ontology36

³⁶
http://stlab.istc.cnr.it/IoT-AP/IoT-AP.rdf, not available at the moment of writing this article.

published by Gangemi et al. (2017) uses iotap:hasObservationValue and om-lite uses om-lite:result object property. Values of these properties can be complex objects that usually include units of measurement, the measurement value, and some other optional parameters. However, sometimes a simple representation with a literal type value may suffice. In order to tackle these situations SOSA/SSN proposes the sosa:hasSimpleResult datatype property. Furthermore, properties representing results are typically associated to observations and actuations, even though there are alternative modelling options. For example, in the SEAS ontology network, the SEAS Evaluation ontology associates seas:value and seas:simpleValue properties to the seas:Property class.

With respect to the proposed Result-Context (RC) ODP (shown in Fig. 5), the representation of both complex and simple results is modelled with the object property rc:hasResult and the datatype property rc:hasSimpleResult respectively. This way, CQ11 is solved.

Fig. 5.

The Result-Context (RC) ODP.

There are occasions in which parameters referring to temporal and spatial aspects may be necessary to qualify a result. Regarding the representation of temporal aspects, the SOSA/SSN ontology distinguishes two notions: the time when the result of an observation, actuation, or sampling applies to the feature of interest (with the object property sosa:phenomenonTime) and the instant of time when such an observation, actuation or sampling was completed (with the datatype property sosa:resultTime). The phenomenon time is specified with an individual of OWL-Time ontology’s time:TemporalEntity class as it may be either an instant, an interval of time, or even a temporal complex. Meanwhile, the result time describes an instant represented with xsd:dateTime. As for the SEAS Evaluation ontology, the temporal context is modelled with the seas:hasTemporalContext object property that links an evaluation with its temporal entity modelled as an individual of time:TemporalEntity. Furthermore, PROV-O also enables the representation of temporal context. Specifically, the prov:generatedAtTime datatype property allows representing the completion of production of a new entity, which would be similar to the sosa:resultTime datatype property.

With respect to the RC ODP, it defines two properties inspired by the design of the SOSA/SSN ontology: rc:hasGenerationTime which is equivalent to sosa:resultTime, and rc:hasTemporalContext which is equivalent to sosa:phenomenonTime. These definitions solve CQ12 and CQ13 respectively.

When using the SOSA/SSN ontology, spatial aspects of an observation/actuation/sampling are expected to be associated with the feature of interest, the sensor/actuator/sampler or the platform on which they are mounted. However, the representation of this association is not covered by the ontology itself and has to be made by deferring to external ontologies. By contrast, the SEAS Evaluation ontology leans towards a modelling option which is similar to the temporal aspect. Namely, it defines the seas:hasSpatialContext that links an evaluation to its spatial validity context represented as an individual of geo:SpatialThing class.

In the RC ODP, the rc:hasSpatialContext object property has been defined. It plays seas:hasSpatialContext property’s same role, but it has eep:Execution class as domain, and geo:SpatialThing as range. This object property solves CQ14.

The RC ODP opted to define its own URIs for these properties, in order to be self-sufficient and completely integrated with the EEP ODP, but following the good practices of the ontology engineering, the equivalences with the corresponding SOSA/SSN ontology terms are explicitly represented in the alignment files.

RC ODP alignments. The RC ODP is aligned with the SOSA/SSN and PROV-O ontologies. These alignments are kept in separate files and are available online in the RC ODP’s documentation page https://w3id.org/rc.

The RC ODP is designed as a horizontal extension of the EEP ODP. But there are cases where data analysts may require from both ODPs so they need to be used jointly. For example:

Which is the temperature value of room 03 on 2018-11-20 at 16:00?

These three ODPs are the cornerstone of the EEPSA ontology. As a matter of fact, the classes defined by the AffectedBy and EEP ODPs act as stub classes, and for each of them an ontology module is developed. The EEPSA ontology is the integration of the following ontological resources: the three ODPs presented (AffectedBy, EEP and RC), five ontology modules specializing the stub classes defined by these ODPs (FoI4EEPSA, Q4EEPSA, P4EEPSA, EXR4EEPSA and EXN4EEPSA), and an ontology module containing expert knowledge (EK4EEPSA). Figure 6 shows an overview of the EEPSA ontology.

Fig. 6.

Overview of the EEPSA ontology.

3.5. Ontology modules

The ontology modularization consists in partitioning them into independent self-contained knowledge components. Such a modular approach brings benefits, including the flexibility for component reuse (Grau et al., 2008), the support for more efficient query answering (Stuckenschmidt and Klein, 2007), and the enhancement of component changes and evolution (Ensan and Du, 2013).

When an already existing ontology is large and monolithic, it needs to be split up in order to benefit from the mentioned advantages. There are different techniques that perform ontology partitioning by dividing an ontology into a set of significant modules that together form the original ontology. However, according to d’Aquin et al. (2009) there is no universal way to modularize an ontology, and the choice of a particular technique or approach should be guided by the requirements of the application or use case.

The implementation of ontology modularization techniques is advised in early ontology development stages because, otherwise, it could end up being a complex task. Following this advice, the EEPSA ontology is modularized by design in order to address the remaining CQs that were not addressed by the three ODPs. The EEPSA ontology modules are presented below.

3.5.1. FoI4EEPSA (Feature of Interest for EEPSA) ontology module

This ontology module covers the knowledge specializing the aff:FeatureOfInterest class for the EEPSA Ontology. In the context of this article, a feature of interest is understood as an abstraction of a phenomenon (object, event, etc). A feature of interest is then described in terms of its qualities, which are qualifiable, quantifiable, observable or operable.

In particular, the FoI4EEPSA ontology module37

³⁷
https://w3id.org/eepsa/foi4eepsa

tries to tackle CQs such as the following:

Which building does a given space belong to?

How many spaces does a building have?

Which storey is a given space located on?

Different ontologies that cover the representation of the building domain were reviewed (a survey can be found in Esnaola-Gonzalez et al. (2020)), and finally BOT38

³⁸

https://w3id.org/bot

was considered to be reused for basic building topology descriptions due to its conciseness, well-presented documentation, careful metadata with explanatory descriptions of the intended meanings of their terms, and alignments to other domain ontologies.

As for representing building elements, which are also an important part of the domain at hand, the FoI4EEPSA ontology module needs to solve the following CQs:

Which space does a given door belong to?

How many walls does a given space have?

Is a given window adjacent-to-outdoors?

To this end, the Building Product Ontology (PRODUCT39

³⁹

https://github.com/w3c-lbd-cg/product

) proposed by the W3C LBD community group was considered. PRODUCT (which at the moment of writing this article is still under development) has a much wider coverage scope than needed, so its importation would result in increasing EEPSA ontology’s size with unnecessary concepts. The Building Element Ontology,40

⁴⁰

https://pi.pauwel.be/voc/buildingelement

which has been proved to be valid semantically tagging or classifying bot:Element instances in van Berlo et al. (2019), was also considered. However, it has several concepts that may result too technical for the end users of the EEPSA ontology. Therefore, following the simplicity goal of the EEPSA ontology, importing PRODUCT or the Building Element Ontology was discarded. Instead, a set of building elements identified in the EEPSA ontology requirements were defined, such as doors (foi4eepsa:Door) and windows (foi4eepsa:Window). Furthermore, a class foi4eepsa:ExternalBuildingElement was defined to represent building elements that face outdoors. This representation mimics the approach followed by Díaz et al. (2013) in EEOnt (Energy Efficiency Ontology), and allows the representation of doors and windows that face outdoors (via foi4eepsa:ExternalDoor and foi4eepsa:ExternalWindow classes), as well as external walls (foi4eepsa:ExternalWall). These new terms defined in FoI4EEPSA are mapped to the related PRODUCT ontology terms and PRODUCT is in turn aligned with the IFC4 Addendum 2 standard,41

⁴¹

https://standards.buildingsmart.org/IFC/RELEASE/IFC4/ADD2_TC1/HTML

making the FoI4EEPSA ontology module interoperable. Likewise, FoI4EEPSA terms are mapped also to the Building Element Ontology.

Last but not least, information related to the building context is also an important aspect. Namely, FoI4EEPSA has to solve the following CQs:

Which is the intended use of the building?

When was the building built?

Which is the gross floor area of the building?

IFC presents a comprehensive collection of property sets (known as PSETs) for describing different aspects of building and building-related context. However, the conceptualization of these properties in the ifcOWL ontology42

⁴²

http://ifcowl.openbimstandards.org/IFC4_ADD2.owl

(Pauwels and Terkaj, 2016) as instances of classes (e.g. ifc:IfcIdentifier or ifc:IfcLabel) is counterintuitive to Semantic Web principles that would expect OWL properties to represent them. Therefore, inspired by the semantic transformations proposed by de Farias et al. (2015), FoI4EEPSA defines a re-engineering of the relevant properties contained in the IFC PSET Building Common and the IFC PSET Building collections. Namely, concepts such as foi4eepsa:hasYearOfConstruction are used to represent the construction year of a building, and foi4eepsa:hasMarketCategory to define the final use of the building (e.g. residential or commercial).

It is worth noting that, although the FoI4EEPSA ontology module is mainly focused on building-related elements, it may be extended with other features of interest such as events that may be of interest to represent. Examples of such events include meetings that may be scheduled for a given room of a building, or even the Demand Response events where customers are asked to increase or decrease their energy consumption to help electric utilities. These events should ideally be modelled by the EEPSA ontology user or, preferably, imported from other existing ontologies such as the OpenADR ontology43

⁴³

https://w3id.org/def/openadr

presented by Fernández-Izquierdo et al. (2020) for Demand Response events.

3.5.2. Q4EEPSA (quality for EEPSA) ontology module

This ontology module covers the knowledge specializing the aff:Quality class, which refers to qualities or aspects of a feature of interest that are intrinsic to and cannot exist without the feature of interest.

In particular, the Q4EEPSA ontology module44

⁴⁴
https://w3id.org/eepsa/q4eepsa

tries to tackle CQs such as the following:

Which are the actuatable qualities?

Which are the predictable qualities?

Which are the thermal comfort qualities?

In Q4EEPSA, two categories of qualities are differentiated. On the one hand, observable qualities of a feature of interest defined by the class q4eepsa:ObservableQuality. Bearing in mind the conceptualization of observation proposed by the O&M (Observations and Measurements) model described in ISO 19156:201145

⁴⁵

https://www.iso.org/standard/32574.html

and followed by the EEPSA ontology, this class comprises qualities that can be observed, estimated and even forecast. These qualities include the cloud coverage of a given location or the occupancy of a given room. On the other hand, qualities of a feature of interest that can be acted on, such as the indoor temperature of a living room, are defined by the class q4eepsa:ActuatableQuality. Qualities that are relevant for the EEPSA’s domain of discourse are classified into at least one of the aforementioned classes. Likewise, qualities that belong to these categories are also classified into orthogonal groups according to dimensions like their area of interest.

Meteorological qualities such as solar radiation (q4eepsa:SolarRadiation) or cloud coverage (q4eepsa:CloudCover) are defined as subclasses of q4eepsa:MeteorologicalQuality, which are observable but not actuatable, as defined with the following axiom:

q4eepsa:MeteorologicalQuality ⊑

q4eepsa:ObservableQuality ⊓ ¬q4eepsa:ActuatableQuality .

Qualities that may have an influence on the comfort of building users such as indoor temperature (q4eepsa:IndoorTemperature) and indoor humidity (q4eepsa:IndoorHumidity) are represented as subclasses of q4eepsa:ComfortQuality. Since comfort is assessed by subjective evaluation, the same objective measurement of a given quality may be perceived in different ways by different people. For example, although a given room may have an illuminance level of 100 lux, a person may consider it too bright and another one too dark. These qualities can be observed and acted on. Furthermore, qualities related to the resource consumption derived from the building operation such as water consumption (q4eepsa:WaterConsumption) or electric generation (q4eepsa:ElectricGeneration) are also defined. These concepts are described as subclasses of q4eepsa:ResourceConsumptionQuality, which is observable. However, even though it can be indirectly acted on (for example with consumption restriction strategies), a consumption is not directly actuatable, so that it is not categorised as a subclass of q4eepsa:ActuatableQuality.

Some of the mentioned classes are reengineered and reused from the M3-lite taxonomy46

⁴⁶

http://purl.org/iot/vocab/m3-lite

(which is a light version of the M3 ontology presented by Datta et al. (2015)), because it contains a great set of well-organized quality classes. Other additional ontologies and vocabularies have also been considered for the same purpose, such as the QUDT Quantity Kinds vocabulary,47

⁴⁷

http://qudt.org/2.1/vocab/quantitykind

although the quality concepts are defined as individuals instead of classes.

Fig. 7.

Overview of the classes defined in Q4EEPSA (visualized in Protégé).

The Q4EEPSA ontology module is aligned with related ontologies such as SAREF, the SEAS Generic Property ontology48

⁴⁸

https://w3id.org/seas/GenericPropertyOntology

and the QUDT Quantity Kinds vocabulary.

Figure 7 shows an overview of the main Q4EEPSA classes.

3.5.3. P4EEPSA (procedure for EEPSA) ontology module

This ontology module covers the knowledge specializing the eep:Procedure class, which represents workflows, protocols, plans, algorithms, or computational methods specifying how to produce an event.

In particular, the P4EEPSA ontology module49

⁴⁹
https://w3id.org/eepsa/p4eepsa

tries to tackle CQs such as the following:

Which are the actuating procedures?

Which are the predictive procedures?

Which are the imputation procedures?

P4EEPSA represents four types of procedures: actuating procedures (p4eepsa:ActuatingProcedure), specifying how to act on an event; sensing procedures (p4eepsa:SensingProcedure), specifying how to sense an event; imputation procedures (p4eepsa:ImputationProcedure), specifying how to impute an event; and predictive procedures (p4eepsa:PredictiveProcedure), specifying how to predict an event. Such a procedure classification is a requirement of the data analyst assistant.

The fine-grained description of the procedures themselves are not a requisite of the EEPSA ontology, so this knowledge has to be modelled by the user, or preferably imported from other existing ontologies. With a view to easing the extension of the P4EEPSA ontology in that direction, it is aligned with the ML Schema Core Specification.50

⁵⁰

http://www.w3.org/ns/mls

The core vocabulary of ML Schema was developed by the W3C’s Machine Learning Schema Community Group,51

⁵¹

https://www.w3.org/community/ml-schema/

and it can be used to represent the algorithms, the machine learning tasks they address, their implementations and executions, as well as the inputs (e.g., data) and outputs (e.g., models) they specify. Another ontology that could be considered is the WiLD ontology52

⁵²

http://purl.org/wild/vocab

proposed by Käfer and Harth (2018), aimed at describing workflows in Linked Data.

3.5.4. EXR4EEPSA (executor for EEPSA) ontology module

This ontology module covers the knowledge specializing the eep:Executor class, which represents agents that produce an event by implementing a procedure.

The EXR4EEPSA ontology module 53

⁵³
https://w3id.org/eepsa/exr4eepsa

tries to tackle CQs such as the following:

Which type of sensor is a given sensor?

Is a given executor a window actuator?

Is a given executor a predictive model?

EXR4EEPSA concepts are categorised in four different classes: sensors, actuators, predictive models and imputation methods. The class exr4eepsa:Sensor represents agents that implement a procedure to sense a value change in a real-world quality. Following the SOSA/SSN ontology’s conceptualization, a sensor is not necessarily a physical device; it can also be virtual, even a human being. Sensors are classified in two main classes: meters and environment sensors. On the one hand, the class exr4eepsa:UtilityMeter defines a set of meters measuring water, heat, gas or electricity consumption, as well as meters to observe the energy generated by a photovoltaic panel. On the other hand, sensors observing environment conditions (exr4eepsa:EnvironmentSensor) include anenometers (exr4eepsa:Anenometer), for sensing wind speed and humidity sensors (exr4eepsa:HumiditySensor). Furthermore, these environment sensors include the exr4eepsa:AirQualitySensor subclass comprising agents sensing air pollution and gases in the surrounding area (e.g. exr4eepsa:CO2Sensor).

The class exr4eepsa:Actuator represents agents that implement a procedure to act on a real-world quality. This concept is more general than the seas:Actuator, iot-lite:ActuatingDevice or sosa:Actuator classes since, similarly to sensors, the agent does not necessarily need to be a device or a physical element. It can be, for example, a software that switches a light bulb on or off. This class includes a set of common actuators for an energy efficiency problem in tertiary buildings, such as door actuators (exr4eepsa:DoorActuator) and window actuators (exr4eepsa:WindowActuator).

The EXR4EEPSA ontology module is not aimed at making an exhaustive representation of different types of sensors and actuators. Instead, it focuses on describing sensors and actuators that are recurrent to energy efficiency and thermal comfort problems in buildings. Furthermore, two additional high-level class of executors are defined in the EXR4EEPSA ontology module. The first one is the class exr:PredictiveModel, representing agents that implement a predictive modelling procedure to forecast unknown or future outcomes. The second one, the class exr:ImputationMethod, describes agents that implement a procedure to compute an estimation of missing values.

Some of the classes created to satisfy the identified CQs are inspired by the M3-lite taxonomy. However, they are not reused because they do not represent the same concept of sensors/actuators (e.g. M3-lite represents only physical sensors, while in the context of EXR4EEPSA sensors are not necessarily physical objects). Some other classes are reengineered and reused from the SEAS Smart Meter ontology.54

⁵⁴

https://w3id.org/seas/SmartMeterOntology

Consequently, the EXR4EEPSA ontology module is aligned with these two related domain ontologies.

3.5.5. EXN4EEPSA (execution for EEPSA) ontology module

This ontology module covers the knowledge specializing the eep:Execution class. This class represents events or actions made by an agent executing a task implemented by a procedure with respect to a quality of a feature of interest.

In particular, the EXN4EEPSA ontology module55

⁵⁵
https://w3id.org/eepsa/exn4eepsa

tries to tackle CQs such as the following:

Is a given execution an actuation?

Is a given execution an observation?

Which are the imputed observations?

To that end, this ontology module defines three main concepts: an observation (exn4eepsa:Observation), which is an execution made by an executor to estimate or calculate a value of a quality of a feature of interest; an actuation (exn4eepsa:Actuation), which is an execution made by an executor to act upon a quality of a feature of interest; and a missing value (exn4eepsa:MissingValue), which happens when executions are empty or null in attributes where a value should have been recorded. Likewise, an observation can be predicted or forecast (exn4eepsa:Forecast), obtained after using an imputation method (exn4eepsa:Imputation), or it can even be an outlier (exn4eepsa:Outlier) when it does not conform to the expected behaviour.

In addition, multiple executions can be grouped in collections, such as a sequence of missing values, or the collection of observations forecast by a predictive model. Therefore, the following CQs need to be answered:

Which are the executions of a given collection?

Which collection’s member is a given execution?

The EXN4EEPSA ontology module defines the class exn4eepsa:CollectionOfExecutions for representing a set of executions. Furthermore, object properties exn4eepsa:hasMember and its inverse exn4eepsa:isMemberOf are defined to associate individuals of class eep:Execution that belong to a collection of executions, and vice versa.

Such a detailed hierarchy of concepts is motivated by the relevance these concepts may have in data analysis problems. Furthermore, the EXN4EEPSA ontology module is aligned with a set of domain ontologies such as the SOSA/SSN ontology, the SEAS Device ontology,56

⁵⁶

https://w3id.org/seas/DeviceOntology

SAREF and om-lite ontology. It is important to note that other ontologies such as SmartEnv and S3N can be indirectly aligned with EXN4EEPSA since they are based on the SOSA/SSN ontology.

3.5.6. EK4EEPSA (expert knowledge for EEPSA) ontology module

This ontology module covers the necessary expert knowledge to provide inferencing capabilities that can be exploited by the data analyst assistant. This module is defined under the supervision of experts in the domain at hand in order to capture task-based knowledge.

In particular, the EK4EEPSA57

⁵⁷
https://w3id.org/eepsa/ek4eepsa

ontology module tries to tackle CQs such as the following:

What is a naturally enlightened space?

Which types of spaces are in a building?

Which are the qualities affecting the temperature of a badly insulated space?

On the one hand, the EK4EEPSA ontology module defines a classification of types of spaces in buildings. These space definitions are based on their structural features, such as spaces in contact with outdoor environment (ek4eepsa:AdjacentToOutdoorSpace) or spaces located below the ground floor (ek4eepsa:BelowGroundLevelSpace). However, other space definitions may be incorporated, such as the ones proposed by the HBC (Human Comfort in Building) ontology58

⁵⁸

https://w3id.org/ibp/hbc

published by Qiua et al. (2018), where spaces are mainly characterized by the equipment contained or not within themselves (e.g. hbc:SpaceWithHeater or hbc:SpaceWithoutHeater). Note that in the scenario tackled in this article, it may be convenient to make heavy usage of axioms expressing sufficient conditions to infer the recognition of individuals in appropriate classes. That is, it may be suitable to use equivalent class axioms with appropriate right hand class expressions, rather than being dependent on explicit assertions only. For example, the class ek4eepsa:AdjacentToOutdoorSpace is defined as follows:

ek4eepsa:AdjacentToOutdoorSpace ≡

bot:Space ⊓

∃bot:hasElement.foi4eepsa:ExternalBuildingElement

On the other hand, for each space type, qualities that affect their indoor temperature are captured. Such modelling relies on qualities represented in the Q4EEPSA ontology module and the axioms defined in the AffectedBy ODP. It is worth noting that this is the only EEPSA ontology module that has dependencies with other EEPSA ontology modules. However, the data analyst assistant has a requisite that needs the ability to ask for interrelationships of entities coming from any other modules. For example, the temperature of an adjacent-to-outdoor space may be affected by qualities such as the indoor humidity, and the occupancy of the room, as represented in the following axiom:

ek4eepsa:AdjacentToOutdoorSpaceIndoorTemperature ⊑

∃aff:affectedBy.q4eepsa:IndoorHumidity ⊓ ∃ aff:affectedBy.q4eepsa:Occupancy

⊓ ∃aff:affectedBy.q4eepsa:SolarRadiation ⊓ ∃aff:affectedBy.q4eepsa:WindSpeed .

This knowledge modelling can be exploited by application programs and to support data analysts in a proper manner. After knowing which is the type of space at hand, data analysts get to know which are the qualities that are relevant to solve the energy efficiency or thermal comfort problem.

At the moment of writing this article, the EK4EEPSA ontology module solves the presented CQs. However, being an ontology module containing expert knowledge, it is extendible as more requisites are demanded.

The adequate integration of the six EEPSA modules with the three ODPs enables satisfying a set of more complex CQs such as the following:

Which are the noise qualities influencing the rooms in the first floor?

Which are the qualities affecting the indoor temperature of the tribology laboratory?

Which sensor measured 22°C in the meeting room 042 on 13th February 2020 at 09:40?

3.6. Documentation

According to Peroni et al. (2013), a good ontology documentation increases its understandability and potential usability, both by experts in semantics and by people who are not necessarily experts. The documentation of the EEPSA ontology and its ontology modules is generated with WIDOCO (a WIzard for DOCumenting Ontologies) developed by Garijo (2017) which creates a set of linked enriched HTML pages. These HTML pages are extended with hand-made sections such as the alignments to other ontologies or with ontology usage examples.

W3C’s Data on the Web Best Practices (Calegari et al., 2017) states that providing metadata is a fundamental requirement that helps human users and computer applications to understand the data as well as other important aspects that describe a dataset. All the ontological resources presented in this article are annotated following guidelines described by Garijo and Poveda-Villalón59

⁵⁹
https://w3id.org/widoco/bestPractices

as authors consider the most complete guideline among the ones reviewed. As a matter of fact, for each EEPSA ontology module or ODP, both the ontology itself and the classes and properties are annotated with all the recommended terms as well as some additional optional terms.

Next, the canonical URIs for the different ontology modules documentation are shown. All URIs are provided by the https://w3id.org re-direction service.

EEPSA ontology: https://w3id.org/eepsa

AffectedBy ODP: https://w3id.org/affectedBy

EEP ODP: https://w3id.org/eep

RC ODP: https://w3id.org/rc

FoI4EEPSA ontology module: https://w3id.org/eepsa/foi4eepsa

Q4EEPSA ontology module: https://w3id.org/eepsa/q4eepsa

P4EEPSA ontology module: https://w3id.org/eepsa/p4eepsa

EXR4EEPSA ontology module: https://w3id.org/eepsa/exr4eepsa

EXN4EEPSA ontology module: https://w3id.org/eepsa/exn4eepsa

EK4EEPSA ontology module: https://w3id.org/eepsa/ek4eepsa

The EEPSA ontology is published in the LOV60

⁶⁰

http://lov.linkeddata.es

(Vandenbussche et al., 2017) (Linked Open Vocabularies) and LOV4IoT61

⁶¹

https://lov4iot.appspot.com

(Gyrard et al., 2016) ontology catalogues. With regards to the ODPs, they are available in the ODP repository,62

⁶²

http://ontologydesignpatterns.org

which collects and makes ODPs available on the web, allowing users to download, propose, and discuss them.

3.7. Evaluation and validation

There are many evaluation metrics for assessing ontologies in existing literature such as Brank et al. (2005) and Obrst et al. (2007). Most of them focus on structural notions without taking into account the semantics, leading to incomparable measurement results (Vrandečić and Sure, 2007). And even though these are valid metrics, they may not be enough to determine the quality of an ontology. In order to avoid a biased evaluation, next, the EEPSA ontology and the modules that comprise it are assessed from three perspectives: design correctness, structural metrics, and modularity quality. Additionally, a testing process has been designed to verify that the EEPSA ontology and its modules satisfy their functional requirements, which were collected in the form of CQs.

3.7.1. Design correctness metrics

The design correctness is evaluated using OOPS! (OntOlogy Pitfall Scanner) developed in Poveda-Villalón et al. (2014), which detects some of the most common pitfalls appearing within ontology developments. OOPS! is available online63

⁶³
http://oops.linkeddata.es/

and evaluates an ontology against a catalogue of 41 potential pitfalls classified into three levels according to their severity: minor, important and critical. This tool was used during the ontology modules development phase, contributing to an early detection of pitfalls, and complementing the manual review of the ontology’s correctness. Table 1 summarizes the number of pitfalls detected in the EEPSA ontology and its components.

Table 1

Summary of ontology design correctness evaluation by OOPS!

Ontology	Minor	Important	Critical
EEP	13	2	0
AffectedBy	4	1	0
RC	8	3	0
FoI4EEPSA	7	1	0
Q4EEPSA	4	1	0
P4EEPSA	4	1	0
EXR4EEPSA	4	1	0
EXN4EEPSA	3	1	0
EK4EEPSA	5	1	0

Overall, most ontology modules share the same minor pitfalls “P04: Creating unconnected elements” and “P08: Missing annotations”. These pitfalls appear mainly in the stub classes that ontology modules extend (e.g. the class aff:FeatureOfInterest for the case of the FoI4EEPSA ontology module) as well for the voaf:Vocabulary class used to describe the ontology itself. These concepts are adequately annotated and connected in their source ontology module, so annotating them again would derive in having duplicated metadata when all ontology modules are imported by the EEPSA ontology. Therefore, these pitfalls are ignored.

Regarding the important pitfalls, the “P10: Missing disjointness” is repeated in all the ontology modules and ODPs. This pitfall arises when an ontology lacks from disjointness axioms between classes or between properties that should be defined as disjoint. However, in the EEPSA ontology modules case, those suggested disjointness axioms are an inconvenient conceptualization constraint, so it was decided not to add those constraints.

3.7.2. Structural metrics

Structural metrics by themselves may not be enough to assess the quality of an ontology or an ontology module, but they may still be relevant to describe an ontology. Protégé has an Ontology Metrics tab64

⁶⁴
http://protegeproject.github.io/protege/views/ontology-metrics

that displays entity and axiom counts for the active ontology. Table 2 summarizes the structural metrics for the different EEPSA ontology modules, ODPs and the EEPSA ontology itself.

Results show that ODPs are richer from a DL expressivity point of view. They define more constraints, while the rest of the ontology modules are more light weighted. As for the size, most of the EEPSA ontology modules are rather small (less than 17 classes). The only exceptions are the Q4EEPSA, EXR4EEPSA and EK4EEPSA ontology modules, which represent over 25 classes. The first two are in charge of representing qualities, sensors and actuators that are typical in problems addressed in the article, so it is understandable to contain a bigger number of classes. The latter, in turn, actually defines only 8 new classes. The rest of the classes are defined in other modules but are reused to describe the expert knowledge contained in the module.

Table 2

Summary of ontology structural metrics by Protégé’s Ontology Metrics tab (OP = Object Properties, DP = Datatype Properties, * = Imported axioms are not considered)

Ontology	Axioms	Class	OP	DP	Annotation	DL Expressivity
EEP(*)	80	6	8	0	40	ALERIF
AffectedBy	62	3	3	0	31	ALERIF+
RC	40	4	3	2	20	AL(D)
FoI4EEPSA(*)	130	17	0	5	64	AL(D)
Q4EEPSA	197	30	0	0	124	AL
P4EEPSA	40	6	0	0	16	AL
EXR4EEPSA	207	33	0	0	127	AL
EXN4EEPSA	72	9	2	0	36	ALI
EK4EEPSA	82	25	4	0	32	ALC

3.7.3. Modularity quality

The EEPSA ontology’s module quality is also assessed based on the guidelines proposed by Khan and Keet (2016). This work creates a comprehensive list of module evaluation metrics as well as a definition of 14 ontology modules types. For each ontology module type, it is described which metrics need to be measured and the expected values for a high quality ontology module. In the case of the EEPSA ontology, modules of type T1 (ODP modules: AffectedBy, EEP and RC) and T2 (Subject domain modules: FoI4EEPSA, Q4EEPSA, P4EEPSA, EXR4EEPSA, EXN4EEPSA and EK4EEPSA) are identified. The evaluation is performed with TOMM65

⁶⁵
http://www.thezfiles.co.za/Modularity/TOMM.zip

(Tool for Ontology Module Metrics) and results are available online.66

⁶⁶

https://github.com/iesnaola/eepsa/tree/master/Evaluation/TOMM

Regarding the ODPs, the guidelines suggest that a good quality module should have a small size compared to the original ontology size (i.e. relative size), a small cohesion (i.e. the extent to which entities in a module are related to each other), and be complete. The proposed three EEPSA ODPs satisfy the small relative size and cohesion requirements. However, EEP and RC are not logically complete, as they do not describe terms defined in other ontologies (e.g. the aff:affectedBy object property in the EEP ODP and the class eep:Execution in the RC ODP) to avoid duplicated metadata in the final EEPSA ontology.

With regards to the rest of the ontology modules, which can be classified as of type “T2-subject domain modules”, they are required to fulfil these criteria to be considered good quality modules: small cohesion, large encapsulation (i.e. “swappability” or ease to exchange a module for another without side effects), small coupling (i.e. the degree of interdependence of a module) and small redundancy (i.e. the duplication of axioms within a set of ontology modules). All the EEPSA ontology modules satisfy these criteria.

3.7.4. Ontology requirement validation

In order to verify that the EEPSA ontology satisfies the ontology requirements identified in the ORSD, a validation process has been implemented. This validation has been performed with Themis,67

⁶⁷
http://themis.linkeddata.es/

a web-based tool presented in Fernández-Izquierdo and García-Castro (2019) which provides a set of test expressions based on lexico-syntactic patterns to check whether ontology requirements are satisfied.

For each of the EEPSA ontology modules and the ODPs, a set of tests have been designed, implemented and run to verify that the targeted CQs are adequately addressed and the desired knowledge is modelled. These tests have been represented with the Verification Test Case (VTC) ontology68

⁶⁸

https://w3id.org/def/vtc

and exported in RDF files, with a view to running them in the future when the EEPSA ontology may be modified. An excerpt of a validation test designed for the AffectedBy ODP is shown below:

3.8. Ontology customization by module replacement

Although the EEPSA ontology is aimed at supporting data analysts in energy efficiency and thermal comfort problems in buildings, it is designed to enable its customization to support data analysts in similar problems in different types of buildings. Being modularized by design, the EEPSA ontology is expected to be easily modified. Furthermore, as it has been demonstrated that the EEPSA ontology modules are loosely coupled and have few dependencies between them, this ontology customization can be methodically approached.

The customization of the EEPSA ontology is recommended to be performed via ontology module replacement. That is, existing ontology modules should be replaced with other ontology modules, which can be new modules or extensions of existing ones. This way, the development of customized EEPSA ontologies is expected to be of bounded complexity. This ontology customization process is illustrated with a real-world poultry farm use case in Esnaola et al. (2019).

4. EEPSA ontology in use

The incorporation of the semantic technologies in the assistant that supports data analysts through the KDD process for energy efficiency and thermal comfort problems is presented in Esnaola-Gonzalez et al. (2018a). This assistant is based on the presented EEPSA ontology, and this section aims to show the role that the EEPSA ontology plays in this assistance. Namely, it proves that the proposed ontology is not a mere collection of classes and properties to semantically annotate data, but it is aimed at providing assistance to data analysts through the KDD process.

The capabilities of the EEPSA ontology are demonstrated in Tekniker’s headquarters, a building located in Eibar (Spain) which hosts the activities of the technological centre. Within this building, the electronics laboratory (from now on referred to as ELE lab) has been targeted, where different electronic components, systems and equipment are designed, developed, and tested. Due to the intermittent usage of the laboratory, ensuring occupants’ thermal comfort while achieving an efficient use of energy is currently an unsolved problem. Towards finding a solution to this problem, a system that proposes the optimal HVAC (Heat, Ventilation and Air Conditioning) activation strategies is sought. Such a system could rely on the assistant that supports data analysts through the KDD processes.

The first step to exploit the EEPSA ontology capabilities consists in annotating the target space, its structural elements, deployed equipment, and the rest of the relevant elements with adequate ontological terms. This semantic annotation process was manually performed by the facility manager of Tekniker, who received the guidance and assistance of an ontologist. As pointed out by Esnaola-Gonzalez et al. (2018a), in the future, this data annotation task should be facilitated with a GUI (graphical user interface) where the user could add building elements and features to the space in an intuitive and easy manner.

The building topology was represented using the FoI4EEPSA ontology module. The Tekniker building (:tekniker) was represented as an instance of the bot:Building class, its minus 2 storey (:minus2Storey) as an instance of the bot:Storey class and the ELE lab (:eleLab) located in such storey as an instance of the bot:Space class. Likewise, the structural elements including the room’s door (:door01), were modelled using the same FoI4EEPSA ontology module. As for the representation of the equipment deployed within the laboratory such as temperature sensors (:tempSen01) or submetering systems (:plug01), the EXR4EEPSA ontology module was used. Regarding the measurements registered by the different devices, the EXN4EEPSA ontology module was used, and more specifically, the exn4eepsa:Observation class. It is worth noting that observations registered by devices are stored in a PostgreSQL database, and that only a few of them were modelled for the purpose of showing the capabilities of the EEPSA ontology. As a matter of fact, as suggested by Petrova et al. (2019), sensor data should be kept out of semantic graphs for computational performance reasons.

The resulting simplified representation of the ELE lab has an ABox size of 57 triples and an excerpt is available below:

Once this semantic annotation process has been finished, a reasoner can be used to infer relevant implicit knowledge. In this case particular case, a HermiT69

⁶⁹
http://www.hermit-reasoner.com/

version 1.3.8 OWL 2 DL reasoner was used. Based on this data representation, the reasoner inferred that the ELE lab was an instance of class ek4eepsa:BelowGroundLevelSpace. As a matter of fact, this is how a below ground level space is defined in EK4EEPSA ontology module.

ek4eepsa:BelowGroundLevelSpace ≡

bot:Space ⊓ ∃bot:hasSpace⁻¹.foi4eepsa:UndergroundStorey .

Furthermore, since ELE lab is inferred to be an instance of class ek4eepsa:BelowGroundLevelSpace, it is also inferred to be influenced by some below ground level space indoor temperature, due to the axioms:

ek4eepsa:BelowGroundLevelSpace ⊑

∃aff:influencedBy.ek4eepsa:BelowGroundLevelSpaceTemperature .

Consequently, due to the definition of ek4eepsa:BelowGroundLevelSpaceTemperature, it is inferred that the indoor temperature of ELE lab is affected by the atmospheric pressure (q4eepsa:AtmosphericPressure), indoor humidity (q4eepsa:IndoorHumidity) and the occupancy of the room (q4eepsa:Occupancy).

ek4eepsa:BelowGroundLevelSpaceTemperature ⊑

∃aff:affectedBy.q4eepsa:AtmosphericPressure ⊓

∃aff:affectedBy.q4eepsa:IndoorHumidity ⊓

∃aff:affectedBy.q4eepsa:Occupancy .

Therefore, once the semantic representation of the ELE laboratory is performed, thanks to the expert knowledge captured in the EEPSA ontology and the execution of a reasoner, it is inferred that the ELE laboratory is a space located below ground level, and consequently, its temperature is affected by atmospheric pressure, indoor humidity, and laboratory occupancy.

The information inferred thanks to the knowledge formalized within the EEPSA ontology, makes up for the data analyst’s probable lack of knowledge for energy efficiency and thermal comfort problems in buildings. Consequently, they will no longer need to resort to the typical trial-and-error approach searching for variables and tasks that could be confidently used in a KDD process.

5. Conclusions

In this work, first of all, and following a well-known ontology development methodology, the requirements of the EEPSA ontology have been collected. Then, the backbone of the EEPSA ontology has been discussed and defined as a combination of three ODPs which try to, on the one hand, be minimal in the number of classes and properties offered but complete with respect to the considered CQs and, on the other, include appropriate ontology axioms that allow proper inferences. Moreover, the careful design of property axioms overcome some weaknesses found in existing ontologies. On top of these ODPs, a set of ontology modules were developed, each of them specializing knowledge in the scope of the stub classes defined in the ODPs and reusing existing resources as much as possible. Furthermore, in order to contribute to the interoperability of the solution, the EEPSA ODPs and ontology modules are aligned with other related ontologies as well as upper-level ontologies. All these developments are properly documented, made available online, validated and evaluated from three different viewpoints. Moreover, thanks to the modular design of the EEPSA ontology and the high encapsulation of its modules, the customization of the ontology to address similar problems in different use cases can be methodically approached.

The resulting EEPSA ontology is a core ontology developed on the basis that a proper axiomatization shapes the set of admitted models better and, therefore, establishes the ground for a better interoperability. On the contrary, underspecification facilitates the admission of non-isomorphic models to represent the same state, hampering interoperability. Furthermore, the EEPSA ontology’s objective goes beyond having a mere collection of classes and properties to semantically annotate data, as it aims at supporting a data analyst assistant in energy efficiency and thermal comfort problems in buildings. To do so, the necessary domain and expert knowledge related to data analysis procedures and tasks is adequately captured and formalized in the ontology. With a view to demonstrating this support to data analysts, the ontology has been instantiated in a real-world laboratory.

Footnotes

Acknowledgements

This work is supported by the REACT project which has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement no. 824395. This work is also supported by PRE-2016-1-0303, IT1041-16-GBV and FEDER/TIN2016-78011-C4-2-R.

This work was conducted using the Protégé resource, which is supported by grant GM10331601 from the National Institute of General Medical Sciences of the United States National Institutes of Health.

Application examples of the ODPs

This appendix shows an example of the presented three ODPs.

References

Abergel, T., Dean, B. & Dulac, J. (2019). Towards a zero-emission, efficient, and resilient buildings and construction sector: Global Status Report. UN Environment and International Energy Agency, 2019.

Agarwal, R., Fernandez, D.G., Elsaleh, T., Gyrard, A., Lanza, J., Sanchez, L., Georgantas, N. & Issarny, V. (2016). Unified IoT ontology to enable interoperability and federation of testbeds. In 3rd IEEE World Forum on Internet of Things.

Alirezaie, M., Hammar, K. & Blomqvist, E. (2018). SmartEnv as a network of ontology patterns. Semantic Web, 9(6), 903–918. doi:10.3233/SW-180303.

Andrews, P., Zaihrayeu, I. & Pane, J. (2012). A classification of semantic annotation systems. Semantic Web, 3(3), 223–248. doi:10.3233/SW-2011-0056.

Bermudez-Edo, M., Elsaleh, T., Barnaghi, P. & Taylor, K. (2017). IoT-lite: A lightweight semantic model for the Internet of things and its use with dynamic semantics. Personal and Ubiquitous Computing, 21(3), 475–487. doi:10.1007/s00779-017-1010-8.

Bernstein, A., Provost, F. & Hill, S. (2005). Toward intelligent assistance for a data mining process: An ontology-based approach for cost-sensitive classification. IEEE Transactions on Knowledge and Data Engineering, 17(4), 503–518. doi:10.1109/TKDE.2005.67.

Bonino, D. & Corno, F. (2008). Dogont-ontology modeling for intelligent domotic environments. In International Semantic Web Conference (pp. 790–803). Springer.

Brank, J., Grobelnik, M. & Mladenić, D. (2005). A survey of ontology evaluation techniques. In Proc. of 8th Int. Multi-Conf. Information Society (pp. 166–169).

Calegari, N., Burle, C. & Loscio, B.F. (2017). Data on the web best practices. W3C recommendation. W3C, https://www.w3.org/TR/2017/REC-dwbp-20170131/.

10.

Compton, M., Barnaghi, P., Bermudez, L., García-Castro, R., Corcho, O., Cox, S., Graybeal, J., Hauswirth, M., Henson, C. & Herzog, A. (2012). The SSN ontology of the W3C semantic sensor network incubator group. Web Semantics: Science, Services and Agents on the World Wide Web, 17, 25–32. doi:10.1016/j.websem.2012.05.003.

11.

Cox, S. (2016). Ontology for observations and sampling features, with alignments to existing models. Semantic Web, 8(3), 453–470. doi:10.3233/SW-160214.

12.

d’Aquin, M., Schlicht, A., Stuckenschmidt, H. & Sabou, M. (2009). Criteria and Evaluation for Ontology Modularization Techniques (pp. 67–89). Berlin, Heidelberg: Springer.

13.

Datta, A.G.S.K., Bonnet, C. & Boudaoud, K. (2015). Cross-domain Internet of things application development: M3 framework and evaluation. In 2015 3rd International Conference on Future Internet of Things and Cloud (pp. 9–16). IEEE. doi:10.1109/FiCloud.2015.10.

14.

de Farias, T.M., Roxin, A. & Nicolle, C. (2015). Ifcwod, semantically adapting ifc model relations into owl properties. In Proceedings of the 32nd CIB W78 Conference on Information Technology in Construction.

15.

Díaz, J.J.V., Wilby, M.R., González, A.B.R. & Mun¯oz, J.G.M. (2013). EEOnt: An ontological model for a unified representation of energy efficiency in buildings. Energy and Buildings, 60, 20–27. doi:10.1016/j.enbuild.2013.01.012.

16.

Ensan, F. & Du, W. (2013). A semantic metrics suite for evaluating modular ontologies. Inf. Syst., 38(5), 745–770. doi:10.1016/j.is.2012.11.012.

17.

Esnaola, I., Fernandez, I., García, E., Ferreiro, S., Gomez, M., Lázaro, I. & García, A. (2019). Towards animal welfare in poultry farms through semantic technologies. In IoT Connected World & Semantic Interoperability Workshop (IoT-CWSI) 2019.

18.

Esnaola-Gonzalez, I., Bermúdez, J., Fernandez, I. & Arnaiz, A. (2018a). Semantic prediction assistant approach applied to energy efficiency in tertiary buildings. Semantic Web, 9(6), 735–762. doi:10.3233/SW-180296.

19.

Esnaola-Gonzalez, I., Bermúdez, J., Fernandez, I. & Arnaiz, A. (2018b). Two ontology design patterns toward energy efficiency in buildings. In Proceedings of the 9th Workshop on Ontology Design and Patterns (WOP 2018) co-located with 17th International Semantic Web Conference (ISWC 2018) (Vol. 2195, pp. 14–28). CEUR.

20.

Esnaola-Gonzalez, I., Bermúdez, J., Fernandez, I. & Arnaiz, A. (2020). Ontologies for observations and actuations in buildings: A survey. Semantic Web, 11(4), 593–621. doi:10.3233/SW-200378.

21.

Fayyad, U., Piatetsky-Shapiro, G. & Smyth, P. (1996). From data mining to knowledge discovery in databases. AI magazine, 17(3), 37.

22.

Fernández-Izquierdo, A., Cimmino, A., Patsonakis, C., Tsolakis, A.C., García-Castro, R., Ioannidis, D. & Tzovaras, D. (2020). Openadr ontology: Semantic enrichment of demand response strategies in smart grids. In 2020 International Conference on Smart Energy Systems and Technologies (SEST) (pp. 1–6).

23.

Fernández-Izquierdo, A. & García-Castro, R. (2019). Themis: A tool for validating ontologies through requirements. In SEKE (pp. 573–753). doi:10.18293/SEKE2019-117.

24.

Gangemi, A., Lillo, R., Lodi, G. & Nuzzolese, A.G. (2017). A pattern-based ontology for the Internet of things. In Proceedings of the 8th Workshop on Ontology Design and Patterns (WOP 2017) (p. 2043).

25.

Gangemi, A. & Presutti, V. (2009). Ontology Design Patterns (pp. 221–243). Berlin, Heidelberg: Springer.

26.

Gangemi, A. & Presutti, V. (2010). Towards a pattern science for the semantic web. Semantic Web, 1(1, 2), 61–68. doi:10.3233/SW-2010-0020.

27.

García-Castro, R., Haller, A. & Mihindukulasooriya, N. (2020). On the usage of the SSN ontology. W3C note. World Wide Web Consortium, November.

28.

Garijo, D. (2017). Widoco: A wizard for documenting ontologies. In

d’Amato ,

Fernandez ,

Tamma ,

Lecue ,

Cudré-Mauroux ,

Sequeda ,

Lange and

Heflin (Eds.), The Semantic Web – ISWC 2017 (pp. 94–102). Cham: Springer. doi:10.1007/978-3-319-68204-4_9.

29.

Grau, B.C., Horrocks, I., Kazakov, Y. & Sattler, U. (2008). Modular reuse of ontologies: Theory and practice. Journal of Artificial Intelligence Research, 31, 273–318. doi:10.1613/jair.2375.

30.

Guarino, N., Oberle, D. & Staab, S. (2009). What is an ontology? In Handbook on Ontologies (pp. 1–17). Springer.

31.

Gubbi, J., Buyya, R., Marusic, S. & Palaniswami, M. (2013). Internet of things (iot): A vision, architectural elements, and future directions. Future generation computer systems, 29(7), 1645–1660. doi:10.1016/j.future.2013.01.010.

32.

Gyrard, A., Bonnet, C., Boudaoud, K. & Serrano, M. (2016). Lov4iot: A second life for ontology-based domain knowledge to build semantic web of things applications. In 2016 IEEE 4th International Conference on Future Internet of Things and Cloud (FiCloud) (pp. 254–261). IEEE. doi:10.1109/FiCloud.2016.44.

33.

Haller, A., Janowicz, K., Cox, S., Lefrançois, M., Taylor, K., Phuoc, D.L., Lieberman, J., García-Castro, R., Atkinson, R. & Stadler, C. (2019). The modular ssn ontology: A joint w3c and ogc standard specifying the semantics of sensors, observations, sampling, and actuation. Semantic Web, 10(1), 9–32. doi:10.3233/SW-180320.

34.

Haynes, B.P. (2008). The impact of office comfort on productivity. Journal of Facilities Management, 6(1), 37–51. doi:10.1108/14725960810847459.

35.

Hedge, A. & Gaygen, D.E. (2010). Indoor environment conditions and computer work in an office. Hvac&R Research, 16(2), 123–138. doi:10.1080/10789669.2010.10390897.

36.

Hitzler, P., Gangemi, A., Janowicz, K., Krisnadhi, A. & Presutti, V. (Eds.) (2016). Ontology Engineering with Ontology Design Patterns: Foundations and Applications (Vol. 25). IOS Press.

37.

Janowicz, K. & Compton, M. (2010). The stimulus-sensor-observation ontology design pattern and its integration into the semantic sensor network ontology Vol. 668. CEUR.

38.

Käfer, T. & Harth, A. (2018). Specifying, monitoring, and executing workflows in linked data environments. In International Semantic Web Conference (pp. 424–440). Springer.

39.

Khan, Z.C. & Keet, C.M. (2016). Dependencies between modularity metrics towards improved modules. In

Blomqvist ,

Ciancarini ,

Poggi and

Vitali (Eds.), Knowledge Engineering and Knowledge Management (pp. 400–415). Cham: Springer. doi:10.1007/978-3-319-49004-5_26.

40.

Klepeis, N.E., Nelson, W.C., Ott, W.R., Robinson, J.P., Tsang, A.M., Switzer, P., Behar, J.V., Hern, S.C. & Engelmann, W.H. (2001). The national human activity pattern survey (nhaps): A resource for assessing exposure to environmental pollutants. Journal of Exposure Science and Environmental Epidemiology, 11(3), 231–252. doi:10.1038/sj.jea.7500165.

41.

Kopanas, I., Avouris, N.M. & Daskalaki, S. (2002). The role of domain knowledge in a large scale data mining project. In

I.P.

Vlahavas and

C.D.

Spyropoulos (Eds.), Methods and Applications of Artificial Intelligence, Berlin, Heidelberg (pp. 288–299), Springer, Berlin Heidelberg. doi:10.1007/3-540-46014-4_26.

42.

Lefrançois, M. (2017). Planned etsi saref extensions based on the w3c&ogc sosa/ssn-compatible seas ontology patterns. In Proceedings of Workshop on Semantic Interoperability and Standardization in the IoT (SIS-IoT) (Vol. 2063, pp. 1–15). CEUR.

43.

Liao, Y., Lezoche, M., Panetto, H., Boudjlida, N. & Loures, E.R. (2015). Semantic annotation for knowledge explicitation in a product lifecycle management context: A survey. Computers in Industry, 71, 24–34. doi:10.1016/j.compind.2015.03.005.

44.

Masolo, C., Borgo, S., Gangemi, A., Guarino, N. & Oltramari, A. (2003). Wonderweb deliverable d18 ontology library (final). Technical report.

45.

Mulville, M., Callaghan, N. & Isaac, D. (2016). The impact of the ambient environment and building configuration on occupant productivity in open-plan commercial offices. Journal of Corporate Real Estate, 18(3), 180–193. doi:10.1108/JCRE-11-2015-0038.

46.

Musen, M.A. (2015). The protégé project: A look back and a look forward. AI matters, 1(4), 4–12. doi:10.1145/2757001.2757003.

47.

Noy, N.F. (2004). Semantic integration: A survey of ontology-based approaches. ACM Sigmod Record, 33(4), 65–70. doi:10.1145/1041410.1041421.

48.

Obrst, L., Ceusters, W., Mani, I., Ray, S. & Smith, B. (2007). The Evaluation of Ontologies (pp. 139–158). Boston, MA: Springer.

49.

Parsons, K. (2014). Human Thermal Environments: The Effects of Hot, Moderate, and Cold Environments on Human Health, Comfort, and Performance (3rd ed.). Boca Raton, FL, USA: CRC Press, Inc.

50.

Paulheim, H. & Fümkranz, J. (2012). Unsupervised generation of data mining features from linked open data. In Proceedings of the 2nd International Conference on Web Intelligence, Mining and Semantics (p. 31).

51.

Pauwels, P. & Roxin, A. (2017). Simplebim: From full ifcowl graphs to simplified building graphs. In

Christodoulou and

Scherer (Eds.), eWork and eBusiness in Architecture, Engineering and Construction: ECPPM 2016: Proceedings of the 11th European Conference on Product and Process Modelling (ECPPM 2016), Limassol, Cyprus, 7–9 September 2016 (pp. 11–18). CRC Press.

52.

Pauwels, P. & Terkaj, W. (2016). Express to owl for construction industry: Towards a recommendable and usable ifcowl ontology. Automation in Construction, 63, 100–133. doi:10.1016/j.autcon.2015.12.003.

53.

Peroni, S., Shotton, D. & Vitali, F. (2013). Tools for the automatic generation of ontology documentation: A task-based evaluation. Int. J. Semant. Web Inf. Syst., 9(1), 21–44. doi:10.4018/jswis.2013010102.

54.

Petrova, E., Pauwels, P., Svidt, K. & Jensen, R.L. (2019). In search of sustainable design patterns: Combining data mining and semantic data modelling on disparate building data. In Advances in Informatics and Computing in Civil and Construction Engineering (pp. 19–26). Springer. doi:10.1007/978-3-030-00220-6_3.

55.

Pinto, F.M. & Santos, M.F. (2009). Considering application domain ontologies for data mining. WSEAS Trans. Info. Sci. and App., 6(9), 1478–1492.

56.

Pinto, H.S., Staab, S. & Tempich, C. (2004). Diligent: Towards a fine-grained methodology for distributed, loosely-controlled and evolving engineering of ontologies. In Proceedings of the 16th European Conference on Artificial Intelligence (ECAI (pp. 393–397). IOS Press.

57.

Poveda-Villalón, M., Gómez-Pérez, A. & Suárez-Figueroa, M.C. (2014). Oops! (ontology pitfall scanner!): An on-line tool for ontology evaluation. Int. J. Semant. Web Inf. Syst., 10(2), 7–34. doi:10.4018/ijswis.2014040102.

58.

Qiua, H., Schneider, G., Kauppinen, T., Rudolph, S. & Steigerd, S. (2018). Reasoning on human experiences of indoor environments using semantic web technologies. In Proceedings of the 35th International Symposium on Automation and Robotics in Construction (ISARC 2018), Berlin, Germany (pp. 95–102).

59.

Rasmussen, M.H., Lefrançois, M., Schneider, G. & Pauwels, P. (2021). Bot: The building topology ontology of the w3c linked building data group. Semantic Web, 12, 143–161. doi:10.3233/SW-200385.

60.

Sagar, S., Lefrançois, M., Rebaï, I., Khemaja, M., Garlatti, S., Feki, J. & Médini, L. (2018). Modeling smart sensors on top of sosa/ssn and wot td with the semantic smart sensor network (s3n) modular ontology. In 9th International Semantic Sensor Networks Workshop (Vol. 2213, pp. 1–15). CEUR.

61.

Seydoux, N., Drira, K., Hernandez, N. & Monteil, T. (2016). Iot-o, a core-domain iot ontology to represent connected devices networks. In Knowledge Engineering and Knowledge Management: 20th International Conference, EKAW 2016, Proceedings 20, Bologna, Italy, November 19–23, 2016, (Vol. 10024, pp. 561–576. Springer. doi:10.1007/978-3-319-49004-5_36.

62.

Simperl, E. (2009). Reusing ontologies on the semantic web: A feasibility study. Data & Knowledge Engineering, 68(10), 905–925. doi:10.1016/j.datak.2009.02.002.

63.

Stuckenschmidt, H. & Klein, M. (2007). Reasoning and change management in modular ontologies. Data & Knowledge Engineering, 63(2), 200–223. doi:10.1016/j.datak.2007.02.001.

64.

Suárez-Figueroa, M.C. & Gómez-Pérez, A. (2012). Ontology Requirements Specification (pp. 93–106). Berlin, Heidelberg: Springer.

65.

Suárez-Figueroa, M.C., Gómez-Pérez, A. & Fernández-López, M. (2012). The NeOn Methodology for Ontology Engineering (pp. 9–34). Berlin, Heidelberg: Springer.

66.

Sure, Y., Staab, S. & Studer, R. (2004). On-To-Knowledge Methodology (OTKM) (pp. 117–132). Berlin, Heidelberg: Springer.

67.

van Berlo, L., Willems, P. & Pauwels, P. (2019). Creating information delivery specifications using linked data. In 36th CIB W78 2019 Conference (pp. 647–660).

68.

Vandenbussche, P.-Y., Atemezing, G.A., Poveda-Villalón, M. & Vatant, B. (2017). Linked open vocabularies (lov): A gateway to reusable semantic vocabularies on the web. Semantic Web, 8(3), 437–452. doi:10.3233/SW-160213.

69.

Verbeke, S. & Audenaert, A. (2018). Thermal inertia in buildings: A review of impacts across climate and building use. Renewable and Sustainable Energy Reviews, 82, 2300–2318. doi:10.1016/j.rser.2017.08.083.

70.

Vrandečić, D. & Gangemi, A. (2006). Unit tests for ontologies. In OTM Confederated International Conferences “On the Move to Meaningful Internet Systems” (pp. 1012–1020). Springer.

71.

Vrandečić, D. & Sure, Y. (2007). How to design better ontology metrics. In

Franconi ,

Kifer and

May (Eds.), The Semantic Web: Research and Applications, Berlin, Heidelberg (pp. 311–325). Springer, Berlin Heidelberg. doi:10.1007/978-3-540-72667-8_23.

72.

Wieringa, R.J. (1996). Requirements Engineering: Frameworks for Understanding. Wiley.

73.

Yoon, S.-C., Henschen, L.J., Park, E.K. & Makki, S. (1999). Using domain knowledge in knowledge discovery. In Proceedings of the Eighth International Conference on Information and Knowledge Management, CIKM ‘99 (pp. 243–250). New York, NY, USA: ACM. doi:10.1145/319950.320008.

EEPSA as a core ontology for energy efficiency and thermal comfort in buildings

Abstract

Keywords

1. Introduction

1 https://www.ashrae.org/technical-resources/bookstore/standard-55-thermal-environmental-conditions-for-human-occupancy

3 http://www.w3.org/ns/ssn/

28 https://w3id.org/eepsa

3.2. Ontology development methodology

29 https://protege.stanford.edu/

3.4. Developing the EEPSA ontology on top of ODPs

31 The rest of the CQs considered were satisfied by developing ontology modules as explained in Section 3.5.

36 http://stlab.istc.cnr.it/IoT-AP/IoT-AP.rdf, not available at the moment of writing this article.

3.5.1. FoI4EEPSA (Feature of Interest for EEPSA) ontology module

37 https://w3id.org/eepsa/foi4eepsa

44 https://w3id.org/eepsa/q4eepsa

49 https://w3id.org/eepsa/p4eepsa

53 https://w3id.org/eepsa/exr4eepsa

55 https://w3id.org/eepsa/exn4eepsa

57 https://w3id.org/eepsa/ek4eepsa

59 https://w3id.org/widoco/bestPractices

3.7.1. Design correctness metrics

63 http://oops.linkeddata.es/

64 http://protegeproject.github.io/protege/views/ontology-metrics

65 http://www.thezfiles.co.za/Modularity/TOMM.zip

67 http://themis.linkeddata.es/

4. EEPSA ontology in use

69 http://www.hermit-reasoner.com/

Footnotes

Acknowledgements

Application examples of the ODPs

References

¹
https://www.ashrae.org/technical-resources/bookstore/standard-55-thermal-environmental-conditions-for-human-occupancy

³
http://www.w3.org/ns/ssn/

²⁸
https://w3id.org/eepsa

²⁹
https://protege.stanford.edu/

³¹
The rest of the CQs considered were satisfied by developing ontology modules as explained in Section 3.5.

³⁶
http://stlab.istc.cnr.it/IoT-AP/IoT-AP.rdf, not available at the moment of writing this article.

³⁷
https://w3id.org/eepsa/foi4eepsa

⁴⁴
https://w3id.org/eepsa/q4eepsa

⁴⁹
https://w3id.org/eepsa/p4eepsa

⁵³
https://w3id.org/eepsa/exr4eepsa

⁵⁵
https://w3id.org/eepsa/exn4eepsa

⁵⁷
https://w3id.org/eepsa/ek4eepsa

⁵⁹
https://w3id.org/widoco/bestPractices

⁶³
http://oops.linkeddata.es/

⁶⁴
http://protegeproject.github.io/protege/views/ontology-metrics

⁶⁵
http://www.thezfiles.co.za/Modularity/TOMM.zip

⁶⁷
http://themis.linkeddata.es/

⁶⁹
http://www.hermit-reasoner.com/