Abstract
Station–city integration cyberspace behaves as an interdisciplinary field of intelligent transportation and smart city, also a representative scenario in the Architecture, Engineering and Construction (AEC) sector. Due to the growing demands in data integration research, the intelligent operation and maintenance (O&M) of the station–city integration cyberspace needs to implement semantic ontology, which is suitable for semantic web construction. To achieve semantic information fusion of multi-source heterogeneous data, and clarify the decision-making role of various types of data on specific operational goals, this article proposed a framework for semantic ontology model construction, based on the deployed sensor network. Specifically, an ontology model for station–city integration cyberspace O&M was constructed, incorporating sensor data mainly from five categories, named structure, environment, crowd flow, emergency events, and energy consumption, respectively. Subsequently, the
Keywords
Introduction
In recent years, research on intelligent operation and maintenance (O&M) across various industrial scenarios has garnered significant attention, particularly in Architecture, Engineering and Construction (AEC) sectors such as manufacturing, energy, and transportation. The adoption of digital transformation in the AEC sector has grown rapidly. This kind of digital transformation involves some advanced information and communication technologies (ICT), such as digital twin, virtual reality, text mining, and block chain. The core objective is to improve operational efficiency and system reliability through data-driven technologies. For example, most intelligent O&M research focuses on equipment failure prediction, intelligent diagnostics, resource optimization, adaptive decision-making, real-time monitoring and analysis (Dinh, 2024; Montero Jimenez, 2020; van Dinter et al., 2022).
Station–city integration is one of the important research directions in the field of AEC. Its highlight is to achieve an integrated layout of transportation and urban functions, which is not merely a simple superposition of transportation facilities and urban space, but rather an optimization of resource allocation and enhancement of functional efficiency through the close integration of stations, cities, and transportation networks. Specifically, intelligent O&M in the station–city integration cyberspace involves a comprehensive evaluation of the operational status based on a developed performance evaluation index system for various O&M scenarios. As the O&M system in station–city integration cyberspace utilizes advanced information technologies, particularly the Internet of Things (IoT), big data, and artificial intelligence (AI), to enhance sensing capabilities, intelligent analysis, and decision-making support, the development of an intelligent O&M platform based on multi-source information fusion becomes essential. However, as the station–city integration cyberspace becomes increasingly complex, traditional O&M management systems face significant challenges, due to the diversity of multi-source heterogeneous sensor data, high overlapping functional O&M tasks, and so on (Chen et al., 2023; Diao, 2019; Ke et al., 2021).
To address this challenge, semantic web technologies have been progressively applied in the field of intelligent O&M. The semantic web is an extension of web technologies that allows information to be accessible to human users and, at the same time, understandable and processable by machines. By adopting standardized knowledge representation languages (such as the Resource Description Framework (RDF) and Web Ontology Language (OWL)) and reasoning mechanisms, the semantic web facilitates cross-platform and cross-domain data integration and sharing, enhancing data interoperability (Canito et al., 2022; Gouda Mohamed et al., 2024). Currently, in the AEC domain, semantic research mainly focuses on how to connect transportation data with urban information effectively, improving the intelligent decision-making capabilities of transportation hub systems during O&M processes (Shi et al., 2023; Tan et al., 2021). Notably, sensor networks integrate data collection, processing, and transmission (Duobiene et al., 2022; Li, 2022) while utilizing numerous low-power, low-cost sensor nodes for data perception, fusion, and analysis, thus providing efficient monitoring and decision support (Navabian et al., 2020). Intelligent O&M often involves various types of sensor devices, such as those for environmental monitoring and energy consumption management. Achieving cross-platform sensor data fusion remains a key challenge for sensor network technologies.
In order to achieve semantic information fusion of multi-source heterogeneous data, and clarify the decision-making role of various types of data on specific operational goals, this article proposes a framework of semantic ontology model construction for the station–city integration cyberspace. This framework involves static sensor deployment information and dynamic monitoring data within the context of station–city integration, aiming to achieve intelligent O&M processes. The goal of the constructed ontology framework is to achieve semantic fusion of heterogeneous data from multiple sources, as well as dynamic O&M management.
The main contributions of this article are as follows: Constructing an ontology model for station–city integration cyberspace, which combines both sensor deployment information and monitoring data. This ontology model encompasses five key O&M scenarios: structure, environment, human flow, emergency events, and energy consumption. Additionally, it incorporates knowledge from various station–city integration standards and sensor manuals, referencing the real-world sensor deployment at Shenzhen North Station of China. Presenting a dynamic reasoning method for the semantic web of the station–city integration cyberspace. This reasoning method integrates the ontology model with the knowledge graph and applies predefined rules to facilitate risk warnings and the maintenance of sensor networks within the station–city integration cyberspace.
The remainder of this article is organized as follows: Section 2 reviews related research on the semantic frameworks for intelligent O&M. Section 3 presents the methodology of the semantic ontology architecture for sensor network-based intelligent O&M, and Section 4 provides a detailed description of the process of constructing the semantic model. Section 5 evaluates the proposed semantic framework in terms of ontology consistency and model validity. Finally, Section 6 concludes the article and presents potential future directions.
Related Work
To leverage the semantic web for implementing the intelligent O&M architecture of sensor networks, an essential element is the construction of a semantic ontology for sensor networks. This section provides an overview of the research progress on the ontology concept within the context of intelligent O&M.
In the context of AI, an ontology is a formal, explicit specification of a shared conceptualization, taking the form of a set of classes, relationships, and axiomatic constraints. ‘‘Formal” refers to the fact that it must be machine-readable. ‘‘Explicit” means that the type of concepts used and the constraints on their use are explicit. ‘‘Shared” describes consensual knowledge that is accepted by a group (Gruber, 1995).
In recent years, research on ontologies has expanded into multiple domains, with significant advancements in the field of smart cities (De Nicola & Villani, 2021). These advancements cover a wide range of dimensions, including communities (Periñán Pascual, 2023), risk management (Cui et al., 2023), eLearning (Tsoutsa et al., 2022), energy (Sayah et al., 2021), environment (Xu et al., 2022), sustainable development (Santos et al., 2023), and urban planning (Sobral et al., 2021). Several public resources for smart city ontologies have been released, such as Smart City Ontology (https://urenio.org/smart-city-ontology/) and Smart City Ontology Standards (https://iec.ch/basecamp/ontology-standards-smart-cities). These studies aim to effectively integrate data from diverse sources and achieve intelligent management. Against this backdrop, with the help of sensor network technology, many ontologies have been developed and successfully applied in scenarios such as data integration, knowledge sharing, and intelligent decision-making (Chen et al., 2020). These ontologies can effectively integrate data from different sources and achieve intelligent management in various fields. Table 1 lists the semantic models highly relevant to this article along with their basic information, including names, knowledge sources, and reused ontologies. Based on these studies, existing ontology applications can be divided into two categories according to their research focus, application scenarios and system architecture: intelligent environments and activity recognition, as well as sensor data integration and intelligent decision support.
Ontologies Related to Sensor Nnetworks.
Ontologies Related to Sensor Nnetworks.
Research on ontology applications in intelligent environments and activity recognition focuses on perceiving the environment and recognizing activities through various sensors and intelligent technologies. For instance, as the recognition of complex daily activities (ADLs) in smart homes was increasingly applied to disease diagnosis, leading to the development of POLARIS (Civitarese et al., 2021), which uses unsupervised segmentation algorithms. However, the data sources for POLARIS’s ontology construction are limited to sensor monitoring information, which is relatively narrow. Similarly, an indoor knowledge graph framework has been developed, which combines smartphone sensor data (such as inertial sensors, WiFi strength, and magnetic field strength) with indoor spatial information to achieve precise and efficient pedestrian positioning in indoor environments (Guo et al., 2021). Although the ontology constructed in this study integrates multi-source information, it does not mention the main ontological backbone, which is crucial for ontology interoperability.
In the domain of smart buildings, with the evolution of multimedia sensor network (MSN) technology, M2SSN-Onto (Cardinale et al., 2022) offers an integrated solution by merging multimedia and mobile sensor data based on the MSSN-Onto framework (Angsuchotmetee et al., 2020), to meet the detection needs of emergency events within smart buildings. Additionally, in the application of construction safety, a novel environmental pollutant identification mechanism has been proposed. This approach utilizes association rule mining and ontology reasoning to combine sensor data with multidisciplinary knowledge, enabling real-time monitoring and intelligent identification of pollutants at construction sites (Xu et al., 2022). Meanwhile, SCO (Li et al., 2022) integrates multi-source safety risk factors with textual rule knowledge, with a focus on enhancing safety during subway construction. However, these ontologies, which are modeled around specific areas within the smart building domain, are not openly accessible, thereby impeding their extensive evaluation and reuse by smart building practitioners.
Sensor Data Integration and Intelligent Decision Support
Research on the application of ontologies in data integration and intelligent decision-making primarily focuses on sensor networks. These methods emphasize the integration of multi-source sensor data through ontology models, offering intelligent decision support and optimized management across diverse domains. For instance, a decision support system for forest fire management was developed, which relies on a semantic sensor network ontology. This system integrates meteorological data and sensor information to prevent and control various types and stages of fires (Chandra et al., 2022). However, the credibility of the proposed ontology’s structure is significantly undermined due to the lack of evaluation. In the context of Industry 4.0, a novel approach was proposed that links sensor data with process activities during data generation, leading to the creation of a process-aware industrial IoT knowledge graph. This graph bridges the gap between raw sensor data and higher-level knowledge within organizations (Diamantini et al., 2023). Although the study provides a detailed description of the ontology’s structure, it fails to adequately justify the selection of entities, thereby weakening the persuasiveness of the ontology design. Meanwhile, to fully represent industrial production workflows, InPro (Yang et al., 2023) described sensor monitoring data by sharing and reusing domain knowledge, achieving conceptualization and formalization of production processes. However, InPro has not been publicly released.
In urban transportation, the Connected Traffic Data Ontology (CTDO) (Viktorovic et al., 2020) improved the spatial structure of existing domain ontologies, enhancing the query efficiency of semi-autonomous vehicle sensor platforms. Within urban management, a knowledge modeling and heterogeneous sensor data integration method for a bridge health monitoring system was proposed. Based on ontology, this method enables fine-grained semantic modeling of bridge structures, sensors, and their observation attributes (Li et al., 2021). Additionally, the Urban District Sustainability Assessment (UDSA) ontology (Kuster et al., 2020) integrated existing urban sustainability assessment frameworks and domain ontologies, addressing data heterogeneity and information exchange issues through semantic web technology. Compared to CTDO, the ontology designs of the bridge health ontology and UDSA are more transparent and rational, yet neither is openly accessible. SmashHitCore (Kurteva et al., 2023) provides a comprehensive model for GDPR-compliant data sharing and processing, with a particular focus on consent and contract compliance for vehicle sensor data, thereby facilitating legal data exchange and management for smart city services. Although the use of SPARQL queries for evaluation was proposed, the details and results of these queries were not explicitly presented. With the development of smart cities, cross-domain resource integration has gradually become a research hotspot. One approach integrates urban models and open standards of the Internet of Things (such as SensorThings API, IndoorGML, and CityGML), which supports various smart city applications (Huang et al., 2022). Furthermore, the construction of a City Information Model (CIM) through the integration of BIM, GIS, and IoT data provides a systematic framework for data integration, contributing to the development of digital twin cities (Shi et al., 2023). However, the modeling of these ontologies typically focuses on BIM-GIS aspects, with relatively simplistic ontology designs for IoT, failing to fully capture the complexity and diversity of IoT.
Summary of Related Work
In summary, the existing general research methods for ontology modeling are achieved by adhering to a series of standards set by the World Wide Web Consortium (W3C), aimed at ensuring the interoperability of ontologies between different systems, enhancing maintainability and providing consistency and connectivity for applications. These standards include RDF, OWL, SPARQL, and Semantic Web Rule Language (SWRL). Ontology modeling based on specific application scenarios involves the following three aspects:
(1)
(2)
(3)
Methodology
The station–city integration cyberspace involves a complex structure with a large number of sensors, which generate data that exhibit multi-source heterogeneous characteristics. Therefore, to systematically present the intelligent O&M framework of the network space, it is necessary to construct the corresponding maintenance framework at the granularity of the ontology. The maintenance ontology framework for sensor data in the station–city integration cyberspace proposed in this article is shown in Figure 1. Specifically, this framework centers around the operational process of the sensor network system. It generates the ontology model and maintenance framework for sensor data through two core steps: ontology construction from static information and dynamic information-driven knowledge graph.

Sensor semantic model construction framework for the station–city integration cyberspace.
The specification phase aims to clearly define the scope and objectives of the ontology, and identify the intended users and requirements for the ontology. In this section, the article demonstrates the specification of the OMSC-Onto ontology through the following points:
Knowledge Acquisition and Conceptualization
To enhance the intelligent management level of urban infrastructure and transportation systems, and to achieve real-time monitoring of stations and their surrounding environments as well as the daily maintenance of sensor network systems, this article focuses on the operation of sensor networks in the context of the station–city integration at Shenzhen North Station of China. It extracts knowledge from both the static deployment information and dynamic monitoring data of sensors to construct and enrich the semantic model.
List of Core Competency Questions (CQs).
List of Core Competency Questions (CQs).
According to the Railway Passenger Station Integration Development Planning and Design Guidelines (T/CSOTE, 2024), sensor groups are deployed in different functional areas based on the classification of station–city functions. Each group selects appropriate sensors based on the key parameters specified in the Integrated Engineering Planning and Design Standards (DB11/T 2129-2023), involving a total of 14 types across five categories of sensors. The static information of the sensors denotes the ontological attributes fixed at deployment and invariant over the sensor’s lifetime, modifiable only during scheduled maintenance. As shown in Table 9 of Appendix A.1, it encompasses sensor type, measurand, indicator class, transduction principle, range, accuracy, resolution and spatial location, providing the persistent identity required for data interpretation, calibration and asset management in integrated station–city monitoring systems.
Dynamic Monitoring Information of Sensors
Dynamic sensor information refers to the time-stamped occurrent data streams continuously emitted after deployment. Accumulated as
Dynamic Monitoring Information of Sensors.
Dynamic Monitoring Information of Sensors.
The purpose of ontology implementation is to transform the ontology model into a machine-readable model using ontology representation languages. OWL 2 (Web Ontology Language 2) is a language recommended by the W3C (World Wide Web Consortium) for representing ontologies on the Web. It is an extended version of OWL, offering richer expressiveness and more efficient reasoning support. To construct an OWL 2-based ontology model, this paper uses the Protégé ontology management system to model, edit, and reason. A classical approach in the ontology modeling methods is the Stanford Seven-Step Method (Noy & McGuinness, 2002). This method belongs to manual ontology modeling methods and consists of seven steps: (1) determine the domain and scope of the ontology; (2) consider reusing existing ontologies; (3) list important terms; (4) define classes and class hierarchies; (5) define properties of classes; (6) define classifications of properties; and (7) create instances. This method has been widely applied in various fields (Ancione et al., 2024; Guyo et al., 2023; Xi et al., 2023). Specifically, this article utilizes the rdflib library in Python to develop an application that batch-converts CSV table data into instances based on predefined classes and properties, and maps them into RDF format.
RDF is a semantic network model based on triples, consisting of ‘‘subject-predicate-object,” which is used to represent relationships between different resources (Ma et al., 2023). This approach allows sensor data to be standardized into structured semantic data, facilitating data interoperability and reasoning across systems and domains. The converted RDF files, as a form of interrelated graph representation, are stored in GraphDB to form a knowledge graph. GraphDB is a graph database developed by Ontotext, built on RDF standards, and supports semantic data storage and querying, making it suitable for handling large-scale graph data and complex relational data. It has been widely applied in fields such as knowledge graphs, the semantic web, data integration, and recommendation systems (Ancione et al., 2024).
Once the dynamic data storage is completed, it is essential to effectively associate relevant entities and data related to risk warning in the ontology with predefined reasoning rules, thereby extending the existing knowledge graph. This process not only enhances the semantic expressiveness of the knowledge graph but also further improves the risk warning functionality within the graph, providing a more precise informational foundation for subsequent data analysis and decision support. In the context of semantic web reasoning rules, SPARQL (SPARQL Protocol and RDF Query Language) combines efficient data querying capabilities, allowing for flexible expansion on large-scale RDF datasets, thereby better supporting cross-platform compatibility and seamless integration with ontologies and rule engines. Consequently, the knowledge required to identify risk conditions is represented in the form of SPARQL rules.
Evaluation
Ontology evaluation aids in identifying potential issues, enhancing the accuracy and consistency of knowledge representation, and facilitating knowledge sharing and reuse. Evaluation methods encompass formal validation approaches, such as leveraging description logic reasoning to detect ontology consistency and completeness; structural assessment methods that focus on the complexity, coverage, and modularity of the ontology; and application-based effectiveness evaluation, which measures the ontology’s performance through practical use cases.
In this study, the criteria for evaluating OMSC-Onto are consistency, coverage, and adaptability. To maintain consistency, the HermiT reasoner in Protégé is employed for automatic consistency checks. To ensure coverage, responses to CQs are conducted. Adaptability is verified through application-based methods using a real-world scenario, where SPARQL queries are formulated to retrieve relevant information on sensor observations.
Semantic Fusion Framework for Sensor Data
Ontology Development for Smart O&M in Station–City Integration Cyberspace
This section provides a detailed description of the proposed OMSC-Onto ontology, including the core logical model representing knowledge of sensor network O&M, as well as the specifics of the OMSC-Onto modules.
The semantic web for intelligent O&M of the station–city integration at Shenzhen North Station collects and organizes static information through sensors, utilizing the general terminology for sensors (GB/T 7665-2005) to extract concepts closely related to intelligent operation and maintenance. This process ultimately identifies the types of sensors, their deployment locations, measurement parameters, and monitoring targets as the main entities of this research.
Based on this foundation, the semantic web for intelligent O&M is systematically constructed using a top-down design. Specifically, the first step involves defining the categories of the system based on the extracted core concepts, which provides a clear framework structure for model construction. Next, for each category, the attributes of each object and their related data properties are further clarified to ensure the comprehensiveness and accuracy of the model. The defined model is then semantically aligned with existing domain ontologies, namely SSN and SOSA. Based on the alignment results, classes and properties in SSN and SOSA are filtered, adjusted, and expanded. The final ontology structure is shown in Figure 2.

Classes (yellow) and properties (blue) of OMSC-Onto.
Table 4 illustrates the core ontology model of OMSC-Onto, which is built upon the SSN (Semantic Sensor Network) ontology model. Mapping the ontology to a high-level abstract framework ensures that the terminology used in the model is comprehensible to end-users and explicit to ontology developers. Given that the DOLCE UltraLite ontology (DUL) is the core dependency of the previous version of SSN and that corresponding alignments already exist (https://www.w3.org/TR/vocab-ssn/), OMSC-Onto leverages DUL as its upper ontology. Specifically, the following alignments are made: sosa:Observation, which denotes the act of executing an observation procedure to estimate or compute a value of a FeatureOfInterest, is aligned with dul:Event. sosa:ObservableProperty, existing as a property dependent on an entity, is aligned with dul:Quality. sosa:Sensor, the entity that carries out the observation, is aligned with dul:Object. sosa:FeatureOfInterest, the thing whose property is estimated or calculated during observation, is treated as a joint subclass of dul:Event, dul:Object, and dul:InformationEntity. In this work, it is specialized into three categories: architectural structures, spatial areas, and equipment. Architectural structures and equipment are further categorized as dul:PhysicalArtifact under dul:Object, while spatial areas are categorized as dul:PhysicalPlace under dul:Object. Consequently, FeatureOfInterest is aligned with dul:Object to more accurately reflect its classification and attributes. MR:Location, representing the deployment site of a sensor, is aligned with dul:PhysicalPlace. MR:Alarm and MR:AbnormalStatus, representing distinct alarm states, are aligned with dul:Situation (Stavropoulos et al., 2021). The following sections will detail the specific information of these modules.
The Alignments With DUL Ontology.
(a) AreaFOI is observed by CrowdFlowSensors, EnvironmentalSensors and EventSensors; for example, an air quality sensor measuring the CO concentration of the waiting hall.
(b) EquipmentFOI is monitored by EnergyConsumptionSensors; for example, a wireless meter recording the power of an air conditioning unit.
(c) StructureFOI is assessed by StructuralSensors; for example, a digital crack meter tracking the crack opening displacement of a Pillar.
The relational constraints of FeatureOfInterest with other entities are expressed as follows:
This indicates that an instance of an FeatureOfInterest is associated with at least one observable property.
This axiom shows that an instance of an Alarm must be associated with at least one observation.
Additionally, the storage of inherent sensor attributes (such as range and resolution) can be divided into two categories:
(a) Different sensors have different focus attributes. For example, an ultrasonic level gauge measuring water depth emphasizes accuracy, range, and resolution, while a machine vision sensor measuring pedestrian flow direction focuses on recognition type, installation height, and recognition time;
(b) For the same type of sensor, the attribute values may change when measuring different parameters. For instance, the resolution of an AI ToF people-counting sensor is
Since these attributes are only for display purposes in this article and are intended for reference in daily management without computation or comparison, this type of information is represented using class annotation properties, as shown in Figure 3. For case (b), a

A case study of annotation property representation.
SPARQL-Based Smart Operation and Maintenance Rules
The objective of the proposed framework is to detect relevant issues. To achieve risk management for the integration of stations and cities, this section defines the O&M rules for the semantic web of station–city integration based on the SPARQL query language, as shown in Table 5. These rules are categorized into five types, which are used to determine the upper and lower limits of risk control, thereby enabling the identification of potential risk scenarios. These rules include, but are not limited to, the occurrence of various abnormal events, the operational status of the sensor network, and maintenance conditions. By monitoring and managing these rules, we can proactively identify potential risk points, thus ensuring the operational safety of the station–city integration cyberspace.
Operation and Maintenance Rules for the Semantic Web of Station–City Integration.
Operation and Maintenance Rules for the Semantic Web of Station–City Integration.
In the semantic web environment, the rules in Table 5 can be expressed and executed using the SPARQL query language. SPARQL is a query language designed explicitly for querying RDF data models, allowing for efficient retrieval of relevant information from knowledge graphs. In practical applications, SPARQL not only supports standard selection queries but also enables complex construction queries to generate new RDF triples and expand existing data structures (Giannios et al., 2024). Therefore, utilizing the SPARQL query language to automate the execution of these rules and risk detection can significantly enhance efficiency and reduce the need for manual intervention.
Specifically, we utilize the management web application GraphDB Workbench interface provided by GraphDB to design and execute a series of SPARQL CONSTRUCT queries to generate and derive relevant graph patterns. These graph patterns can represent problem scenarios through RDF triples, further enriching and expanding the existing knowledge graph. For example, when executing queries, the system can identify potential nodes, relationships, and events related to risks in real-time, forming a complete chain of risk scenarios and storing this information in the graph for subsequent analysis and decision-making.
Figure 4 illustrates an example of the SPARQL rule structure, which is responsible for extracting problematic situations during the operation of the sensor network. The Prefix section declares two namespaces, with xsd used to reference XML Schema data types, such as

Rule structure.
To achieve intelligent O&M, the sensor monitoring data in the station–city integration cyberspace needs to be first integrated with the corresponding sensor information. Based on the RDF mapping principles illustrated in Figure 5, this article develops an application using the

Principles of resource description framework (RDF) mapping.
To verify the correctness and applicability of the semantic network model for the station–city integration cyberspace proposed in this article, this section conducts an ontology consistency evaluation and a model validity evaluation of the constructed semantic network model.
Automated Consistency Checking
Ontology consistency typically refers to the state in which all concepts, relationships, rules, and their semantic and logical structures within an ontology remain consistent and conflict-free (Yang et al., 2023). HermiT is an efficient OWL 2 DL reasoner used for reasoning and verification of descriptive logic ontologies (Glimm et al., 2014). Disjointness axioms are essential for enabling non-trivial entailments and detecting inconsistencies (Völker et al., 2015). Therefore, to obtain a non-trivial result, we explicitly introduced 27 class-disjointness axioms (Table 10 in Appendix A.3) based on (i) the inherent mutual exclusivity arising from metrological and sensor-physics principles and (ii) the mandatory distinctions defined in international standards such as ISO/OGC and urban-rail-transport industry specifications. These axioms capture the semantic mutual exclusiveness among OMSC-Onto classes. After integrating these potentially conflicting constraints, the debug function of Protégé5.5, using HermiT 1.4.3, confirmed that OMSC-Onto remains logically consistent.
Answering CQs
Referencing a set of CQs provided in Table 2, this study designed a targeted set of CQs (see Table 6) to conduct application-based evaluation using the SPARQL query language. The purpose of this evaluation is to verify the performance of the OMSC-Onto ontology in practical applications, particularly its capability to retrieve precise information to answer specific query requirements. The query results indicate that the OMSC-Onto ontology can effectively retrieve accurate information to meet the designed CQs.
Specified Competency Questions (CQs) and Answers Based on the Passenger Flow Case.
Specified Competency Questions (CQs) and Answers Based on the Passenger Flow Case.
Due to the typically large volume of data in real-life scenarios, this study selected the entry passenger flow records (Wang et al., 2017) from the Zhujiang Road subway station between April 9 and May 6, 2012, to validate the model’s effectiveness on a large-scale dataset. The dataset includes sensor information (sensor type, maintenance cycle, and last maintenance time) as well as passenger flow data. Among them, the passenger flow data comprises a total of 5,712 observations over 28 days, with the Labor Day holiday occurring from April 29 (Sunday) to May 1 (Tuesday), while April 28 (Saturday) is considered a working day. Figure 6 illustrates the general trend of this dataset.

Original data of inbound transaction records at Zhujiang Road subway station from April 9 to May 6, 2012.
Firstly, based on the static information of the sensor, an ontology is constructed to formally represent various entities related to sensors and their interrelationships. Subsequently, the developed application utilizes this ontology model to transform real-time passenger flow data into specific instances, saved in RDF format and stored in a GraphDB for subsequent querying and analysis. Figure 7 illustrates the transformation process from the ontology layer to the instance layer. At the ontology layer, core concepts such as Sensor, Observation, FeatureOfInterest, and their related subclasses pertinent to this scenario are defined. The transformed instance layer then concretely embodies the application of these concepts in actual data, including specific instances of passenger flow observations, such as observation times, values, and associated sensor information.

Ontology mapping process of the passenger flow case.
After processing the data using the methods described in this study, the results were stored in GraphDB, and corresponding queries and reasoning were conducted based on this data. To evaluate the capability of the ontology in answering questions, this study designed a series of CQs targeting the key areas of Observation and Sensor, along with their corresponding expected answers (see Table 6). Table 7 illustrates two example queries devised to address the CQs listed in Table 6. The final query results are presented in Figure 8. By comparing the query results with the expected answers, it is evident that the ontology is highly effective for dynamic passenger flow monitoring.

Querying information for Competency Questions (CQs) 1–10.

A knowledge graph for large passenger flow alarms (part).

The class hierarchy structure of observation and sensor.

The class hierarchy structure of alarm and FeatureOfInterest.
SPARQL Statements for Competency Questions (CQs) 1–10.
For the passenger flow data, the main focus of the reasoning included the identification of large passenger flows and the maintenance of the sensor network system (rules 15 and 17 in Table 5). Table 8 illustrates a series of queries and reasoning processes executed on the entry passenger flow records. In query (a), a total of 5,712 records of the observation type were identified, which is consistent with the number of observations in the original data, indicating that no data loss or damage occurred throughout the data processing workflow. In reasoning (b), it was identified that 595 records of passenger flow data exceeded the set threshold, with these records typically appearing during peak hours (i.e. 17:00–19:00), providing a basis for further analysis of large passenger flow phenomena. Figure 9 displays the visualized graph of these warning records. In reasoning (c), the system also detected 3,030 records that triggered maintenance alarms, indicating potential issues within the sensor network. These unexpected records first appeared on April 22, 2012, suggesting that inspections or maintenance of the relevant sensors or equipment may be necessary.
SPARQL Statements for Alarm and AbnormalStatus.
Through these queries and reasoning, the proposed framework demonstrates its capability to accurately identify large passenger flow events and proactively predict and alert potential system failures or maintenance needs, thereby providing robust support for the system’s stable operation.
The research on intelligent O&M in the station–city integration cyberspace is an intersection of intelligent transportation and smart city domains, addressing the semantic fusion of multi-source heterogeneous data. It encompasses risk analysis and decision-making, as well as the maintenance and management of sensor network systems. Currently, the unstructured and heterogeneous nature of data within sensor networks hinders knowledge sharing and semantic interoperability among various stakeholders and communities, making it challenging to establish mechanisms for data communication and utilization. This situation, in turn, complicates the identification of safety risks in the context of integrated station–city intelligent O&M. Therefore, constructing a semantic web model for the integrated station–city sensor network is essential.
This article develops a semantic model for sensor networks within the station–city integration cyberspace, aimed at achieving intelligent O&M of sensor networks based on a ‘‘data-ontology-maintenance” framework with data feedback characteristics. The main innovation lies in the proposed framework, which integrated various semantic web technologies (i.e. ontology, RDF, and SPARQL) to process five categories of sensor data—structure, environment, crowd flow, event, and energy consumption—across static and dynamic states, converting them into a unified RDF semantic representation format, thereby forming a knowledge graph for sensor intelligent O&M. Within the constructed sensor semantic web framework, potential risk identification during the integrated O&M process was achieved by applying predefined rules and the operational knowledge graph. To validate this framework, a comprehensive evaluation method for the ontology consistency and model validity of the constructed sensor semantic web was conducted. On the one hand, the consistency of the ontology was verified using the HermiT plugin provided by Protégé, ensuring that the defined concepts, classes, and attributes are logically conflict-free. On the other hand, the validity of the semantic web’s dynamic reasoning rules was validated using subway sensor data. This dual-layer evaluation standard significantly demonstrated the broad application prospects of the framework.
It is noteworthy that the proposed framework encompasses the primary risks encountered in the daily O&M processes of the integrated station–city, meeting general risk identification requirements. However, there are still some limitations in the specific implementation process that can be addressed in future research. First, this study primarily focuses on the identification of various risks; supplementary work regarding emergency response measures following risk identification will be conducted in the future. Second, the risk identification of abnormal behaviors such as running and falling cannot be determined by the rules presented in this article and requires further investigation in conjunction with deep learning methods, such as convolutional neural networks. Additionally, the rules in this article are based on manually extracted data; future work will involve the development of a rule extraction engine utilizing large language models.
Footnotes
Acknowledegments
This research was funded by the National Key Research and Development Program of China (Grant No. 2023YFC3807501). The authors gratefully acknowledge this support.
Author Contributions
XP and HF: conceptualization; XP: methodology; XP: software; XP: validation; XP: formal analysis; XP: investigation; HF: resources; XP: data curation; XP: writing–original draft preparation; XP and HF: writing–review and editing; XP: visualization; HF: supervision; XP: project administration; HF: funding acquisition. All authors have read and agreed to the published version of the manuscript.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was funded by the National Key Research and Development Program of China (Grant No. 2023YFC3807501).
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Data Availability Statement
The data presented in this study are available on request from the corresponding author. The OMSC-Onto ontology can be found at https://llamla.github.io/station-city-integration/ontology/omsco, and the code is available at
.
A.1 Static Deployment Information of Sensors
Static Deployment Information of Sensors.
| Measurement | Indicator | Sensor | Measured | ||
|---|---|---|---|---|---|
| Type | Parameter | Type | Technology | Parameters | Location |
| Structure | Stress | Quantitative | Strain gauge (resistive) | Accuracy: 0.2% Range: 0–20,000 Resolution: Micrometric | Pillars and connection nodes at the station hall and platform layer |
| Quantitative | Piezoelectric | Accuracy: |
|||
| Strain | Quantitative | Resistive straingauge | Accuracy: |
||
| Quantitative | Fiber optic | Accuracy: |
|||
| Displacement | Quantitative |
Resistive | Accuracy: Micrometric Range: 0–50 mm Resolution: Micrometric | ||
| Quantitative |
Fiber optic | Accuracy: Micrometric Range: 0–50 mm Resolution: Micrometric | |||
| Vibration (Angular velocity) (XYZ velocity) (Acceleration) | Quantitative | Accelerometer | Accuracy: |
||
| Quantitative | Piezoelectric sensor | Accuracy: |
|||
| Quantitative | Fiber optic | Accuracy: |
|||
| Water leakage | Quantitative | Vertical graphenesensor | – | ||
| Tilt angle | Quantitative | MEMS tilt sensor | Accuracy: |
||
| Crack | Quantitative | Digitalcrack gauge andphotogrammetry | Accuracy: Millimeter level Range: 0–50 mm Resolution: 100 pixels | ||
| Environment | Air quality | Quantitative | Optical sensor | Accuracy: |
Around the station hall and resting areas of passengers |
| Quantitative | Electronic sensor | Accuracy: |
|||
| Quantitative | Laser sensor | Accuracy: |
|||
| Temperature | Quantitative | Thermocouplesensor | Accuracy: |
||
| Quantitative | RTD (resistivetemperaturedetector) | Accuracy: |
|||
| Quantitative | Infraredtemperature sensor | Accuracy: |
|||
| Humidity | Quantitative | Lithium chloridehumidity sensor | Accuracy: |
||
| Noise level | Quantitative | Thermal noisesensor | Accuracy: |
||
| Flametemperature | Quantitative | Infraredthermographiccamera | Accuracy: |
Waiting hall and transfer corridor | |
| Fire sourcelocation | Quantitative | Infraredthermographiccamera | Accuracy: |
||
| Smokeconcentration | Quantitative | Smoke sensor | Accuracy: |
||
| Hazardous gascomponents | Quantitative | Electrochemicalsensor | Accuracy: |
||
| Water flow speed | Quantitative | Ultrasonic flowspeed sensor | Accuracy: |
Taxi pick-up area and ride-hailing pick-up area | |
| Water depth | Quantitative | Ultrasonic liquidlevel meter | Accuracy: 0.5%–1% Range: 0.3–30 m Resolution: 3 mm | ||
| Energy consumption | Voltage | Quantitative | Hall voltage sensor | Accuracy: |
Air conditioning units and pump units |
| Current | |||||
| Power | |||||
| Energy | |||||
| Water flow | Quantitative | Water flow sensor | Accuracy: |
||
| Crowd flow | Passenger flow | Quantitative | AI ToF peoplecounting sensor | Accuracy: |
Station hall entrances and exit areas, and external distribution areas |
| Quantitative | Machine vision | Accuracy: |
|||
| Density | Quantitative | AI ToF peoplecounting sensor | Accuracy: |
||
| Quantitative | Machine vision | Accuracy: |
|||
| Queue Number | Quantitative | AI ToF peoplecounting sensor | Accuracy: |
||
| Quantitative | Machine vision | Accuracy: |
|||
| Trajectory | Quantitative |
AI ToF peoplecounting sensor | Accuracy: |
||
| Quantitative + Qualitative | Machine vision | Accuracy: |
|||
| Speed | Quantitative | AI ToF peoplecounting sensor | Accuracy: |
||
| Quantitative | Machine vision | Accuracy: |
|||
| Direction | Quantitative |
AI ToF peoplecounting sensor | Bidirectional recognition | ||
| Quantitative + Qualitative | Machine vision | Bidirectional recognition installation height: 5 m Recognition time: 25 fps | |||
| Large passenger flow | Quantitative | AI ToF peoplecounting sensor | Accuracy: |
||
| Quantitative | Machine vision | Accuracy: |
|||
| Event | Fall | Quantitative | Panoramic camera | Resolution: 8K (7,680 x 4,320 pixels) Range: 360 | |
| Running | Quantitative | Panoramic camera | Resolution: 8K (7,680 x 4,320 pixels) Range: 360 |
A.2 The Class Hierarchy Structure
Feature 10 shows the class hierarchy structure of observation and sensor. Feature
shows the class hierarchy structure of alarm and FeatureOfInterest.
A.3 The Class Disjointness Axioms
The Class Disjointness Axioms.
| ID | Axioms |
|---|---|
| 1 | DisjointClasses: AbnormalStatus, Alarm, FeatureOfInterest, Location, ObservableProperty, Observation, Sensor |
| 2 | DisjointClasses: CrowdFlowObservation, EnergyConsumptionObservation, EnvironmentalObservation, EventObservation, StructuralObservation |
| 3 | DisjointClasses: CrowdFlowProperty, EnergyConsumptionProperty, EnvironmentalProperty, EventProperty, StructuralProperty |
| 4 | DisiointClasses: CrowdFlowSensors, EnergyConsumptionSensors, EnvironmentalSensors, EventSensors, StructuralSensors |
| 5 | DisjointClasses: CrowdFlowWarning, EnergyConsumptionWarning, EnvironmentalWarning, EventWarning, StructuralWarning |
| 6 | DisjointClasses: AreaFOI, EquipmentFOI, StructureFOI |
| 7 | DisjointClasses: AccelerationMonitoring, AngularVelocityMonitoring, CrackOpeningDisplacementMonitoring, DisplacementMonitoring, StrainMonitoring, StressMonitoring, TripleAxialTiltAngleMonitoring, WaterleakageMonitoring, XYZVelocityMonitoring |
| 8 | DisjointClasses: CO2Monitoring, FireSourceLocationMonitoring, FlameTemperatureMonitoring, HarmfulGasComponentsMonitoring, HumidityMonitoring, NoiseMonitoring, PM10Monitoring, PM2.5Monitoring, SmokeConcentrationMonitoring, TemperatureMonitoring, WaterDepthMonitoring, WaterflowVelocityMonitoring |
| 9 | DisjointClasses: CrowdFlowDirectionMonitoring, CrowdFlowRateMonitoring, IndividualSpeedMonitoring, PersonnelDensityMonitoring, QueueSizeMonitoring, TotalPopulationinTheAreaMonitoring |
| 10 | DisjointClasses: CurrentMonitoring, ElectricEnergyMonitoring, PipeTemperatureMonitoring, PowerMonitoring, VoltageMonitoring |
| 11 | FallingMonitoring DisjointWith RunningMonitoring |
| 12 | DisjointClasses: Acceleration, Angular Velocity, CrackOpeningDisplacement, Displacement, strain, stress, TripleAxialTiltAngle, WaterLeakage, XYZVelocity |
| 13 | DisiointClasses: CO2concentration, FireSourceLocation, FlameTemperature, HarmfulGasComponents, Humidity, NoiseDecibels, PM10Concentration, PM2.5Concentration, SmokeConcentration, Temperature, WaterDepth, WaterFlowVelocity |
| 14 | DisjointClasses: CrowdFlowDirection, CrowdflowRate, IndividualSpeed, PersonnelDensity, QueueSize, TotalPopulationinTheArea |
| 15 | DisjointClasses: Current, ElectricEnergy, PipeTemperature, Power, Voltage |
| 16 | Falling Disiointwith Running |
| 17 | DisjointClasses: DigitalCrackMeter, InclinationGauge, LowPowerDisplacementSensor, LowPowerStrain sensor, LowPowerStressSensor, UprightGrapheneSensor, VibrationSensor |
| 18 | DisjointClasses: AirQualitySensor, HarmfulGasDetector, NoiseSensor, smokeSensor, TemperatureAndHumditysensor, ThermalImagingCamera, UltrasonicFlowRatesensor, UltrasonicLevelGauge |
| 19 | MachineVisionSensor DisjointWith PeopleCountingSensor |
| 20 | DisjointClasses: MatchingCurrentTransformer, TemperatureSensor, WirelessMeteringMeter |
| 21 | DisjointClasses: AbnormalCrack, AbnormalVibration, StructuralCorrosion, StructuralDeformation, StructuralDisplacement, StructuralOverload, StructuralTilt |
| 22 | DisjointClasses: AbnormalHumidity, AbnormalNoise, AbnormalTemperature, AirQualityWarning, FireWarning, FloodWarning |
| 23 | FallingEvent DisjointVith RunningEvent |
| 24 | DisjointClasses: PlatformLevel, StationConcourseLevel, stationForecourt, TransferCorridor, WaitingHall |
| 25 | DisjointClasses: AirConditioningUnit, DistributionBox, PumpAssembly |
| 26 | DisjointClasses: CanopyColumn, ConnectionNode, GlassFacadeCable, Pillar, StructuralColumn, Truss |
| 27 | MaintenanceOverdue DisjointWith NotWorking |
