Abstract
This study explores enhancing spatial analysis by integrating Space Syntax with deep learning methods for images and graphs. While Space Syntax traditionally focuses on geometry and topology, it overlooks visual data like texture and color. As image content varies by location, capturing spatial context is essential, especially in irregular or open spaces where identifying central points is challenging. To address this, the study introduces the concept of an “isovist graph” - a spatial model that minimally covers a closed plane with isovists while preserving centrality and connectivity. A method is proposed to compute this model rigorously in discrete settings. Experiments show that exact solutions can be efficiently derived for many spatial cases, aligning with the intended objectives.
Introduction
Basic space unitization methods such as convex spaces, axial maps, and isovists 1 have been widely applied in Space Syntax 2 to analyze spatial configurations at the scale of buildings and city blocks. However, image information such as texture, light, and color, which are visually more primitive, is outside the scope of Space Syntax. Even if the spatial composition, size, and shape are identical, the impression of a space can change significantly depending on the materials and lighting. Therefore, image-based information is essential for various types of spatial decision-making.
From the 1980s to the 2000s, when the Space Syntax method was developed and applied, the difficulty of obtaining image information as data and analyzing it seemed to be in the background. In recent years, omnidirectional cameras have become widespread, and the number of stock photos captured from various streets as street views has increased. With the recent development of deep learning technology, it has become easier to extract meaningful features from high-dimensional data in the form of images and to transform and compress images. In the field of urban analysis, the semantic segmentation of street-view images into basic spatial components, such as sky, buildings, and trees, is widely used to analyze their areas and spatial phenomena (e.g., walkability). 3 In addition to streets, the stock of captured indoor images is also increasing, 4 providing services such as virtual tours, and spreading to areas such as real estate.5,6 In addition, a deep learning technique called a graph convolutional network has been developed to perform tasks such as regression and classification on the graph itself, which has multidimensional features such as nodes. Research has applied this technology to the analysis of houses7,8 and streets, 9 and the groundwork has been laid for analyzing complex graph structures in different ways.
An omnidirectional image is similar to a 3D isovist image; however, instead of depth information, it can be considered to be accompanied by general image information. 10 Since the information in an image changes depending on where the image is taken, it is important to know where to take the image. Usually, images are captured at the center of a space with clear partitions, such as rooms; however, in a space with few partitions or an irregular system, the boundaries of the space may become unclear, making it difficult to define the center of the space. In addition, because a virtual tour enables the continuous navigation of a space by connecting the points where images are captured, it is necessary to consider connectivity. However, when creating a 3D model using camera position estimation and photogrammetry, it is easy to specify the shooting points relatively closely within the same space. However, because the main purpose of this research is to describe the structure of a space, it is preferable to have a minimum or close-to-minimum number of shooting points at a location (center) that is as representative of the space as possible.
Based on this background and awareness of the problem, this study defines an isovist graph as “a spatial model that covers a closed plane with a minimum or near-minimum number of isovist while ensuring centrality and connectivity,” and proposes a rigorous method to obtain it in a discrete problem setting. We propose a method for obtaining this rigorously in a discrete problem setting. This approach addresses limitations in prior isovist-based and axial map models, which often rely on approximate solutions and do not explicitly account for isovist centrality. A more detailed review of related studies and the positioning of this study is provided in Section 2. The proposed method was applied to multiple floor plans ranging from the architectural scale to the city block scale, and the solution time and obtained graphs were verified.
In the following sections, Section 2 describes the previous studies and the position of this research, and Section 3 presents the problem framework. Section 4 reports the concrete flow and formulation of the method. Section 5 verifies the method, Section 6 discusses the results, and Section 7 summarizes the study.
Related studies and positioning of this study
To clarify the novelty and methodological contribution of this study, this section reviews existing research in two main aspects: (1) spatial models used to represent visibility structures such as isovists, axial lines, and visibility graphs; and (2) solution methods for constructing or optimizing these spatial graphs. By organizing prior studies from these two perspectives, we highlight the theoretical and computational gaps that this study aims to address. Our method is then positioned accordingly, with a particular emphasis on its discrete treatment of isovists and its rigorous approach to solving the minimum coverage problem while considering centrality.
Spatial visibility models and their characteristics
Spatial visibility has been modeled in various ways depending on the target of analysis. This subsection outlines the key characteristics of isovist-based models and axial maps, highlighting differences in their spatial units, assumptions, and analytic purposes.
The concept of our isovist graph shares similarity with the function finding minimal set of isovists implemented in the Isovist_App. 11 While it can be said that the method is pioneering approach, it does not consider the centrality of isovist unlike our study. The definition of our isovist graph has many essential overlaps with the definition of an axial map in [2, pp. 91-92]. Hillier and Hanson 2 defined an axial map as the minimal set of straight lines that pass through all convex spaces and capture all axial links in the open space structure of a settlement. The idea of covering a plane with a finite number of connected unit spaces is common. However, axial maps are somewhat confusing concepts as entities. As highlighted by Batty and Rana, 12 axes are interpreted as coherent straight lines that move unimpeded in space; however, these are spatial units similar to aggregated road edges, making naive interpretation difficult. Therefore, when analyzing an axial map as a graph, the axes given as edges must be dual-transformed into nodes. In ordinary spatial analysis, various states at arbitrary observation points are observed and weighed. However, such a correspondence cannot be obtained for nodes given by a dual transformation, and measurement at the center of the line segment or an averaging process is required. However, an isovist is a simpler spatial unit than an axial map because it covers the entire area visible from a single point in all directions. On the other hand, our study differs from the present study 12 in that it uses isovists to generate the axial map described below and does not consider the centrality of isovists or the connection relationship between vantage points. Visibility graph analysis (VGA) 13 is a well-known graph model based on the connection between an isovist’s vantage points. This method generates isovists on a grid, creates a visibility graph of them, calculates various graph features such as a clustered coefficient, and visualizes the features as an isovist field on the original raster. VGA is suitable for expressing the differences in detailed features in a space because it expresses the space as a detailed field. To simplify the visibility graph using VGA, a method of thinning by maximizing the modularity of the graph 14 has been proposed, but it is difficult to obtain a complete coverage of the space by the thinned representative points. There are studies that generate graphs to move as a method of indoor navigation.15–17 In order to create a navigation graph, it is necessary to standardize the spatial units in the preprocessing stage. In this case, it is often the case that the entire space divided by walls and doors is considered to be one spatial unit,18,19 and there are cases where spatial units that are not necessarily visually integrated, such as L-shaped rooms, are included. On the other hand, in complex spaces, a method based on a triangular division has been proposed, 20 but this leads to the creation of visually redundant spatial units. Another similar field is the study of determining the location of the station points for 3D scan of buildings.21,22 In these studies, the problem of finding the minimum number of points and their positions necessary to cover the entire target space is similar to that of this study, but since the main purpose is to scan buildings, the scope of an isovist is often limited and the centrality of the space is not taken into account.
Solution methods for constructing or optimizing these spatial graphs
This subsection reviews prior methods for constructing and optimizing spatial graphs based on visibility.
First, the function of Isovist_App uses a greedy algorithm to find the minimum isovist graph. However, the number of vantage points obtained is not guaranteed to be theoretically minimal. Next, we focus on the automatic generation of axial maps. As mentioned earlier, it was claimed that an axial map can be derived by leaving axial lines until all convex spaces are traversed and all axial lines that can be connected to other axial lines are connected without repetition. The method for deriving a linear representation of a spatial configuration relies on the prior definition of a convex partition of space. The problem of partitioning the minimum convex polygon of a polygon involves efficient algorithms for simple polygons without holes (e.g. Ref. 23). However, it is NP-hard in the case of holes, which are common in space. 24 Moreover, even given the optimal convex partitions, the problem of covering these convex polygons with the minimum number of axial lines is NP-hard because it can be attributed to the set cover problem. 25 In addition, from the viewpoint of mathematical programming, the axial map must be connected as a graph that adds cut and flow constraints, thereby making the solution more difficult. Despite these mathematical difficulties, there is a growing need for an objective method for generating axial maps, and Peponis et al., 26 Batty and Rana, 12 and Turner 27 studied its automatic generation. First, Peponis et al. divided space by s-partition, 28 which divides space by wall extensions instead of convex regions, and then enumerated m-lines, which represent the diagonals of other wall surfaces, and ordered m-lines, in the order in which m-lines intersect more s-partitions. Tuner et al. improved Peponis’ method and proposed a heuristic called subset elimination to retain the necessary lines of sight. Batty and Rana divided the space into grids, found isovists at each point, and used some of them to cover the space. They then generate axial lines in these isovists according to criteria such as maximum length and repeat the process until there are no uncovered areas. This method gives the user the freedom to switch between multiple criteria and change the selection of isovists and axial lines; however, it does not guarantee the connectivity of the axial lines.
All the above algorithms were heuristic and did not guarantee the minimum number of necessary axial links in their problem setting. Jung and Kim
29
formulated Peponis’ problem of finding an axial map using integer programming (without considering cyclicity) and showed that an exact solution can be found. In their formulation, they defined the constraint that axial lines are connected to each other; however, the only connection constraint is that an axial line is connected to one or more other axial lines, such that, for example, if there is a set of four axial lines
In summary, compared to previous studies, this study proposes a new model and problem setting, namely the isovist cover problem, considering centrality and connectivity. Furthermore, the solution method differs from related studies in that it devises an efficient and exact solution method that can be applied to axial map-solving, which is highly novel.
Problem framework
In this study, we assume that the class of polygons representing a connected plane is either simple or multiple connected polygons. We consider the problem of visually covering a plane using isovists generated from a finite number of vantage points. As it would be somewhat difficult to implement a geometrically rigorous calculation of the coverage, we generated observation points discretely and evaluated the coverage as an approximation by assuming that they were all visible from the isovist. In this case, in order to ensure the representation of the structure of the space, it is a constraint that the observation points are visually connected to each other. Considering a situation such as that used in filming, we aim to position the isovist’s vantage point at what we consider the center of the space. Let
Definition of our isovist
Let us take as an example a plane with a T-shaped hole in a square region as shown in Figure 1(a). The polygon formed by connecting the visible observation points in sequence from the vantage point Target space and isovist.
Coverage
Next, the isovist coverage is explained. When the observation points are placed only at the endpoints of each line segment, as shown in Figure 2(a), the two monitoring points at the positions shown in the figure are sufficient to cover all the observation points. However, it can be seen that the upper part of the space is not covered by the isovist. Therefore, an observation point was added at the midpoint of each line segment, as shown in Figure 2(b). In this case, three observation points are required to cover all the observation points, and the corresponding isovist also shows that the space is completely covered. In this example, to fill the gaps in space, the observation points must be placed at the required density. Explanation of covering.
Connectivity
Connectivity has been introduced as a fundamental constraint for axial lines in the automatic generation of axial maps and is also introduced in this study because it is a prerequisite for analyzing space graphically. In this study, isovists were considered connected if their vantage points were directly visible to each other. In addition to visual connectivity, linear connectivity also exists. For example, if the connectivity of an axial map is the former, the connectivity of a segmentation map divided among road segments corresponds to the latter. 31 It can be said that dynamic connectivity is a graphical representation of the topological property of space that indicates whether or not one spatial unit is continuously accessible from another, regardless of whether it is visually visible or not. If so, it is conceivable to express the isovist connectivity in terms of the isovist overlap, which is relatively easy to formulate and solve. However, we did not adopt this method because, combined with the omnidirectional nature of isovists, it connects to distant isovists that cannot be accessed without passing through other spatial elements. The limitations of the connectivity set used in this study and future directions are discussed later.
Centrality
The last key point is centrality, which is the goal of this research: to cover space with an isovist and structure the space in terms of its connectivity. To capture an omnidirectional image, it is desirable to capture the image at the center of each individual “space” as much as possible. The problem is how to define these “spaces” and their “centers.” Because isovists are a class of star-shaped regions that radiate from a viewpoint, the coherence of the entire region as a space is generally weak. Conversely, convex regions are a class of regions in which any two points within them are visible and can easily be a unit of a coherent space. An isovist is a broader region class than convex regions; therefore, we evaluated spatiality in terms of convex regions contained within the isovist. For our limited isovist, the convex hull is the maximal clique in the visibility graph formed by connecting the observation points (Figure 3(a)). However, because there are generally multiple maximal cliques, it is necessary to select the one that is representative of the isovist from among their names. Figure 3(b) shows an example of the three maximal cliques. First, cliques Examples of derived process maximum cliques up to representative clique.
Finally, as shown in Figure 3(c), let
Specific flow and formulation
Symbols used in variable names.
The main flow is as follows. The proposed method consists of steps 1-4. These are described in the following order.
Step 1: Placement of observation points and candidate vantage points
After loading the floor plan data, observation points were placed on the wall to evaluate the visibility of the entire space. The total number of observation points to be placed was assumed to be constant and expressed as Arrangement of observation and vantage points.
Next, the isovist candidate vantage points are placed in the space. Similarly, the total number of points was assumed to be constant and denoted as
Step 2: Selection of isovist and maximum convex region
An isovist is generated by extracting the visible observation points for each candidate vantage point Example of a maximum clique with a hall.

In the above equation,
Step 3: Finding the minimum number of vantage points (Opt. 1)
The first is the number of isovists that can cover the plane while satisfying the isovist linkage constraints. For this purpose, we define the following mixed-integer quadratic constraint programming problem.
Equation (4) guarantees that any observation point is visible from one of the selected vantage points. Equation (5) is a lazy constraint that guarantees that the isovist graph is connected. Lazy constraints are a bottom-up method in which after finding a tentative solution, the tentative solution is checked to see if it is disconnected. If so, the necessary constraints are added successively to gradually strengthen the constraints. First, the set of vantage points for Difference between connected components and neighborhoods.
Step 4: Select the isovist that occupies the center of the space (Opt. 2)
The following is a mathematical programming problem in which the number of vantage points is set to a value greater than or equal to the minimum n obtained in Opt. 1, and the isovist is chosen such that the vantage points are placed as much as possible in the center of the space while satisfying the connectivity constraints. The constraints are the same as in Opt. 1. In Opt. 2,
Verification
Verification was performed using the following settings.
Floor plans and parameters
In this study, six floor plans of architectural and street scales were used, focusing on familiar spaces in this type of analysis (see Figure 7). Buildings 1,2,3
26
and the National Gallery
27
were used as well-formed spaces, and a short-term treatment facility for children with emotional disorders (Treatment Facility)
34
and Gassin
2
were used as irregular spaces. Table 2 lists the number of vantage and observation points in each space. The number of observation points in each space was approximately 1000 and they were placed as evenly as possible on the wall. The vantage point was generated at the intersection of the m-lines and one of the vantage points within 1/3 of the minimum side length of each space model was selected. Floor plans used in the verification. Number of vantage points and observation points placed in each floor plan.
Implementation of the method
The proposed system was implemented in Python 3.12 and integrated with Grasshopper, operating within Rhinoceros 3D version 8.0. Graph-theoretic operations, including clique extraction, were performed using the igraph library. Mathematical programming components were formulated through the Python API of Gurobi Optimizer (version 12.0.1). For the optimization process, the solver parameters were set with Params. MIPFocus = 2 to prioritize the identification of optimal integer solutions. A time limit of 10 min was imposed on each solution run to balance precision with computational efficiency. All computations were conducted on a Windows 11 Pro 24H2 system equipped with an Intel Core Ultra 285K processor, 192 GB of DDR5-5200 MHz memory.
Results
Summary of optimization results.
For comparison, Appendix A1 shows the connection constraints with network flows and their computation results, which show that they are less efficient than the lazy constraints.
Figures 8 and 9 show the results of the placement of each plan view using Opt. 1 and Opt. 2 for each plane. In addition to the vantage points and edges connecting them, the visualization of the Opt. 1 results shows that the areas covered by isovists were colored to confirm that there was no coverage leakage. In the Opt. 2 visualization, the main clique for each isovist is shown. The following sections describe the floor plans. Optimization result 1 (upper row: Opt. 1, lower row: Opt. 2). Optimization result 2 (upper row: Opt. 1, lower row: Opt. 2).

Building one consists of a series of rectangular rooms, and the center of the space and the number of points required can be easily assumed. Fifteen points were the minimum number of points required by Opt. 1, but this did not place the rooms in the center of the space, but at the boundaries. Therefore, Opt. 2 set the number of points to 17 to match the number of rooms. As the vantage points must be visible, the vantage points in the lower-left room are placed at the boundary. The other rooms were placed in the center of the room.
Building two consisted of a relatively finely segmented space with ambiguous boundaries. Both examples show the minimum number of vantages’ points. Because they have the same number of vantage points, they are similar, except for the finer space on the right side.
Building three is loosely segmented from a wide space to a narrow space with few boundaries. Compared to Opt. 1, the narrow space had a point placed in the center.
The National Gallery is a segmented space, as shown in Building 1. There was a difference between the minimum number of vantage points and the appropriate number of vantage points. However, there are still places where the vantage points are not placed, such as the circular empty space on the right-hand side.
Treatment Facility has irregular systems and unclear boundaries. Because the vantage points tended to be located near the walls with the minimum number of units, we increased the number by one and performed Opt. 2. The impression is that the vantage points are arranged in a coherent spatial unit.
Gassin ran Opt. 2 with the minimum number of vantage points. The arrangement of the interior vantage points tends to be slightly different from Opt. 1.
The above results show that the minimum number of vantage points by Opt. 1 tended to be placed at the boundary, but by Opt. 2, the number of vantage points was placed at the center of the space by increasing the number of vantage points as necessary. However, the connection relation of the vantage points is often not that of the neighbors but that of the distant vantage points, which will be discussed in the next chapter.
Discussion
The following are discussed.
Difference in α
By setting α in Opt. 2, the convex hull representing the isovist can be set in consideration of the balance from a spatially closed convex hull to a convex hull with a large area. The previous results were obtained for Differences in Opt. 2 results according to 
Connectivity
This section discusses the connectivity. The results showed a tendency to connect to distant vantage points. This result is somewhat unnatural from the viewpoint of spatial connectivity as perceived by humans. People may perceive spatial connectivity in a kinematic sense. In this study, the visual connectivity of the vantage points was used as a constraint. According to Turner et al. (year): However, as indicated by Batty and Rana, 12 in isovist coverings, the isovist region itself is always connected or adjacent according to the definition of the problem. This connectivity guarantees second-order visibility. As mentioned in Section 3, isovist regions themselves often intersect with distant isovist regions; therefore, expressing connectivity in this manner would result in an excessive degree of connectivity, as the isovist region itself would be connected to the distant isovist region. Alternatively, if orthodox convex partitioning is applied, for example, coverage and linear connectivity are satisfied. However, the results of Carranza and Koch 35 show that there is a tendency for excessive partitioning, such as the 114 partitions in Gassin. Because the main objective of this study was to propose a method for solving connectivity in a rigorous and efficient manner, the issue of connectivity remains. Future work is required to improve the proposed method to consider dynamic isovist connectivity in applications such as graph analysis.
Other limitations and future issues
In this study, monitoring points were placed discretely on the wall surface to simplify the implementation of the spatial analysis. isovist is geometrically determined by the location of the vantage point; therefore, if rigor is important, it is desirable to solve the covering problem under continuous conditions. To simplify the computation time, the vantage points are placed only at the intersection points on the m-line as a rule of thumb. The effect of this limitation on the results has not been fully verified and will need to be verified in conjunction with improvements to the model. In addition, the exact solution method has some computational limitations, and the development of an approximate solution method is necessary.
Conclusion
In this study, we proposed an isovist graph as a new concept of a spatial model that covers a plane with an isovist under the condition of connectivity, mainly for architectural and street-scale spaces, and proposed a fast exact solution method for the automatic generation of axial maps by devising the connectivity constraint, which has been difficult so far. We proposed a new method to solve the problem of how to generate an axial map automatically by devising linkage constraints and verified the method using six architectural and urban area models. Several plans can be solved in a short time by sampling data, devising constraint expressions for coupling conditions, such as delay constraints, and using the latest mathematical programming solvers. Although complex plans require more computation time, and in some cases, the optimal solution was not obtained, the optimality gap was demonstrated and could be used as a guide when creating heuristics in the future. The center of space is important for this method because it determines the point at which an omnidirectional image is captured. The vantage point could be placed close to the center of the space by setting α = 0.5. This study aimed to develop an isovist graphical model to analyze spatial configurations. However, the model cannot model local connectivity relations, such as lines of flow, because the connection relation of the vantage points is often such that distant vantage points are connected. In addition, the space was sampled discretely. Improvements in the representation of local connectivity and the discrete nature of spatial sampling are necessary for practical use in the future.
Footnotes
Acknowledgements
The authors would like to thank Associate Professor Sam McElhinney at UCA Canterbury for sharing information about Isovist_App.
Ethical approval
This study did not involve human participants or animals; therefore, ethical approval and informed consent were not required.
Funding
This work was supported by a JSPS Grant-in-Aid for Scientific Research (C) (grant number: 23K04161).
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Data Availability Statement
Data are available from the corresponding author upon reasonable request.
