Abstract
The remarkable lateral oscillatory occurrences observed in renowned footbridges, such as the Solferino Bridge in France, the Millennium Bridge in the UK, and the Oda Bridge in Japan, have garnered widespread attention. The intricate nature of pedestrian-induced lateral vibrations necessitates a comprehensive investigation into their underlying mechanisms, with the crux lying in the identification of the lateral excitation model. Presently, most prevailing detection approaches predominantly rely on contact-based equipment, such as force sensors and displacement sensors. However, these approaches suffer from two main drawbacks. Firstly, contact measurement necessitate a substantial number of sensors and entail higher experimental costs. Secondly, when extending contact-based measurements to accommodate multiple individuals, complications arise, including cumbersome installation, heightened technical complexity, and escalated experimental costs. Therefore, we present a novel approach, based on visual technology, to expeditiously recognize the lateral excitation induced by pedestrians. Termed the Dual Perspective of Mediapipe (DPM), this method harnesses the power of Mediapipe, coupled with dual camera models, to precisely unravel pedestrian gait particulars and lateral forces. The efficacy of the proposed approach is validated through a comparative analysis with existing pedestrian gait data. It is noteworthy that this approach offers significant advantages: non-contact with the test pedestrians, thereby ensuring the accuracy of gait information; convenient and straightforward arrangement of measurement equipment; and low experimental costs.
Keywords
Introduction
Over the last thirty years, several footbridges worldwide have encountered large lateral vibration events, such as the Millennium Bridge in London, Toda Park Bridge in Japan and the Solferino Bridge in Paris. When the number of pedestrians on the footbridge reaches a critical threshold, even a slight increase in number of pedestrians triggers an abrupt oscillation of the bridge(Dallard et al., 2001; Fujino et al., 1993; Newland, 2003). Since then, extensive research has been conducted to investigate the pedestrian-induced lateral vibration of footbridge. Several intricate mechanisms underlying these events have been unveiled, encompassing synchronization, human-structure interaction, and crowd dynamics.
Over two decades of extensive research, although significant advancements have been made in this field, the underlying mechanism of pedestrian-induced lateral vibration remains unclear. Initially, it was believed that the synchronous resonance between pedestrians and footbridges was the main cause of pedestrian-induced vibration. Fujino and Siringoringo (2016) proposed a model for the lateral force of pedestrian, representing it as a simple harmonic force with a random phase. This model was applied to the Toda Bridge and successfully predicted the amplitude of footbridge vibrations, aligning well with actual observations. The experiments conducted by Arup on the Millennium Bridge revealed a direct proportionality between the lateral force exerted by pedestrians and the local velocity of the footbridge. Building upon this finding, Dallard et al. (2001) introduced the concept of equivalent damping coeffecient
The aforementioned experiment aims to acquire pedestrian gait through contact measurement. However, it is evident that certain limitations exist in this approach: Firstly, the direct sensor-to-body contact unavoidably influences the subjects and fails to fully replicate natural walking, thereby compromising data accuracy. Secondly, the extensive number of sensors result in high experimental costs and subsequent maintenance requirements. On the other hand, deep learning has made significant advancements currently, and convolutional neural network has a strong feature extraction ability, which made it widely used in human posture recognition through vision technology. Previously, Toshev and Szegedy (2014) integrated deep learning with human gesture recognition and proposed Deep Pose, which is a deep neural network (DNN)-based method for recognizing gestures that employs DNN regression to predict the coordinates of key points on the human body. Zhou and Zhang (2020) proposed a multi-model-guided regression algorithm, which integrates multivariate human body information including visual features, skeletal poses, audio signals, etc., and combines it with a multi-class SVM to recognize and predict human body action. Fu et al. (2020) proposed a two-step Kalman filter and a fuzzy logic-based adaptive tuning method, where the environmental information is used as an input for adaptive tuning by the sensors. This method ultimately captures the human posture and motion. Ma et al. (2020) proposed an improved GaitSet network for human gait recognition, which utilizes human silhouette and posture as key information to avoid the influence of clothing differences on recognition. Ding et al. (2020) proposed a human pose recognition algorithm based on multi-feature and rule learning. They defined a 219-dimensional feature vector containing angles and distances. They also used bagging and random subspace methods to classify human poses, achieving the pose recognition. Bourahmoune et al. (2022) proposed an intelligent sitting posture training scheme based on the LifeChair IoT cushion, which achieves real-time monitoring of sitting posture as well as highly accurate recognition. Xu et al. (2023) proposed a robust Abnormal Human-Posture recognition, and OpenPose is used for human skeletal data extraction. Van Hauwermeiren et al. (2020) used a static camera and a UAV camera to simultaneously observe the crowd trajectories on a footbridge and used color segmentation to detect them, in which the resulting detection accuracy reaches 2∼3 cm. While vision technology has demonstrated remarkable performance in human posture recognition, its application to pedestrian lateral force detection remains relatively underexplored. Therefore, we aim to bridge this gap by applying vision technology for pedestrian lateral force recognition, offering novel perspectives and methodologies. This paper proposes a novel visual recognition approach, namely Dual Perspective of Mediapipe (DPM), for precise identification of pedestrians' gait information and lateral forces. The approach is compared with experiments conducted by Carroll et al. (2013) to verify its effectiveness and carries out a preliminary application - validation of the IPM. DPM employed a vision-based, non-contact gait recognition technique that enhances accuracy in data collection and pedestrian walking restoration. This approach improves the quality of test data, thereby enabling us to further validate the IPM. Compared to traditional contact measurement, DPM also offers simplified equipment requirements, reduced experimental costs, and only utilizes two cameras as the key equipment. The cameras only need to meet basic recording specifications for successful identification. Besides, the setup of measurement equipment is simple and the utilization of neural networks ensures fast processing speed. Consequently, DPM exhibits remarkable attributes including high accuracy and cost-effectiveness and serves as a reliable research tool for recognizing pedestrian gait information and lateral forces under constrained conditions.
Dual-perspective visual recognition method
Convolutional neural networks (CNNs) are widely used in the field of visual recognition, and their powerful feature extraction ability has established them as a key tool in human pose recognition. For single person pose recognition, the most common approach involves inputting RGB images into a neural network to predict the positions of various joint points in the human body. Subsequently, the surrounding scene features are analyzed, and the corresponding joint points are connected. With the continuous development of single person pose recognition algorithms, their efficiency and accuracy have been consistently improving. A paradigm shift from traditional numerical coordinate-based methodologies to the innovative heat map estimation techniques has endowed single person pose recognition with invaluable utility across diverse scenarios and conditions.
In this study, we introduce a pioneering approach to visual recognition, employing a dual-perspective methodology built upon the lightweight convolutional neural network, Mediapipe(Lugaresi et al., 2019). Our method capitalizes on the swift and precise attributes inherent to Mediapipe in the realm of single-person pose recognition, while synergistically integrating the capabilities of the dual-perspective model for measuring detection points’ distances. It is noteworthy that the binocular camera exhibits superior accuracy and 3D reconstruction capabilities in comparison to its monocular counterpart. This facilitates a more precise capture of pedestrians' lateral gait characteristics, thereby enabling a more reliable and accurate pedestrian gait recognition process.
Visual recognition network of mediapipe
This study employed a lightweight convolutional neural network, namely Mediapipe, to accurately recognize posture of pedestrian. Mediapipe encompasses a suite of modules that integrate various models, including human pose recognition, gesture recognition, and facial recognition. For the purpose of human posture detection, Mediapipe leverages the BlazePose lightweight convolutional neural network. This network combines heat map and regression methods to accurately identify the coordinates of key points of the human posture. During the inference stage, BlazePose incorporates a detector-tracker architecture. The detector component identifies the presence of a complete human body in the current frame, while the tracker predicts the coordinates of key points. In cases where the tracker fails to predict the posture information for the next frame, it reruns the detector to recognize the human body and initialize the detector once again. Given that the human body exhibits a wide range of movements and certain movements may be obstructed by limbs, BlazePose enhances detection efficiency and accuracy by utilizing facial detection instead of limb detection. This choice is motivated by the fact that the face undergoes relatively smaller movement changes compared to the torso, enabling face detection to handle various complex environments and changes in human posture. However, it is important to note that this approach has a limitation: the prediction process relies on the visibility of the human face.
BlazePose’s framework is structured around the integration of heat map, offset, and regression methods, which are divided into two main components: the key point detection network and the key point regression network. During the training phase, a supervised lightweight embedding approach is employed, utilizing heat map and offset losses to refine the network’s output features. These refined features are then utilized by the regression networks. During the inference phase, the corresponding output layer is removed, resulting in a streamlined network structure. Through careful verification, it has been observed that this modified network architecture significantly accelerates the operation speed without compromising accuracy. The specific details of this network structure can be found in Figure 1b, which provides a visual representation of its components and their connections. (a)Human body’s 33 marker points identified by Mediapipe, (b)Framework of Blazepose.
In the estimation of human posture, Mediapipe demonstrates remarkable precision in identifying 33 key marker points distributed across the human body, as illustrated in Figure 1(a). These marker points encompass crucial information, including the coordinates and confidence levels associated with facial features, arms, trunk, legs, and other body parts. Thanks to its exceptional prediction speed and accuracy, Mediapipe is particularly well-suited for indoor single-person treadmill walking experiments. This visual recognition framework can be effectively utilized in the study of pedestrian-induced lateral vibrations on footbridges. By harnessing Mediapipe for efficient and high-precision recognition of human poses, in conjunction with the proposed algorithm for information extraction, the integration and extraction of pedestrian gait information can be achieved, offering a reliable solution within the constraints of limited research resources and conditions.
Camera model based on dual-perspectives
A dual-perspective camera setup is used to measure human posture, providing the following advantages: 1) The human posture information in the two 2D images is used to convert the actual recognition point displacement. This makes the displacement data sufficiently accurate. 2) The effect of lens distortion is reduced. During a person’s walking process, two cameras simultaneously record the motion in their respective viewpoints. Gait information, such as step frequency, step width, CoM displacement, velocity, and acceleration, is extracted based on the Center of Mass (CoM) and Center of Pressure (CoP) recognition point coordinates using Mediapipe. At the same time, the camera lens may have certain deviations in production accuracy and assembly process, resulting in stretching near the edge of the portrait. To reduce measurement errors caused by image stretching, two cameras should be aligned with the CoM and CoP so that they are directly imaged in the center.
The dual-perspective camera setup used in DPM is illustrated in Figure 2. The two cameras are positioned along the extension line of the treadmill’s central axis. The longitudinal plane, represented by the yellow plane in Figure 2, is perpendicular to the ground and passes through the central axis. Camera 1 is horizontally aligned with the position of the human’s CoM and provides the CoM perspective. Camera 2 is horizontally aligned with the position of the human’s foothold points, known as the CoP, and provides the CoP perspective. In this setup, the human’s CoM is captured at the image center of Camera 1, while the human’s CoP is captured at the image center of Camera 2. Schematic diagram of visual capture of human posture from dual perspective.
The horizontal distance
As depicted in Figure 3, a specific dual-perspective camera model is presented in detail. In Figure 3(a), the main view of the dual-perspective camera model is shown along the x-axis direction. Two rectangular boxes represent the imaging frames of the upper and lower cameras. Figure 3(b) presents a cross-sectional view of the dual-perspective camera model along the longitudinal plane. In Figure 3(a), Schematic diagram of dual camera model:(a) Main-view; (b) Cross-sectional view; (c) Offset of the imaging point; (d) Top-down view.
The lateral distance
Extraction of CoM displacement and gait information
The visual recognition network, implemented through the utilization of Mediapipe, demonstrates the capacity to discern and identify 33 pivotal landmarks within the human physique. However, it regrettably falls short in capturing the indispensable information pertaining to the CoM of pedestrian. Carroll’s seminal research illuminates the intrinsic mechanics of human locomotion, elucidating the preponderant role played by the Head, Torso, and Pelvis (HTP) in generating lateral forces(Carroll et al., 2013). Astonishingly, these HTP forces account for a substantial majority, encompassing 50.4% of the total body mass. This empirical evidence effectively corroborates the fundamental tenets of the simplified model theory, specifically the IPM, which posits the simulation of the human body as a mass entity, bolstered by a rigid limb devoid of mass. Carroll ascertained the location of CoM in the corporeal domain to reside superior to the pelvis through meticulous investigation. thereby substantiating the reliability of this theoretical framework. Drawing inspiration from Carroll’s pioneering research, this study similarly situates the center of mass position above the pelvis as the definitive location within the human physique. The process of ascertaining the CoM’s precise coordinates necessitates the utilization of specific identification points, namely 11, 12, 23, and 24, as illustrated in Figure 4. Notably, the midpoint “a,” derived from the intersection of identification points 11 and 12, as well as the midpoint “b,” formed by the points 23 and 24, serve as cardinal indicators of the torso’s central axis. The CoM’s spatial coordinates manifest themselves at a distance corresponding to 0.8 times the length of “ab” from the aforementioned central axis of the torso. Thus: Schematic diagram of marker points of human’s Centre of Mass(CoM).
The position of the CoP is determined by identifying points 27 and 28 in the Figure 1. In the IPM, the CoP’s position is defined as the point where the massless rigid rod supporting the body makes contact with the ground. It is assumed that the CoP’s position remains fixed during each step’s support phase. Among the 33 marker points provided by Mediapipe, there are six points associated with the foot region, respectively points 27∼32. However, points 29∼32, which correspond to the heel and toes, tend to exhibit significant fluctuations in position due to the rotational movement of the ankle during walking. In other words, when visually recognizing adjacent frames, these points may deviate considerably from their actual positions. On the contrary, points 27 and 28, representing the ankle, are relatively more stable. Consequently, only points 27 and 28 are considered for extracting step frequency information and determining the CoP’s position.
When pedestrians walk, the coordinates of the CoP change over time. A step taken with the same foot can be divided into three periods: the pre-foot-lift period, step fall period, post-foot-lift period, and forward step period. • Pre foot lift period: In this period, pedestrians lift one foot and move it forward until it touches the ground. • Step fall period: This period represents the pedestrian’s contact with the ground, supporting the entire body. Meanwhile, the other foot enters the later stage of foot lift. • Post foot lift period: After the step fall period, the foot that was supporting the body is lifted, preparing for the forward step. • Forward step period: This period involves the forward movement of the foot that just landed, while the other foot enters the pre-foot-lift period.
This process alternates between the two feet, creating a steady forward gait for the human body. Throughout these four stages, the CoP’s coordinates during the descent period remain relatively stable. However, during the Pre foot lift Period, Step fall period, and Forward step period, the CoP is separated from the ground. In the Pre and Post foot lift period, the CoP reaches its highest position above the ground, resulting in a local peak in the vertical coordinate of the CoP (
Based on the changes in CoP’s coordinates described earlier, this study has developed a peak recognition algorithm. The purpose of this algorithm is to identify the CoP’s peak area for each step cycle during the entire walking duration of pedestrians. Additionally, it separates the forward step period, which exhibits certain fluctuations in CoP’s coordinates. The algorithm ultimately outputs the stable period of the CoP, which corresponds to the time point when the pedestrian takes each step. Furthermore, the study utilizes the actual distance exchange algorithm of the dual-perspective model to calculate the actual distance between the foothold point of each step and the vertical plane. This calculation provides the coordinates of the foothold’s points. The separation process involved in this algorithm is illustrated in Figure 5, which likely provides a visual representation of the steps and stages involved in the algorithm. The vertical coordinate timescale of CoP during the walk is denoted by Process of peak recognition algorithm.
Verification and testing
Experimental campaign
In this study, a single-person indoor walking experiment was conducted on a stationary treadmill. The primary objective of this experiment was to assess the reliability and accuracy of the DPM in capturing pedestrian gait information. To achieve this, we compared the gait data obtained through DPM with the experimental data(Carroll et al., 2013). By comparing the results obtained from DPM with Carroll’s experimental data, the scholars aimed to evaluate the performance and accuracy of DPM in capturing and analyzing pedestrian gait information.
In the experiment, a treadmill model SHUA-T5170P was used, which is depicted in Figure 6a. Like most treadmills, it allows for arbitrary adjustment of the treadmill track speed. To simulate pedestrian walking under real conditions as closely as possible, a rhythmic audio with a fixed frequency was recorded for each participant prior to the experiment. During the experiment, the participant wore a pair of headphones and gradually adjusted their stride frequency to match the rhythm audio being played. Simultaneously, the speed of the treadmill was adjusted until the participant’s stride frequency fully matched the rhythm audio. This process aimed to replicate the natural walking pattern of the participant. The experiment and simulation were carried out under two different lateral step frequency conditions: 0.85 Hz and 0.93 Hz. These specific frequencies were likely chosen to investigate the effects of different step frequencies on the reliability and accuracy of DPM in capturing pedestrian gait information. Indoor treadmill experiment: (a) The treadmill (SHUA-T5170P) used; (b) Schematic diagram of sensor placement for the treadmill; (c) Indoor single pedestrian experiment site.
Physical data of subject and experimental conditions.
Information on experimental equipment.
We conducted a comprehensive validation of the recognition of human CoM motion and gait information by DPM. In Section 3.2, we compared the data obtained from sensors installed on the treadmill with the data derived from DPM. Specifically, we examined the temporal evolution of step points and step widths to assess the accuracy of DPM in identifying pedestrian step point and step width information. In Section 3.3, we compared the CoM lateral force determined by DPM with the human ground reaction force (GRF) measured by Carroll using visual target technology. This comparison was performed in both the time and frequency domains, providing further evidence to support the accuracy of the gait information obtained through the methodology employed in this study. Finally, in Section 4, we conducted numerical simulations of the IPM and compared the displacement, velocity, and acceleration time histories of the human centroid CoM between the two datasets. This analysis served to validate the rationality of the IPM.
Step width recognition and verification
One of the pedestrian gait parameters that can be recognized by DPM are the foothold points position and stride width of each step taken by the pedestrian. As discussed in Chapter 1.2, within the framework of the dual-perspective model, the distance between each step point and the longitudinal plane, represented by the centerline of the treadmill, can be determined. Prior to commencing the experiment, the participant adjusts their walking pace according to rhythmic audio cues, while the off-field personnel adapt the treadmill speed to match the participant’s stride frequency. Once the adjustments are finalized, the participant begins walking in a smooth and natural manner. Simultaneously, the two cameras commence capturing the position of the CoP of the subject’s body, while a thin film sensor located beneath the treadmill track records the position of each foothold made by the subject.
As depicted in Figure 7(a) presents the variations in step width and foothold points’ position, as measured by the DPM and the thin film sensor, respectively, for subject 1 at a step frequency of 0.85 Hz. Notably, the foothold points’ position obtained via the thin film sensor corresponds to the collective coordinates of multiple sensors, each encompassing the area where a step is taken. These coordinates represent the geometric center of the pressure-bearing sensors. Consequently, we can reasonably approximate the coordinate position of the pressure sensor as the actual location of each subject’s step. From Figure 7(a) and 7(b), it is evident that the foothold points’ coordinates derived from the DPM exhibit a remarkable concordance with the data directly measured by the thin film sensor. The DPM proves highly accurate in capturing the position of the CoP, as well as documenting the timing and location of each ground contact event. Figure 7(c) and 7(d) illustrates the comparative analysis of temporal variations in step width. Regarding step width, the trend observed in the DPM’s data aligns closely with the measurements obtained from the sensor. The relative error between the visual measurements of the two subjects and the sensor data falls within the range of 0 to 19%, signifying that the step width values calculated via the DPM closely approximate the actual values. (a) Comparison of foothold points of Subject 1 (step frequency of 0.85 Hz) (a) Comparison of foothold points of Subject 2 (step frequency of 0.93 Hz) (c) Comparison of step width of Subject 1 (step frequency of 0.85 Hz); (d) Comparison of step width of Subject 2 (step frequency of 0.93 Hz).
Identification and verification of lateral force
Relative mass of human limb segments.
To capture the lateral forces exerted during pedestrian locomotion, Carroll introduced a visual target technology employing inverse dynamics analysis to comprehensively examine the intricate kinematic patterns exhibited by pedestrians. In his study, Carroll devised a modified treadmill capable of lateral vibrations, effectively emulating the walking conditions experienced on a vibrating bridge. Through this innovative approach, the proposed methodology successfully reproduced the lateral force, commonly referred to as the ground reaction force (GRF), during walking across various amplitudes and frequencies.
Carroll’s experiment encompasses the acquisition of subject lateral force from two distinct perspectives. Firstly, the load imposed is directly obtained by 4 accelerometers installed on the treadmill. The resulting ground reaction force (GRF) derived from this approach is referred to as the direct GRF and is denoted as
In addition to the aforementioned approach, Carroll employs visual targeting technology and sensors positioned on different anatomical regions of the human body to capture the acceleration of various limb segments. Subsequently, the corresponding forces are calculated utilizing Newton’s second law of motion. This reproduced ground reaction force is denoted as the reproduced GRF(
Carroll conducted experiments on a non-vibrating treadmill and compared the forces obtained through two distinct measurement methods. The results revealed a substantial level of agreement between the two methods, both in the time and frequency domains. This finding underscores the reliability and accuracy of the visual targeting technology employed to acquire pedestrian lateral forces. However, due to the limited number of experiments conducted by Carroll on a stationary treadmill, only the time history information and corresponding power spectral density of the lateral force were obtained. Consequently, this study further conducted a comparative analysis based on these two aspects of data to effectively showcase the capability of visual technology in discerning individual lateral forces. This research endeavor establishes a technical foundation for future investigations to extend beyond individual experiments and encompass crowd experiments.
Figure 8(a) illustrates the lateral acceleration of each limb of subject 2 as identified through the DPM. Conversely, Figure 8(b) showcases the lateral forces exerted by various combinations of body segments of subject 2, also identified using DPM, within the 5∼10 second timeframe. It is evident that the predominant source of pedestrian lateral forces stems from the HTP combination, accounting for over 80% of the total lateral forces. This outcome aligns with the findings of Carroll’s experiment. Despite the legs and arms contributing to a portion of the overall body mass, employing the HTP combined motion as a representation of the entire human body during walking is deemed reasonable. This result supports the simplified theory of the IPM, while also indicating the feasibility of utilizing a single-point measurement method (such as measuring the CoM acceleration) for conducting crowd experiments. (a)Lateral acceleration of each limb segment of subject 2 obtained by DPM (b) Lateral force of each limb segment of subject 2 recognized by DPM.
Figure 9(a) depicts the variation curve of subject acceleration between the DPM and Carroll’s measurements within the 0∼10 second time interval. Upon comparing the two datasets, it was observed that the lateral force obtained by Carroll exhibited a distribution approximately around 5% of the body weight, with a peak value slightly below 10%. Conversely, the lateral force identified by DPM predominantly exhibited a distribution around 4.5%, with a peak value of approximately 8%. When considering other subjects within DPM and Carroll, the data exhibited a significant level of agreement within this magnitude range. Although slight variations in the shape and numerical values of the curve changes were observed, these deviations can be attributed to differences in the subjects' individual body data and walking habits. Comparison between the data of DPM and Carroll’s experiment: (a) Comparison between recognition data of DPM and Carroll’s experiment (b) Discrete harmonics of lateral forces from Carroll’s experiment; (c)Discrete harmonics of lateral forces obtained by DPM.
In terms of the frequency domain, as illustrated in Figure 9(b) and 9(c), the distribution of the two main frequencies in DPM exhibited a general similarity to Carroll’s direct Ground Reaction Force (GRF) data at lower frequencies. The amplitude at the primary frequency of 0.93 Hz was slightly lower compared to Carroll’s data, and the amplitude of harmonic waves within the frequency band around 3 Hz was slightly smaller than the data obtained in Carroll’s experiment. DPM did not detect the high frequency components (above 4 Hz) of lateral force, which is related to the frame rate of the camera equipment used in this experiment. For higher frequency components, it is possible to capture them using a camera device with a higher frame rate (e.g. 60 fps). As shown in Figure 10, the detection results of the DPM at 30fps and 60fps are compared. Under the same experimental conditions, DPM is able to detect the third frequency component of pedestrian’s lateral force at 60fps. The scholar can select filming equipment with an appropriate frame rate to capture the lateral force of pedestrian based on their research needs. Typically, high frequency components above 4 Hz have low amplitudes and are difficult to identify, while the first two orders of components at low frequencies are adequate for most pedestrian lateral force studies. Recognition results of DPM at different frame rates: (a) 30fps (b) 60fps.
Based on the aforementioned verification, DPM has demonstrated effectiveness in recognizing pedestrian gait on stationary structures. In contrast to conventional contact measurement approaches, DPM leverages visual technology to acquire precise pedestrian gait information which minimizes the disturbances to pedestrians, thereby it ensures that the gait data is accurate and reliable. Furthermore, this method can be employed for further investigations, such as recognizing the gait of a crowd and deriving crowd loading models. The subsequent section presents a preliminary application of DPM - validation of the IPM.
Application of DPM: Verification of IPM
The pedestrian-induced lateral vibration of footbridge constitutes a complex nonlinear problem involving biomechanics. Among various models proposed, the IPM has gained relatively high recognition. However, there still exists some uncertainties, as well as some doubts persisting in synchronous model rather than the IPM being more suitable for explaining pedestrian-induced lateral vibrations. The IPM is not fully accepted by the public. Therefore, more experimental data are needed to support the validity of IPM. Moreover, most current experiments validating the IPM rely on contact gait measurements which tend to cause interference during experimentation, thereby compromising the credibility of test data. In view of this, this section validates the IPM using the DPM, which can supplement the current validation basis. The experimental comparisons in Section 2 demonstrate the reliable and excellent performance of the DPM in pedestrian gait recognition. The notable advantage of the proposed method lies in its non-contact measurement capability, enabling pedestrians to maintain a natural walking state while accurately representing their walking conditions and ensuring data accuracy.
The IPM, initially proposed by Macdonald (2008), incorporates various biomechanical theories and serves as a simulation framework for representing pedestrians as inverted pendulums. In this model, CoM of the human body is sustained by two massless rigid legs, while the point of contact between the rigid legs and the ground is referred to as the CoP, as illustrated in Figure 11. The IPM primarily focuses on the frontal surface motion of pedestrians, which is perpendicular to their forward direction. During each step, the CoP remains fixed, and the motion of the CoM is derived through force balance considerations. Inverted pendulum model (IPM) detail.
As shown in Figure 11, when the step is supported by the right foot, there is the following force balance equation:
In the absence of horizontal structural vibrations, the displacement solution for the CoM as presented in equation (11) is as follows:
Assuming a fixed stride cycle with a smooth transition from one foot to the other, and considering a lateral stride frequency denoted as
Verification of IPM based on DPM
The initial set of experimental data was obtained from subject 1, and the comparison was conducted based on the principles of the IPM. In the absence of structural vibrations, we select a set of parameters closest to those of subject 1 for the simulation of the IPM. The actual measurement indicated a step width of Comparison between data of DPM and IPM (
In the case of another subject, subject 2, the testing procedure involved a different lateral step frequency of Comparison between data from DPM and IPM ( Cross correlation coefficients between DPM and IPM.
Figure 14 depicts the temporal profiles of lateral forces obtained through DPM identification and IPM 's simulation. The results of subject 1’s identification, as presented in Figure 14(a), illustrate that, excluding the initial step adjusting phase, the simulated values from the IPM exhibit a high degree of consistency with those identified by DPM, displaying a similar trend of change. Moreover, the peak forces observed in each step align closely with the measured values obtained through DPM. Comparison of identification data from DPM and lateral force of IPM’s imitation: (a) Subject 1 (
In conclusion, the IPM yields plausible simulation outcomes for pedestrian locomotion on a stationary structure, demonstrating satisfactory agreement in terms of both pedestrian motion and lateral forces.
Conclusion
In this study, we introduce a novel dual-perspective visual recognition technology based on Mediapipe, named DPM, to accurately recognize and capture gait information and lateral forces exerted by an individual pedestrian walking on a treadmill. The walking experiments demonstrate that the HTP combination generates over 80% of the lateral force exerted by pedestrians, confirming the plausibility of its simulation of pedestrian walking and validating the feasibility of single-point measurements in crowded experiments. The DPM identification is highly consistent with Carroll’s experimental results, with some differences primarily arising from individual variations. The frequency domain analysis reveals that the DPM successfully detects the low-frequency components of the lateral force, while failing to fully capture the high-frequency harmonics due to limitations imposed by the frame rate (fps = 30) of the recording device. This limitation can be overcome by employing a camera with a higher frame rate. However, for the pedestrian force, the low frequency components (i.e., first and third order) are dominant, so the low frequency data is sufficient to describe the pedestrian excitation.
Subsequently, after comparative analysis of the experimental data, we further validated the IPM. We select specific parameters of the IPM based on the subject’s body for simulation. It is found that the simulated values of the IPM are highly consistent with the measured data of the subjects, with the same trend of change. And the peak value of the lateral forces is also similar, except for a certain deviation at the beginning of the pace adjustment. The consistent outcomes across various experimental subjects provide compelling evidence that IPM effectively replicates the kinematic characteristics of pedestrians and has wide individual applicability. This implies that IPM can serve not only as a tool for analyzing the motion of individual pedestrians, but also for simulating and predicting the behavior of large-scale crowds, enabling us to anticipate the evolution of crowd dynamics and devise appropriate preventive and reactive measures against lateral vibrations on footbridges. The proposed method has exhibited commendable proficiency in recognizing pedestrian gait information. It is believed that it can also be applied to the recognition of crowd gaits in the future. Nonetheless, the present study has certain limitations: 1) The current implementation of DPM lacks the capability to recognize pedestrian vertical forces. 2) The diversity of experimental conditions should be expanded, for instance, by incorporating a wider range of prescribed step frequencies and walking speeds.
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Research on the shear resistance mechanism of a new prefabricated UHPC composite tenon steel composite structure, Guangdong Natural Science Foundation project (general project) (2022A1515011023) and Deepening research on pedestrian induced lateral vibration of pedestrian bridges based on Kuramoto synchronous model, Guangdong Natural Science Foundation project (general project) (2022A1515011703).
