Abstract
Abstract
Background:
Accurate limb volume measurement is key in the assessment of outcomes in lymphedema microsurgery. There are two commonly used methods as follows: manual circumferential measurement (tape) or Perometer measurement. There are no data on the intra- and interclass correlation of either method, making it difficult to establish a gold standard of limb volume measurement. We aim to assess the intra- and interclass correlation of each method to establish the most appropriate method for clinical practice and future research studies, aiming to compare the accuracy and reliability of tape measurement as assessed against Perometer measurement.
Methods and Results:
Student volunteers and experts (lymphedema practitioners) were each asked to perform repeat tape and Perometer measurements on the upper or lower limb of one healthy volunteer. Perometer measurements were globally more accurate than tape (average SE [Perometer]: 23.23 vs. 77.21 [tape]). For intraobserver reliability, experts outperformed students in all domains tested, with little difference in intraobserver reliability using tape or Perometer (average Cronbach's alpha 0.9597 [expert)] vs. 0.6033 [student]).
Conclusions:
We recommend that, for increased interobserver reliability, the Perometer provides a more reliable standard of limb volume measurement.
Introduction
L
The increasing use of microsurgical techniques, such as lymphaticovenular anastomosis 9 and vascularized lymph node transfer, 10 to treat lymphedema has highlighted the importance of accurate limb volume measurement. Without an accurate assessment of volume, it is difficult to assess the impact of these microsurgical treatments and to monitor their longevity. The lack of clarity on this topic is becoming an increasing problem due to the increasing prevalence of lymphedema. 11 Establishing a gold standard of limb volume measurement is key for assessing the outcome of lymphedema treatment. 12
The difficulties in measuring limb volume are demonstrated by the number of techniques available: circumferential measurement (at predetermined short intervals along the limb, with these measurements being used to calculate volume), optoelectric limb volumetry (Perometer), computer tomography, 3D laser scanning, and various methods of water displacement. 13 It is difficult to know, at present, which measurement is the most accurate. Early work on this subject has validated the Perometer, 14 suggesting that it is a reliable and convenient tool for the measurement of limb volume.
Discrepancies in measurement methods are frequently cited as a significant limitation when comparing data from multiple studies,15–17 hindering evidence based progression in lymphedema management. Although various methods of measuring limb volume have been described which correlate with self-reported symptoms, 18 the lack of agreement between the methods establishes that they are not interchangeable. 19 This study aims to compare the accuracy of tape measurement as assessed against Perometer measurement, as the two most widely used and widely available methods of measuring limb volume. We aim to assess the intra- and interclass correlation of both methods.
Materials and Methods
Ethical approval for this study was gained from the University of Oxford Central University Research Ethics Committee (approval number MS-IDREC-C1-2015-124).
Tape measurements
We use circumferential tape measurements to calculate limb volume. Modeling the limb as a cylinder, we used the equation πr2h, where the radius was measured every 4 cm to take into account the variation in limb radius, and the final calculation was a composite of these measurements. Both limbs were in a constant position (horizontal for arm measurements and vertical for leg measurements) for all measurements.
The Perometer
The Perometer takes measurements every 4.7 mm and uses these to calculate volume.
Student measurements
Videos were made showing how to use the Perometer and how to perform circumferential tape measurements every 4 cm from the ankle to the upper thigh. Medical student volunteers watched these videos and then carried out both Perometer and circumferential measurements on either the right arm or right leg of one healthy subject. The subject was seated between measurements, and all measurements were carried out in the same position (horizontally for arms and vertically for legs). All measurements were repeated once to assess intraobserver reliability.
Expert measurements
To compare the data from lymphedema practitioners (experts), we repeated the above set up at the British Lymphoedema Society (BLS) conference in October 2015. To participate in this study, all “experts” had to be experienced lymphedema practitioners who were members of the BLS. Rather than showing a video, practitioners used the Perometer and performed circumferential tape measurements using their usual method. All practitioners were experienced in both methods. We asked all experts to return to perform a second measurement, to compare the expert measurements with the student measurements and to assess differences in intraobserver reliability between experts and students.
Data analysis
To standardize both data sets, the minimum limb length measurement recorded was taken as a base, and all further measurements above the minimum were excluded so that each measurement started and ended at the same point. To do this, the minimum limb length recorded on the Perometer for both the experts and students was found, and all other Perometer data were standardized to this measurement, using the proprietary PeroPlus software (Pero-System Messgeräte GmbH, Wuppertal, Germany). The minimum length of arm or leg for circumferential measurements was also found and volume calculations did not include measurements exceeding this minimum length. This allowed direct comparison of data between all measurers.
For interobserver reliability, we calculated the mean and 95% confidence intervals (CIs). For intraobserver reliability we used Cronbach's alpha, which measures correlation between measurements. This is reported with two statistical measures: first, the average inter-item coefficient (IC) and second the scale reliability coefficient, reported as alpha. This was calculated after excluding missing repeat values. Generally speaking, an alpha value of 0.7–0.8 is regarded as showing acceptable internal consistency. Good consistency is 0.8–0.9 and >0.9 is excellent.
Results
For the student data, 23 repeated arm measurements and 22 repeated leg measurements were collected, for both tape and Perometer. For the expert tape data, there were 12 arm measurements, 2 of which were repeated, and 14 leg measurements, 6 of which were repeated. For the expert Perometer data, there were 12 arm measurements, 2 of which were repeated, and 13 leg measurements, 4 of which were repeated.
Tape versus Perometer accuracy
Tables 1 and 2 compare the accuracy of measurements using the tape and Perometer, respectively. Graphic representation is shown in Figure 1.

Box plots showing volume measurements for students and practitioners. In students the second measurement is shown for comparison; with practitioners the number of repeats was too low to allow comparison.
Note for practitioner measurements of arm with tape there were only two repeat readings so statistical analysis has not been performed.
CI, confidence interval; SE, standard error.
Note for practitioner measurements of arm with Perometer there were only two repeat readings so statistical analysis has not been performed.
Considering tape measurement of arms, the standard error (SE) was lower in the expert measurements than in the students (average SE 25 [expert] vs. 29.55 [student]), and the expert's CI was narrower. With leg measurements, the SE and CIs were much wider in general, with the students having significantly lower SEs (average SE 63.75 [student] vs. 164.43 [expert]). This is most likely because we asked the practitioners to use their usual method of measurement with tape, rather than teaching a set method as we did with the students.
Compared to tape measurements, the Perometer measurements had, on average, lower SEs and narrower CIs when used by both students and experts (average SE arm measurements [experts and students] 24.88 [Perometer] vs. 28.04 [tape]) average SE leg measurements [experts and students] 22 [Perometer] vs. 114.10 [tape]). The only measurement not to follow this statement was expert measurements of arms, in which the SE of tape measurement was lower than the Perometer reading (SE 25 vs. 38.99). Using the Perometer, experts had higher standard error than students in both arms and legs, which is likely an anomaly due to the significantly lower numbers of expert readings due to time constraints at the BLS conference.
Intraobserver reliability
Table 3 reports intraobserver reliability. We asked experts to use their usual method of tape measurement, rather than teaching a set method as we did with the students, to preserve their expertise; by not using their usual technique they would likely have lost a degree of expertise and likely would have erroneously decreased their intraobserver reliability. As such, the expert's intrarater reliability is significantly better than students.
As previously mentioned, there were no missing values for the student data, hence the unusually low alpha value for the student's leg measurements. This suggests that there is acceptable internal consistency with manual measurements of legs. The expert data show excellent internal consistency when using the Perometer and tape to measure legs (Cronbach's alpha of 0.9009 and 0.9442, respectively), suggesting that expertise with either equipment may be the key to consistency.
As Cronbach's alpha is unreliable with missing data, we have excluded all measurements that were not repeated. Using only the repeated data, there is excellent intraobserver reliability in experts using both tape and Perometer measurements. However, the number of repeat expert readings was low, limiting the interpretation of intraobserver reliability data for experts.
Discussion
In terms of accuracy, on average the SE was smaller and CIs were narrower with the Perometer measurements, suggesting that it is broadly more accurate than tape measurement.
For intraobserver reliability, Cronbach's alpha was >0.9 in all “expert” domains, showing excellent intraobserver reliability. This is as one might expect for trained practitioners. However, a key limitation of this study is the low number of repeats in the expert group. With student data, the numbers are higher and the results much more variable. The student Perometer measurement of legs showed very low internal consistency, which needs further validation.
For interobserver reliability, the experts outperformed the students in all domains tested except tape measurement of legs. This is likely because we invited the experts to use their normal methods of measurement; by preserving their expertise (and therefore their intraobserver reliability) we adversely affected their interobserver reliability in measurements in which margins of error were larger (i.e., circumferential measurement of legs) as they tended to use different starting points to measure from and different techniques for measuring.
Overall, for interobserver reliability, the Perometer appears consistently better than tape measurement. This is a relevant finding, as patients visiting a lymphedema clinic may have their affected limb(s) measured by a different lymphedema practitioner each time. Increasing interobserver reliability using a Perometer would allow more accurate monitoring of the effect of treatment. In addition, the Perometer is universally faster than tape measurement, thus could save a significant amount of clinic time. Although we did not aim to time measurements, we estimate that the Perometer took 2 minutes per limb and tape measurement ∼10 minutes. However, there is significant monetary cost attached to the Perometer, and while it has a function to automatically prescribe garments from the measurements it takes, thus saving further time and increasing the utility of the device, it is not universally available. As intraobserver reliability was high in the expert group for tape and Perometer measurements alike, if a Perometer was not available, using the same trained practitioner for each patient's measurements would also be a reproducible way of assessing treatment outcomes. However, as human factors make this difficult to achieve in a clinic environment, a more accurate and reliable way to monitor clinical response would be the consistent use of the Perometer.
Conclusion
The ability to accurately measure limb volume is increasingly relevant given the increasing prevalence of lymphedema and the variety of microsurgical reconstructive techniques available. The gold standard treatment of lymphedema is controversial. To assess the merits of each treatment in a reliable manner, a reliable way of measuring limb volume must be established.
It is beyond the scope of this study to state a definitive gold standard technique for limb volume measurement, as it compares only the two most widely used and available methods. However, the Perometer gives more accurate results overall and is much quicker to use. Further work is required, with increased numbers of participants and comparison with other commonly used methods of limb volume measurement.
Footnotes
Author Disclosure Statement
No competing financial interests exist.
