Abstract
Objective:
To develop scanning electron microscopy-based Trainable Weka (Waikato Environment for Knowledge Analysis) Intelligent Segmentation Technology (SEMTWIST), an open-source software tool, for structural detection and rigorous quantification of wound biofilm aggregates in complex human wound tissue matrix.
Approach:
SEMTWIST model was standardized to quantify biofilm infection (BFI) abundance in 240 distinct SEM images from 60 human chronic wound-edge biospecimens (four technical replicates of each specimen). Results from SEMTWIST were compared against human expert assessments and the gold standard for molecular BFI detection, that is, peptide nucleic acid fluorescence in situ hybridization (PNA-FISH).
Results:
Correlation and Bland–Altman plot demonstrated a robust correlation (r = 0.82, p < 0.01), with a mean bias of 1.25, and 95% limit of agreement ranging from −43.40 to 47.11, between SEMTWIST result and the average scores assigned by trained human experts. While interexpert variability highlighted potential bias in manual assessments, SEMTWIST provided consistent results. Bacterial culture detected infection but not biofilm aggregates. Whereas the wheat germ agglutinin staining exhibited nonspecific staining of host tissue components and failed to provide a specific identification of BFI. The molecular identification of biofilm aggregates using PNA-FISH was comparable with SEMTWIST, highlighting the robustness of the developed approach.
Innovation:
This study introduces a novel approach “SEMTWIST” for in-depth analysis and precise differentiation of biofilm aggregates from host tissue elements, enabling accurate quantification of BFI in chronic wound SEM images.
Conclusion:
Open-source SEMTWIST offers a reliable and robust framework for standardized quantification of BFI burden in human chronic wound-edge tissues, supporting clinical diagnosis and guiding treatment.
Sashwati Roy, PhD
INTRODUCTION
Biofilms in chronic wounds act as hostile microbial aggregates as they remain encased within their polymeric matrix cocoons, making them resistant to both antimicrobial therapy and host immune defenses. 1 Biofilms create a persistent inflammatory state that disrupts normal wound healing processes by impairing reepithelialization and angiogenesis. 2 Chronic wounds with biofilms are more likely to develop recalcitrant severe infections, leading to complications such as amputation and tissue necrosis. 3 Without a proper diagnosis, clinicians may rely on systemic antibiotics, which are often ineffective against biofilm-associated bacteria due to their protective extracellular polymeric substance (EPS) matrix. 4 This can lead to prolonged antibiotic use, contributing to antimicrobial resistance without effectively resolving the infection. 5 Biofilms exhibit up to 1,000-fold increased resistance to antibiotics compared with planktonic bacteria, making treatment challenging. 6
Traditional culture-based methods often fail to detect biofilm infection (BFI) in wound tissues, as the biofilm-associated bacteria enter a dormant state, rendering them undetected using conventional microbiological diagnostic techniques. 7 Diagnosing BFI in chronic wounds remains challenging, with no standardized clinical protocol currently established for clinical research practice. 8 –11 Reliable and robust clinical assessment of BFI burden in wounds is critical to determining treatment strategy. 12,13 The extent to which the BFI burden negatively impacts wound healing outcomes depends on the balance between BFI threat and countering defenses, including host factors and therapeutic intervention. 1,3,14,15 The conventional methods, such as colony forming unit (CFU) assays and bacterial culture, can only identify planktonic bacterial growth and may not be reliable in detecting BFI in wounds because of the slow and fastidious characteristics of the growth of biofilm bacteria. 12,16 In the fastidious state, bacteria within biofilm exhibit metabolic dormancy, which slows down their growth. As a result, biofilm bacteria may not reach detectable levels within the set incubation period of 24–48 h of the standard culture method, leading to false-negative results. 17 –19 Of various methods available for qualitative structural BFI detection on abiotic or biotic surfaces, scanning electron microscopy (SEM)-based detection of BFI structure is considered the gold standard. 20 –22 The European Society of Clinical Microbiology and Infectious Diseases recommends using SEM as one of the most reliable diagnostic techniques for detecting biofilms in wounds. 23,24 The high resolution of SEM allows detailed analysis of biofilm structures, allowing direct visualization by achieving a resolution of 50 to 100 nm. 25 This resolution allows for in-depth examination of fine bacterial surface features, biofilm architecture, and nanoscale bacterial interactions. 21 However, in the present study, our primary focus was on identifying biofilm aggregates within human chronic wound-edge tissue. Therefore, the images were captured with consideration of the size of host tissue features observed, which were crucial for training the algorithm for biofilm quantification.
One of the challenges of SEM-based BFI detection in wound-edge tissue is the lack of availability of robust and objective quantification methods. Although the visual system of humans, with expert knowledge, can interpret and provide relative BFI abundance in SEM images from wound-edge tissue, 26 such assessments are primarily subjective, time-intensive, limited by the availability of expert raters, and prone to interobserver variability. 27 –30 This variability often stems from differences in expertise, image noise, and the complex anatomical architecture of tissues. Quantitative methods, such as machine learning (ML)- and artificial intelligence (AI)-based models 31 –33 for image analysis and processing, may help minimize bias and improve reproducibility in biofilm quantification of wound SEM images. Weka, an open-source software tool that enables users to perform ML-based image segmentation, object classification, and feature extraction with minimal programming experience, 33 –35 has been widely used in biological and clinical research to quantitatively analyze various types of microorganisms, cells, and tissues. 36 –40
This study aimed to develop and standardize an objective ML-based approach using the trainable Weka intelligent segmentation technology (TWIST) tool capable of grading BFI burden in the chronic wound-edge tissues into the following clinically relevant categories: none, low, intermediary, or high BFI abundances.
Various ML algorithms within the TWIST framework have been used for several pixel-based binary segmentation tasks across different image modalities, including bacterial counting, growth assessment, and cell segmentation. 41
This gradation of BFI abundance is essential to diagnose BFI promptly, allowing for the implementation of effective treatment strategies and significantly reducing the risk of poor outcomes. A reliable quantification approach is also essential for clinical research studies evaluating antibiofilm treatments, where BFI reduction is a primary outcome. The article has been designed as per the Strengthening the Reporting of Observational Studies in Epidemiology guidelines (Fig. 1).

Schematic illustrating a total of 112 patients enrolled in an ongoing, unreported clinical trial (IRB# 10011, Indiana University). From this primary cohort, a blinded subset of 240 SEM images (four technical replicates from each participant) were obtained from n = 60 (randomly selected) participants. A set of 160 SEM images (130 unique) were used to train SEMTWIST for identifying biofilm and nonbiofilm features within SEM images. Following the training phase, the deidentified dataset of 240 images (n = 60) was used to validate SEMTWIST for quantifying biofilm infection abundance in the chronic wound tissue. A random subset (n = 15 of the n = 60) was selected to optimize SEMTWIST’s performance by comparing its results with those from human rater assessments, WGA staining, culture pathology, and PNA-FISH techniques (bottom right), as specified in subsequent figures. PNA-FISH, peptide nucleic acid fluorescence in situ hybridization; SEM, scanning electron microscopy; SEMTWIST, scanning electron microscopy-based Trainable Weka (Waikato Environment for Knowledge Analysis) Intelligent Segmentation Technology; WGA, wheat germ agglutinin.
INNOVATION
SEM analysis reveals that wound biofilm, shaped by an iterative exchange between microbial and host-defense processes, is a polymicrobial aggregate including host cells. In the present work, we standardized the TWIST algorithm to robustly quantify BFI in SEM images from intricate human chronic wounds, characterizing biofilm distribution within the wound tissue in relation to the surrounding host tissue microenvironment and landscape features.
Clinical problem addressed
Wound biofilm aggregates in chronic human wounds exhibit unique characteristics, and their quantification in clinical samples has long been challenging due to limited understanding of the complex structural interactions between biofilm and the wound tissue. Fluorescence-based techniques such as wheat germ agglutinin (WGA) staining have shown promising results for in vitro biofilm experiments with limited efficacy for detecting biofilms within infected wound tissue samples. 42,43 Advanced methods such as peptide nucleic acid fluorescence in situ hybridization (PNA-FISH) 44 provide insights into the microbial composition and distribution but do not offer details about the structural and topographic interactions between biofilms and tissues that SEM provides. Similarly, reverse transcription-polymerase chain reaction evaluates biofilm-related gene expression but does not offer insights into the spatial organization of the EPS matrix and biofilm morphology. 45 –47 Understanding these complex structural interactions is crucial for accurate and reliable biofilm quantification, ultimately supporting improved diagnostics and treatment strategies. 48
MATERIALS AND METHODS
Methods
Human subjects and tissue collection from chronic wounds
Study design
A blinded subset of 240 SEM images (n = 60 subjects) from an ongoing, unreported clinical trial (IRB# 10011, Indiana University) were used for SEMTWIST optimization and validation. These images were analyzed without the knowledge of subject identity or clinical outcome. This dataset was solely used for SEMTWIST development and is distinct from the primary analysis of this study, which will be reported separately upon completion of trial outcomes. All participants provided written informed consent before inclusion in the primary study (IRB# 10011).
Participants
Subjects in the study were aged 18 years and older with infected chronic wounds from a prospective clinical study. Demographic characteristics of the 60 subjects whose SEM image data were used for SEMTWIST optimization and validation are summarized in Table 1.
Demographic characteristics and wound type of the human participants
SEM sample processing and imaging for biofilm detection
Wound biopsies were collected in a 2.5% glutaraldehyde fixation buffer from patients at the Indiana University wound clinics. For processing, the samples were washed with phosphate-buffered saline and resuspended in ethanol, followed by dehydration using a graded ethanol series (50%, 70%, 90%, and 100%). Subsequently, the samples were treated with hexamethyldisilazane (HMDS) (Ted Pella, Inc., Redding, CA) in combination with ethanol at varying ratios: 100% ethanol: HMDS—3:1, 100% ethanol: HMDS—1:1, and 100% ethanol: HMDS—1:3. Finally, the samples were treated with pure HMDS and left to dry overnight under a hood.
After drying, the samples were desiccated in a vacuum chamber for at least 12 h before imaging. The dried samples were then gold-coated using a Cressington turbo-pumped sputter coater. For biofilm detection and tissue morphology examination, SEM images were obtained using a field emission scanning electron microscope (FESEM) (JEOL 7800F, JEOL Japan) at a beam energy of 5 kV and 5,000× magnification. Images were captured from four different regions/locations within the same tissue sample to ensure comprehensive coverage of the tissue regions. For selection of four regions of the tissue, a low-magnification (60×) image was obtained. The image was digitally segmented into four quadrants, and one representative high-magnification (5,000×) image from each quadrant was used.
Quantitative biofilm assessment in SEM images using TWIST
SEM images obtained from patients’ wound biopsies were analyzed using TWIST software, a plug-in ImageJ-based Fiji software. Image analysis included three major steps as follows:
a. Image preprocessing
All SEM images were saved in .tiff form at 5,000× magnification to have a broader field of view. In each image, the scale bar was removed using scripts with boundary detection function as part of the preprocessing. All the SEM images’ pixel dimensions (2,048 × 2,560) and bit depth (8 bits) were kept constant to enable batch processing. No background subtraction or filtering techniques were applied before segmentation, ensuring that the integrity of the tissue features was preserved for our study.
b. Image classification and segmentation by TWIST training
The TWIST training was conducted using two primary types of SEM images: (i) SEM images without BFI (visible host tissue features only): Before biofilm segmentation, the TWIST software was utilized to highlight and elucidate the architectural features of host tissue in SEM images of wound-edge tissue obtained from patients.
These images served as a baseline for understanding the structural characteristics of the tissue without biofilm interference. (ii) SEM images with BFI (visible host tissue features and biofilm): The biofilm in the image was segmented using TWIST, which uses a suite of ML algorithms for effective segmentation. A suite consisted of a collection of supervised ML algorithms integrated with ImageJ, utilizing user-labeled training data to train a supervised ML classifier (specifically a fast random forest classifier in this article) for complex image segmentation. The algorithm integrated feature extraction functions that enhanced the detection of variations in important SEM image characteristics (such as edges, textures, and intensity). These extracted features served as inputs for classifiers, which learned to differentiate between biofilm and nonbiofilm surface patterns. The trained model was then applied to a new SEM image, enabling automated segmentation with improved accuracy. Relevant pixels within the image were selected and manually drawn using the drawing tool, dividing the image into biofilm and surface classes. The surface class encompassed complex tissue features with varying morphologies and characteristics. The size of the pixel regions was chosen to include all tissue and biofilm key features. The classifier was trained in a semisupervised manner using these samples, with the default classifier (fast random forest) used for all analyses in this study. The pixels could be adjusted or repositioned to retrain the classifier for improved segmentation accuracy. The following feature filters were selected to create a classifier for most images: Gaussian blur, Sobel filter, Hessian, difference of Gaussians, membrane projections, variance, mean, minimum, maximum, and median. 38 This combination provided the most effective segmentation within the shortest time frame. Other Weka settings were kept at default (membrane thickness 1, membrane patch size 19, minimum sigma 1.0, and maximum sigma 16.0). These filters helped in identifying irregular surface features of bacterial biofilms, characterized by their clustered microbial cells and thick EPS matrix by emphasizing the subtle variations in topography, texture, intensity, and contrast within the image. This resulted in a reliable distinction between biofilm aggregates and surrounding host tissue features with enhanced efficiency.
Batch processing was feasible for some images, allowing a single trained classifier to be applied across multiple images for automatic segmentation. For example, one classifier could be created for images without biofilm, focusing solely on tissue features, and another for images with ≥70% biofilm coverage. Due to the complexity of tissue architecture and the variable locations of embedded biofilm, some images were processed using a customized training dataset that was tailored to include distinct features within the images. By selecting relevant pixels, the classifier learns to recognize and differentiate these patterns. This process was verified by carefully zooming in and out of the images to ensure proper feature recognition. A total of 160 SEM images were used to train the software for recognizing both with and without BFI features. Of the 240 images from which data have been provided in Fig. 1D, a set of 30 images, and an additional 130 unique images were used.
c. Image postprocessing
The histogram of each image was analyzed to determine the number of white pixels representing the biofilm area. Using TWIST’s “get probability” feature, images were generated with black and white regions corresponding to non-BFI and BFI classes, respectively. Thresholding was then applied to assign distinct colors to each class in the histogram, yielding the percentage of biofilm coverage in each image. In addition, k-means clustering algorithm was used for postprocessing where the pixels were classified into a predefined number of color clusters. Each pixel was assigned to a cluster, and binary masks were generated for each color class. The percentage of biofilm (a predetermined color class in TWIST segmentation) was calculated by determining the proportion of pixels in that class relative to the total number of pixels in the image.
Quantitative biofilm assessment in SEM images by human raters
SEM images were coded and provided to four biofilm experts (3–4 years of experience with wound BFI and SEM images) for blinded evaluation. A total of 60 SEM images from 15 subjects (four images per subject) were selected for rating. These images were taken from different regions of each sample (left, right, and center) to minimize bias. The experts assessed the biofilm abundance based on the percentage coverage in each image. The categories for biofilm abundance were as follows: ≤10% (no biofilm, with prominent host tissue features visible), 10% to <40% (low biofilm, characterized by few bacteria and associated EPS matrix observed), 40% to <70% (intermediate biofilm, with biofilm structures covering multiple tissue regions), and ≥70% (dense biofilm, where most tissue features are obscured by biofilm structures).
Tissue preparation for PNA-FISH analysis
The PNA-FISH was performed to detect biofilm in formalin-fixed and paraffin-embedded (FFPE) tissue sections. Briefly, the FFPE tissue sections were deparaffinized through xylene and ethanol to water before probe hybridization. Then, the tissue sections were fixed with 4% PFA for 10 min at RT, followed by incubation of 10 min with 50% ethanol. Afterward, the Staphylococcus aureus- and Pseudomonas aeruginosa-specific hybridization solution was added to the section, and hybridization was performed at 57°C for 90 min. Afterward, the tissue sections were washed in a wash solution for 30 min at 57°C. Finally, the nuclei were stained with DAPI solution for 5 min at room temperature, and the slides were mounted with ProLongTM gold antifade mount medium (Invitrogen: P36930). The images were captured using a confocal laser scanning microscopy (LSM 880, Zeiss) and AxioScan Z1 (Zeiss).
Biofilm matrix staining
Biofilm staining using the WGA-Alexa Fluor 488 conjugate was conducted to detect biofilm in FFPE human wound-edge tissue sections, following the previously published protocol. 43
Culture pathology
Wound tissues were obtained from each patient after cleaning the wound surface with saline. The sample was kept in a sterile container and transported to a clinical pathology laboratory for standard bacterial culture-based analysis by inoculating the samples on tryptic soy agar to detect bacterial load.
Image adjustment analysis
We performed sensitivity analysis for TWIST performance on five randomly selected SEM images. In each image, the annotation bar was removed using scripts with boundary detection function as part of the preprocessing using MATLAB script. Using the bicubic interpolation method, 49 the images were downscaled or upscaled to 320 × 256, 480 × 384, 640 × 512, 960 × 768, 1,280 × 1,024, and 2,560 × 2,048 pixels while maintaining the original aspect ratio. The TWIST segmentation was trained on the same instance locations for two classes (BFI and non-BFI) to obtain classification results. The output classified figure from TWIST was processed using k-means clustering.
Statistical analysis
The raw data (n = 15, 60 SEM images) were compiled and categorized into two groups—human raters and TWIST—to evaluate correlation and agreement for BFI abundance data. Regression analysis and correlation methods in GraphPad Prism and Microsoft Excel were used to assess the correlation between groups. Bland–Altman plot 50 was made in R version 4.3.2 to determine agreement levels between individual human rater’s BFI data and the rater average and TWIST. Limits of agreements (LoAs) were calculated as the mean difference ± (1.96 × standard deviation of the difference). In the Bland–Altman plot, the y-axis represents the differences between two measurements, while the x-axis shows their mean value.
RESULTS
SEM-ML tool for BFI assessment
To quantify biofilm infections in chronic wound-edge tissues using SEM images, we used an ML-based approach and validated this method using a known set of biofilm infected wound-edge tissues. SEM images were obtained and analyzed by experts alongside our SEM-ML system (Fig. 2A). To enhance the clinical relevance of the assessment for treatment strategy, the % BFI abundance data output from ML-based quantification was classified into four grades: (i) no BFI, (ii) low (BFIlo), (iii) intermediate (BFIint), or (iv) high (BFIhi) (Fig. 2B). Representative raw SEM images and the corresponding ML-processed images of the wound BFI gradations are shown in Fig. 2C. We observed that SEM-ML analysis was able to reliably detect all the abovementioned grades of BFI by accurately segmenting BFI and non-BFI regions. To establish the feasibility of the BFI gradation approach, we performed an SEM-ML assessment of BFI abundance in 240 unique SEM images obtained from human (n = 60) chronic wound-edge tissues that included four technical replicates for each biospecimen (Fig. 2D). Recognizing that human expert raters are the standard for assessing BFI in SEM images, 26 we conducted a comparison between the results of SEM-ML analysis and those from human assessments using data from randomly selecting 60 images from a total set of 240 images.

Quantification assessment of biofilm infection (BFI) abundance in human chronic wound edge using scanning electron microscopy (SEM)-machine learning (ML).
As shown in Fig. 2E, F, a strong correlation (r = 0.82, p < 0.01) was observed between the results from human experts and those produced by SEM-ML analysis, which was further validated by the Bland–Altman plot (Fig. 2G). In the plot, the mean bias and 95% LoA was 1.25 with −43.40 to 47.11, between the average BFI abundance for expert human raters versus the data from SEM-ML analysis. Nonetheless, correlation analysis revealed interrater variability in the human analysis, highlighting the subjective nature of human assessment (Fig. 2H). Substantial variability between the trained human raters was also noted in the Bland–Altman plot as shown by mean bias and the wide LoAs 17.8 (−33.34 to 68.93) (Supplementary Fig. S1). These findings suggest low reliability and reproducibility of BFI assessment of SEM image datasets by human experts. Interestingly, a strong correlation was noted between the human expert rater averages and the SEM-ML approach, demonstrating the validity and robustness of the SEM-ML-based assessment of clinical BFI abundance in human wound specimens.
These findings also indicate that human expert rating is only valid when performed by >3 raters. In this study, a stepwise analysis procedure of SEMTWIST has been described (Fig. 3). All segmentation results were saved as MACRO test files with MATLAB scripts, which as needed can be reopened and used for future reference. The detailed outline of each step used for processing is described in Supplementary Fig. S2.

Stepwise implementation of biofilm infection (BFI) detection and quantification using trainable Weka intelligent segmentation technology (TWIST), an ML-based approach. A stepwise procedure of BFI quantitative assessment using TWIST (ML) approach, including (
Analysis of image sensitivity in SEMTWIST segmentation
High-resolution SEM images are essential for improving image segmentation accuracy. 51,52 To optimize the resolution of SEM images for input to TWIST analysis, we performed sensitivity analyses (Fig. 4 and Supplementary Fig. S3). Representative raw SEM images and their corresponding TWIST-segmented images at resolutions of 320 × 256, 960 × 768, and 2,560 × 2,048 pixels showed BFI abundances of 42.6%, 26.4%, and 25.8%, respectively. These results demonstrate how changes in pixel density can impact segmentation accuracy and BFI detection within each image, thus influencing quantitative outcomes in BFI analysis (Fig. 4A). In each figure, TWIST was trained on the same instance locations with only variation in resolution to obtain classification results. TWIST performance sensitivity was assessed under conditions of different SEM image resolution. In general, the performance of the ML algorithm decreased at lower pixel densities (Fig. 4B). For high-resolution SEM images (pixel density of 2,560 × 2,048), TWIST-based detection has high accuracy regardless of the extent of biofilm abundance in the image. In cases of low-resolution SEM images, TWIST-based detection was less susceptible to errors when biofilm abundance in the image is high, that is, when >75% of the SEM image is covered by biofilm structures (image 2, Supplementary Fig. S3). However, under conditions of low-resolution SEM images and the rare (<50%) presence of biofilm, TWIST-based detection may not be reliable (Fig. 4C) (image 1, Supplementary Fig. S3). Upscaling images to 2,560 × 2,048 will improve the diagnostic accuracy when comparing images of different pixel densities, particularly when the training data lack sufficient spatial resolution. 51,52

The image pixel density is critical for SEM image biofilm quantification using TWIST.
Training SEMTWIST for specificity
For efficient performance of the TWIST, it is critical to train the software to distinguish between BFI and non-BFI structures in wound-edge tissue by specifically distinguishing host tissue structures. To conduct this, the images were classified into two categories: BFI and non-BFI (tissue features), resulting in binary pixel classification. The constituents of wound-edge tissue, including red blood cells (RBC), white blood cells (WBC), fibrin, activated platelets, and extracellular matrix (ECM), were distinguishable alongside the biofilm (Fig. 5A, B). Given that the biofilm may be embedded within, or interspersed between these structures, accurate segmenting for these structures is necessary. To achieve this, the training of TWIST is crucial to reliably identify and distinguish these tissue elements from the BFI40. Pixel-level training was conducted to ensure precise differentiation between BFI and non-BFI regions, thereby achieving high accuracy in region selection and classification. An example of the accuracy of the segmentation of tissue RBC structure and its discrimination from BFI is illustrated in Fig. 3. Given that the images are derived from chronic human wound-edge tissue samples, notable morphological alterations in RBC shape and ECM topography were observed, as depicted in Fig. 5A (Supplementary Fig. S4). RBCs typically appear as stomatocytes (concave-shaped) but may transform into echinocytes (shell-like with pointed projections), as observed in multiple patient samples. 53,54 The ECM architecture is highly intricate and can sometimes be misinterpreted as biofilm due to protuberances or abnormal topographical modifications on its membrane that occur during ECM remodeling. 55,56 Furthermore, other features observed included activated platelets that appeared as elongated spheres with spicules, neutrophils with a ruffled morphology, and monocytes with pseudopods such as protrusions. In addition, fibrins were identified as thread-like structures. Recognizing these morphological variations is critical for training the software to accurately distinguish biofilm dispersed within these tissue components in an SEM image.

Training the TWIST to recognize the complex and diverse structure of host tissue in a chronic wound environment.
Validation using traditional biofilm methodologies
Microbial biofilm research has relied on CFU counts, culture-based assays, and biofilm staining as standard approaches for quantifying bacterial biofilm in both in vitro and clinical contexts. 16,57 In a cohort of 15 patient samples analyzed in this study, eight showed negative results for the presence of any bacterial species in culture reports. Notably, five of these eight culture-negative biospecimens displayed dense biofilm structures upon SEM examination (Fig. 6A). PNA-FISH specifically identifies bacterial species and displays aggregation characteristics thus helping identify and characterize BFI. 44 The biospecimens with negative culture reports showed the presence of polymicrobial BFI-like structures when examined using PNA-FISH (Fig. 6A), thus validating SEMTWIST findings and indicating that CFU-based culture pathology may not be reliable for the detection of BFI in wound-edge tissues. We tested a widely used staining WGA-Alexa Fluor 488 conjugate for the detection of N-acetylglucosamine in biofilms 42,43,58 –60 and compared it with SEMTWIST BFI assessment (Fig. 6B). WGA staining was not necessarily specific for biofilm in human chronic wound-edge samples (Supplementary Fig. S5). BFI quantification using WGA or TWIST showed a weak correlation (r = 0.24).

Standard wound culture-based analysis is not reliable to detect fastidious and slow-growing biofilm bacteria in human chronic wounds.
DISCUSSION
Conventional SEM and FESEM are the two rigorous methods for high magnification and high-resolution visualization of biofilm morphology. 21 Reading such images requires trained human experts limiting the significance of these analytical approaches as it relates to clinical research use. In recognition of the complex nature of tissue structures in the chronic wound SEM images, we used TWIST, an advanced segmentation, semisupervised ML technique. 40 Deep-learning procedures based on convolutional neural networks have already been utilized to automate the SEM image analysis. However, such solutions have been practiced mostly in the context of material sciences. 61 –64 A few studies have utilized TWIST-based biofilm segmentation using SEM images but primarily for the abiotic surfaces. 38,65 Chronic wound-edge SEM images are complex and consist of diverse features. 53 –56 This work represents a maiden effort seeking in-depth characterization using the SEM-ML approach to identify typical structures of BFI-infected or noninfected features in chronic wound SEM images. Tissue samples were sourced from chronic wound environments (Table 1), where various morphological modifications can be observed in micrographs captured using SEM. The training data construction for ML models accounted for how variability affects model performance. 66 This study included a diverse set of tissue samples with different biofilm types and varying levels of complexity. Such an approach helped develop a robust model with generalizable application.
Human assessments are subject to bias and may overlook certain data types that otherwise can be effectively identified by ML algorithms, especially when performing pixel-level analysis. SEM images often contain noise, artifacts, or irregularities in tissue architecture that can obscure biofilm structures. Evaluators might interpret these visual distortions differently, leading to variability in biofilm quantification. For example, membrane projections or ECM features can be mistaken for biofilm, especially in samples with low biofilm abundance. 55,56 Therefore, while human scores are valuable as a gold standard, it is essential to account for potential biases and discrepancies that may arise from human evaluations. 67 –69 By integrating software capabilities with human assessments and establishing rigorous gradation criteria, the BFI-assessing TWIST-model sets a new benchmark for biofilm quantification. To further strengthen our findings on the concordance between human evaluations and SEM data, this work relied on the highly specific PNA-FISH molecular technique widely recognized for its application to BFI detection. 43 PNA-FISH specifically detected S. aureus and P. aeruginosa involvement in wound BFI.
WGA, a widely used lectin-based stain, is often used for the detection of EPS-encasing biofilm aggregates. 42,43,70 It is important to recognize that WGA has a high affinity for N-acetylglucosamine residues and glycoproteins, which are present in abundance in both bacterial biofilm EPS and in human wound tissue. 70 It is reported that WGA can bind ECM and epidermal cell membranes in tissue sections. 71,72
Fluorescently tagged WGA lectin binds to connective tissues, including muscle, bone, ligament, and tendon. 73,74 The specificity of WGA to specifically detect biofilm has therefore been questioned. 75 While WGA may be useful to detect biofilm on abiotic structures, in a tissue environment it can cross-react with tissue components. In contrast, PNA probes using FISH have been designed and optimized for the detection of several bacteria. When PNA probes are attached to a fluorochrome dye, they can be detected by epifluorescence microscopy or flow cytometry using the FISH method. PNA-FISH has been established as a productive approach to detect and characterize wound biofilm in human tissues. 76 –78
The culture-based technique was used to assess biofilm presence; however, the primary issue in accurately determining biofilm presence using the culture technique can be attributed to the inherent complexity and heterogeneity of biofilm structures. 79 Bacteria within biofilm aggregates are encased in a dense EPS, which poses challenges for effective bacterial separation and quantification. Even with mechanical disruption techniques such as vortexing or ultrasonication, biofilm aggregates often remain intact, leading to skewed results where clumps of bacteria may get erroneously counted as single colonies. Moreover, biofilm bacteria are adapted to anoxic conditions within the EPS matrix. 12 When exposed to aerobic environments on nutrient-rich agar plates, these bacteria may undergo a rapid transition from anaerobic to aerobic conditions. 12,80 This abrupt environmental shift can lead to bacterial stress, reduced viability, or even cell death. 80 In some cases, bacteria may be viable but being fastidious strains, they are nonculturable under conventional laboratory conditions. Consequently, culture results, which typically require 1–3 days to obtain, can lead to an underestimation of biofilm abundance and inaccurate data interpretation, making this method unsuitable for early and precise diagnosis of biofilm presence in the clinical samples.
This work assigns importance to SEM image resolution as it relates to reliable TWIST-based detection of BFI. In this context, it is important to recognize that best results may be obtained from high-resolution images only when an appropriate classification model and interpolation methods are used. 49 In certain medical applications, such as pathological imaging and digital mammography, high-resolution images often need to be downsampled due to the limitations of storage capacity. Therefore, selecting the appropriate image resolution for TWIST involves balancing the desired analysis with the available computational resources, as higher resolutions require significant processing time and power. 81 Our investigation revealed that standardizing image resolution to 2,560 × 2,048 provided optimal results in the given context.
Generalizability
As demonstrated in the current study, where TWIST was trained to differentiate between biofilm and nonbiofilm features in wound SEM images, the same approach can be applied to specialized datasets for studies focused on other tissues such as bone and muscle. As an example for infected bone tissue sample, the TWIST ML algorithm needs to be trained for distinguishing specific features in the SEM images from bone tissue. The model must be trained to accurately identify key structural components, including the layered bone matrix (lamellae), honeycomb-like network (cancellous bone), osteoblasts, osteoclasts, osteocytes, mineralized bone matrix, and canaliculi. 82 By applying the training strategy similar to the approach used that was used for classifying RBCs, WBCs, fibrin, and platelets in wound tissue SEM images, the infected bone model will be able to differentiate distinct features of that tissue, ensuring precise classification and segmentation of bone microstructures from BFI.
CONCLUSION
Complementing these findings, this work provided an ML-based quantitative tool for the determination of the extent of BFI burden in chronic wound-edge tissues leveraging the optimized SEM image resolution and segmentation technique that can deliver reliable quantitative insights into biofilm prevalence within intricate tissue environments.
Footnotes
ACKNOWLEDGMENTS AND FUNDING SOURCES
This work was funded by the
AUTHOR DISCLOSURE AND GHOST WRITING
The authors declare no financial interests or conflicts of interest to disclose.
DISCLAIMER
Views expressed are those of the authors and do not reflect the official policy or position of the Department of Defense/U.S. Government. Identification of specific products or scientific instrumentation does not constitute an endorsement or implied endorsement on the part of the authors, Department of Defense, or any component agency.
ABOUT THE AUTHORS
SUPPLEMENTARY MATERIAL
Supplementary Figure S1
Supplementary Figure S2
Supplementary Figure S3
Supplementary Figure S4
Supplementary Figure S5
Supplementary Table S1
Abbreviations and Acronyms
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
