Abstract
With the emergence of multiresistant bacteria, the use of bacteriophages is gaining renewed interest as potential antimicrobial agents. The aim of this study was to analyze the structure of three lytic bacteriophages infecting Escherichia coli (SD1, SD2, and SD3) using a gel-based proteomics approach and the cellular response of this bacterium to phage SD1 infection at the proteome level. The combination of the results of 1-DE and 2-DE followed by mass spectrometry led to the identification of 3, 14, and 9 structure proteins for SD1, SD2, and SD3 phages, respectively. Different protein profiles with common proteins were noticed. We also analyzed phage-induced effects by comparing samples from infected cells to those of noninfected cells. We verified important changes in E. coli proteins expression during phage SD1 infection, where there was an overexpression of proteins involved in stress response. Our results indicated that viral infection caused bacterial oxidative stress and bacterial cells response to stress was orchestrated by antioxidant defense mechanisms. This article makes an empirical scientific contribution toward the concept of bacteriophages as potential antimicrobial agents. With converging ecological threats in the 21st century, novel approaches to address the innovation gaps in antimicrobial development are more essential than ever. Further research on bacteriophages is called for in this broader context of planetary health and integrative biology.
Introduction
Enterobacteriaceae are a large family of gram-negative Bacilli, being Escherichia coli one of the most representative species of this family. These bacteria can be found in many ecosystems (soil, water, vegetation, and is part of the normal intestinal flora of humans and animals) (Ben Yahia et al., 2020). Some E. coli strains have developed mechanisms of pathogenicity; they are considered to be the most important opportunistic pathogens, which means that they can cause human and animal diseases (Ben Said et al., 2016). These diseases can be intestinal (diarrhea) or extraintestinal (septicemia, pneumonia, urinary tract infection, and meningitis) (Bolocan et al., 2016). β-Lactams are considered as the most powerful antimicrobial agents both in human and veterinary medicine.
However, resistance to this class of antibiotics has been reported to increase over time, especially in bacteria (Poeta et al., 2009). Indeed, antimicrobials resistance (such as extended spectrum β-lactamases [ESBL]) is due to several factors, including the excessive use and uncontrolled abuse of antibiotics in human health, in veterinary medicine, for the needs of livestock and agriculture over time leading to an increase in bacterial resistance, in particular gram-negative bacteria such as E. coli (Odenthal et al., 2016; Törneke et al., 2015). Antibiotic resistance is considered today a worrying and evolving phenomenon and even a major global public health problem. The importance of the current situation urgently requires another bactericidal alternative of special and collective interest to tackle multiresistant bacteria (Dalmasso et al., 2016).
In this context, scientific research is geared more and more toward phage therapy, focusing on the discovery of new bacteriophages to fight bacterial diseases or to stop the dissemination of multiresistant bacteria (Hamdi et al., 2016). Bacteriophages are viruses that infect only bacteria and multiply within their specific hosts (Blasdel et al., 2017). They are considered to be the most abundant life form, it is estimated that there are around 1031 phage particles worldwide (Savalia et al., 2008; Wommack and Colwell, 2000). Nevertheless, before the investigation and the exploitation of bacteriophages in therapy, understanding the mechanisms of interactions between bacteriophages and their hosts is a preliminary and fundamental step to evaluate its safeness, effectiveness, and the potential to be applied.
Over the past decades, proteomics has become an important tool for the study of complex biochemical processes, the discovery of new proteins and the investigation of protein–protein interactions (Brötz-Oesterhelt et al., 2005). In this sense, phage proteomics can be an asset to the preclinical study of bacteriophages, delivering data about the interdependencies between viral proteins and host bacterial proteins during the infectious cycle. Also, the proteome is more dynamic and more complex than the genome, which prompts us to do a proteomic analysis to identify the protein composition of a given cell under a specific set of conditions (Brötz-Oesterhelt et al., 2005).
In this study, a gel-based proteomics approach was performed to study the structure of three novel lytic bacteriophages infecting ESBL-producing E. coli and the cellular response of this bacterium to phage infection at the proteome level.
Materials and Methods
Bacterial strain and phage isolation
The ESBL-producing E. coli C3570 was used as a host for three lytic bacteriophages. The host bacterial strain used for the phages isolation in this current study has been described and was preliminarily characterized by conventional biochemical methods and by specific polymerase chain reaction (PCR) (amplification of the uidA gene), and screened for the ESBL phenotype by double-disk synergy test in a previous publication (Ben Sallem et al., 2015). Phages SD1, SD2, and SD3 were isolated from wastewater samples from Tunisia, using a method previously described (Hamdi et al., 2016), and the phage purification step was repeated three times.
Bacterial proteins extraction
ESBL-producing E. coli C3570 cells were streaked on brain-heart infusion (BHI) plates and grown at 37°C. Single colonies of this strain were transferred into an Erlenmeyer flask in 300 mL of stirred liquid BHI for 15 to 16 h to reach the highest point of the growth curve. Cells were harvested from the exponential phase in all experiments (An optical density [OD]540 = 6 corresponds to about 2–3 × 109 CFU/mL).
The samples were handled in this way: only the bacteria and the bacteria infected with filtered phage. The cells were pelleted down at 10,000 g for 3 min at 4°C. The pellet was resuspended in 4 mL of prewarmed phosphate-buffered saline pH 7.4. After new centrifugation (at 10,000 g for 3 min at 4°C), the pellet was then resuspended in about 0.5 mL of 2% (w/v) sodium dodecyl sulfate (SDS), 250 mmol/L Tris at pH 9.5. The mixture was sonicated with an ultrasonic homogenizer (Vibra-Cell™ 75186) (four times for 10 sec at 30%). The disrupted cells were centrifuged (14,000 g) for 30 min at 4°C. Then, 500 μL of the supernatant were precipitated in ice-cold 1.5 mL of 20% (w/v) trichloroacetic acid (TCA) in acetone during 1 h at −20°C. After centrifugation (13,000 g, 20 min, 4°C), the pellet was resuspended in 200 μL of ice-cold acetone and centrifuged again. This step was repeated at least three more times discarding the supernatant.
The pellet was left to dry overnight at room temperature. Finally, the dried pellet was solubilized in 200 μL of solubilization buffer containing 4% (w/v) 3-[(3-cholamidopropyl)dimethylammonio]-1-propanesulfonate (CHAPS), 7 mol/L urea, 2 mol/L thiourea, 1% (v/v) immobilized pH gradient (IPG) buffer, and 20 mmol/L 1, 4-dithiothreitol (DTT). Total solubilization was achieved using sonication (four times for 10 sec at 30 sec on ice). Protein concentration was assayed using the 2-D Quant Kit (GE Healthcare, Buckinghamshire, United Kingdom) following the manufacturer's instructions (Ribeiro et al., 2015).
Phage concentration
The phage concentration was performed as described elsewhere (Boulanger, 2009) with some modifications. In brief, to the bacteriophage filtrate was added sodium chloride (NaCl, 0.5 mol/L) and the mixture was incubated at 4°C for 1 h. This step promotes dissociation of phage particles from bacterial debris and is required for the next step of precipitation with polyethylene glycol (PEG). After incubation, the suspension was centrifuged at 10,000 g for 10 min at 4°C. PEG 8000 (Sigma-Aldrich) was added to the supernatant to a final concentration of 10% (w/v) and incubated overnight at 4°C to precipitate phage particles. Finally, phage particles were sedimented by centrifugation at 10,000 g for 20 min at 4°C and the supernatant was carefully discarded.
Phage proteins extraction
The pellet recovered from the concentration step was suspended in a solubilization buffer [1% (w/v) SDS and 25 mM Tris-HCl]. Total solubilization was achieved using sonication (four times for 10 sec at 30%) on ice. The sample was heated at 100°C for 3 min. After centrifugation (10,000 g, 10 min, 4°C), the supernatant was recovered. Protein was then precipitated using 2-D Clean-Up Kit (GE Healthcare Life Sciences) following the manufacturer's instructions and solubilized in 4% (w/v) CHAPS, 7 mol/L urea, 2 mol/L thiourea, 1% (v/v) IPG buffer, and 20 mmol/L DTT. The mixture was sonicated and centrifuged at 12,000 g for 5 min at 4°C. Protein concentration was assayed using the 2-D Quant Kit (GE Healthcare) following the manufacturer's instructions (Ribeiro et al., 2015).
One-dimensional gel electrophoresis (1-DE)
In brief, protein samples from ESBL-producing E. coli C3570 host strain, phages, and host bacterial cells mixed with the filtrated phage SD1 obtained as above-mentioned were solubilized in a buffer containing 2% (w/v) SDS, 40% glycerol, 10 mmol/L Tris-HCl at pH 8.0, and 0.02% (w/v) bromophenol blue and separated in a resolving gel using 12.52% T (total monomer percentage) and 0.97% C (weight percentage of crosslinker). The 1-DE were stained with Coomassie Blue R-250 for 24 h and then washed in water overnight. Coomassie-stained gels were scanned with a flatbed scanner (Umax PowerLook 1100, Fremont, CA, USA) (Ribeiro et al., 2015).
Two-dimensional gel electrophoresis (2-DE)
Isoelectric focusing (IEF) of ∼300 μg of protein was performed using an IPG strip of pH 4–7 (13 cm) on an Ettan™ IPGphor II™ system (Amersham Biosciences) using the conditions described elsewhere (Chibani-Chennoufi et al., 2004) and summarized as follows: a first step of active rehydration was performed at 50 V for 12 h, followed by IEF at 500 V for 1 h, a gradient up to 1000 V for 1 h, a gradient up to 8000 V for 2 h 30 min, and finally, 8000 V for 30 min. Focused IPG strips were equilibrated twice for 15 min in equilibration buffer [(6 mol/L urea, 30% (w/v) glycerol, 2% (w/v) SDS in 0.05 mol/L Tris–HCl buffer pH 8.8)]. In the first equilibration step, 1% DTT was added to the original equilibration buffer and 4% iodoacetamide in the second step. Bromophenol blue was also added to both solutions.
The equilibrated IPG strips were gently rinsed with SDS electrophoresis buffer, blotted to remove excessive buffer, and then applied to SDS-polyacrylamide gels (T = 12.52%, C = 0.97%). After SDS-polyacrylamide gel electrophoresis (SDS-PAGE), the 2-DE gels were fixed in 40% (v/v) methanol/10% (v/v) acetic acid solution for 1 h and then stained overnight in Coomassie Brilliant Blue G-250.
Excess stain was removed by rinsing the gels with 40% (v/v) methanol solution. Coomassie-stained gels were scanned with a flatbed scanner (Umax PowerLook 1100, Fremont, CA, USA) and the digitized images were analyzed using Lab Scanner Image Master 5.0 software (Amersham Biosciences; GE Healthcare) and Progenesis SameSpots v4.5 (Nonlinear Dynamics Limited, Newcastle, United Kingdom). Protein patterns were the result of triplicate protein extractions and three 2-DE replicates. The reference gels are shown (Ramos et al., 2016).
Protein identification by mass spectrometry
Selected 2-DE gel pieces and 1-DE bands were excised and prepared for tryptic digestion. In-gel protein digestion was performed as previously described (Ribeiro et al., 2020). 2-DE spots were reduced with 10 mM DTT and then they were alkylated with 55 mM iodoacetamide. They were distilled with 25 mM ammonium bicarbonate and 5% acetonitrile for 15 min, followed by another distillation with 25 mM ammonium bicarbonate, 50% acetonitrile for 30 min two times. Spots were dehydrated with 100% Acetonitrile for 10 min. After dehydration, the acetonitrile was removed, and the spots were dried in a SpeedVac for about 10 min.
Then, 100 ng of Trypsin (Promega) in 25 mM Ammonium was added to each spot and incubated at 37°C overnight. Peptides were further extracted from the gel pieces by adding of Acetonitrile 100% to the digestion solution (final concentration of acetonitrile around 45–50%) followed by sonication for 10 min. The peptides containing solution was transferred to new tubes. Before nanoscale liquid chromatography coupled to tandem mass spectrometry (nanoLC-MS/MS) analysis, the samples were dried in a SpeedVac for around 30–45 min. They were sonicated in 12 μL of trifluoroacetic acid (TFA) 0.05% for 10 min and transferred into vials. Mass spectrometry scans were performed using a LTQ-Orbitrap Velos, from Thermo Scientific, with resolution of 15,000. MS was operated in positive ion. Spectra were recorded with m/z range of 375–1400. MS/MS scans were acquired in Fourier transform mass spectrometry (FTMS) with resolution of 7500. So, 5 μL of sample were injected in precolumn (C18 PepMap300, 5 μm, 300 Å, 300 μm × 5 mm).
Then, the sample was separated in a separation Column Acclaim PepMap 100, C18, and 75 μm × 25 cm nanoViper (Thermo scientific) using the following conditions: 30 min run, a linear gradient from 4% to 40% B over 15 min and from 40% to 90% B through 1 min (A: H2O, 0.1% formic acid, 5% dimethylsulfoxide [DMSO]; B: acetonitrile, 0.1% formic acid, 5% DMSO; C: H2O, 0.05% TFA). Proteins were identified using Mascot Daemon (v. 2.5.1) software and Mascot search algorithm. MS and MS/MS spectra were searched against a database containing 206,156 protein sequences retrieved from UniProt (date: February 26, 2020). The protein sequences were from E. coli, Escherichia phages as well as other bacteriophages.
The following MASCOT parameters were applied: precursor mass tolerance of 20 ppm, fragment tolerance of 0.5 Da, trypsin specificity with two missed cleavages, carbamidomethylation of cysteine, oxidation of methionine, and deamidation (NQ) were set as variable modification. False discovery rate (FDR) was adjusted to 1%, and at least two peptides were necessary to consider protein identification.
Statistical and image analysis
To study the cellular response of the ESBL-producing E. coli C3570 host strain to phage SD1 infection at the proteome level, the Coomassie-stained gels were scanned and the digitized images were analyzed using Progenesis SameSpots v4.5 (Nonlinear Dynamics Limited) to identify bacterial proteins differentially expressed after phage infection stress. The two conditions were studied with triplicate protein extractions and 2-DE for each condition; Condition 1, control protein samples from noninfected E. coli C3570 host strain, and Condition 2, bacterial cells subjected to phage SD1 infection, were compared with each other, and through a statistical analysis using the SameSpots software, those spots with an analysis of variance (ANOVA) p-value ≤0.05 and a FDR q-value ≤0.05 were selected for further analysis by mass spectrometry. Figure 1 shows, as an example, the results obtained for spot 916.

Statistical image analysis. The results obtained for spot 916, as an example, showing one replicate for the different conditions under study: condition 1 (bacteria) and condition 2 (bacteria infected with the filtrated phage SD1)
Results
Phage structural characteristics
The structural protein composition of the three phages under study was initially made through the separation of protein bands by 1-DE and their further analysis and identification by nanoLC-MS/MS. The 1-DE analysis of SD1, SD2, and SD3 gave rise to 13, 17, and 10 Coomassie-stained bands, respectively, ranging from 6.5 to 212 kDa. Of these, three structural proteins were successfully identified for each phage (SD1, SD2, and SD3) (Fig. 2A and Table 1). For SD1, sequence coverage ranges from 17% to 71%; Major capsid protein, Tail sheath monomer, and Putative baseplate wedge tail fiber connector were identified (Fig. 2A and Table 1).

Mass Spectrometry-Identified Protein of the SD1, SD2, and SD3 Phages 1-DE Profile
Regarding SD2, the sequence coverage was similar to that obtained for SD1 (24–64%), the RNA polymerase ADP-ribosylase, Major capsid protein, and Putative baseplate wedge tail fiber connector were identified. With a sequence coverage of 25–56%, major capsid protein, the Peptidase S74 domain containing protein, and the Putative long tail fiber were identified in SD3 bacteriophage (Fig. 2A and Table 1). The most abundant protein for phages SD1 and SD2 was the Major capsid protein, while for phage SD3, they were the Peptidase S74 domain-containing protein and Putative long tail fiber.
Some bacterial proteins were also identified in some bands. To further analyze the proteome of the bacteriophages, we performed 2-DE before the analysis by mass spectrometry. The different proteins identified by this approach are listed in Table 2 and Figure 2 B–D, which gather information about the accession number, molecular weight, Mascot score, sequence coverage, the number of significant unique sequences, and the predicted function of the proteins.
Mass Spectrometry-Identified Protein of the SD1, SD2, and SD3 Phages 2-DE Profile
The use of pH 4–7 IPG strips resulted in a well-spread display of protein spots, which made spot excision and image identification more accurate. From the 2-DE profile of the phage SD1, it was possible to identify three distinct proteins, namely the Tail sheath monomer, the Capsid vertex protein, and the Major capsid protein, with a sequence coverage ranging from 42% to 54% (Table 2 and Fig. 2B).
The analysis of phage SD2 detected 14 structural proteins with a sequence coverage ranging from 32% to 69%, including proteins of Dc2, RNA polymerase ADP-ribosylase, Thymidine kinase, Putative long tail fiber proximal subunit, Major capsid protein, Tail sheath monomer, Capsid vertex protein, Putative baseplate wedge subunit and tail pin, Baseplate wedge subunit, Putative tail fiber protein, Inhibitor of prohead protease, Prohead core scaffold protein, Portal protein, and the Fibritin_C domain-containing protein (Table 2 and Fig. 2C). For SD3 phage, nine structural proteins were detected with a sequence coverage ranging from 50% to 74%. These proteins included uncharacterized protein, Tail tube protein, DarA, Putative baseplate wedge tail fiber connector, Tail sheath monomer, Capsid vertex protein, Major capsid protein, Structural protein, and the Putative major head protein (Table 2 and Fig. 2D).
During extraction, the use of reducing agents can result in the separation of different subunits of the identified protein, which despite corresponding to different spots are part of the same protein (Ribeiro et al., 2020). In the 2-DE gels of protein extracts from the three phages, different spots corresponding to the same protein are broadly distributed across the gel, with different isoelectric points and molecular weights probably due to posttranslational modifications affecting molecular mass and/or isoelectric point (Roberts et al., 2004) (Fig. 2). For SD2 proteins, the Putative long tail fiber was identified with two molecular masses of 49.9 and 136.5 kDa (Table 2 and Fig. 3B).

2-DE spot pattern of proteins from the host bacterial cells infected by phage SD1. The insets show differentially expressed proteins, which were further analyzed by MS (Fig. 4).
For SD3 proteins, the Major capsid protein comes in two alternative forms of 37.7 and 56.5 kDa. Two alternative forms for the Tail tube protein were identified with molecular mass of 17 and 18.3 kDa (Table 2 and Fig. 2D). Probably, the two alternative forms of Putative long tail fiber of SD2 phage are two subunits of the same protein. These two proteins split into minor subunits, which were previously stabilized by noncovalent interactions (Bishop et al., 1974). Same case for the two proteins of SD3 phage that have different alternative forms (the Major capsid protein and the Tail tube protein). Possibly, theses subunits are parts of a bigger protein. In addition, the physical proximity between the excised spots can also explain these results (Ribeiro et al., 2020).
Phage-infected E. coli characterization
Protein samples from ESBL-producing E. coli C3570 host strain (condition 1) and host bacterial cells subjected to phage SD1 infection (condition 2) were distinctly separated and most formed a single spot on the 2-DE gel. There was a significant change in the 2-DE profile of the condition 2 when compared to the control (condition 1). In this sense, we performed an image analysis from three independent replicates for the two conditions under study using the SameSpots software (Nonlinear Dynamics Limited). The spots that presented significant differences between the two conditions (ANOVA p-value ≤0.05 and a FDR q-value ≤0.05) were physically located in the gels for excision and further analyzed by mass spectrometry (Fig. 3).
The nanoLC-MS/MS analysis detected 4 bacterial proteins that were significantly less expressed when the cells are infected by the virus, namely the Inorganic pyrophosphatase (19.7 kDa, sequence coverage of 48%), the RNA polymerase-binding transcription factor DksA (17.5 kDa, sequence coverage of 58%), the Aconitate hydratase B (93.4 kDa, sequence coverage of 59%), and D-ribose transporter RbsB (30.9 kDa, sequence coverage of 67%) (Fig. 4).

Differentially expressed proteins identified in the proteome of the host bacterial cells infected by phage SD1. Dark and light shading are used to facilitate data analysis through a gradient proportional to the fold change. aSpots are shown in Figure 3; MW, molecular weight; MS coverage, percentage of amino acids in sequence matched by peptides detected by mass spectrometry.
On the contrary, 10 bacterial proteins were found upregulated after phage infection. These proteins have a sequence coverage ranging from 45% to 80% (Figs. 3 and 4). The proteins identified included Putative glucose-6-phosphate1-epimerase (32.6 kDa, 45%), Aspartate ammonia-lyase (52.3 kDa, 48%), Two-component response regulator (27.2 kDa, 56%), Phosphoglycerate kinase (PGK) (41 kDa, 57%), Cysteine desulfurase IscS (45 kDa, 60%), Peptidase PmbA (47.9 kDa, 62%), Superoxide dismutase (SOD) (21.2 kDa, 62%), Formate dehydrogenase-H (62.4 kDa, 65%), ADP-L-glycero-D-manno-heptose-6-epimerase (34.8 kDa, 73%), and Formate dehydrogenase-H (62.4 kDa, 80%).
Of the proteins showing an increase in their expression under phage infection, most were assigned as having functions in stress response. For example, the SOD (Ramos et al., 2016) as well as Formate dehydrogenase H that acts as a stress protein and is involved in stress oxidative tolerance (Iwadate et al., 2017). Furthermore, Cysteine desulfurase IscS is other protein identified in the host bacterium E. coli with an increased expression under phage infection stress. This protein functions as a selenium delivery protein in the pathway for the biosynthesis of selenophosphate (Schwartz et al., 2000).
In addition, the PGK is a multifunctional enzyme that is involved in stress tolerance. The PGK is a glycolytic enzyme that catalyzes one of the two ATP-producing reactions in the glycolytic pathway, through the conversion of 1,3-bisphosphoglycerate to 3-phosphoglycerate (Rojas-Pirela et al., 2020). The level of expression of this protein was increased during host bacterium infection by the phage SD1. Aspartate ammonia-lyase belongs to the family of lyases, which has catalytic activity by catalyzing the reversible conversion of L-aspartic acid to fumarate and ammonia (Rudolph and Fromm, 1971), and Peptidase PmbA has a metallopeptidase activity and is involved in the regulation of posttranscriptional activity during stress condition (Chandrasekhar et al., 2014); both are enzymes with increased expression upon infection of E. coli by phage SD1.
Thus, an overexpression in the levels of the putative glucose-6-phosphate1-epimerase (catalytic activity) (Wurster and Hess, 1973), ADP-L-glycero-D-manno-heptose-6-epimerase (catalyzes the interconversion between ADP-D-glycero-beta-D-manno-heptose and ADP-L-glycero-beta-D-manno-heptose via an epimerization at carbon 6 of the heptose) (Morrison and Tanner, 2007), and the two-component response regulator (it is a system that detect and respond to changes in many adverse environmental conditions such as oxidative stress) (Stock et al., 2000) levels was also noted during infection with the SD1 phage.
Discussion
This study assessed the proteome of three lytic bacteriophages SD1, SD2, and SD3 as well the differentially expressed proteins of their host strain extended spectrum ß-lactamase-producing E. coli C3570 when infected by SD1 bacteriophage.
Phage proteome
The SD1 and SD2 phages share with the SH7 phage the same sizes of the following structural proteins: Major capsid protein (56.5 kDa), Tail sheath monomer (72.5 kDa), and the RNA polymerase ADP-ribosylase (76.2 kDa) (Hamdi et al., 2017). Interestingly, other than the phage SD1 and SD2 structural proteins, RNA polymerase ADP-ribosylase was determined with a relatively high sequence coverage value (43%), which corresponds to an enzyme not usually present in the phage structure. The RNA polymerase ADP-ribosylase was also detected by mass spectrometry in SH7 and AR1 phages (Hamdi et al., 2017; Liao et al., 2011).
The ADP-ribosylase targets host RNA polymerase, and it is not essential for phage development (Goff and Setzer, 1980; Wilkens et al., 1997). Thus, it is possible that this protein was identified in our proteome data due to copurification with the phage particles (Goff and Setzer, 1980; Wilkens et al., 1997). The increase in gene expression after phage infection is under the control of ADP-ribosylation of RNA polymerase. However, the presence of this enzyme is not necessary for the phage development, but it plays an important role in shutting off host transcription; it is a virion component incorporated into the phage head.
While for phage SD2, the difference is obvious in the size of these two proteins: Putative long tail fiber, this protein is present in two alternative forms (49.6, 136.5 kDa), and Prohead core scaffold protein (29.5 kDa). However, SH7 proteome characterization showed that the two previously mentioned proteins have the following sizes 140 and 16 kDa. The SD2 phage also differs from SH6 phage by the Portal protein size (Hamdi et al., 2017).
The major difference between the proteome of phages SD2 and vB_Eco4M-7, which was previously described by Necel et al., 2020, was the protein size. Nevertheless, this study showed that there is a correspondence in the identified proteins of these two phages. Citing the SD2 identified sizes: Tail sheath protein (72.5 kDa), Portal protein (60.4 kDa), Baseplate wedge subunit (37.6 kDa), and Prohead core scaffold protein (29.5 kDa). However, vB_Eco4M-7 values were as follows: 54.8, 88.7, 12.8, and 41.7 kDa, respectively. On the contrary, the results obtained in this work were different to those reported by Xu et al., 2018, SD2 and vB_EcoS-B2 phages have one single common protein but with different size. For phage SD2, the Putative tail fiber protein presented 54.2 kDa and phage vB_EcoS-B2 presented 125.8 kDa (Xu et al., 2018).
For phage SD3, the results of the Major capsid protein (37.7 kDa) are in agreement with the data described by Yazdi et al., 2020, who reported a Major capsid protein of VB_EcoS-Golestan infecting multidrug-resistant E. coli with a similar size. In addition, the Putative major head protein (37.8 kDa) reassembles with that of vB_EcoS-B2 (Xu et al., 2018). On the contrary, the SD3 and vB_EcoS-B2 phages have another common protein, but with different size. For phage SD3, the Putative tail fiber protein (54.1 kDa) and the Structural protein (50.2 kDa) had different sizes when compared to the phage vB_EcoS-B2 counterparts, 125.8 and 130.5 kDa, respectively (Xu et al., 2018).
The comparison between the structure proteins identified of the SD3 phage and that of vB Ec4M-7 phage showed a similarity of one protein with only a difference in protein size. The tail sheath monomer is the only common protein between SD3 and vB Ec4M-7 phages (Necel et al., 2020). Again, the only difference is in the protein size. The protein of SD3 phage had the size of 72.5 kDa, unlike vB Ec4M-7 phage that presented 54.8 kDa. The current study indicates that our results are in accordance with the previous studies. In general, the phages share the same proteins, presenting some important differences regarding its size.
The SD3 phage has two alternative forms for the Major capsid protein 37.7 and 56.5 kDa. The second form is like that of SH7 phage (Hamdi et al., 2017). Also, they have in common the Tail sheath monomer (72.5 kDa), while they have another common protein but with different size. In SH7 phage, we find the Putative baseplate wedge tail fiber connector (23 kDa), but this value is different in SD3 phage (30.4 kDa).
The identification of two or more alternative forms for a protein can be due to the separation of different subunits of the same protein. These subunits are widely distributed on the gel and can be linked by covalent or noncovalent bonds resulting in the formation of a large protein (Ribeiro et al., 2020; Stone et al., 2019). For example, capsids are protein shells that surround and protect the viral genome. Capsid proteins often self-assemble with a quasi-equivalent arrangement of individual subunit. The capsid subunits use similar interactions throughout the assembly (Fokine and Rossmann, 2014; Stone et al., 2019).
For the phage SD3, as reported in the results section, we found two alternative forms of the Major capsid protein (56.5 and 37.7 kDa). Possibly, these subunits are parts of a bigger protein. Our results are in agreement with Fokine and Rossmann, 2014, and Stone et al., 2019, who demonstrated that the major capsid protein is a set of subunits arranged and linked by hydrogen bonds and salt bridges, which stabilize intracapsomeric interactions. So, these noncovalent interactions between the Major capsid protein subunits provide rigidity of the protein structure (Fokine and Rossmann, 2014; Ross et al., 2005; Stone et al., 2019).
Furthermore, bacteriophage tails (long tail fiber and short tail fiber) are fascinating molecular machines created to recognize the host cells, penetrate the cell envelope barrier, and deliver DNA into the cytoplasm (Fokine and Rossmann, 2014). The tail fibers are a complex protein structure that functions at the beginning of the infection process (Hyman and van Raaij, 2018). For SD2 proteins, the Putative long tail fiber was identified with two molecular masses of 49.9 and 136.5 kDa, which is in accordance with the results of Bishop et al., 1974. The putative long tail fiber protein is a large protein; its assembly takes place by noncovalent bonds between structural subunits, suggesting a rigid joining and structure stability.
In addition, Arnaud et al. (2017) demonstrated that two forms of tail tube protein were detected in E. coli T5 phage. This result is in accordance with our data as two forms of tail tube protein were found with two different sizes (17.1 and 18.3 kDa). According to the works of Langlois et al., 2015, and Špakova et al., 2019, the Tail tube protein is constituted by stacked rings of subunits and these subunits are connected by stable hydrogen bonds. This demonstrates that the two forms are two subunits interconnected by hydrogen bonds to form a bigger molecule of tail tube protein.
Our results strongly suggest that these three proteins that presented two alternative forms (Major capsid protein, the Putative long tail fiber, and the Tail tube protein) are part of bigger proteins stabilized by noncovalent interactions, which split into minor subunits. In fact, in the 2-DE analysis, the Major capsid protein, the Putative long tail fiber, and the Tail tube protein are three structural proteins, of which none resolved into a single discreet spot (Roberts et al., 2004). Obtaining two alternative forms of each of these three proteins may be due to posttranslational modifications, this hypothesis in agreement with the previous results of Martin et al., 1976, and Roberts et al., 2004.
Among the structural proteins identified in phage SD3, we found the Peptidase S74 (101.9 kDa). This protein is a chaperone of Endosialidase, which acts as a tail spike protein. This chaperone is responsible for the host polysialic acid capsule recognition, binding and degradation activity (Stummeyer et al., 2005).
The proteome characterization of SD1, SD2, and SD3 phages identified a variety of tail fiber proteins (Putative tail fiber proteins, Putative baseplate wedge tail fiber connector, Putative baseplate wedge subunit and tail pin, Tail tube protein, and Putative long tail fiber). Tail fibers in the phage tail play a primordial role in the initiation of the phage coupling with its bacterial receptors. In addition, these fibers have a role in the host specificity (Li et al., 2016; Yazdi et al., 2020). It is only after successful adsorption that a phage is properly posed to release its genetic material into the cytoplasm of the host cell, in which the viral infectious cycle can continue. Completion of adsorption step is the key signal of DNA injection (McPartland and Rothman-Denes, 2009).
Before phage genomic material delivery by the Tail Sheath protein, there will be attachment and coupling step with the bacterial receptors located in the host outer membrane envelope. This fixation is provided by phage basal plate that is decorated by some tail fiber proteins (Arisaka et al., 2016). Based on our results, the tail sheath protein presented a high sequence coverage (57%). This protein is the main element that allows the delivery and release of the genetic material of the phage through the host cell envelope, in other words, it is the way of the establishment of a direct connection between phage-host also as an event signaling for DNA injection (Kurochkina et al., 2018; McPartland and Rothman-Denes, 2009).
Thus, the Prohead core scaffold protein has an outer shell, which is formed by the major capsid protein, and it has an inner core made from scaffold proteins. Usually, the Prohead has a protease, which is activated during the maturation of the capsid to destroy the inner core and free up space for the genome (Fokine and Rossmann, 2016). While the portal protein multimerizes as a single ring-shaped homododecamer arranged around a central channel by forming the portal vertex of the capsid, this protein has different critical roles in head assembly, viral DNA genome packaging, neck/tail attachment, and genome ejection through host cell envelope (Hua et al., 2014; Rao and Feiss, 2008).
Cellular response to phage infection
The phage infection stress changed the expression of several E. coli proteins. A downregulated expression of Aconitate hydratase B (fold of −0.72), D-ribose transporter RbsB (fold of −0.53), Inorganic pyrophosphatase (fold of −2.2), and RNA polymerase binding transcription factor DKsA (fold of −0.41) was noticed.
The Aconitate hydratase B in E. coli is involved in the catabolism of short-chain fatty acids via the tricarboxylic acid and the 2-methylcitrate cycle I. This protein catalyzes the reversible isomerization of citrate to isocitrate via cis-aconitate. Also catalyzes the hydration of 2-methyl-cis-aconitate to yield (2R,3S)-2-methylisocitrate (Tang et al., 2002). Several studies demonstrated that the Aconitate hydratase B serve as a protective buffer against the basal level of oxidative stress that accompanies aerobic growth by acting as a sink for reactive oxygen species and by modulating translation of the sodA transcript (Lopez-Campistrous et al., 2005; Tang et al., 2002; Tonella et al., 2001; Yan et al., 2002).
D-ribose transporter RbsB is a ribose-binding protein, involved in molecules transporter. Similarly to our results, there was a dramatic reduction in the level of RbsB in Bacillus subtilis under phage φ29 infection (Mojardín and Salas, 2016). D-Ribose is one of the metabolites that bacteria can actively transport into the cell to use as carbon and energy source. Previously reported work demonstrated that in phage infection stress, this protein was downregulated (Mojardín and Salas, 2016).
The microbial Inorganic pyrophosphatase plays a primordial role in macromolecular biosynthesis and is essential for the viability of E. coli (Triccas and Gicquel, 2001). This enzyme catalyzes hydrolysis of Inorganic pyrophosphate, Tripolyphosphate, and Tetrapolyphosphate (Josse, 1966). General oxidative stress did not increase the activity of the Inorganic pyrophosphatase in Mycobacterium tuberculosis in vitro (Triccas and Gicquel, 2001). But regarding the intracellular oxidative stress, it has resulted in a stable expression level of the Inorganic pyrophosphatase (Abu Kwaik, 1998).
This study demonstrated that this protein was specifically induced in response to the intracellular environment of that host. The decrease in expression of this protein under stress exerted during host viral infection (intracellular stress) was confirmed by other works (Abu Kwaik, 1998; Triccas and Gicquel, 2001). So, this may suggest that the Inorganic pyrophosphatase reacts only under conditions of intracellular stress.
In E. coli, DksA is a transcriptional regulator that modulates gene expression at the levels of transcription initiation and elongation (Haugen et al., 2008; Roghanian et al., 2015). RNA polymerase-binding transcription factor DksA was found downregulated in our study. It belongs to a family of proteins that can insert themselves into the two channels of RNAP. The best characterized system of DksA regulation is stress response (Kolmsee et al., 2011). DksA is a key player in bacterial survival under various environmental changes.
For example, DksA of Salmonella enterica is hypersusceptible to the bacteriostatic effects of nitric oxide free radicals and is attenuated in macrophage and murine models of infection (Henard and Vazquez-Torres, 2012). The Shigella flexneri DksA has decreased Hfq transcription, causing the loss of virulence (Sharma and Payne, 2006), and in E. coli, DksA has been shown to be important for survival of dehydration (Chen and Goulian, 2018). Again, various studies have shown that DksA provokes global changes in transcriptional expression in host cells under various stresses, such as nitrosative, oxidative, and nutrient stresses (Crawford et al., 2016; Holley et al., 2015). To our knowledge, no study has reported whether DksA contributes or not to defense of host bacterial cells against phage infection.
This study suggests that the downregulation of some bacterial proteins is necessary to promote a rapid change in host metabolism for an optimal viral development.
On the contrary, some key oxidative stress-related proteins were found upregulated after SD1 phage infection, namely SOD, Formate dehydrogenase-H, Cysteine desulfurase IscS, Aspartate ammonia-lyase, Peptidase PmbA, PGK, Putative glucose-6-phosphate 1-epimerase, ADP-L-glycero-D-manno-heptose-6-epimerase, and the two-component response regulator.
The SOD is an antioxidant enzyme involved in the defense mechanism against oxidative stress by catalyzing the dismutation of toxic superoxide anion radicals produced in stressed cells (Ramos et al., 2016; Schellhorn and Hassan, 1988). The increase in the expression of SOD during infection of E. coli by phage SD1 indicates that the host bacterial cells are under unfavorable and stressful conditions, which generate a state of oxidative stress. This suggests that the SOD protein is a putative marker for viral infection in bacteria.
Formate dehydrogenases are enzymes that catalyze the oxidation of formate to carbon dioxide coupled to NAD+ reduction into NADH (Savin and Tishkov, 2010). These enzymes act as stress proteins in pathogenic microorganisms such as E. coli (Colas des Francs-Small et al., 1993). The formate dehydrogenase-H is an enzyme detected in our host strain of E. coli that showed a very high level (sequence coverage of 80%, fold of 5.8) of expression during stress induced by SD1 phage. This result is in agreement with the study carried out by Iwadate et al., 2017 who highlighted the role of formate dehydrogenase-H in contributing for the stationary phase oxidative stress tolerance in E. coli. In this sense, it is possible that the contribution of this protein for the oxidative stress tolerance generated the host bacteria cells following phage infection.
Cysteine desulfurase IscS produces sulfur and l-alanine from l-cysteine. This protein was involved in the recovery of [Fe-S] clusters (Ding et al., 2005; Schwartz et al., 2000). Oxidative stress leads to the formation of reactive oxygen species, hence cellular stress (Han and Lee, 2006). Maintaining cellular redox balance is found to be a process that involves molecules synthesized from reduced sulfur extracted from the environment. As other authors suggested that expression of IscS in E. coli protects against the denaturation of cytoplasmic proteins during oxidative stress (Dai and Outten, 2012; Fuentes et al., 2007). In this case, it can also be suggested that Cysteine desulfurase IscS is involved in the contribution of tolerance to oxidative stress caused by infection of phage SD1.
Another bacterial protein, which presented a high level of expression (sequence coverage of 57%, fold of 1.8) under the stress of SD1 phage infection, is the PGK. This is a glycolytic enzyme that catalyzes one of the two ATP-producing reactions in the glycolytic pathway, through the conversion of 1,3-bisphosphoglycerate to 3-phosphoglycerate (Rojas-Pirela et al., 2020). In E. coli, PGK has been cataloged as an enzyme that is part of a complexome, associated with proteins involved in glycolysis and the stress response (Joshi et al., 2016; Pan et al., 2011). The host bacteria E. coli underwent oxidative stress during the phage infection, and based on our results it can be deduced that PGK appears to be a potential candidate for increasing bacterial tolerance to stress exerted during viral infection.
Aspartate ammonia-lyase was also a protein found upregulated for condition 2. Aspartase belongs to the family of lyases, ammoniacal lyases, which cut carbon-nitrogen bonds. It is found in various bacteria, including E. coli. Aspartate ammonia lyase has catalytic activity by catalyzing the reversible conversion of L-aspartic acid to fumarate and ammonia (Rudolph and Fromm, 1971). The Peptidase PmbA was another protein showed a notable change in its expression after exposure to stress condition through the presence of phage SD1 (sequence coverage of 62%, fold of 2.1).
It has been shown previously that the Peptidase PmbA was downregulated in E. coli under stress conditions (Chandrasekhar et al., 2014). This result is in disagreement with our results that this protein increased the level of expression under phage infection. During stress, this protein was involved in the regulation of posttranscriptional activity (Chandrasekhar et al., 2014). So, we can only agree with the predicted function for peptidase PmbA during the E. coli stress mechanism.
Similarly, the two-component response regulator has also presented an increased expression. It is a two-component regulatory system that serves as a basic stimulus-response coupling mechanism to enable organisms especially in gram-negative bacteria (which in our case: E. coli) to detect and respond to changes in many adverse environmental conditions such as oxidative stress (Stock et al., 2000). During the infection process, bacteria encounter different strapping conditions, which cause a cellular imbalance that subsequently generates a state of oxidative stress. The two-component response regulator system pathway is a primary means of responding to external stimuli (Zheng et al., 2018).
Previous work has shown that this system in E. coli is involved in cellular adaptation to oxidative stress (Park et al., 2020; Zheng et al., 2019). So, this prompted us to admit the same suggestion about the role of the two-component response regulator system by adapting and increasing the tolerance of bacterial host cells of E. coli infected to oxidative stress generated during viral infection.
Finally, two other proteins were upregulated for condition 2, namely the ADP-L-glycero-D-manno-heptose-6-epimerase (sequence coverage of 73%, fold of 2.5) and the putative glucose-6-phosphate 1-epimerase (sequence coverage of 45%, fold of 2.2).
Based on these findings, SD1 phage infection puts the host bacterium E. coli into a state of oxidative stress. To cope with oxidative stress, E. coli cells trigger rapid global responses designed to eliminate reactive oxygen species (ROS), repair oxidative damage, bypass damaged functions, and induce adapted metabolism, thus allowing the cells to persist under high ROS conditions. The bacterial cells response to stress is usually orchestrated by antioxidant defense mechanisms (Srivastava and Kumar, 2015). Proteomic analyses have further revealed that the level of antioxidant enzymes (SOD, Cysteine desulfurase IscS, and Formate dehydrogenase-H), the PGK, the Peptidase PmbA, and the two-component response regulator system is increased to have key regulatory functions in cellular detoxification of E. coli cells infected by the phage.
Conclusion
This study allowed us to get new insights into the proteome of three novel E. coli bacteriophages and the identification of important structural proteins for SD1-SD3 phages was accomplished. The proteome of each phage showed a different protein profile (Mw and pI) with common proteins, namely the Major capsid protein, Tail sheath monomer, and Capsid vertex protein. To better understand the cellular response of the host bacterium to the phage infection, a gel-based quantitative proteomics study was performed. Several bacterial proteins were found differentially expressed, particularly related to stress response. Antioxidant enzymes (SOD, Cysteine desulfurase IscS, and Formate dehydrogenase-H), the PGK, the Peptidase PmbA, and the two-component response regulator system seemed to be involved in oxidative stress tolerance during phage infection.
In general, these results indicate that E. coli bacterium undergoes oxidative stress. The knowledge of how the bacterium's proteome responds physiologically to phage infection stress, such as the increased expression of the antioxidant enzymes to tolerate the stress situation, can be an important step, preliminary though, for the better understanding of the mechanisms and interactions between bacteriophages and their hosts. Considering that antimicrobial resistance is increasing, while the rate of discovery of new antibiotics is decreasing, the identification and characterization of target-specific bacteriophages take on special importance and raise some expectations to face a possible postantibiotic era.
Footnotes
Author Disclosure Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Funding Information
This work was supported by the Associate Laboratory for Green Chemistry—LAQV which is financed by national funds from FCT/MCTES (UIDB/50006/2020).
