BACTERIAL CONJUGATIVE SYSTEM AND THERAPEUTIC USES THEREOF
TECHNOLOGICAL FIELD
The present disclosure relates to a bacterial conjugative system for transferring, in vivo, a nucleic acid cargo from a conjugative bacterium to a recipient bacterium.
BACKGROUND
Bacterial communities play an important role in human and animal health. It is now clearly established that imbalances in gut microbial populations, also known as dysbiosis, are linked to several severe pathologies such as: cancer, diabetes, Crohn’s disease, and irritable bowel syndrome to name only a few. Being able to precisely manipulate bacterial communities to restore and/or maintain healthy microbiomes would therefore be of great interest to cure those diseases. For example, having a technology allowing the selective elimination or inactivation of pathogenic bacteria, while maintaining the beneficial flora of a microbiome intact, would be a highly valuable therapeutic tool. In addition, having the possibility to modify certain bacteria of a microbial community so they can locally provide therapeutic agents to the organ they colonize would also be an important approach for improving human and animal health. In summary, developing a tool that would allow precise in situ manipulation of human or animal microbiomes could unleash new promising therapeutic possibilities.
Parallel to this, public health is also challenged by the alarming emergence of antibiotic resistant bacteria that infects the gut (e.g. Campylobacter, Escherichia coli and Salmonella), the urinary tract (e.g. Escherichia coli), and wounds (e.g. Staphylococcus aureus). This is a major concern because the development of new antibiotic molecules has declined drastically over the last decades. It is estimated that by 2050 antibiotic resistant bacteria will be responsible for more human deaths than cancer. Therefore, to palliate the growing inefficiency of conventional antibiotics, there is an urgent need to develop new alternative drugs to fight antibiotic resistant bacteria.
Bacterial conjugation is a natural process through which a donor bacterium transfers genetic material, via a conjugative element, into a recipient bacterium. Owing to the recent advances in synthetic biology, bacteria (such as probiotics) could be engineered to use bacterial conjugation in order to transfer a genetic cargo containing the CRISPR-cas9 RNA-guided nuclease system into a target bacterium. This new class of drug, based on a probiotic capable of delivering CRISPR-cas9 to target bacteria, could provide an efficient way to manipulate microbiomes, or treat bacterial infections, in situ. For instance, such probiotics could be used to transfer the CRISPR-cas9 RNA-guided nuclease system into target bacteria to delete antibiotic resistance genes or to eliminate pathogenic bacteria by inducing double strand breaks in their chromosomes. This principle could also be directly applied to the
treatment of dysbiosis by targeting over-represented species of bacteria, hereby editing the microbiome with great precision.
While engineering bacteria, including probiotics, that use bacterial conjugation to deliver CRISPR-cas9 system to modify microbiomes is a promising avenue, this approach faces some serious technological challenges that need to be addressed. As a matter of facts, to be useful, this technology requires that: (1) the engineered probiotic is capable of carrying out bacterial conjugation in vivo inside the environment of human or animal body (e.g. in the gut, the urinary tract, or a wound), and (2) the in vivo conjugation efficiency must be high enough to achieve satisfactory therapeutic effects.
Up to now, bacterial conjugation has been studied almost exclusively in vitro in Petri dishes, an environment that significantly differs from the conditions encountered in vivo in a human or animal body. Very little is known about which conjugative bacterial systems are actually capable of functioning in vivo, and at what efficiency. In stark contrast to drugs derived from chemical molecules (e.g. traditional antibiotics), for which the in vitro activity in a Petri dish is indicative of the in vivo activity, drugs based on living organisms (e.g. bacteria) are not as predictable. For instance, contrary to inert chemical molecules, bacteria are living organisms that are adapted to certain conditions, and that respond to their environment via complex mechanisms affecting a plethora of cellular processes. Therefore, it is difficult to predict if a bacterium capable of conjugation in vitro will be able to perform conjugation under in vivo conditions. In short, for drugs that use living organisms as therapeutic vectors, the in vitro efficacy is not sufficient to predict the ability of the drug to function in vivo.
In sum, a new class of therapeutics based on bacterial conjugation is a very promising therapeutic avenue to manipulate bacterial communities in situ. However, in order to become a viable approach, this technology requires the development of a bacterial system actually capable of carrying out conjugation in vivo, and this, with high-efficiency. Such bacterial system can then be used as a universal platform for the transfer and delivery of CRISPR- cas9, or any other type of genetic cargo that can eliminate or modify target bacteria.
BRIEF SUMMARY
According to a first aspect, the present disclosure provides a conjugative bacterial host cell for transferring, in vivo, a genetic cargo to a recipient bacterial cell. The conjugative bacterial host cell comprises (i) the genetic cargo (wherein the genetic cargo comprises a transport module operatively associated with a payload module); (ii) a type IV secretion system module, (iii) a mating pair stabilization module comprising a type IV adhesion pilus, the type IV adhesion pilus comprising an adhesin; and (iv) a mobilization module. The transport module is capable of being recognized by the transport machinery encoded by the
mobilization module. In an embodiment, the type IV adhesion pilus and/or the adhesin comprises at least one of the following proteins: PilL, PUN, PilO, PUP, PilQ, PilR, PUS, PUT, TraB, PilU, PilV or TraN. In another embodiment, the type IV adhesion pilus is derived from at least one of the following family of bacterial plasmids: IncA, IncB/O (Ind O), IncC, IncD, IncE, IncFI , lncF2, IncG, IncHM , lncHI2, Inch , Incl2, IncJ, IncK, IncL/M, IncN, IncP, IncQI , lncQ2, IncR, IncS, IncT, IncU, IncV, IncW, IncXI , lncX2, IncY, IncZ, ColE1 , ColE2, ColE3, p15A, pSC101 , lncP-2, lncP-5, lncP-7, lncP-8, lncP-9, Ind , Inc4, Inc7, Inc8, Inc9, Inc1 1 , Inc13, Ind 4 or Ind 8. In still another embodiment, the type IV secretion system module comprises at least one of the following proteins: VirB1 , VirB2, VirB3, VirB4, VirB5, VirB6, VirB7, VirB8, VirB9, VirB10, VirB1 1 or VirD4. In yet another embodiment, the type IV secretion system module is derived from at least one of the following family of bacterial plasmids: MPFT, MPFf, MPF,, MPFFATA, MPFb, MPFfa, MPFG or MPFC. In yet a further embodiment, the genetic cargo is located on a first extrachromosomal vector and further comprises a first vegetative replication module, the conjugative bacterial host cell comprises a first maintenance module encoding a first replication machinery, and the first vegetative replication module is capable of being recognized by the first replication machinery encoded by the first maintenance module. In such embodiment, the first maintenance module comprises at least one of the following proteins: RepA, ParA, ParB, a DNA primase, YgiA, a toxin, Vcrx028, YcfA, an antitoxin, Vcrx027, YcfB, a DNA topoisomerase, YdiA or YdgA. In still another embodiment, the first vegetative replication module or the first maintenance module is derived from at least one of the following family of bacterial plasmids: IncA, IncB/O (Ind O), IncC, IncD, IncE, IncFI , lncF2, IncG, IncHM , lncHI2, Inch , Incl2, IncJ, IncK, IncL/M, IncN, IncP, IncQI , lncQ2, IncR, IncS, IncT, IncU, IncV, IncW, IncXI , lncX2, IncY, IncZ, ColE1 , ColE2, ColE3, p15A, pSC101 , lncP-2, lncP-5, lncP-7, lncP-8, lncP-9, Ind , Inc4, Inc7, Inc8, Inc9, Ind 1 , Ind 3, Inc14 or Ind 8. In still a further embodiment, the genetic cargo comprises the mobilization module. In an embodiment, the conjugative bacterial host cell comprises a transfer machinery located on a second extrachromosomal vector, wherein the transfer machinery comprises the type IV secretion system module, the mating pair stabilization module and a second vegetative replication module; the conjugative bacterial host cell comprises a second maintenance module encoding a second replication machinery; and the second vegetative replication module is capable of being recognized by the second replication machinery machinery encoded by the second maintenance module. In an embodiment, the second maintenance module comprises at least one of the following proteins: RepA, ParA, ParB, a DNA primase, YgiA, a toxin, Vcrx028, YcfA, an antitoxin, Vcrx027, YcfB, a DNA topoisomerase, YdiA or YdgA. In a further embodiment, the second vegetative replication module or the second maintenance module is derived from at least one of the following family of bacterial plasmids: IncA, IncB/O (Ind O), IncC, IncD, IncE, IncFI ,
lncF2, IncG, IncHM , lncHI2, Inch , Incl2, IncJ, IncK, IncL/M, IncN, IncP, IncQI , lncQ2, IncR, IncS, IncT, IncU, IncV, IncW, IncXI , lncX2, IncY, IncZ, ColE1 , ColE2, ColE3, p15A, pSC101 , lncP-2, lncP-5, lncP-7, lncP-8, lncP-9, Ind , Inc4, Inc7, Inc8, Inc9, Inc1 1 , Inc13, Inc14 or Inc18. In an embodiment, the transfer machinery further comprises the mobilization module. In yet another embodiment, the conjugative bacterial host cell comprises a transfer machinery located in the bacterial chromosome, wherein the transfer machinery comprises the type IV secretion system module, the mating pair stabilization module and the mobilization module. In an embodiment, the conjugative bacterial host cell further comprises an exclusion module, a selection module and/or a regulatory module. In an embodiment, the regulatory module comprises at least one of the following regulatory protein or non-coding RNA: YajA, YafA, FinO, Fur, Fnr, KorA, AcaC, AcaD, Acr1 , Acr2, StbA, TwrA, ResP, KfrA, ArdK, dCas9, crRNA, ZFN, TALEN, taRNA, toehold switch, AraC, TetR, Lad or Laclq. In another embodiment, the mobilization module comprises at least one of the following proteins: VirC1 , NikB or NikA. In still another embodiment, the mobilization module is derived from at least one of the following family of bacterial plasmids: MOBF, MOBP, MOBv, MOBH, MOBc or MOBQ. In a further embodiment, the mating pair stabilization module further comprises a shufflase for modifying a shufflon associated with the gene encoding the adhesin. In still another embodiment, the shufflon is derived from at least one of the following family of bacterial plasmids: IncA, IncB/O (Ind O), IncC, IncD, IncE, IncFI , lncF2, IncG, IncHM , lncHI2, Inch , Incl2, IncJ, IncK, IncL/M, IncN, IncP, IncQI , lncQ2, IncR, IncS, IncT, IncU, IncV, IncW, IncXI , lncX2, IncY, IncZ, ColE1 , ColE2, ColE3, p15A, pSC101 , lncP-2, lncP-5, lncP-7, lncP-8, lncP-9, Ind , Inc4, Inc7, Inc8, Inc9, Ind 1 , Ind 3, Inc14 and/or In 8. In a further embodiment, the shufflase is encoded by a rci gene. In a further embodiment, the sufflase is derived from at least one of the following family of bacterial plasmids: IncA, IncB/O (Ind O), IncC, IncD, IncE, IncFI , lncF2, IncG, IncHM , lncHI2, IncU , Incl2, IncJ, IncK, IncL/M, IncN, IncP, IncQI , lncQ2, IncR, IncS, IncT, IncU, IncV, IncW, IncXI , lncX2, IncY, IncZ, ColE1 , ColE2, ColE3, p15A, pSC101 , lncP-2, lncP-5, lncP-7, lncP-8, lncP-9, Ind , Inc4, Inc7, Inc8, Inc9, Ind 1 , In 3, Inc14 and/or In 8. In still a further embodiment, the payload module encodes a nuclease. In yet another embodiment, the nuclease is a clustered regularly interspaced short palindromic repeat (CRISPR) associated DNA-binding (Cas) protein or a Cas protein analog and the payload module further encodes a CRISPR RNA (crRNA) molecule recognizable by the Cas protein or the Cas protein analog. In still another embodiment, the crRNA molecule is substantively complementary to a DNA molecule in the recipient bacterium. In yet a further embodiment, the DNA molecule corresponds to a gene in the recipient bacterium. In an embodiment, the gene encodes a virulence factor in the recipient bacterium. In still another embodiment, the payload module further encodes a transactivating CRISPR RNA recognizable by the Cas protein or the Cas protein analog. In
another embodiment, the payload module encodes a therapeutic protein. In another embodiment, the therapeutic protein allows for the production or the degradation of a metabolite. In an embodiment, the conjugative bacterial host cell has an in vivo conjugation efficiency of at least 103 bacterial transconjugant/recipient CFU and/or a ratio of in vitro conjugation efficiency obtained in a liquid medium when compared to a corresponding conjugation efficiency obtained in a solid medium higher than 0.1 %. In still another embodiment, the conjugative bacterium is a probiotic bacterium. In a further embodiment, the conjugative bacterium is an enteric bacterium. In some embodiment, the conjugative bacterial host cell is from the genus Escherichia, for example, from the species Escherichia coli and in some specific embodiments, from the strain Escherichia coli Nissle 1917.
According to a second aspect, the present disclosure provides a composition comprising the conjugative bacterial host defined herein and an excipient. In some embodiments, the composition is formulated for oral administration.
According to a third aspect, the present disclosure provides a process for making the conjugative bacterial host cell defined herein, the process comprises introducing the genetic cargo and at least one of the type IV secretion system module, the mating pair stabilization module or the mobilization module defined herein in a bacterium to provide the conjugative bacterial host cell. In some embodiments, the process further comprises introducing at least one of the vegetative replication module, the maintenance module, the regulatory module, the selection module or the exclusion module as defined herein in the bacterium to provide the conjugative bacterial host cell.
According to a fourth aspect, the present disclosure provides a conjugative recombinant bacterial host cell obtainable or obtained by the process described herein.
According to fifth aspect, the present disclosure provides a process for making the composition defined herein, the process comprising formulating the conjugative bacterial host cell defined in herein with an excipient.
According to a sixth aspect, the present disclosure provides a composition obtainable or obtained by the process described herein.
According to a seventh aspect, the present disclosure provides a conjugative recombinant bacterial host cell defined herein or a composition defined herein for transferring, in vivo in a subject suspected of having a recipient bacterium, the genetic cargo from the conjugative bacterial host cell to the recipient bacterium. The present disclosure also provides the use of a conjugative recombinant bacterial host cell defined herein or a composition defined herein for transferring, in vivo in a subject suspected of having a recipient bacterium, a genetic cargo from the conjugative bacterial host cell to the recipient bacterium. The present
disclosure further provides the use of a conjugative recombinant bacterial host cell defined herein or a composition defined herein for the manufacture of a medicament for transferring, in vivo in a subject suspected of having a recipient bacterium, a genetic cargo from the conjugative bacterial host cell to the recipient bacterium. The present disclosure also provides a method for transferring, in vivo in a subject suspected of having a recipient bacterium, a genetic cargo from a conjugative bacterial host cell to the recipient bacterium, the method comprising administering an effective amount of a conjugative recombinant bacterial host cell defined herein or a composition defined herein to the subject under conditions to allow the transfer of the genetic cargo to the recipient bacterium. In some embodiments, the conjugative bacterial host cell is a probiotic bacterial host cell and/or an enteric bacterium. In another embodiment, the modification system of the conjugative bacterial host cell is substantially similar to the restriction system of the recipient bacterium. In another embodiment, the payload module encodes a heterologous protein, such as for example a therapeutic protein, a heterologous protein allowing for the production or the degradation of a metabolite, and/or a nuclease. In an embodiment, the nuclease is a clustered regularly interspaced short palindromic repeat (CRISPR) associated DNA-binding (Cas) protein and the payload module further encodes a guide RNA (gRNA) molecule recognizable by the Cas protein. In some embodiments, the gRNA molecule is substantively complementary to a DNA molecule in the recipient bacterium. In a further embodiment, the DNA molecule is a gene in the recipient bacterium. In yet a further embodiment, the gene encodes, in the recipient bacterium, a virulence factor, a protein involved in a resistance to an antibiotic, a toxin or a pilus. In some embodiments, the conjugative bacterial host cell can be used for the treatment or the alleviation of symptoms of a dysbiosis or an infection caused by the recipient bacterium.
BRIEF DESCRIPTION OF THE DRAWINGS
Having thus generally described the nature of the invention, reference will now be made to the accompanying drawings, showing by way of illustration, a preferred embodiment thereof, and in which:
Figure 1.A to D. Generation of Escherichia coli Nissle 1917 (EcN) strain derivatives required to easily distinguish the donor and recipient strains in conjugation experiments. The donor strain, termed KN01 , was generated by insertion of the first cassette, including a specialized metagenomics sequencing (16S) tag and aac/7, a spectinomycin resistance gene (Figure 1.A). KN02 harbored a different cassette containing a different metagenomics sequencing (16S) tag, strAB for streptomycin resistance, an IPTG inducible NeonGreen fluorescent reporter, and the cat gene conferring resistance to chloramphenicol (Figure 1.B). KN02 was used both as a recipient and as a target strain. KN03 had an insert that contains strAB for
streptomycin resistance, the same metagenomics sequencing (16S) tag as KN02 and tetB for tetracycline resistance (Figure 1.C). KN03 was used as both a recipient and a non-target control. All genes in Figure 1.A to C are shown to scale with the total amount of base pair (bp) shown below each DNA construct. All inserts were cloned into pGRG36’s Smal and Xhol restriction sites located between attLTnJ and attRTnJ sites. (Figure 1.D) The resulting plasmids were transformed in EcN and the expression of the Tn7 system was induced with arabinose. This led to the excision of the DNA fragment located between the attL/RTnJ of pGRG36, which was was next integrated at the 3' end of the glmS gene. Since the pGRG36 replication machinery is heat-sensitive, an incubation at a non-permissive temperature of 42°C was finally used to cure the empty vector backbone from the cell.
Figure 2. Impact of dapA deletion on E. coli's metabolism. The deletion of dapA prevents the transformation of L-asparate-semialdehyde into 4-hydroxy-2,3,4,5-tetrahydrodipicolinate (THDP-OH). This reaction is essential for synthesis of both forms of DAP in E. coli. L,L-DAP and meso-DAP can however be imported from the environment to complement the mutation and allow the synthesis of both peptidoglycan and lysine. This makes DAP an essential medium additive for growth of the dapA deletion mutants. Abbreviations and designations: THDP-OH: 4-hydroxy-2,3,4,5-tetrahydrodipicolinate; THDP, (S)-2, 3,4,5- tetrahydrodipicolinate; Succinyl-AKP, N-succinyl-L-2-amino-6-ketopimelate; Succinyl-DAP, N-succinyl-L,L-2,6-diaminopimelate; L,L-DAP, LL-2,6-diaminopimelate; meso-DAP, meso- 2,6-diaminopimelate; DapA, dihydrodipicolinate synthase; DapB, dihydrodipicolinate reductase; DapD, THDPA succinylase; SerC, succinyl-DAP aminotransferase; DapE, succinyl-DAP desuccinylase; DapF, DAP epimerase; LysA, DAP decarboxylase.
Figures 3.A to G. Streptomycin (Sm) treatment improves mouse gut colonization by Escherichia coli Nissle 1917 (EcN). The ability of different Sm concentrations to deplete endogenous Enterobacteriaceae (white area) and to promote EcN colonization (gray area) was evaluated by quantifying CFU in feces from mice treated with 0 mg/L (Figure 3.A), 50 mg/L (Figure 3.B), 100 mg/L (Figure 3.C), 250 mg/L (Figure 3.D), 450 mg/L (Figure 3.E) and 1000 mg/L (Figure 3.F) in drinking water starting at day -2. The colonization levels of KN01 in several sections of the intestinal tract of untreated (O) and streptomycin-treated (·) mice was evaluated (Figure 3.G). The dotted line in panel G indicates sufficient CFU levels for conjugation frequency evaluation. This was based on the minimal in vitro conjugation rate on solid support for all tested systems (about 1x103 transconjugants/recipient). Horizontal black lines show the mean values under each condition.
Figures 4.A to E. Evaluation of transfer efficiencies for conjugative plasmid candidates. Six different conjugative plasmids were first tested for transfer efficiency both on agar (solid) and in broth (liquid) for 2 hours at 37°C (Figure 4.A). Conjugative plasmids were also tested for
their transfer efficiency in the murine gut using 4 mice per experiment. The proportion of transconjugants per recipient bacteria was evaluated in feces (Figure 4.B) throughout 3 days and compared to the ratio found in the caecum at day 3 (Figure 4.C). TP1 14’s ability to transfer was confirmed using an additional set of 4 mice, and compared to the conjugation frequencies obtained in vitro on agar for the same time points (Figure 4.D). Conjugation experiments with TP1 14 and R6K in both Sm treated and untreated mice were performed to compare transfer efficiencies within partly depleted or intact microbiota. Transfer rates were measured in the feces after 12 hours of conjugation in 4 mice per conditions per plasmid (Figure 4.E). Error bars show standard deviation of the mean (black lines) from at least 3 biological replicates.
Figures 5.A to F. Raw colony forming units (CFUs) data used to calculate in vivo transfer rates showed in Figure 4.B. The CFUs for donors (·), recipients (■) and transconjugants (▼) bacteria were counted from feces samples at day 1 , 2, 3 on selective MacConkey agar plates for each plasmid. CFUs from conjugation of pOX38 (Figure 5.A), R6K (Figure 5.B), TP1 14 (Figure 5.C), pVCR94 (Figure 5.D), R388 (Figure 5.E) and RK24 (Figure 5.F) are shown.
Figures 6.A to D. Effect of time between recipient and donor strains colonization on conjugation efficiencies. Sm-treated mice were fed with the recipient bacteria either 2 or 12 hours before the introduction of the donor strain. Conjugation efficiency of TP1 14 (Figure 6.A) and R6K (Figure 6.B) was followed throughout four days in the feces of 4 mice to evaluate the impact of recipient bacteria colonization prior to conjugation. The recipient’s colonization was also followed with CFU from feces throughout the experiment with both TP1 14 (Figure 6.C) and R6K (Figure 6.D). The time points are given relative to the donor strain introduction at time = 0 day.
Figure 7. Gene annotation of TP1 14 as predicted by RAST, CDsearch and BLAST. TP1 14 was sequenced using both lllumina and Oxford Nanopore technologies and coding genes were first annotated using RAST. A locus tag was attributed to each CDS with the prefix TP1 14-0 and a number referring to gene order based on the starting position of TP1 14’s sequence as deposited on genbank: MF521836.1. The annotation was further refined using CDsearch and BLAST to attribute putative functions to the genes. Names were attributed to genes based on their putative homolog. General functions were then manually attributed to different modules that mediate a specific function such as type 4 secretion system (T4SS), mating pair stabilization, mobilization, maintenance, regulation, selection and unknown function.
Figures 8.A to D. Sequence homology between TP1 14 and other plasmids of the Incl family. Sequence homology was evaluated using BRIGG, a BLAST-like program that shows
sequence identity in circular pattern. Sequence identity threshold were set at 100%, 70% and 50% for all analysis. TP1 14’s sequence was first compared to seven members of the Incl2 subfamily based on the nucleic acid sequence (Figure 8.A), and the amino acid sequence of its coding genes (Figure 8.B). Gene conservation among Incl2 plasmids is also compiled in Table 6. TP1 14 was then compared to seven members of the Inch subfamily based on the nucleic acid sequence (Figure 8.C), and the amino acid sequence of its coding genes (Figure 8.D). Homology regions for the Inch subfamily only comprised the repA replication initiation gene, the shufflon and its associated shufflase rci. Numbers on homology rings correspond to the plasmids in the legend with 1 being the innermost ring and 7 being the outermost ring.
Figure 9 High-density transposon mutagenesis (HDTM) experiment overview. An EcN containing TP1 14 was bombarded with transposons using MFDp/r+ containing pFG051 (SEQ ID NO: 147) (a mobilizable Tn5 transposition plasmid) and pFG036 (SEQ ID NO: 146) (a plasmid repressing Tn5 transposon machinery in the donor strain). This resulted in the random insertion of Tn5 in both EcN’s chromosome and TP1 14, generating HDTM Library 1. A conjugation experiment both in vitro (on solid support) and in vivo (in the murine gut) was performed to select for TP1 14::Tn5 mutants that were still able to transfer in those specific environments. The in vivo transfer of HDTM Library 1 was carried for 2 days in two mice per biological replicate Two HDTM libraries were generated from this experiment: transconjugants retrieved from feces samples constituted the HDTM Library 4, whereas HDTM Library 6 is composed of transconjugants retrieved from the caecum of the mice. The in vitro transfer resulted in HDTM Library 2 which was again used as donors for in vitro solid mating (generating the HDTM Library 3) and in vivo conjugation in the gut (generating the HDTM Library 5 and 7 consisting of transconjugants found in the feces and in the caecum respectively). HDTM Library 8 was generated by conjugation of TP1 14::fefS (SEQ ID NO: 166) in HDTM Library 2 to investigate the exclusion mechanism of TP1 14.
Figure 10 HDTM library analysis. HDTM libraries were sequenced and reads were used to precisely locate transposon insertion sites in TP1 14. Read mapping was then visualized the using UCSC Genome Browser. Black lines represents an insertion site, the height of the line represents the density of reads at a given position in TP1 14. Tracks shown are representative of the background noise found in the HDTM library 2 as compared with library 3. Data of biological replicate #2 is shown for HDTM Libraries 1 , 2 and 3.
Figure 11. Correlation between HDTM samples. The normalized number of reads mapped onto each 100 bp bin on TP1 14 was correlated between replicates and conditions using Pearson correlation. A grayscale was applied to the data in order to visually identify the samples which strongly correlates (1.0 in dark grey) or weakly correlates (0.0 in white)
between each other. Samples were identified following a three numbers format X.X.X, where the first position refers to the HDTM library number, the second position refers to the biological replicate of the HDTM library, and the third number refers to the mouse identity when experiments were in vivo.
Figure 12. Essential genes for plasmid maintenance. HDTM libraries were sequenced and reads were mapped based on their insertion point on TP1 14. Read mapping was then visualized using the UCSC Genome Browser. Vertical lines represent transposon insertion sites, with their respective height corresponding the density of reads at this position. The selected racks presents three biological replicates of HDTM library 1 that were analyzed for any reproducible drop in transposon insertion coverage. These low coverage regions were considered to represent essential maintenance genes. However, some genes contained low mappability regions, which also appeared as low coverage regions and were filtered out of the analysis. The remaining genes with low coverage are are shown within a dotted frame (see also Table 8) and considered important for plasmid maintenance.
Figures 13.A to D. Distribution of gene count ratios of HDTM libraries 2 and 3. This procedure was repeated to determine gene importance in HDTM library 4 to 7. Gene counts were first determined by calculating the normalized number of reads mapping within a given gene under a specific condition. Gene counts were then compared to the gene counts in HDTM Library 1 using the formula (gene count in condition X - gene count in condition 1) / gene count in condition 1. Max and average values, black and gray lines respectively, were calculated using a set of genes suspected to be essential in the test Library X but not in Library 1. The gene count ratio distribution is shown for HDTM Library 2 (Figure 13.A) with the dashed section zoomed in (Figure 13.B). The gene count ratio distribution is shown for HDTM Library 3 (Figure 13.C) with the dashed section zoomed in (Figure 13.D). All genes with a gene count ratio bellow the maximal value threshold were considered important in the given condition.
Figure 14. pil genes are essential for transfer in vivo but not in vitro. The transposon insertion sites were mapped onto TP1 14 and visualized using the UCSC Genome Browser. Transposon insertion density is shown both for in vitro and in vivo conjugation experiments. Black lines represent insertion sites, and their height represent read density at a given position in TP1 14. Tracks shown represents HDTM Libraries 1 , 3, 6 and 7 for biological replicate 2. While HDTM Library 1 shows the complete insertion profile for conjugation results in vitro, HDTM Library 3 shows insertion densities after two in vitro conjugation and HDTM Library 6 and 7 shows the effect of in vivo conjugation on the insertion densities. Comparison of the tracks clearly reveals a diminution in insertion signal intensity for the pil
genes only for the two in vivo conjugation experiments (genes in dashed selection). Gene essentiality for in vivo conjugation is summarized in Table 10.
Figure 15. Distribution of core and essential genes of TP1 14 for maintenance, in vitro conjugation and in vivo conjugation. Data for the conservation of genes and for gene essentiality as determined by HDTM were mapped onto TP1 14’s sequence. Only essential genes with high confidence (black) and core genes (grey) are shown. Gene functions were attributed based on Figure 7.
Figure 16. A to E. Effect of T4P abolition on mating pair stabilization of TP1 14 in vitro. The pilS gene was deleted and complemented in T4P mutants of TP1 14 for conjugation under solid support, liquid static and agitating liquid conditions (Figure 16.A). Briefly, ~108 CFUs of KN01 strain containing either TP1 14 pilS::cat or TP1 14 pilS::cat+ pPilS were mixed with an equal amount of the recipient strain KN03 to assess the importance of the T4P on TP1 14 conjugation efficiency. The resulting mixtures were incubated on solid medium or in liquid with and without shaking for 2 hours at 37°C. Conjugants were then resuspended (solid) or diluted (liquid) in 800 pl_ total volume, and plated on LB medium with appropriate antibiotics to evaluate the proportion of transconjugants in the entire recipient cell population. Error bars show standard deviation of the mean from at least 3 biological replicates. Asterisk indicates that frequency of transconjugant formation was below the limit of detection (<108). The entire pi IV gene or theshufflon, which can re-organize the C-terminusof pilV (including the 3’- end of the pilV gene), was replaced by a Flag tag to generate a second set of T4P mutants. A plasmid allowing the expression of a pilV variant (TP'\ '\4 pilV::cat+ pPNV4’) was able to complement this phenotype using the same experimental conditions as described above (Figure 16. B). Each possibility of locked pilV variant was assessed for conjugation under solid support, liquid static and agitating liquid conditionsunder the same conditions as described above (Figure 16. C to E, respectively).
Figures 17.A toC. Effect of T4P inactivation on TP1 14 in vivo transfer rates. The ability of a TP1 14 Dr/VS mutant to transfer in vivo was compared to TP1 14. Briefly, groups of 5 mice were treated with 1 g/L of streptomycin two days prior to strain introduction. Mice were administered the recipient strain KN03 2 hours prior to donor strain introduction. The proportion of transconjugants per recipient bacteria was then monitored in feces for four days (Figure 17.A). On the fourth day, mice were sacrificed and the proportion of transconjugants was compared between the caecum and the feces (Figure 17.B). Error bars show standard deviation of the mean from at least 5 biological replicates. A similar experiment was conducted for TP1 14 pilV::cat as well as two locked pilV variants ( ΊR ADeIhiίΐIohn.ranΐ-qBί and TP1 '\4 shufflon::pilV4’-cat, which respectively failed and succeeded conjugation in vitro
to a E. coli recipient strain), thus revealing the essential role of specific pilV variants for conjugation towards a given target bacterium (Figure 17.C).
Figures 18.A to D. Incompatibility and exclusion hinder the transfer of conjugative plasmids. Incompatibility and exclusion mechanisms are specific to each Inc plasmid families. KN02 containing TP1 14 was used as a donor for conjugation towards recipient bacteria bearing different plasmids (pOX38, R6K, TP1 14::fe/S, pVCR94, R388, RP4). TP1 14’s transfer rate into a recipient bacterium devoid of any conjugative plasmid is shown by the dotted line, with standard deviation shown in gray (Figure 18.A). Exclusion ratios were calculated based on the data of panel A. Briefly, the conjugation frequency of TP1 14 into a recipient devoid of any conjugative plasmid was divided by the transfer rate into a recipient already containing a plasmid. In trans mobilization of pCloDF13 by TP1 14 into a recipient containing or not a copy of TP1 14 was also assesed (Figure 18. B). Exclusion was then tested in vivo. Streptomycin- treated mice fed with either KN02 + TP1 14::fe/S (SEQ ID NO: 166) or KN02 first, and subsequently fed with KN01 + TP1 14 2 hours later. The proportion of transconjugants per recipient cells in feces was then analyzed daily for four consecutive days (Figure 18.C). On the fourth day, mice were sacrificed and the proportion of transconjugants per recipient bacteria was compared between caecum and feces (Figure 18.D). Error bars show standard deviation of the mean from at least 3 biological replicates.
Figures 19.A to C. Exclusion is abolished in specific TP1 14 mutants from HDTM Library 8. TP1 14::fefS (SEQ ID NO: 166) was transferred by conjugation into a mutant pool from HDTM Library 1. The resulting transconjugants were referred to as HDTM Library 8, and were mostly deficient for exclusion. Individual HDTM Library 8 mutants were isolated and then used as donor strain to isolate exclusion deficient clones of TP1 14::Tn5. After two successive rounds of conjugative transfers, the ability of TP1 14::Tn5 to exclude TP1 14 v.tetB (SEQ ID NO: 166) was verified in a conjugation experiment between KN02 + TP1 14 v.tetB (SEQ ID NO: 166) and KN01 , KN01 + TP1 14 or KN01 + TP114::Tn5 as a recipient (Figure 19.A). The exclusion ratio of TP1 14 or TP1 14::Tn5 was also compared to an empty recipient as previously described in Figure 18 (Figure 19. B). Finally, the capacity of a TP1 14::Tn5 mutant to transfer at expected rates was verified on solid medium for 2 hours at 37°C using KN01 as the donor cell and KN03 as the recipient (Figure 19. C). Error bars show standard deviation of the mean from at least 3 biological replicates.
Figure 20. Genes limiting TP114’s transfer efficiency. Transposon insertion sites were aligned onto TP1 14 and visualized using the UCSC Genome Browser. Representative insertion density tracks for in vitro (HDTM Library 3) and in vivo (HDTM Library 7) conditions are shown. Vertical black lines represents transposon density at a given postion of the TP1 14 genome. The tracks shown represent HDTM Library step 1 , 3 and 7 for biological
replicate #2 as presented in Figure 9 Only two genes showed enrichments from FIDTM Library 1 to FIDTM Libraries 3 and 7: TP1 14-005 (previously shown to mediate exclusion and a gain in conjugation frequency) and yaeC (a homolog of transcription repressor finO). Both genes are boxed in a dotted frame.
Figures 21. A to G. Plasmids and gRNAs used to test cargos KilM and KiN3. Maps of the KilH insertion device (Figure 21.A), Kill3 insertion device (Figure 21. B), pREC1 (SEQ ID NO: 160) (Figure 21.C), pBXB1 (SEQ I D NO: 145) (Figure 21. D) and pT (Figure 21. E) are shown to scale. Total length in base pair (bp) is displayed bellow the plasmid name. gRNAs (an engineered fusion of the tracrRNA and the crRNA) from Kill3 are designed with the same promoters and terminators as KilM’s gRNA. The asterisks in the cat gene in pT’s map represent the protospacer of the gRNAs. All gRNAs were designed to target cat, a chloramphenicol resistance gene, which was introduced in the target’s genome or present on a plasmid. The gRNA’s spacers sequences match the target sequence in the cat gene (Figure 21. F). The complete nucleotide sequence of the cat gene (SEQ I D NO: 87) shows the location of gRNA 1 (SEQ ID NO: 88, light gray), gRNA 2 (SEQ ID NO: 89, gray) and gRNA 3 (SEQ ID NO: 90, dark gray) protospacers and their protospacer-associated motif (PAM) (framed with a solid line) (Figure 21. G)
Figures 22.A to D. Introduction of a genetic cargo in the transfer machinery by Double Recombinase Operated Insertion of DNA (DROID). The DROID method is exemplified by the insertion of KilM in the transfer machinery TP1 14 (Figure 22.A) The first step is to insert the tetB loading dock in the transfer machinery by recombineering. Then, the Bxb1 integrase operates the fusion between the attB and attP sites located on the loading dock and on the genetic cargo insertion device respectively Lastly, a FLP recombinase is expressed to knock out the insertion device segment between the two newly joined FRT sites (tetB, pSC101 ts and the attL site). PCR verifications for the recombineering (Figure 22. B), the DROID step 1 (Figure 22. C) and the DROID step 2 (Figure 22. D) are shown for KillTs insertion. Amplicons of each lanes are identified on the drawings in bold letters or numbers except for lane D which is the right junction between tetB and TP1 14 in TP1 14: tetB.
Figure 23.A and B. Examples of conjugative delivery system configurations The bacterium can be decomposed into several components organized hierarchically (Figure 23. A) The genetic cargo can be delivered in several configurations, three of which are shown in example III (Figure 23. B). The first approach is to deliver a genetic cargo by cis mobilization, where the genetic cargo is directly inserted in the transfer machinery to form a single vector encoding the Conjugative Delivery System. A second method is to deliver the genetic cargo through constrained cis mobilization, where the essential replication genes are relocated in the chromosome of the donor bacterium to prevent replication of the Conjugative Delivery
13
RECTIFIED SHEET (RULE 91.1)
System outside the donor strain. Finally, the genetic cargo and Transfer machinery can be encoded on two or more vectors to allows in trans mobilization. In this set-up, the genetic cargo needs a transport module which is recognized by the transfer machinery and mediates its transfer from the donor strain to the recipient strain. Each delivery mode presents different levels of biosafety, which are represented by an X (not biocontained), a + (contained) and a ++ (more strictly contained). Replication and transfer capacity in both the donor and the recipient strains are shown by an X (not possible) or check marks (possible). Replication in the recipient for in trans mobilization is dependent on the maintenance module of the genetic cargo.
Figure 24. Transformation efficiencies of KNI1 and KNI3 genetic cargos assessed by transformation into a recipient cell harboring a target plasmid (pT). 50 ng of each genetic cargo insertion device were electroporated in biological triplicates into KN03 + pT and plated to select only the genetic cargo (black bars) or to select both the genetic cargo and pT (white bars). Transformation efficiencies are shown as transformants per mg of electroporated DNA. Error bars show standard deviation of the mean from at least 3 biological replicates. Asterisk indicates the absence of CFUs on plate.
Figures 25.A and 25. B. TP1 14::Kill1 to selectively eliminates a target bacterium from a mixed population in vitro. The COP is used for the specific targeting of an E. coli Nissle 1917 (Figure 25.A) or Citrobacter rodentium (Figure 25. B) carrying a chromosomal copy of the cat gene and this, in a mock bacterial population composed of three other Enterobacteriaceae. Equal amounts of each strain were mixed, and then incubated with the COP strain, or with KN0'\AdapA + TP1 14 control strain, for 2 hours on solid medium at 37°C. The graphic shows the relative abundance (%) of transconjugants for TP1 14::Kill1 as compared with TP1 14 for each strain of the mock population. In both cases, the abundance of the targeted strain transconjugants was specifically decreased by ~1000 fold. Error bars show standard deviation of the mean from at least 3 biological replicates.
Figure 26. Identification of the TP1 14 origin of replication ( oriV) locus. The locus encoding the replication protein RepA was predicted in silico, and cloned in three different configurations in a pir- dependent plasmid backbone.“repA + up” corresponds to the repA coding sequence (CDS)+ 1 ,000 bp from the upstream region;“repA + down” represents the repA CDS and putative promoter + 1 ,000 bp in the downstream region; “repA + both” encompasses the repA CDS + 1 ,000 bp from the upstream and downstream regions. All three plasmid versions were transformed in a pir- or pir+ strain to test the activity of TP1 14 oriV. Only the repA CDS with both upstream and downstream regions could replicate in a pir- strain. Error bars show the standard deviation of the mean from 3 biological replicates.
Figures 27.A and 27. B. Constrained cis mobilization delivery can prevent plasmid persistence in the environment. TP1 14 and TP1 14ArepA::caf-o/7VR6K were transferred by conjugation from EC100Dp/r+ towards a pir- devoid strain (black bars) and a pir+ strain (white bars) at 37°C on solid LB medium for 2 hours. No transconjugant was detected for TP1 14Arep/4::caf-or/VR6K towards the pir- devoid strain (asterisk) and reduced frequency was observed for TP1 14Arep/4::caf-or/VR6K towards the pir+ strain. In stark contrast, wild-type TP1 14 could transfer at normal frequency (~10 3) (Figure 27.A). TP1 14 and TP1 14ArepA::caf-or/VR6K were transferred by conjugation from KN05 towards a EC100Dp/r+ strain (gray bars) at 37°C on solid LB medium for 2 hours (Figure 27. B). Under these conditions, both plasmids could conjugate at similar rates. Error bars show the standard deviation of the mean from 3 biological replicates.
Figures 28.A and 28. B. In trans mobilization of or/TTpn4-containing plasmids. Shuttle plasmid pNA01 contains or/TV and therefore should be mobilizable by TP1 14. pNA02 presents a 7-bp deletion centered on the nicking site of or/TTpi i4. In trans mobilization frequencies for plasmids pNA01 (Figure 28.A) and pNA02 (Figure 28. B) using a donor strain containing TP1 14 and a shuttle plasmid (either pNA01 or pNA02). Conjugation frequencies were calculated for transconjugants containing TP1 14 (black bars), the shuttle plasmid pNA01 or pNA02 (gray bars) and for transconjugants harbouring both TP1 14 and a shuttle plasmid (white bars). Error bars show the standard deviation of the mean from 3 biological replicates.
Figure 29. Localization of TP1 14’s origin of transfer (oriT) nicking site by pairwise sequence alignment. The oriT allows the recognition of a plasmid and is essential for mobilization. This recognition is based on the presence of repeats within the oriT sequence. The relaxosome then specifically binds the oriT and nicks (single strand break) the DNA to initiate conjugative transfer. TP1 14’s oriT (SEQ ID NO: 141) was aligned with previously characterized R64 minimal oriT (SEQ ID NO: 142). Important repeats were mapped onto the alignment to allow for prediction of the nicking site. While sequence alignment was weak, repeats were present both in TP1 14 and R64, suggesting the putative localization of the nicking site. Lines ‘|’ represents a perfect sequence alignement, dots 7 shows low similarity regions and a blank space‘‘ is a gap of a mismatch.
Figure 30.A and 30. B. Impact of the deletion of the origin of transfer (oriT) on TP1 14 conjugation frequency. The approximate location of the nicking site in TP1 14’s oriT was deleted by recombineering, creating TR1 14DOG/T. Conjugation frequency was first evaluated using transfers from E. coli MG1655NxR into E. coli MO I QddR^ and compared to wild-type conjugation rates (Figure 30.A). The transfer rate was drastically reduced in TR1 14DO/-/T, with residual transfer events (~ 106) likely due to partial oriT recognition or by the presence of
a cryptic oriT sequence in TP1 14. Transfer of TR1 14DO/-/T was then tested towards E. coli MΰIqddR^ and, no transconjugants were detectable (asterisk) (Figure 30. B). Error bars show the standard deviation of the mean from 3 biological replicates.
Figure 31. In trans- mobilization of shuttle plasmids pKN30 and pKN31 by a non-mobilizable transfer machinery. In trans mobilization frequencies for plasmids pKN30 (Figure 28.A) and pKN31 (Figure 28. B) using a donor strain containing both TP1 14 and a shuttle plasmid (either pKN30 or pKN31). Plasmids pKN30 and pKN31 are kanamycin resistant variants of pNA01 and pNA02, respectively. Both pKN30 and pKN31 contain TP1 14’s oriT, but pKN31 has a 7-bp deletion in the nicking site region to prevent its transfer. Conjugation frequencies were calculated for transconjugants containing TP'\ '\4 oriT::cat-tetB, the shuttle plasmid (pKN30 or pKN31) and for transconjugants harbouring both TP1 14 and a shuttle plasmid. Asterisk indicates that frequency of transconjugant formation was below the limit of detection (< 108) . Error bars show the standard deviation of the mean from 3 biological replicates.
Figure 32. Schematic representation of in trans mobilization as described in Example III. The TP1 14 oriT sequence was identified in silico and cloned into pNA01. The oriT sequence is recognized by the TP1 14 nickase to mediate pNA01 in trans mobilization. However, a 7-bp deletion centered on the predicted nicking site (essential for nickase activity) prevents in trans mobilization of pNA02.
Figures 33.A and 33. B. The COP system can transfer DNA conferring a beneficial phenotype to a target bacterium in vivo. TP1 14 was used as a Conjugative Delivery System to transfer the kanamycin resistance gene to a target bacteriumin the gut of mice. Mice were fed with KN02 two hours prior to KN01 + TP1 14 introduction. Proportion of recipient bacteria that have acquired the resistance phenotype (transconjugants) relative to total recipients was followed for 4 days in feces (Figure 33.A). On the fourth day, mice were sacrificed and the proportion of transconjugants per recipients was compared in the caecum and in the feces (Figure 33. B). Error bars show the standard deviation of the mean from 4 biological replicates.
Figures 34.A to D. Application of the conjugative probiotic (COP) as presented in the Example IV. The COP system is based on a probiotic cellular chassis delivering a genetic cargo by conjugative transfer. In the present example, the genetic cargo encodes CRISPR- Cas9 which can be transferred to a population of bacteria and target specific strains for elimination based on sequence specific criteria (Figure 34.A). Conjugation is mediated by the transfer machinery encoded by a highly efficient conjugative plasmid, which in this example, directly harbours the genetic cargo hereby forming the conjugative delivery system. Conjugative plasmids are also usually modular, with genes grouped according to their
function (Figure 34.B). Once in the target cell, Cas9 endonuclease is expressed and assembles with the gRNA and scans the entire DNA content of the cell. Once a protospacer sequence is found downstream of a protospacer-adjacent motif (PAM) sequence, Cas9 mediates the double stranded cleavage of the DNA (Figure 34.C). The bacterial COP can be used to selectively target cells in a complex microbial community. The bacterial COP will transfer the genetic cargo to recipient cells; however, the Cas9-gRNA system will only target specific strains from the community. If the target sequence is genomic, the target cells will die from DNA compromised genome integrity; if the target is plasmidic (e.g., virulence associated gene), the plasmid will be cured leading to target cell disarmament (Figure 34.D).
Figures 35.A to D. COP can mediate loss of phenotypic traits through CRISPR-Cas9 extra- chromosomal sequence targeting. TP1 14 (control) or TP1 14::Kill1 was transferred from KN01 to KN02 (harbouring the target plasmid pT) within the mouse intestinal tract. Target strain disarming (pT plasmid curation) was followed through time in feces for 4 days by analyzing colony fluorescence on plate allowing growth of all target recipient cells (Figure 35.A). One-way ANOVA was performed on the raw percentage of the four mice from the control group and three of the test group mice in response to the COP treatment. A representative image shows how green fluorescence was discernable between the colonies that had lost the plasmid and those who did not (Figure 35. B). On the fourth day, plasmid curation results were compared in the feces and in the caecum of the mice and showed high consistency (Figure 35. C). When selecting for transconjugants only, TP1 14::Kill1 achieved 100% disarming rates in all mice, but TP1 14 transfer showed no such activity (Figure 35. D).
Figures 36.A to E. COPs can be administered prophylactically to prevent colonization by an invading strain in vivo. Schematic description of the experiment (Figure 36.A). A probiotic donor strain bearing the TP1 14 plasmid with or without the CRISPR-Cas9 system (COP and control, respectively) was administered (~108 CFUs) 12 hours prior to the target/non-target strain mix. The abundance (Figure 36. B and D) and competitive index (Figure 36. C and E)of the target and non-target strains per mg of feces are presented as a function of time after gavage of the donor strain. The competitive index shows the relative abundance of the target or non-target bacteria. The dotted line in panel B and D is the upper limit for detection of CFU while the y axis is set to the lower limit of detection. In panel C and E, the dotted line represent a competitive ratio equivalent for both strains, in a situation where both strains would have exactly the same fitness.
Figures 37.A to E. COPs can be administered therapeutically to selectively eliminate a target strain in vivo. Schematic description of the experiment (Figure 37.A). The target/non target strain mix was administered (~108 CFUs) 12 hours prior to the probiotic donor strain bearing the TP1 14 plasmid with or without the CRISPR-Cas9 system (COP and control,
respectively). The abundance (Figure 37. B and D)and competitive index (Figure 37. C and E)of the target and non-target strains per mg of feces are presented as a function of time after gavage of the target and non-target strain mix. The competitive index shows the relative abundance of the target or non-target bacteria. The dotted line in panel B and D is the upper limit for detection of CFU while the y axis is set to the lower limit of detection. In panel C and E, the dotted line represent a competitive ratio equivalent for both strains, in a situation where both strain would have exactly the same fitness.
Figures 38.A to C. The genetic cargo can generate beneficial and detrimental effects on bacterial populations. In this experiment, TP1 14::KiM3 encoded a kanamycin resistance gene and a CRISPR-Cas9 system targeting the gene responsible for chloramphenicol resistance. Both TP1 14::KiM3 and TP1 14 (control) were transferred by conjugation in a recipient bacterium bearing a target plasmid (pT). Plasmid curation efficiency was first monitored through antibiotics resistance where co-existence of the plasmid pT with TP1 14 or TP1 14::KiM3 was assessed. Co-selection of pT and TP1 14::KiM3 led to a clear drop in CFU which suggests that selection of the genetic cargo would lead to higher pT loss within the recipient bacteria population. Asterisk indicates that frequency of transconjugant formation was below the limit of detection (<108) (Figure 38.A). Plasmid curation was then evaluated by quantifying the number of fluorescent colonies (containing the pT plasmid) while selecting the acquisition of the genetic cargo using kanamycin (Figure 38. B). A representative picture of bacterial spots from serially-diluted conjugative mixtures after growth of target cells with kanamycin selection (Figure 38. C). Error bars show the standard deviation of the mean from 3 biological replicates.
Figures 39.A and B. Conjugative transfer rates of several plasmids between the EcN donor and different recipient strains from various bacterial species. In vitro transfer of several conjugative plasmids spanning six incompatibility families towards strains representing some of the most infamous multidrug resistant Enterobacteriaceae species. Transfer experiments were performed both on agar (Figure 39.A) and in broth (Figure 39. B). Transfer frequency normalized on recipient CFUs is represented using a grayscale gradient. Data shown are the average of at least 3 biological replicates.
Figure 40. Protection from restriction systems can be acquired through DNA modification. Restriction modification systems are a barrier to horizontal gene transfer. Using donor that possess a modification system compatible with the recipient’s restriction system improves the conjugative efficiency. In the legend, MG1655 = E. coli MG1655 and EcN = E. coli Nissle 1917. Transfer is done between the donor-recipient couple. Data shown are the average of at least 3 biological replicates.
DETAILED DESCRIPTION
The present disclosure relates to the methods and systems for developing and using a conjugative bacterial cell specifically engineered to deliver a payload, such as a therapeutic genetic cargo, in vivo to a recipient bacterial host cell. In a specific embodiment, the conjugative bacterial cell can be used in vivo (e.g., in the gastro-intestinal tract environment or in the bladder, for example). In some embodiments, the conjugative bacterial cell can be used to (1) treat microbiota dysbiosis, (2) modify a microbiota to express beneficial factors, (3) suppress antibiotic resistance and/or the spread of antibiotic resistance, (4) eliminate a specific pathogen, and (5) suppress the expression of bacterial virulence factors.
As used in the context of the present disclosure, the term “derived” refers to the use of genetic material that has been obtained or modified from a naturally-occurring organism.
Components of the conjugative delivery system
The conjugative delivery system of the present disclosure comprises genetic elements present natively or genetically introduced in a bacterium allowing the bacterium to transfer in vivo its genetic cargo to a recipient bacterial cell. The conjugative delivery system comprises two main components : a transfer machinery (which includes the genetic elements required to transfer the genetic cargo) and the genetic cargo itself. The components of the transfer machinery can be located on one or more extrachromosomal vector and/or integrated in the bacterial’s chromosome. The components of the transfer machinery can be located in cis or in trans with respect to each other. The genetic cargo has been genetically enginereed in the conjugative bacterial host cell either by positioning a transport module in operative association with the payload module or by introducing an heterologous genetic cargo in the conjugative bacterial host cell. The genetic cargo which includes a transport module operatively associated with a payload module can be located on an extrachromosomal vector or integrated in the bacterial’s chromosome. The transport module is“operatively associated” with the payload module which allows the transfer of the payload module when the proteins encoded by the mobilization module (e.g., the transport machinery) recognize and act upon the transport module. Therefore, on the genetic cargo, the payload module is located in cis to the transport module at a position allowing the transfer of the payload module when the proteins of the mobilization module associate with the transport module.
In an embodiment, the components of the transfer machinery and of the genetic cargo are exclusively located on one or more extrachromosomal vector. In a specific embodiment, the components of the transfer machinery and of the genetic cargo are located on a single extrachromosomal vector. In another embodiment, the components of the transfer machinery and of the genetic cargo are located on more than one extrachromosomal vectors. For
example, the components of the transfer machinery and of the genetic cargo can be organized in two distinct chromosomal vectors as shown in Figure.23A.
In another embodiment, the components of the transfer machinery and of the genetic cargo are/can be located exclusively in the bacterial’s chromosome.
In yet a further embodiment, the components of the transfer machinery and of the genetic cargo are/can be located on one or more extrachromosomal vectors as well as in the bacterial’s chromosome. For example, the components of the transfer machinery can be located exclusively in the bacterial chromosome and the components of the genetic cargo can be located exclusively in an extrachromosomal vector. In another example, some of the components of the transfer machinery can be located in the bacterial chromosome as well as in one or more extrachromosomal vectors while the components of the genetic cargo can be located exclusively in one or more extrachromosomal vector (such as, for example, the embodiments shown in Figure.23B). In still another example, the components of the transfer machinery can be located exclusively in the bacterial chromosome while some of the components of the genetic cargo can be located in the bacterial chromosome and others can be located in one or more extrachromosomal vector. In still a further example, some of the components of the transfer machinery can be located in the bacterial chromosome as well as in one or more extrachromomal vectors while some of the components of the genetic cargo can be located in the bacterial chromosome and others can be located in one or more extrachromosomal vector.
As used herein, the term "genome" refers to the whole hereditary information of an organism that is encoded in the DNA including both coding and non-coding sequences. The term “module” refers to a group of genes that contribute to a same function. In an embodiment, all genes from a same module are physically linked (in cis) on the same DNA molecule. In yet another embodiment, the genes can be contained on more than one DNA molecule.
As used herein, the term “extrachromosomal vector” refers to a genetic element which is physically distinct from the bacterial genome. The extrachromosomal vector is usually capable of independent replication from the bacterial genome due to the presence of a vegetative replication module. In some embodiments, the extrachromosomal vector is a plasmid, such as, for example, a circular plasmid. Vectors can be circular plasmids, usually when it is intended that the vector is independently replicating from the genome of the donor bacterium, or vectors can be linear DNA molecules integrated in the genome of the donor bacterium. In embodiments in which more than one vector is present, they can be provided in the same or different forms.
In an embodiment the transfer machinery and the genetic cargo can be part of the same nucleic acid molecule or different nucleic acid molecules. The nucleic acid molecules can be circular or linearized (and intended for integration in the bacterial’s chromosome).
The transfer machinery and the genetic cargo include modules comprising genes which can encode one or more proteins, variants thereof or fragments thereof. The protein can be a variant of a a protein known to be encoded by the module. A variant comprises at least one amino acid difference when compared to the amino acid sequence of the native/known protein. As used herein, a variant refers to alterations in the amino acid sequence that do not adversely affect the biological functions of the protein. A substitution, insertion or deletion is said to adversely affect the protein when the altered sequence prevents or disrupts a biological function associated with the heterologous protein. For example, the overall charge, structure or hydrophobic-hydrophilic properties of the protein can be altered without adversely affecting a biological activity. Accordingly, the amino acid sequence can be altered, for example to render the peptide more hydrophobic or hydrophilic, without adversely affecting the biological activities of the heterologous protein. The protein variants have at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identity to the heterologous protein described herein. The term“percent identity”, as known in the art, is a relationship between two or more protein sequences or two or more nucleotide sequences, as determined by comparing the sequences. The level of identity can be determined conventionally using known computer programs. Identity can be readily calculated by known methods, including but not limited to those described in: Computational Molecular Biology (Lesk, A. M., ed.) Oxford University Press, NY (1988); Biocomputing: Informatics and Genome Projects (Smith, D. W., ed.) Academic Press, NY (1993); Computer Analysis of Sequence Data, Part I (Griffin, A. M., and Griffin, H. G., eds.) Humana Press, NJ (1994); Sequence Analysis in Molecular Biology (von Heinje, G., ed.) Academic Press (1987); and Sequence Analysis Primer (Gribskov, M. and Devereux, J., eds.) Stockton Press, NY (1991). Preferred methods to determine identity are designed to give the best match between the sequences tested. Methods to determine identity and similarity are codified in publicly available computer programs. Sequence alignments and percent identity calculations may be performed using the Megalign program of the LASERGENE bioinformatics computing suite (DNASTAR Inc., Madison, Wis.). Multiple alignments of the sequences disclosed herein were performed using the Clustal method of alignment (Higgins and Sharp (1989) CABIOS. 5:151-153) with the default parameters (GAP PENALTY=10, GAP LENGTH PEN ALT Y= 10). Default parameters for pairwise alignments using the Clustal method were KTUPLB 1 , GAP PENALTY=3, WINDOW=5 and DIAGONALS SAVED=5.
The variant protein described herein may be (i) one in which one or more of the amino acid residues are substituted with a conserved or non-conserved amino acid residue (preferably a conserved amino acid residue) and such substituted amino acid residue may or may not be one encoded by the genetic code, or (ii) one in which one or more of the amino acid residues includes a substituent group, or (iii) one in which the mature polypeptide is fused with another compound, such as a compound to increase the half-life of the polypeptide (for example, polyethylene glycol), or (iv) one in which the additional amino acids are fused to the mature polypeptide for purification of the polypeptide. A“variant” of the protein can be a conservative variant or an allelic variant.
The protein can be a fragment of a protein encoded by one of the genes of the module or a fragment of a variant protein. In an embodiment, the fragment corresponds to the known/native protein to which the signal peptide sequence has been removed. In some embodiments, heterologous protein“fragments” have at least at least 100, 200, 300, 400, 500, 600, 700, 800, 900 or more consecutive amino acids of the protein. A fragment comprises at least one less amino acid residue when compared to the amino acid sequence of the known/native heterologous protein and still possess the enzymatic activity of the full- length heterologous protein. In an embodiment, the fragment corresponds to the amino acid sequence of the protein lacking the signal peptide. In some embodiments, fragments of the heterologous protein can be employed for producing the corresponding full-length heterologous by peptide synthesis. Therefore, the fragments can be employed as intermediates for producing the full-length proteins.
Conjugative bacterial cell
As used herein, the terms“conjugative bacterial (host) cell”, “recombinant bacterium” or “donor bacterium” refer to a bacterium capable of horizontal gene transfer via bacterial conjugation. As used in the context of the present disclosure the terms “bacterial conjugation”, “conjugation”, “conjugative transfer” or “transfer” refer to a mechanism of horizontal gene transfer where genetic material (referred to as the genetic cargo) is delivered from a donor bacterium to a target bacterium (also referred to as a recipient bacterial cell) through a conjugative pore forming a channel between the two bacterial cells. The conjugative bacterial cell can be, in some embodiments, modified prior to being used in conjugation so as to remove or inactivate one or more virulence factors. In some embodiments, the conjugative bacterial cell can be a probiotic bacterium which can be referred to as a“conjugative probiotic” or“COP”. As used herein, the term“probiotic” refers to a bacterium that, once administered in adequate amount and via adequate routes, has no detrimental effects and may also provide beneficial effects to its host.
The present disclosure thus provides a bacterium, which can be a probiotic, which has been genetically engineered to bear the conjugative delivery system of the present disclosure. Thus, the present disclosure also provides a process for obtaining the recombinant bacterium by introducing the conjugative delivery system of the present disclosure in a bacterium.
Bacterial genera referred to as probiotic to a human or animal subject and that could be the COP of the present disclosure include, but are not limited to, Bacillus sp., Bifidobacterium sp., Enterococcus sp., Escherichia sp., Lactobacillus sp., Lactococcus sp., Leuconostoc sp., Pediococcus sp. and Streptococcus sp. As such, the present disclosure provides a probiotic recombinant bacterium from the genus Bacillus sp., Bifidobacterium sp., Enterococcus sp., Escherichia sp., Lactobacillus sp., Lactococcus sp., Leuconostoc sp., Pediococcus sp. and Streptococcus sp. as well as a process for making such probiotic bacteria by introducing the conjugative bacterial vector or system in a probiotic bacterium of the genus Bacillus sp., Bifidobacterium sp., Enterococcus sp., Escherichia sp., Lactobacillus sp., Lactococcus sp., Leuconostoc sp., Pediococcus sp. or Streptococcus sp. Bacterial species which are considered probiotic to human subjects include, but are not limited to Bacillus coagulans (e.g., strain GBI-30 or 6086), Bifidobacterium animalis subsp. lactis (e.g., strain BB-12), Bifidobacterium longum subsp. infantis, Enterococcus durans (e.g. strain LAB18s), Escherichia coli (e.g., strain Nissle 1917), Lactobacillus acidophilus (e e.g., strain NCFM), Lactobacillus bifidus, Lactobacillus johnsonii (e.g., strain Lai, LCI or NCC533), Lactobacillus paracasei (e.g. , strain Stl 1 or NCC2461), Lactobacillus plantarum (e.g. , strain 299v), Lactobacillus reuteri (e.g., strain ATCC 55730, SD21 12, Protectis, DSM 17938, Prodentis, DSM 17938, ATCC 55730, ATCC PTA 5289, RC-14), Lactobacillus rhamnosus (e.g., strain GG, GR-1) and Lactococcus thermophiles, Leuconostoc masenteroides (e.g. strain B7), Pediococcus acidilactici (e.g. strain UL5), and Streptococcus thermophilus. As such, the present disclosure provides a probiotic recombinant bacterium from the bacterial species which are considered probiotic to human subjects as well as a process for making such probiotic bacteria by introducing the conjugative bacterial vector or system in a probiotic bacterium of the strains of bacteria considered probiotic in humans. In a specific embodiment, the probiotic is from the genus Escherichia, for example the species Escherichia coli, e.g. E. coli Nissle 1917. The present disclosure provides a process for making such probiotic recombinant bacteria by introducing the conjugative bacterial vector or system in a probiotic species Escherichia coli, e.g. E. coli Nissle 1917.
Transfer machinery
The conjugative bacterial host cell comprises a genetic cargo, a type IV secretion system module, a mobilization module and a mating pair stabilization module comprising a type IV
adhesion pilus, the type IV adhesion pilus comprising an adhesin. In some embodiments, the modules that are not part of the genetic cargo can be organized into the transfer machinery.
The transfer machinery is responsible for allowing the formation of a conjugative pore and the subsequent physical transfer of the genetic cargo into the recipient bacterium. The transfer machinery includes genes and regulatory elements that are divided in different modules further described below. The genes present within those modules can optionally be organized in the form of one or more operons.
As used herein, the term“gene" refers to a nucleic acid molecule containing the sequence information necessary for expression of a protein or a non-coding RNA (e.g. tracrRNA, crRNA, gRNA, rRNA, tRNA, anti-sense RNA). When the gene encodes a protein, it includes the promoter and the structural gene open reading frame sequence (ORF), as well as other sequences involved in the expression of the protein. When the gene encodes a non-coding RNA, it includes the promoter and the nucleic acid that encodes the untranslated RNA. As indicated above, genes may be expressed in the form of one or more operons. As used herein, the term“operon”, as it is known in the art, is a functional unit containing a cluster of genes under the control of a single promoter.
The term regulatory element refers to promoters, activator/repressor binding sites, terminators, enhancers and the like. In an embodiment, more than one promoter is included in the bacterial conjugative delivery system of the present disclosure. In yet another embodiment, only one promoter is included in the conjugative delivery system of the present disclosure.
When present, a promoter can be constitutive or inducible. The terms“constitutive” and “inducible” refer to the dynamic state of expression. A constitutive expression is stable overtime whereas an inducible expression allows a significant change in the level of expression of a gene. An inducible expression can be achieved in various ways such as the activation of transcription by a transcription activator, the repression of transcription by a transcription repressor or the control of translation by a functional 5’ untranslated region commonly referred to as a riboswitch.
In an embodiment, the transfer machinery comprises a type IV secretion system (T4SS) module, a mating pair stabilization module and a mobilization module. In some embodiments, the transfer machinery can optionally comprise a transport module, a regulatory module, a vegetative replication module, a maintenance module, a selection module and/or an exclusion module.
The T4SS module includes genes and regulatory elements responsible for the formation of a type IV secretion system. The T4SS is a protein assembly capable of establishing a
conjugation pore that forms a channel between the donor bacterium and the recipient bacterium. It is through this conjugation pore that the genetic cargo is transferred from the donor bacterium to the recipient bacterium. In some embodiments, the T4SS module (which can be heterologous to the conjugative bacterial cell) is integrated in the genome of the conjugative bacterial cell. In another embodiment, the T4SS module is located in one or more extrachromosomal vectors (such as plasmids) which may be endogenous or heterologous to the conjugative bacterial cell. The genes present in the T4SS module include, but are not limited to, one or more of virB1 (TP1 14-012 : traB), virB2 (TP1 14-013 : traC), virB3 (TP1 14-014 : traD), virB4 (TP1 14-015 : traE), virBS (TP1 14-004 : trbJ ), virB6 (TP1 14-003 : traA), virB7 (TP1 14-01 1 : ygeA ), virB8 (TP1 14-017 : traG), virB9 (TP1 14-018 : traH ), virBIO (TP1 14-019 : tral ), virB11 (TP1 14-020 : traJ) and/or virD4 (TP1 14-021 : traK). As such, the T4SS module can include one or more genes encoding one or more proteins of a T4SS. In addition, one or more T4SS conjugative pore, as well as, one or more different types of T4SS can be encoded by the T4SS module and expressed by the donor bacterium. In an embodiment, the genes encoding the T4SS can be derived from one or more of the following family of bacterial conjugative plasmids MPFT, MPFf, MPFI, MPFFATA, MPFb, MPFfa, MPFQ and/or MPFC. In another embodiment, the genes encoding the T4SS can be derived from one of the MPFT family of bacterial conjugative plasmids. For example, the genes encoding the T4SS can be derived from the bacterial plasmid TP1 14. In another example, the genes encoding the T4SS can be derived from the bacterial plasmid R6K. In yet another embodiment, the genes encoding the T4SS can be derived from one of the MPFF family of conjugative plasmids. In yet another embodiment, the genes encoding the T4SS can be derived from the bacterial vector F (or pOX38).
The transfer machinery also includes a mating pair stabilization module. The mating pair stabilization module includes genes and regulatory elements responsible for the stabilization of the physical interaction of the donor bacterium with the target bacterium. As shown in Example II below, stabilizing the interaction between the donor bacterium and the target bacterium favors maintaining a physical proximity necessary for the establishment of the T4SS conjugative pore, which is important for the subsequent transfer of the genetic cargo in an unstable environment (in vivo or liquid for example). The stabilization of the interaction between the donor bacterium and the target bacterium is particularly important in vivo (e.g., in the gastro-intestinal environment or the bladder) where perturbations could affect transfer from the conjugative bacterial cell to target bacterium. The mating pair stabilization module (which may be heterologous to the conjugative bacterial cell) can be integrated in the genome of the conjugative bacterial cell or located on one or more bacterial vectors.
The mating pair stabilization module includes genes and regulatory elements responsible for the formation of a type IV adhesion pilus. The mating pair stabilization module (which may be heterologous to the conjugative bacterial cell) can be integrated in the genome of the conjugative bacterial cell or located on one or more bacterial vectors. Type IV adhesion pilus, as used herein, are protein assemblies forming long thin filaments that protrude from, and retract into, bacterial cells. The presence of type IV adhesion pilus on the membrane of a donor bacterium is believed to facilitate the“capture” of a target bacterium by physically “grabbing” it and“pulling” it. The presence of type IV adhesion pilus on the membrane of a donor bacterium thus stabilizes the interaction of the donor bacterium with the target bacterium. Type IV adhesion pilus genes include, but are not limited to one or more of pilL (TP 1 14-009), pilN (TP 1 14-022), p/VO (TP 1 14-023), pilP (TP 1 14-024), pilQ (TP 1 14-025), pilR (TP1 14-026), pilS (TP1 14-027), pilT (TP1 14-028), traN, traB (TP1 14-012), pilU (TP1 14-029) and/or pilV (TP1 14-030). As such, the mating pair stabilization module can include one or more genes encoding one or more proteins of a type IV adhesion pilus. In addition, one or more type IV adhesion pilus, as well as, one or more different types of type IV adhesion pilus can be encoded by the mating pair stabilization module and expressed by the donor bacterium. In an embodiment, the genes encoding the type IV adhesion pilus can be derived from one of the following family of bacterial conjugative plasmids IncA, IncB/O (Ind O), IncC, IncD, IncE, IncFI , lncF2, IncG, IncHM , lncHI2, Inch , Incl2, IncJ, IncK, IncL/M, IncN, IncP, IncQI , lncQ2, IncR, IncS, IncT, IncU, IncV, IncW, IncXI , lncX2, IncY, IncZ, ColE1 , ColE2, ColE3, p15A, pSC101 , lncP-2, lncP-5, lncP-7, lncP-8, lncP-9, Ind , Inc4, Inc7, Inc8, Inc9, Ind 1 , Ind 3, Inc14, Ind 8. In another embodiment, the genes encoding the type IV adhesion pilus can be derived from one of the incompatibility family of bacterial conjugative plasmids capable belonging to the l-complex: IncU , Incl2, Incly, IncB/O (Ind O), IncK and/or IncZ. In another embodiment, the genes encoding the type IV adhesion pilus can be derived from one of the Incl2 family of bacterial conjugative plasmids. For example, the genes encoding the the type IV adhesion pilus can be derived from the bacterial vector TP1 14.
The type IV adhesion pilus comprises an adhesin. Adhesin are proteins which can, when displayed on the surface of a donor bacterium membrane, interact with various molecules present on the outer membrane of a target bacterium (e.g., proteins, sugars, lipids). For example, the PilV adhesin from the Incl2 family of bacterial conjugative plasmids interacts with receptors such as lipopolysaccharides (LPS), which are molecules typically found on the outer membrane of Gram-negative bacteria. Hence, if a donor bacterium displays a PilV adhesin on its outer membrane, PilV will bind to the LPS of a Gram-negative target bacterium and stabilize the interaction of the two cells. Adhesins include, but are not limited to, one or more of pilV (TP1 14-030) from TP114, pilV from R64, traN from pOX38. In some
embodiments, one or more adhesin, as well as, one or more different types of adhesin can be encoded by the mating pair stabilization module and expressed by the donor bacterium. In an embodiment, adhesins can be displayed on the surface of the donor bacterium by either being part of an accessory pili protein assembly (e.g., like type IV adhesion pilus), and/or by being part of a T4SS conjugative pili protein assembly, and/or by being part of any molecular complex allowing the adhesin to be displayed on the surface of the bacterium. In another embodiment, the genes encoding adhesins can be derived from one of the following family of bacterial conjugative plasmids IncA, IncB/O (Ind O), IncC, IncD, IncE, IncFI , lncF2, IncG, IncHM , lncHI2, Inch , Incl2, IncJ, IncK, IncL/M, IncN, IncP, IncQI , lncQ2, IncR, IncS, IncT, IncU, IncV, IncW, IncXI , lncX2, IncY, IncZ, ColE1 , ColE2, ColE3, p15A, pSC101 , IncP- 2, lncP-5, lncP-7, lncP-8, lncP-9, Ind , Inc4, Inc7, Inc8, Inc9, Inc1 1 , Inc13, Inc14, Inc18. In another embodiment, the genes encoding the type IV adhesion pilus can be derived from one of the incompatibility family of bacterial conjugative plasmids capable belonging to the I- complex: IncU , Incl2, Incly, IncB/O (Ind O), IncK and/or IncZ. In yet another embodiment, the genes encoding the adhesins can be derived from one of the Incl2 family of bacterial conjugative plasmids. For example, the genes encoding adhesins can be derived from the bacterial vector TP114. In yet another embodiment, the genes encoding the adhesins can be derived from one of the IncFII family of bacterial conjugative plasmids. For example, the genes encoding adhesins can be derived from the bacterial vector pOX38. In yet another embodiment, the genes encoding the adhesins can be derived from one of the IncX family of bacterial conjugative plasmids. For example, the genes encoding adhesins can be derived from the bacterial vector pR6K.
Adhesin genes can optionally be rearranged by the presence of shufflons and the activity of a shufflase. A shufflon is a cluster of multiple DNA inversions segments which can be located in the 3’ end of an adhesin gene. Under the action of a shufflase, an enzyme with a recombinase activity, the sequential order of different segments of the shufflon can be randomly rearranged. Following this rearrangement, the one segment that aligns with the adhesin gene becomes the end of the adhesin gene. Therefore, when an adhesin gene is associated with a shufflon, the distal section of the gene is variable and can potentially be any of the different DNA inversions segments included in the shufflon. As a result, when an adhesin gene with a shufflon is transcribed and translated, the C-terminus end of the adhesin is also variable and corresponds to a shufflon’s inversion segment that ends the adhesin gene. Each shufflon’s segment confers specific binding affinities to the adhesin protein. For example, for the shufflon adjacent to the pilV adhesin gene of the Incl2 family of conjugative plasmids, each shufflon’s segment confers to the PilV adhesin binding affinities to specific receptors. Therefore, when a shufflon segment is aligned to an adhesin gene, it modulates
the binding affinity of the corresponding adhesin protein. When a donor bacterium displays an adhesin, a shufflon can thus be used to influence the stability of the interaction between the donor bacterium and the target bacterium. Shufflons include, but are not limited to the following DNA sequences Shufflase recognition sites 5’-GTGCCAATCCGGTNNNTGG-3’ (SEQ ID NO: 140, abbreviated srs), alternative ORF to be re-arranged ( altORFs ). As such, one or more genes encoding one or more adhesin proteins present in the mating pair stabilization module can possess a shufflon. In an embodiment, the DNA sequence of the shufflon can be derived from one of the following family of bacterial conjugative plasmids IncA, IncB/O (Ind O), IncC, IncD, IncE, IncFI , lncF2, IncG, IncHM , lncHI2, Inch , Incl2, IncJ, IncK, IncL/M, IncN, IncP, IncQI , lncQ2, IncR, IncS, IncT, IncU, IncV, IncW, IncXI , lncX2, IncY, IncZ, ColE1 , ColE2, ColE3, p15A, pSC101 , lncP-2, lncP-5, lncP-7, lncP-8, lncP-9, Ind , Inc4, Inc7, Inc8, Inc9, Inc1 1 , Inc13, Inc14, Inc18. In another embodiment, the DNA sequence of the shufflon can be derived from one of the incompatibility family of bacterial conjugative plasmids belonging to the l-complex: IncU , Incl2, Incly, IncB/O (Ind O), IncK and/or IncZ. In another embodiment, the DNA sequence of the shufflon can be derived from one of the Incl family of bacterial conjugative plasmids. In yet another embodiment, the DNA sequence of the shufflon can be derived from one of the Incl2 family of bacterial conjugative plasmids. In yet another embodiment, the DNA sequence of the shufflon can be derived from the bacterial vector TP1 14.
When the gene encoding the adhesin comprises a shufflon, the mating pair stabilization module comprises one or more genes encoding a shufflase. Shufflases are recombinases capable of reorganizing the shufflon’s DNA inversions segments which, as indicated above, can affect the binding activity and specificity of adhesin proteins. Shufflases include, but are not limited to one or more of rci (TP1 14-031). As such, the mating pair stabilization module can include the one or more genes encoding the one or more shufflase proteins. In addition, one or more shufflase, as well as, one or more different types of shufflase can be encoded by the mating pair stabilization module and expressed by the donor bacterium. In an embodiment, the genes encoding shufflases can be derived from one of the following family of bacterial conjugative plasmids IncA, IncB/O (Ind O), IncC, IncD, IncE, IncFI , lncF2, IncG, IncHM , lncHI2, IncU , Incl2, IncJ, IncK, IncL/M, IncN, IncP, IncQI , lncQ2, IncR, IncS, IncT, IncU, IncV, IncW, IncXI , lncX2, IncY, IncZ, ColE1 , ColE2, ColE3, p15A, pSC101 , lncP-2, lncP-5, lncP-7, lncP-8, lncP-9, Ind , Inc4, Inc7, Inc8, Inc9, Ind 1 , Ind 3, Inc14, Ind 8. In another embodiment, the shufflon and/or shufflase can be derived from one of the incompatibility family of bacterial conjugative plasmids belonging to the l-complex: Inch , Incl2, Incly, IncB/O (Ind O), IncK and/or IncZ. In another embodiment, the genes encoding the shufflases can be derived from one of the Incl family of bacterial conjugative plasmids. In
yet another embodiment, the genes encoding the shufflases can be derived from one of the Incl2 family of bacterial conjugative plasmids. For example, the genes encoding shufflases can be derived from the bacterial vector TP1 14.
The mobilization module encodes a relaxosome, e.g., a protein complex capable of recognizing the transport module (an origin of transfer (or/7)) which is operatively associated with the payload module and subsequently transfers the genetic cargo through the conjugative pore into the recipient bacterium. The mobilization module (which can be heterologous to the conjugative bacterial cell) can be integrated in the chromosome of the conjugative bacterial cell or can be located on one or more bacterial vectors. The mobilization module includes, but are not limited to one or more of virC1 (TP1 14-68 : parA), (TP1 14-41 : nikB) and/or (TP1 14-42 : nikA). The mobilization module can be derived from at least one of the following conjugative families MOBF, MOBP, MOBv, MOBH, MOBc and/or MOBQ. In another embodiment, the genes encoding the mobilization machinery can be derived from one of the MOBP family of bacterial conjugative plasmids. For example, the genes encoding the mobilization machinery can be derived from the bacterial vector TP1 14. In yet another example, the genes encoding the mobilization machinery can be derived from the bacterial vector R6K. In another embodiment, the genes encoding the mobilization machinery can be derived from one of the MOBP family of bacterial conjugative plasmids. For example, the genes encoding the mobilization machinery can be derived from the bacterial vector pOX38.
The transport module is a component of the genetic cargo which can also be present in the transfer machinery when the elements of the genetic cargo and of the transfer machinery are in cis organization. The transport module includes one or more functional DNA elements acting as an origin of transfer (or/7) of the genetic cargo into the recipient bacterium. The transport module may be heterologous to the conjugative bacterial cell. The transport module is cis- acting and is thus found on the genetic component (chromosome or extrachromosomal vector) comprising the genetic cargo. As indicated above, the transport module is acted upon by the mobilization module. As used in the context of the present disclosure, the term “mobilization” refers to the process by which a conjugative plasmid accomplishes the transfer, from a donor bacterium to a recipient bacterium, of a DNA molecule that contains an origin of transfer (or/7). The term “origin of transfer” (abbreviated or/7) refers to a DNA sequence that, when present in a DNA molecule, is recognized by the corresponding mobilization proteins and allows its mobilization.
The regulatory module, when present in the transport machinery, can include one or more genes and regulatory elements encoding one or more proteins or non-coding RNAs capable of regulating the expression of genes or capable of being used to regulate the expression of
genes (e.g., an activator, a repressor, a riboswitch, CRISPR-Cas9, Zinc Finger Nuclease (ZFN), a TALE, taRNA). The regulatory module (which can be heterologous to the conjugative bacterial cell) can be integrated in the genome of the conjugative bacterial cell or located on one or more bacterial vector. In an embodiment, the regulatory genes and elements can be on a distinct nucleic acid molecule than the modules of the transfer machinery or of the conjugative delivery system. In another embodiment, the regulatory genes and elements can be isolated from different sources such as, but not limited to, the same plasmid as the other modules, another plasmid, a bacterial chromosome, a phage, a eukaryote chromosome, an archaebacterium. In yet another embodiment, the regulatory genes and elements can be engineered or evolved from naturally occurring genes. The regulatory proteins or non-coding RNAs encoded by the regulatory module can be used to induce or repress genes located on the chromosome of the bacterium hosting the delivery system, as well as to induce or repress genes located on any of the modules of the transfer machinery or of the genetic cargo. In an embodiment, the regulatory module includes one or more genes encoding a one or more regulatory proteins or non-coding RNAs such as, but not limited to, yajA (TP1 14-058), yafA (TP1 14-069), yaeC (TP1 14-070), yheC (TP1 14- 085), fur, fnr, korA, acaCD, acr1, acr2, stbA, twrA, ResP, kfrA, ardK, Cas9, crRNA, ZFN, TALEN, taRNA, toehold switch, araCJetR, lad and/or laclq.
When the transfer machinery is located (in total or in part) in an extrachromosomal vector, the extrachromosomal vector includes a vegetative replication module. The vegetative replication module of the transfer machinery can be the same or different from the vegative replication module of the genetic cargo. A vegetative replication module is required when the transfer machinery is located on one or more vector that replicate independently from the genome of the bacterial host. In that case, the one or more extrachromosomal vectors containing the transfer machinery need a vegetative replication module to replicate and be maintained in the bacterial host. The vegetative replication module comprises one or more functional DNA elements acting as an origin of vegetative replication ( oriV ). The oriV is a DNA sequence present on a genetic element that, when recognized by the replication machinery encoded by the maintenance module, allows a plasmid to replicate in a bacterial host. For versatile use, and for the maintenance of vectors in a large range of bacterial hosts, the oriV of the vegetative replication module can be a broad-host-range oriV (i.e. and oriV recognized by a broad range of bacterial host species). In some embodiments, where the maintenance of vectors needs to be restricted to a limited range of bacterial hosts, it may be preferable to use an oriV with a restricted or narrow host range (i.e. an oriV recognized by a limited range of bacterial host species).
When the transfer machinery is located (in totally or in part) on an extrachromosomal vector, the conjugative bacterial host cell includes a maintenance module (which can be considered, in some embodiments, part of the transfer machinery). The maintenance module includes proteins (referred to as replication machinery) capable of recognizing the oriV of the vegetative replication module and allows the replication of extrachromosomal vectors (e.g., plasmids) containing the vegetative replication module. The maintenance module includes proteins capable of recognizing the oriV of the vegetative replication module and allows the replication of plasmids containing the vegetative replication module. The maintenance module can be heterologous to the conjugative bacterial cell. When the maintenance of vectors needs to be restricted to the donor bacterium, it may be preferable to locate (e.g., integrate) the maintenance module into the donor bacterium chromosome. Alternatively, the maintenance module may be located on one or more extrachromosomal vector. The maintenance module can also comprise one or more genes and regulatory elements responsible for adequate DNA partitioning. The genes responsible for partitioning may include proteins responsible for equal segregation of plasmid copies into daughter cells, toxin and anti-toxin stabilization system and/or helicase and DNA primase which helps the replicative machinery in the replication of the plasmid. The proteins of the maintenance module include, but are not limited to one or more of proteins often annotated as repA (TP1 14-083 : repA ), TP1 14-082, parA (TP1 14-068: parA), parB, DNA primase (TP1 14-006: ygiA ), a toxin (e.g. vcrx028 from pVCR94, TP1 14-051 : ycfA from TP1 14), an antitoxin (e.g. vcrx027 from pVCR94, TP1 14-050: ycfB from TP1 14), DNA topoisomerases (TP1 14-035: ydiA and TP1 14-036: ydgA). As such, the maintenance module includes the one or more genes encoding the one or more proteins of the replicative machinery and can also include zero or more genes and regulatory elements coding for DNA partitioning protein. In addition, one or more replicative machinery, as well as, one or more different types of replicative machinery can be present in the maintenance module. In an embodiment, the maintenance module and/or the vegetative replication module can be derived from one of the following family of bacterial vectors IncA, IncB/O (Ind O), IncC, IncD, IncE, IncFI , lncF2, IncG, IncHM , lncHI2, Inch , Incl2, IncJ, IncK, IncL/M, IncN, IncP, IncQI , lncQ2, IncR, IncS, IncT, IncU, IncV, IncW, IncXI , lncX2, IncY, IncZ, ColE1 , ColE2, ColE3, p15A, pSC101 , lncP-2, lncP-5, lncP-7, lncP-8, lncP-9, Ind , Inc4, Inc7, Inc8, Inc9, Inc1 1 , Inc13, Inc14 and/or Inc18. In an embodiment, maintenance module and/or the vegetative replication module can be derived from one of the Incl2 family of bacterial vectors. For example, maintenance module and/or the vegetative replication module can be derived from the bacterial vector TP1 14.
The transfer machinery can also include one or more selection module. The selection module includes one or more genes conferring a selectable trait for identifying bacteria
bearing one or more modules of the transfer machinery. The selection module is operatively connected with the one or more bacterial vector and/or the integrated modules of the transfer machinery. The selection module of the transfer machinery can be the same or different from the selection module of the genetic cargo. The selectable trait can be, but is not limited to, an antibiotic resistance gene, a gene coding for a fluorescent protein (including a green fluorescent protein), an auxotrophic selection marker, a gene coding for a b-galactosidase (e.g. , the bacterial lacZ gene), a gene coding for a luciferase, a gene coding for a chloramphenicol acetyltransferase (e.g. , the bacterial cat gene), a gene coding for a b- glucuronidase.
The exclusion module, when present in the transfer machinery, includes one or more of genes encoding exclusion proteins. The exclusion module (which can be endogenous or heterologous to the conjugative bacterial cell) can be located in the bacterial chromosome or in one or more extrachromosomal vectors. Exclusion proteins limit the horizontal transfer of genetic material by rendering a bacterium resistant to conjugative plasmids. For example, a bacterium that expresses exclusion proteins (e.g., excAB) against a specific bacterial conjugative plasmid (e.g., R64) can no longer receive this plasmid through conjugation. This phenomenon can be used to avoid futile conjugative transfer between conjugative bacterial cell bacteria. For instance, if conjugative bacterial cell bacteria are designed to express an exclusion protein directed against their own transfer machinery used to propagate the genetic cargo, transfer between conjugative bacterial cell bacteria can no longer occur (or at significantly lower rates). Exclusion proteins include, but are not limited to one or more of TP1 14-05 from TP1 14, excA and excB from plasmid R64, trbK from RP4, traS and traT from plasmid F (pOX38). As such, the exclusion module can include one or more genes encoding one or more exclusion proteins. In an embodiment, the genes encoding exclusion proteins can be derived from one of the following family of bacterial conjugative plasmids IncA, IncB/O (Ind O), IncC, IncD, IncE, IncFI , lncF2, IncG, IncHM , lncHI2, Inch , Incl2, IncJ, IncK, IncL/M, IncN, IncP, IncQI , lncQ2, IncR, IncS, IncT, IncU, IncV, IncW, IncXI , lncX2, IncY, IncZ, ColE1 , ColE2, ColE3, p15A, pSC101 , lncP-2, lncP-5, lncP-7, lncP-8, lncP-9, Ind , Inc4, Inc7, Inc8, Inc9, Ind 1 , Ind 3, Inc14, Ind 8. In another embodiment, the genes encoding exclusion proteins can be derived from one of the Incl2 family of bacterial conjugative plasmids. For example, the genes encoding exclusion proteins can be derived from the bacterial vector TP1 14.
Genetic cargo
The genetic cargo is intended to be delivered by the conjugative bacterial cell donor bacterium to a target bacterium via the transfer machinery. The genetic cargo includes genes and regulatory elements which are divided in different modules further described below. The
genes present within those modules can optionally be organized in the form of one or more operons.
The genetic cargo comprises a payload module which is operatively associated with a transport module. The transport module is“operatively associated” with the payload module which allows the transfer of the payload module when the proteins encoded by the mobilization module associate with the transport module. The genetic cargo is heterologous to the conjugative bacterial host cell because at least one of the payload module or the transport module has been genetically introduced in the conjugative bacterial host cell in order to operatively associate the transport module with the payload module. The genetic cargo can optionally include a selection module, a vegetative replication module and/or a mobilization module.
The payload module can include, but is not limited to, genes, regulatory elements, noncoding RNAs (such as siRNAs, shRNAs and miRNAs for example), transposons, genomes (e.g. , phage, or bacterial). In a specific embodiment, the payload module encodes a guide RNA (gRNA) and/or a CRISPR-array (crRNA and tracrRNA) that can be recognized and acted upon by the recipient cell. The payload module can encode for one or more proteins, and/or one or more non-coding genetic elements (such as RNA for example). The payload module can also be a combination of one or more genes, and/or regulatory elements, and/or non-coding RNA, and/or transposons, and/or genome.
In a specific embodiment, the payload module includes one or more heterologous genes encoding one or more heterologous proteins or functional RNA which are intended to be expressed in a recipient bacterium. In the context of the present disclosure, the expression of the heterologous gene(s) in the recipient bacterium can be beneficial, neutral or detrimental to the recipient bacterium. An heterologous gene is considered beneficially expressed in a recipient bacterium when its expression causes a biological advantage to the recipient bacterium. Beneficially expressed heterologous genes include, but are not limited to lacZ, lacY, lacA, galE, galT, galK, gadD, gadT, gadP, scrA, scrB, merA, AN-PEP. An heterologous gene is considered neutrally expressed in a recipient bacterium when its expression does not provide a biological advantage and also fails to provide a biological disadvantage to the recipient bacterium. Neutrally expressed heterologous genes include, but are not limited to, proteins exhibiting a therapeutic benefit to the subject having the recipient bacterium (e.g. , therapeutic proteins such as eukaryotic growth factors, hormones (e.g., glucagon-like peptide-1 or GLP-1 , insulin, etc.), cytokines including interleukins (e.g., interleukin 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16 or 17), and/or chemokines (e.g., CC chemokines, CXC chemokines, C chemokines or CX3C chemokines). An heterologous gene is considered detrimentally expressed in a recipient bacterium when its expression provides a biological
disadvantage to the recipient bacterium (for example, a reduction in cell growth, an increase in sensitivity to an antibiotic and/or an increase in mortality). Detrimentally expressed heterologous gene include, but are not limited to, nucleases (for example, transcription activator-like effector nucleases (TALEN), zinc finger nucleases (ZFN) and clustered regularly interspaced short palindromic repeat (CRISPR) associated DNA-binding (Cas) proteins and analogs thereof, endonuclease restriction enzymes (e.g., ApaLI, BamHI, Bglll, Dpnl, EcoR1 , EcoRV, Hindlll, Pvul, Pvull, Xhol), and toxins or protein toxic for the recipient bacterium (e.g. Lysins, Vcrx028, MazF, HicB, KikA, CcdB, microcins).
In a specific embodiment, the heterologous protein encoded by the payload module is a Cas protein or a Cas protein analog. As used in the context of the present disclosure, a Cas protein or an associated analog is an endonuclease capable of mediating a double-strand cut (either blunt or staggered) in a DNA molecule at the specific location where a CRISPR RNA (crRNA) localizes on the DNA molecule. The Cas protein can be a type I, type II, or type III CRISPR RNA-guided endonuclease. In the context of the present disclosure, a“Cas protein analog” refers to a variant of the Cas protein, or to a fragment of the Cas protein, capable of mediating a double-strand cut (either blunt or staggered) in a DNA molecule at the specific location where a CRISPR RNA (crRNA) localizes on the DNA molecule, or capable of mediating a single stand cut in a DNA or RNA molecule at the specific location where a CRISPR RNA (crRNA) localizes on the DNA or RNA molecule.
A Cas protein variant comprises at least one amino acid difference when compared to the amino acid sequence of the native Cas protein. As used herein, a variant refers to alterations in the amino acid sequence that does not adversely affect the biological functions of the Cas protein analog. A substitution, insertion or deletion is said to adversely affect the protein when the altered sequence prevents or disrupts a biological function associated with the Cas protein. For example, the overall charge, structure or hydrophobic-hydrophilic properties of the protein can be altered without adversely affecting a biological activity. Accordingly, the amino acid sequence can be altered, for example to render the peptide more hydrophobic or hydrophilic, without adversely affecting the biological activities of the Cas protein. The Cas protein variants have at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identity to the Cas proteins described herein. The term“percent identity”, as known in the art, is a relationship between two or more polypeptide sequences or two or more polynucleotide sequences, as determined by comparing the sequences. The level of identity can be determined conventionally using known computer programs. Identity can be readily calculated by known methods, including but not limited to those described in: Computational Molecular Biology (Lesk, A. M., ed.) Oxford University Press, NY (1988); Biocomputing: Informatics and Genome Projects (Smith, D. W., ed.) Academic Press, NY
(1993); Computer Analysis of Sequence Data, Part I (Griffin, A. M., and Griffin, H. G., eds.) Humana Press, NJ (1994); Sequence Analysis in Molecular Biology (von Heinje, G., ed.) Academic Press (1987); and Sequence Analysis Primer (Gribskov, M. and Devereux, J., eds.) Stockton Press, NY (1991). Preferred methods to determine identity are designed to give the best match between the sequences tested. Methods to determine identity and similarity are codified in publicly available computer programs. Sequence alignments and percent identity calculations may be performed using the Megalign program of the LASERGENE bioinformatics computing suite (DNASTAR Inc., Madison, Wis.). Multiple alignments of the sequences disclosed herein were performed using the Clustal method of alignment (Higgins and Sharp (1989) CABIOS. 5:151-153) with the default parameters (GAP PENALTY=10, GAP LENGTH PEN ALT Y= 10). Default parameters for pairwise alignments using the Clustal method were KTUPLB 1 , GAP PENALTY=3, WINDOW=5 and DIAGONALS SAVED=5.
The variant Cas proteins described herein may be (i) one in which one or more of the amino acid residues are substituted with a conserved or non-conserved amino acid residue (preferably a conserved amino acid residue) and such substituted amino acid residue may or may not be one encoded by the genetic code, or (ii) one in which one or more of the amino acid residues includes a substituent group, or (iii) one in which the mature protein is fused with another compound, such as a compound to increase the half-life of the protein, or (iv) one in which the additional amino acids are fused to the mature protein for purification of the polypeptide. A“variant” of the Cas protein can be a conservative variant or an allelic variant.
The Cas protein analog can be a fragment of a known/native Cas proteins. Cas protein “fragments” (including baking enzyme“fragments”) have at least at least 100, 200, 300, 400, 500, 600, 700, 800, 900 or more consecutive amino acids of the Cas protein. A fragment comprises at least one less amino acid residue when compared to the amino acid sequence of the known/native Cas protein and still possess the endonucleic activity of the full-length Cas protein. In some embodiments, fragments of the Cas proteins can be employed for producing the corresponding full-length Cas proteins by peptide synthesis. Therefore, the fragments can be employed as intermediates for producing the full-length proteins. In some embodiments, the Cas protein fragments can have at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identity to the Cas proteins described herein.
In an embodiment, the Cas protein is a Cas9 protein and allows for the formation of blunt ends at the cleavage site. In an embodiment, the Cas9 protein can be derived, for example, from Streptococcus pyogenes. The Cas9 protein acts in collaboration with a CRISPR RNA (crRNA) moiety and trans-activating CRISPR RNA (tracrRNA) moiety to specifically cleave
double-stranded DNA. The crRNA moiety can be specific to a nucleic acid sequence in a double stranded DNA (present in the recipient bacterium for example), and in the presence of such nucleic acid sequence and the Cas9 protein, forms a duplex with the nucleic acid sequence to specifically direct the Cas9 endonuclease activity in the duplex region. The tracrRNA specifically binds to the Cas9 protein and allows a close association with the crRNA. In an embodiment in which the Cas9 protein is the heterologous protein, the payload module can also include a gene encoding the crRNA and/or the tracrRNA. In another embodiment in which the Cas9 protein is the heterologous protein, the payload module nucleic acid molecule can comprise a gene coding for a guide RNA (gRNA). The gRNA includes, on the same gene transcript, both a CRISPR RNA (crRNA) and trans-activating CRISPR RNA (tracrRNA).
In another embodiment, the Cas protein is a Cpf1 protein and allows for the formation of staggered ends at the cleavage suite. In an embodiment, the Cpf1 protein can be derived, for example, from Francissella novicida. Unlike the Cas9 protein, the Cpf1 protein only requires the presence of crRNA to mediate specific cleavage of the double stranded DNA. As such, in embodiments in which the Cpf1 protein is used as the Cas protein, the payload module includes a CRISPR RNA (crRNA) and does not need to include a trans-activating CRISPR RNA (tracrRNA).
The present disclosure provides that the crRNA found on the payload module is recognizable by the Cas protein. This means that the crRNA is able to direct the endonuclease of a type I or type II Cas protein to a specific location on a double stranded DNA molecule, or to direct the endonuclease of a type III Cas protein to a specific location on a RNA molecule. Since that, in such embodiments, what is required is that the crRNA forms a duplex at one or more specific location (e.g., one or more target location) in the recipient bacterium genome, or at one or more specific location on RNA molecules of the recipient bacterium, then the crRNA must be substantially complementary to the one or more target location on the genome in the recipient bacterium, or on RNA molecules present in the recipient bacterium. As used herein, the term "genome" includes the chromosomal and plasmidic DNA of a bacterium. As also used herein, the term“substantially complementary” refers to the sequence of the crRNA having a minimal level of complementary so as to allow it to form a specific duplex with the one or more target location in the recipient bacterium genome, or RNA molecules present in the recipient bacterium.
In one embodiment, the crRNA is substantially complementary to a target sequence present in single or multiple copies in the recipient bacterium. In such embodiment, the transfer of the genetic cargo in the recipient bacterium will allow for the expression of the crRNA (which will form a plurality of duplexes in the recipient bacterium) and the Cas protein in the recipient
bacterium which will eventually lead to the formation of multiple double-strand DNA cuts in the target bacterial genome. These multiple double-strand DNA cuts will eventually lead to a reduction in the viability of the recipient bacterium, most likely, in the death of the recipient bacterium.
In embodiments in which killing of the recipient bacterium is not desired (for example to avoid inflammatory reactions triggered by the death of a population of recipient bacterium), the crRNA can be substantially complementary to a single location in the genome of the recipient bacterium, for example, a specific gene in a recipient bacterium. The payload module would also have to contain a DNA molecule that can be used as a template to repair the target locus and introduce an inactivating mutation that also can protect from the crRNA targeting. For example, the crRNA can be substantially complementary to a gene coding for a virulence factor in the recipient bacterium, or an RNA coding for a virulence factor in the recipient bacterium. In such embodiments, the introduction of the payload module will lead to the inactivation of the virulence factor by introducing the mutated reparation template into the virulence factor gene without altering the viability of the recipient bacterium or causing deleterious effects in the subject bearing the recipient bacterium. The virulence factor can be located on the chromosome of the recipient bacterium or on a plasmid of the recipient bacterium.
The virulence factor in the recipient bacterium can be for example a gene conferring resistance to a drug, such as, for example, an antibiotic. The term "antibiotic resistance gene" encompasses a gene, or the encoding portion thereof, which encodes a protein or transcribes a functional RNA that confers antibiotic resistance. For example, the antibiotic resistance gene may be a gene or the encoding portion thereof which contributes to (1) an enzyme which degrades an antibiotic, (2) an enzyme which modifies an antibiotic, (3) a pump such as an efflux pump for the antibiotic, or (4) a mutated target which suppresses the effect of the antibiotic. Gene coding for an antibiotic resistance trait include, but are not limited to, the aadA2, aadA, aacC,aacA1 , aphA, strAB, pbp1A, pbp1B, pbp2A, pbp2B, dac, blaCMY-2, floR, cmlA, cat, cmx, ermA, mph2, met, erm(x), mecA, aadAla, sul1, sul2, tetA, tet(W), blaSHV-1, dhfr, van(A), van(B) and blaNDMi.
The virulence factor in the recipient bacterium can be, for example, a gene encoding a toxin. Gene coding for a toxin include, but are not limited to, ccdB, relE, parE, doc, vapC, hipA, stl, espA, pag, ctxA, ctxB, tcpA, exoU, exoS, exoT, SgiT and hipB.
The virulence factor can be a structure or a component, such as a pilus, a fimbriae, a flagella or pumps. Gene encoding for virulent component include, but are not limited to fimA, csgD, toxT, cps, ptk, epsA, mia, ssrB, acrA, acrB, tolC and csgA.
In a specific embodiment, the crRNA is specific to genes, or to RNA molecules derived from genes, coding for a virulence factor found in an Escherichia sp., such as, for example, a gene coding for a virulence factor in Escherichia coli. Virulence factors found in Escherichia coli include, but are not limited to, those described in WO2015/148680. In a specific embodiment, genes encoding a virulence factor include antibiotic resistance genes and shiga toxin genes in Escherichia coli (e.g., multidrug resistance shiga-toxin producing E. coli). In another specific embodiment, the genes encoding the virulence factor include gene coding for a pilus (e.g., for example a type 1 pilus) in Escherichia coli (e.g., adherent-invasive E. coli).
The transport module is a component of the genetic cargo and includes a functional DNA locus responsible for the physical transport of the genetic cargo into the recipient bacterium. The transport module comprises an origin of transfer (oriT), e.g., a nucleic acid sequence allowing the transfer of a vector from the donor bacterium to the recipient bacterium. The transport module may be heterologous to the conjugative bacterial cell. The transport module is cis- acting and is thus found on the genetic component (chromosome or extrachromosomal vector) comprising the genetic cargo. As indicated above, the transport module is acted upon by the mobilization module. As used in the context of the present disclosure, the term “mobilization” refers to the process by which a conjugative plasmid accomplishes the transfer, from a donor bacterium to a recipient bacterium, of a DNA molecule that contains an origin of transfer (oriT). The term “origin of transfer” (abbreviated or/7) refers to a DNA sequence that, when present in a DNA molecule, is recognized by the corresponding mobilization proteins and allows its mobilization.
The genetic cargo can also include one or more selection module. The selection module includes one or more genes conferring a selectable trait for identifying bacteria bearing one or more modules of the genetic cargo. The selection module is operatively connected with the one or more bacterial vector and/or the integrated modules of the genetic cargo. The selection module of the genetic cargo can be the same or different from the selection module of the conjugative delivery system. The selectable trait can be, but is not limited to, an antibiotic resistance gene, a gene coding for a fluorescent protein (including a green fluorescent protein), an auxotrophic selection marker, a gene coding for a b-galactosidase (e.g. , the bacterial lacZ gene), a gene coding for a luciferase, a gene coding for a chloramphenicol acetyltransferase (e.g. , the bacterial cat gene), a gene coding for a b- glucuronidase.
When the genetic cargo is located (in total or in part) in an extrachromosomal vector, the extrachromosomal vector includes a vegetative replication module. The vegetative replication module of the genetic cargo can be the same or different from the vegative replication
module of the conjugative delivery system. A vegetative replication module is required when the transfer machinery is located on one or more vector that replicate independently from the genome of the bacterial host. In that case, the one or more extrachromosomal vectors containing the genetic cargo need a vegetative replication module to replicate and be maintained in the bacterial host. The vegetative replication module comprises one or more functional DNA elements acting as an origin of vegetative replication ( oriV ). The oriV is a DNA sequence present on a genetic element that, when recognized by the replication machinery encoded by the maintenance module, allows a plasmid to replicate in a bacterial host. For versatile use, and for the maintenance of vectors in a large range of bacterial hosts, the oriV of the vegetative replication module can be a broad-host-range oriV (i.e. and oriV recognized by a broad range of bacterial host species). In some embodiments, where the maintenance of vectors needs to be restricted to a limited range of bacterial hosts, it may be preferable to use an oriV with a restricted or narrow host range (i.e. an oriV recognized by a limited range of bacterial host species).
When the genetic cargo is located (in totally or in part) on an extrachromosomal vector, the conjugative bacterial host cell includes a maintenance module (which can be considered, in some embodiments, part of the transfer machinery). The maintenance module includes proteins (referred to as the replication machinery) capable of recognizing the oriV of the vegetative replication module and allows the replication of extrachromosomal vectors (e.g., plasmids) containing the vegetative replication module. The maintenance module includes proteins capable of recognizing the oriV of the vegetative replication module and allows the replication of plasmids containing the vegetative replication module. The maintenance module can be heterologous to the conjugative bacterial cell. When the maintenance of vectors needs to be restricted to the donor bacterium, it may be preferable to locate (e.g., integrate) the maintenance module into the donor bacterium chromosome. Alternatively, the maintenance module may be located on one or more of an extrachromosomal vector. The maintenance module can also comprise one or more genes and regulatory elements responsible for adequate DNA partitioning. The genes responsible for partitioning may include proteins responsible for equal segregation of plasmid copies into daughter cells, toxin and anti-toxin stabilization system and/or helicase and DNA primase which helps the replicative machinery in the replication of the plasmid. The proteins of the maintenance module include, but are not limited to one or more proteins often annotated as repA (TP1 14- 083 : repA), TP1 14-082, parA (TP1 14-068: parA), parB, DNA primase (TP1 14-006: ygiA), a toxin (e.g. vcrx028 from pVCR94, TP1 14-051 : ycfA from TP1 14), an antitoxin (e.g. vcrx027 from pVCR94, TP1 14-050: ycfB from TP1 14), DNA topoisomerases (TP1 14-035: ydiA and TP1 14-036: ydgA). As such, the maintenance module includes the one or more genes
encoding the one or more proteins of the replicative machinery and can also include zero or more genes and regulatory elements coding for DNA partitioning protein. In addition, one or more oriV and replicative machinery, as well as, one or more different types of oriV and replicative machinery can be present in the maintenance module. In an embodiment, the maintenance module and/or the vegetative replication module can be derived from one of the following family of bacterial vectors IncA, IncB/O (Ind O), IncC, IncD, IncE, IncFI , lncF2, IncG, IncHM , lncHI2, Inch , Incl2, IncJ, IncK, IncL/M, IncN, IncP, IncQI , lncQ2, IncR, IncS, IncT, IncU, IncV, IncW, IncXI , lncX2, IncY, IncZ, ColE1 , ColE2, ColE3, p15A, pSC101 , IncP- 2, lncP-5, lncP-7, lncP-8, lncP-9, Ind , Inc4, Inc7, Inc8, Inc9, Inc1 1 , Inc13, Inc14 and/or Inc18. In an embodiment, maintenance module and/or the vegetative replication module can be derived from one of the Incl2 family of bacterial vectors. For example, maintenance module and/or the vegetative replication module can be derived from the bacterial vector TP1 14.
The mobilization module encodes a relaxosome, e.g., a protein complex capable of recognizing the transport module (an origin of transfer (or/7)) which is operatively associated with the genetic cargo and subsequently transferring the genetic cargo through the conjugative pore into the recipient bacterium. The mobilization module (which can be heterologous to the conjugative bacterial cell) can be integrated in the chromosome of the conjugative bacterial cell or can be located on one or more bacterial vectors. The mobilization module includes, but are not limited to one or more of virC1 (TP1 14-68 : parA), (TP1 14-41 : nikB) and/or (TP1 14-42 : nikA). The mobilization module can be derived from at least one of the following conjugative families MOBF, MOBP, MOBv, MOBH, MOBc and/or MOBQ. In another embodiment, the genes encoding the mobilization machinery can be derived from one of the MOBP family of bacterial conjugative plasmids. For example, the genes encoding the mobilization machinery can be derived from the bacterial vector TP1 14. In yet another example, the genes encoding the mobilization machinery can be derived from the bacterial vector R6K. In another embodiment, the genes encoding the mobilization machinery can be derived from one of the MOBP family of bacterial conjugative plasmids. For example, the genes encoding the mobilization machinery can be derived from the bacterial vector pOX38.
Configurations of the conjugative delivery system
In a specific embodiment, the conjugative delivery system is designed to provide cis mobilization to allow exponential dissemination of the genetic cargo. In such embodiment, all of the modules of the system are located on a single extrachromosomal vector (in some embodiments, a circular plasmid). Using a system based on cis mobilization provides very limited to no containment, allowing the transfer of the conjugative plasmid to the recipient cell
and subsequent rounds of transfers from the recipient cell to other recipient cells as well as the replication of the conjugative plasmid in the recipient cells.
In another specific embodiment, the conjugative delivery system is designed to provide a constrained cis mobilization to allow rapid dissemination of the genetic cargo and provide a certain degree of containment. In such embodiment, the maintenance module is located in the conjugative bacterial cell’s chromosomes and the remaining modules of the system are located on a single extrachromosomal vector (in some embodiments, a circular plasmid). Using a system designed to provide constrained cis mobilization offers some level of containment, allowing transfer of the conjugative plasmid to the recipient cell, and subsequent transfers from the recipient cell to other recipient cells, but preventing its replication in the recipient cells.
In a further specific embodiment, the conjugative delivery system is designed to provide in trans mobilization to increase the level of containment of the genetic cargo. In such embodiment, the entire transfer machinery is located in the conjugative bacterial cell’s chromosomes or is located on one or many extrachromosomal vector (in some embodiments, a circular plasmid) but lacks the transport module. The modules of the genetic cargo (payload module and transport module) of the system are located on a single extrachromosomal vector (in some embodiments, a circular plasmid). In such embodiment, the genetic cargo would also include a vegetative replication module. Using a system designed to provide in trans mobilization offers the highest level of containment, preventing the transfer of the mobilized plasmid from a recipient cell to another recipient cell, and preventing replication in the recipient cells.
The modules of the genetic cargo can also be integrated in the bacterial chromosome. In such embodiment, the genetic cargo could either be excised or include a vegetatvive replication module upstream (in operative association) with the payload module.
The system of the present disclosure is designed to allow the transfer of a genetic cargo to a recipient bacterium in order to express one or more heterologous proteins and/or one or more non-coding DNA or RNA molecules, in the recipient bacterium. The system, when introduced in a donor bacterium, allows the genetic cargo to be transferred to target bacteria at an acceptable conjugation efficiency of in vivo (e.g. in the gastro-intestinal environment). As used in the context of the present disclosure,“conjugation efficiency” refers to a measure of the transfer of the genetic cargo from the donor bacterium to the recipient bacterium. A conjugation efficiency can be determined in vivo (e.g., in a subject) or in vitro (e.g., outside a subject, in a (liquid or solid) culture medium, for example) in numerous ways by the person skilled in the art. In embodiments in which the conjugation efficiency is measured in vivo, it
can be provided as the number of bacterial exconjuguants (e.g., the number of bacteria that have received the genetic cargo from the conjugative bacterial cell bacterium) per total available recipient bacterium. Conjugation efficiency can be measured in a specific location in the subject, for example in the gut of the subject (and in such instance, a level of enteric conjugation efficiency is provided). As used in the context of the present disclosure, the expression “an acceptable level of in vivo conjugation efficiency” refers to a level of conjugation, observed in vivo, capable of providing sufficient transfer of the genetic cargo to mediate significant impact on, or by, the target cell population. In some embodiments, the system has an in vivo conjugation efficiency of at least 103, 102 or 101 transconjugant bacterium/recipient bacterium. In a specific embodiment, the system has an in vivo conjugation efficiency of at least 103 transconjugants bacterium/recipient bacterium. In a specific embodiment, the system has an in vivo conjugation efficiency of at least 102 transconjugant bacterium/recipient bacterium. In a specific embodiment, the system has an in vivo conjugation efficiency of at least 10 1 transconjugant bacterium/recipient bacterium.
As shown in the present disclosure, in some embodiments, conjugation efficiencies are compared in several mating conditions. As such, a measure of in vitro conjugation under certain conditions can be used as a proxy for determining if the in vivo transfer efficiency of a vector is acceptable. Consequently, in some embodiments, conjugation efficiency of the system under hypoxic conditions, presence of feces in the medium, physiologically relevant temperature (e.g., 37°C), unstable mating environment (e.g. a static or agitating broth) is at least 103, 102 or 10 1 transconjugant bacterium/recipient bacterium as compared with standard solid medium mating. In a specific embodiment, the system has an in vitro conjugation efficiency, when measured in any of the above conditions, of at least 103 transconjugant bacterium/recipient bacterium. In a specific embodiment, the system has an in vitro conjugation efficiency, when measured in any of the above conditions, of at least 10 2 transconjugant bacterium/recipient bacterium. In a specific embodiment, the system has an in vitro conjugation efficiency, when measured in any of the above conditions, of at least 10 1 transconjugant bacterium/recipient bacterium.
In some embodiments, a ratio between the conjugative efficiency in a liquid medium vs. a solid medium can be used as a as a proxy for determining if the in vivo transfer efficiency of a vector is acceptable. As shown in the Examples below, a ratio of conjugative efficiency higher than 0.1 , 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9 or 1.0% is indicative that the conjugative bacterium has acceptable in vivo transfer efficiency. In some embodiments, a ratio of conjugative efficiency higher than 0.1 % is indicative that the conjugative bacterium has acceptable in vivo transfer efficiency.
The present disclosure also includes a method for determining the efficiency of in vivo transfer by measuring the ability of a bacterial system to conjugate in a liquid medium. Such method includes contacting a conjugative bacterial host cell and a recipient bacterial host cell in a liquid medium and determining the conjugation efficacy in such liquid medium. In an embodiment, the liquid medium has a viscosity substantially similar to water, when measured at a specific temperature (37°C for example). If the conjugation efficacy is at least 103, 102 or 10 1 transconjugant bacterium/recipient bacterium as compared with standard solid medium mating, then it is determined that the conjugative bacterial cell will successfully be able to conjugate in vivo (in the gastro-intestinal tract of a subject for example). Alternatively or in combination, the method can include determining a ratio between the conjugative efficiency in a liquid medium vs. a solid medium. In such embodiment, a ratio of conjudative efficiency higher than 0.1 , 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9 or 1.0% is indicative that the conjugative bacterium has acceptable in vivo transfer efficiency. The contact between the conjugative bacterial cell and the recipient bacterial cell can be done at a specific temperature which is the same or substantially similar to the in vivo environment, e.g., between 30 and 40°C (37°C for example). The contact between the conjugative bacterial cell and the recipient bacterial cell can be done in static conditions or in the presence of an agitation.
Probiotic recombinant donor bacteria, compositions comprising same and processes for making same
The present disclosure also provides a recombinant bacterial host cell (referred to as a conjugative bacterial cell) that can act as a donor bacterium capable of conjugation to transfer the genetic cargo described herein into a target (recipient) bacterium. The conjugative bacterial cell bacterium comprises the transfer machinery and the genetic cargo described herein. In some embodiments, the transfer machinery and the genetic cargo can be independently replicating from the genome of the recombinant bacterium. In such embodiment, the transfer machinery can be operatively associated with the genetic cargo nucleic acid molecule and form, for example, a single unitary vector (e.g., a single plasmid). In another embodiment, the transfer machinery can be integrated in the chromosome of the conjugative bacterial cell bacterium (at a single location or at multiple locations) and the genetic cargo nucleic acid molecule can be independently replicating from the genome of the donor bacterium. In another embodiment the donor bacterium can comprise at least two distinct vectors (e.g., two distinct plasmids): a first one comprising the transfer machinery and a second one comprising the genetic cargo nucleic acid molecule.
Since the transfer machinery of the present disclosure has a high in vivo conjugation efficiency, the amount of conjugative bacterial cell bacteria necessary to achieve a desired
therapeutic effect in the subject is going to be equal or lower than other recombinant bacteria lacking the system of the present disclosure.
In some embodiments, the conjugative bacterial cell can be a pathogenic bacterial cell that has been modified to reduce or eliminate its pathogenicity. Alternatively, the conjugative bacterial cell of the present disclosure is considered to be a probiotic bacterium since these are, at the very least, not harmful (e.g., not pathogenic) to the subject, and in some embodiments, probiotics can by themselves confer a health benefit to the subject. The present disclosure thus provides a bacterium which has been genetically engineered to bear the delivery system of the present disclosure. Thus, the present disclosure also provides a process for obtaining the conjugative bacterial cell by introducing the system of the present disclosure in a bacterial cell. Optionally, the system can include a gene conferring one or more selectable traits.
Bacterial cells that can be used as conjugative bacterial cells include, but are not limited to, Bacillus sp., Bifidobacterium sp., Enterococcus sp., Escherichia sp., Lactobacillus sp., Lactococcus sp., Leuconostoc sp., Pediococcus sp. and Streptococcus sp. As such, the present disclosure provides a probiotic recombinant bacterium from the genus Bacillus sp., Bifidobacterium sp., Enterococcus sp., Escherichia sp., Lactobacillus sp., Lactococcus sp., Leuconostoc sp., Pediococcus sp. and Streptococcus sp. as well as a process for making such probiotic bacteria by introducing the conjugative bacterial vector or system in a probiotic bacterium of the genus Bacillus sp., Bifidobacterium sp., Enterococcus sp., Escherichia sp., Lactobacillus sp., Lactococcus sp., Leuconostoc sp., Pediococcus sp. and Streptococcus sp. Bacterial species which as considered probiotic to human subjects include, but are not limited to Bacillus coagulans (e.g., strain GBI-30 or 6086), Bifidobacterium animalis subsp. lactis (e.g., strain BB-12), Bifidobacterium longum subsp. infantis, Enterococcus durans (e.g. strain LAB18s), Escherichia coli (e.g., strain Nissle 1917), Lactobacillus acidophilus (e e.g., strain NCFM), Lactobacillus bifidus, Lactobacillus johnsonii (e.g., strain Lai, LCI or NCC533), Lactobacillus paracasei (e.g., strain Stl 1 or NCC2461), Lactobacillus plantarum (e.g., strain 299v), Lactobacillus reuteri (e.g., strain ATCC 55730, SD21 12, Protectis, DSM 17938, Prodentis, DSM 17938, ATCC 55730, ATCC PTA 5289, RC-14), Lactobacillus rhamnosus (e.g., strain GG, GR-1) and Lactococcus thermophiles, Leuconostoc masenteroides (e.g. strain B7), Pediococcus acidilactici (e.g. strain UL5), Streptococcus thermophilus. As such, the present disclosure provides, in some embodiments, a conjugative bacterial cell recombinant bacterium from the bacterial species which are considered probiotic as well as a process for making such probiotic bacteria by introducing the conjugative bacterial vector or system in a probiotic bacterium of the strains of bacteria considered probiotic in humans. In a specific embodiment, the probiotic is from the genus Escherichia, for example the species
Escherichia coli, e.g. E. coli Nissle. The present disclosure provides a process for making such probiotic recombinant bacteria by introducing the conjugative bacterial vector or system in a probiotic species Escherichia coli, e.g. E. coli Nissle.
In an embodiment, the recombinant donor bacterium is an enteric recombinant bacterium because it is capable of colonizing the gastro-intestinal tract of the subject receiving the recombinant bacteria. In an embodiment, the enteric recombinant bacterium is capable of colonizing the stomach, the intestine (including the small and the large intestine) and/or the colon of the subject receiving the recombinant bacteria
In an embodiment, the recombinant bacterium can be formulated as a composition (which can be a probiotic composition). The composition can also comprise an excipient, one or more antibiotic(s), a selection pressure (for selecting the cells having the selectable trait) and/or one or more chemically active molecules, and/or one or more strains of probiotic (nonrecombinant) bacterium. In the composition, the recombinant bacterium can be provided as a solution/suspension or in a dried form. The composition can be provided for administration by any routes and, in an embodiment, the composition can be provided for oral administration, for injection, for inhalation, etc. When the composition is intended for oral administration and is used with the intention of colonizing the gastro-intestinal tract of a subject, care should be taken to formulate the recombinant bacterium to preserve its viability and its ability to perform conjugation until it reaches the desired location (suspected of comprising the recipient bacterium).
The present disclosure thus provides a process for making the composition. Broadly, the process comprises combining the recombinant bacterium with an excipient and optionally additional probiotic bacteria and/or antibiotics and/or chemically active molecules. The process can comprise making a solution/suspension of the recombinant bacterium or drying the recombinant bacterium. When the composition is intended for oral administration, the process for making the composition and the excipient used in the composition are designed/selected for allowing oral administration.
Therapeutic uses of recombinant bacterial host cells and compositions comprising same
The recombinant conjugative bacterial host cell of the present disclosure acts as a donor bacterium to transfer the genetic cargo to a target (recipient) bacterium. The transfer can occur in a subject (human or animal) to which a conjugative bacterial cell is to be administered. The subject can be suspected or is known to bear the recipient bacterium. The subject can be a human subject or an animal subject (such as, for example, a non-human mammal). In an embodiment in which the recombinant bacterium is an enteric bacterium, the transfer is intended to occur in the gastro-intestinal tract of the subject.
In an embodiment, the recombinant bacterium is selected or engineered to have a modification module which is the same or similar to the restriction-modification system of the intended recipient bacterium. In bacteria, there are four known restriction-modification systems (type I, II, III and IV) involved in the bacterial defence system against foreign DNA. Similarity in modification module will facilitate the introduction of the genetic cargo molecule in the recipient bacterium by protecting the DNA from restriction, thus increasing conjugation efficiency. For example, in embodiments in which the recipient bacterium has a type I restriction-modification system, a recombinant bacterium having a similar type I modification system, for example a recombinant bacterium from the species Escherichia coli, can be selected and used. In an embodiment, the restriction modification system is endogenous to the donor bacterium and is part of the exclusion module. In another embodiment, the restriction modification system is heterologous to the donor bacterium and is incorporated in an exclusion module. In an embodiment, the restriction modification system of the donor bacterium and the restriction modification system of the recipient bacterium comprises/is a type I restriction modification system. In another embodiment, the restriction modification system of the donor bacterium and the restriction modification system of the recipient bacterium comprises/is a type II restriction modification system. In a further embodiment, the restriction modification system of the donor bacterium and the restriction modification system of the recipient bacterium comprises/is a type III restriction modification system. In yet another embodiment, the restriction modification system of the donor bacterium and the restriction modification system of the recipient bacterium comprises/is a type IV restriction modification system.
The present disclosure thus provides a method of transferring a genetic cargo from a donor bacterium to a recipient bacterium in the microbiota of a subject in need thereof. The transfer can be done in a liquid (urine or blood for example) or in a solid surface (an epithelium for example). The microbiota may be located on a solid surface (such as the gastro-intestinal epithelium, the bladder epithelium or the lung epithelium) or in a liquid (such as in the urine of the bladder or the urethra, the blood in a blood vessel, the gastric juices or the stomach or the lymph in a lymph node for example). The method comprises administering a therapeutically effective amount of the conjugative bacterial cell bacterium of the present disclosure to the subject in need thereof. As used in the context of the present disclosure, a therapeutically effective amount refers to an amount (dose) effective in mediating a therapeutic benefit to the subject. It is also to be understood herein that a“pharmaceutically effective amount” may be interpreted as an amount giving a desired therapeutic effect, either taken in one dose or in any dosage or route, taken alone or in combination with other therapeutic agents. The method can also comprise determining the presence of the recipient
bacterium in the subject prior to the administration of the recombinant bacterium. The method can further comprise determining if the restriction-modification system of the recombinant bacterium is substantially similar to the restriction-modification system of the intended recipient bacterium.
Advantageously, as indicated above, since the system of the present disclosure has high in vivo conjugation efficiency, the amount of recombinant bacteria necessary to achieve a desired therapeutic effect in the subject receiving the recombinant bacterium is going to be lower than other recombinant bacterium lacking conjugative delivery system of the present disclosure.
In the embodiments in which the genetic cargo encodes for one or more heterologous protein, and/or a non-coding RNA, and/or a phage genome, and/or a bacterial genome, the recipient bacterium can be any type of bacterium present in the subject which would accept conjugation from the recombinant bacterium. In such instances, the recipient bacterium can be, for example, part of the enteric microbiota (which can be or not pathogenic to the subject) which include, but is not limited to Aeromonas sp., Bacillus sp., Bifidobacterium sp., Campylobacter sp., Citrobacter sp., Clostridium sp., Enterobacter sp., Escherichia sp., Klebsiella sp., Hafnia sp., Helicobacter sp., Lactobacillus sp., Lactococcus sp., Morganella sp., Plesiomonas sp., Proteus sp., Providencia sp., Pseudomonas sp., Salmonella sp., Serratia sp., Shigella sp., Staphylococcus sp., Vibrio sp. and Yersinia sp.
In an embodiment in which the genetic cargo encodes one or more heterologous proteins, the bacterium receiving the genetic cargo can subsequently express one or more heterologous proteins. For example, the recipient bacterium can express one or more of a eukaryotic growth factor, and/or hormone, and/or cytokine (including an interleukin and/or a chemokine). The expression of the heterologous protein is intended to provide a therapeutic benefit to the subject having received the recombinant bacterium. For example, when the therapeutic protein is a hormone, like a GLP-1 peptide, the present disclosure provides using the recombinant bacterium to prevent, treat or alleviate the symptoms associated with a condition which would benefit from an increase in GLP-1 , such as, for example, diabetes. In another example, when the therapeutic protein is an interleukin, the present disclosure provides using the recombinant bacterium to prevent, treat or alleviate the symptoms associated with a condition which would benefit from an increase in interleukin, such as, for example, an inflammatory condition.
In an embodiment in which the heterologous protein encoded by the genetic cargo is a programmable nuclease, the recipient bacterium can be modified to express one or more of a TALEN, a zinc finger nuclease or a Cas protein. The expression of the programmable
nuclease is intended to provide a therapeutic benefit to the subject having received the recombinant bacterium and being afflicted by the recipient bacterium. The administration of the conjugative bacterial cell recombinant bacteria can, for example, kill the recipient bacterium, sensitize the recipient bacterium to an antibiotic, or modify the recipient bacterium in order to suppress the expression of a protein, or a non-coding RNA, contributing to the pathogenicity. When the heterologous protein is a Cas protein, like a Cas9 protein, the present disclosure provides using the conjugative bacterial cell recombinant bacterium to prevent, treat or alleviate the symptoms associated with an infection or a dysbiosis caused by the intended recipient bacterium. In an embodiment, the infection and/or dysbiosis caused by the intended recipient bacterium is located in the gastro-intestinal tract and the recombinant bacterium is administered to prevent, treat or alleviate the symptoms of such infection and/or dysbiosis. For example, when the subject is infected with a multidrug resistant shiga toxin-producing E. coli, the recombinant bacterium can be used to restore drug sensitivity in the recipient bacterium and/or inhibit the expression of the shiga toxin. In yet another example, when the subject is afflicted by an adherent-invasive E. coli and is also afflicted by Crohn’s disease or an inflammatory bowel disease linked to a dysbiosis, the recombinant bacterium can be used to inhibit the expression of an adhesion pilus to render the recipient bacterium less adherent to the gastro-intestinal wall and, in some embodiments, treat the dysbiosis. In still another example, when the subject is afflicted with a urinary tract infection or a blood septicemia linked to a dysbiosis, the recombinant bacterium can be used to inhibit the expression of an adhesion pilus or a virulence factor to render to recipient bacterium less virulent and, in some embodiments, treat the dysbiosis.
In an embodiment in which the genetic cargo encodes one or more non-coding RNA, the recipient bacterium can subsequently express one or more non-coding RNA. For example, the recipient bacterium can express one or more crRNA, and/or tracrRNA, and/or anti-sense RNA, and/or gRNA, and/or rRNA, and/or tRNA. The expression of the non-coding RNA is intended to provide a therapeutic benefit to the subject having received the recombinant bacterium. For example, when the therapeutic non-coding RNA is an antisens-RNA, it can knock down the expression of a virulence factor thus rendering the recipient unable to infect the subject.
In an embodiment in which the genetic cargo encodes one or more non-coding RNA and one or more heterologous proteins, the bacterium receiving the genetic cargo can subsequently express one or more non-coding RNA and one or more heterologous proteins. For example, the recipient bacterium can express one or more crRNA, and one or more Cas proteins. The expression of the crRNA and Cas protein is intended to provide a therapeutic benefit to the subject having received the conjugative bacterial cell recombinant bacterium. For example,
the simultaneous presence of crRNA and Cas9 at specific loci in the recipient bacterium’s genome will result in double-strand cleavage at those sites. These cuts will subsequently induce the death of the recipient bacterium.
The recombinant bacterium can optionally be used in combination with an antibiotic. Examples of antibiotics include, without limitation, aminoglycosides, ansamycins, carbapenems, cephalosporins, glycopeptides, lincosamides, lipopeptides, macrolides, monobactams, nitrofurans, oxazolidonones, penicillins, quinolones, sulfonamides, tetracyclines, and combinations thereof. Examples of aminoglycosides include, without limitation, amikacin, gentamicin, kanamycin, neomycin, netilmicin, tobramycin, paromomycin, spectinomycin and combinations thereofs. Examples of ansamycins include, without limitation, geldanamycin, herbimycin, rifaximin (streptomycin) and combinations thereof. Examples of carbapenems include, without limitation, ertapenem, doripenem, imipenem/cilastatina, Meropenem and combinations thereof. Examples of cephalosporins include, without limitation, cefadroxil, cefazolin, cefalotin or cefalothin, cefalexin, cefaclor, cefamandole, cefoxitin, cefprozil, cefuroxime, cefixime, cefdinir, cefditoren, cefoperazone, cefotaxime, cefpodoxime, ceftazidime, ceftibuten, ceftizoxime, ceftriaxone, cefepime, ceftaroline fosamil, ceftobiprole and combinations thereof. Examples of glycopeptides include, without limitation, teicoplanin, vancomycin, telavancin and combinations thereof. Examples of lincosamides include, without limitation, clindamycin, lincomycin and combinations thereof. An example of a lipopeptide includes, without limitation, daptomycin. Examples of macrolides include, without limitation, azithromycin, clarithromycin, dirithromycin, erythromycin, roxithromycin, troleandomycin, telithromycin, spiramycin and combinations thereof. An example of a monobactams includes, without limitation, aztreonam. Examples of nitrofurans include, without limitation, furazolidone, nitrofurantoin and combinations thereof. Examples of oxazolidonones include, without limitation, linezolid, posizolid, radezolid, orezolid and combinations thereof. Examples of penicillins include, without limitation, amoxicillin, ampicillin, azlocillin, carbenicillin, cloxacillin, dicloxaciUin, flucloxaciUin, mezlocillin, methicillin, nafcillin, oxacillin, penicillin G, penicillin V, piperacillin, penicillin G, temocillin, ticarcillin and combinations thereof. Examples of quinolones include, without limitation, ciprofloxacin, enoxacin, gatifloxacin, levofloxacin, lomefloxacin, moxifloxacin, nalidixic acid, norfloxacin, ofloxacin, trovafloxacin, grepafloxacin, sparfloxacin, temafloxacin and combinations thereof. Examples of sulfonamides include, without limitation, mafenide, sulfacetamide, sulfadiazine, silver sulfadiazine, sulfadimethoxine, sulfamethizole, sulfamethoxazole, sulfanamide (archaic), sulfasalazine, sulfisoxazole, trimethoprim- sulfamethoxazole(Co-trimoxazole) (TMP-SMX), sulfonamidochrysoidine(archaic) and
combinations thereof. Examples of tetracyclines include, without limitation, demeclocycline, doxycycline, minocycline, oxytetracycline, tetracycline and combinations thereof.
The present invention will be more readily understood by referring to the following examples which are given to illustrate the invention rather than to limit its scope.
EXAMPLE I - IN VITRO BACTERIAL CONJUGATIVE TRANSFER EFFICIENCY IS NOT PREDICTIVE OF IN VIVO BACTERIAL CONJUGATIVE TRANSFER EFFICIENCY
Strains, plasmids and growth conditions. All strains and plasmids used in this Example are described in Table 1 . All oligonucleotide sequences are provided in Table 2. Cells were typically grown in Luria broth Miller (LB) or on Luria broth agar Miller medium supplemented, when needed, with antibiotics at the following concentrations: ampicillin (Ap) 100 pg/mL, chloramphenicol (Cm) 34 pg/mL, kanamycin (Km) 50 pg/mL, nalidixic acid (Nx) 4 pg/mL, spectinomycin (Sp) 100 pg/mL, streptomycin (Sm) 50 pg/mL, sulfamethoxazole (Su) 160 pg/mL, tetracycline (Tc) 15 pg/mL, and trimethoprim (Tm) 32 pg/mL. Diaminopimelic acid (DAP) auxotrophy was complemented by adding DAP at a final concentration of 57 pg/mL in the medium. All cultures were routinely grown at 37°C. Cells with thermosensitive plasmids (pSIM6, pCP20, pGRG36) were grown at 30°C. No bacterial cultures over 18 hours of age were used in the experiments.
Table 1 . List of strains and plasmids used in the Examples.
Strain or plasmid Relevent phenotype or genotype Source/Reference
Citrobacter rodentium
DBS100 Murine enteric pathogen ATCC 51459
KN04 DBS100, SmR, CmR Example III
Enterobacter aerogenes
ATCC 35029 Synonym Klebsiella aerogenes, opportunistic pathogen ATCC 35029
E. coli
BW25113 F, DE(araD-araB)567, lacZ4787(del)::rrnB-3, LAM , rph- CGSC# : 7636
1 , DE(rhaD-rhaB)568, hsdR514
EC100Dpir+ F- mcrA A(mrr-hsdRMS-mcrBC) <|>80dlacZAM15 #ECP09500 (Lucigen)
AlacX74 recA1 endA1 araD139 A(ara, leu)7697 galU
galK l- rpsL nupG pir+ (DHFR)
K-12 J53 F+ met pro DSM-4246 (DSMZ)
KN01 SmRSpR Nissle 1917 Example I
KN01 Map A MapA KN01 Example I
KN02 SmRCmR Nissle 1917 Example I
KN03 Sm
RTc
R Nissle 1917 Example I
TP114 : : KΪII3 (SEQ ID NO : 165) TP1 14::fefB with inserted Kill 3 insertion device Example III
DNA manipulations. A detailed list of oligonucleotide sequences used in this Example is found in Table 2. Plasmids were prepared using EZ10-Spin Column Plasmid Miniprep kit (BIOBASIC #BS614) whereas genomic DNA (gDNA) minipreps were prepared using Quick gDNA miniprep (ZYMO RESEARCH) according to the manufacturer’s instructions. PCR amplifications were performed using Veraseq DNA polymerase (Enzymatics) or TaqB (Enzymatics) for DNA parts amplification and screening respectively. Digestion with restriction enzymes were incubated for 1 hour at 37°C following manufacturer’s recommendations. Plasmids were assembled by Gibson assembly using the NEBuilder Gibson Assembly mix (NEB) following manufacturer’s protocol.
Recombineering. AW recombineering experiments were performed using pSIM6 as described previously (PMID: 16750601). Briefly, the E. coli strain containing pSIM6 was cultured at 30 °C until an optical density of 0.4 to 0.8 at 600 nanometers was reached. Then, the cells were heat-shocked for 15 minutes at 42 °C, washed and the recombineering cassette was electroporated in the heat shocked E. coli cells. Cells were then incubated overnight at room temperature before plating on selective medium. The colonies were then screened by PCR to identify positive clones.
Table 2. Description of the oligonucleotides used in the Examples
Purpose Name Sequence3, 11 Template To amplify pGRG_SmSp oGSS1 GGATCCTAGTAAGCCACGTTTTAATTAATCAGATCCCGGGCC g Block_16S 16S tag
-F TACGGGAGGCAGCAGTGG fSEQ ID NO : 29) agR
oGSS1 CGGGGAACTAGGAGGGTATGGTGCGCGCATGGAAAGACTAC
-R CAGGGTATCTAATCCTGTT fSEQ ID NO : 30)
oGSS2 CGCACCATACCCTCCTAGTTCCCCGGTTATCTCTCCTGTCTC pFG051 aad7
-F TTATACACATCTGACGCT 1SEQ ID NO : 3T) (unpublishe
d)
oGSS2 GGGGTCGACGCGGCCGTGGCGCGCCTCCTAGGTGCTCGAG
-R GCAAGCGAACCGGAATTGCC fSEQ ID NO : 32)
pGRG_SmC oGSC1 GATCCTAGTAAGCCACGTTTTAATTAATCAGATCCCGGGCTA SXT strA-strB m -F GTATGACGTCTGTCGCAC (SEQ ID NO : 33)
oGSC1 TGCT AGCTTT GAAAATT AAGAGGT AT AT ATT ATT G AAT CG AAC
-R1 TAATATTTTTTTTGGTG (SEQ ID NO : 34)
oGSC1 GTGTCAACGTTTACAGCTAGCTCAGTCCTAGGTATTATGCTA Add P1 -U8
-R2 GCTTTGAAAATTAAGAGG fSEQ ID NO : 35)
oGSC2 TACCTAGGACTGAGCTAGCTGTAAACGTTGACACCATCGAAT pTRC-HisB lacl-Ptrc -F1 GGTGCAAAACCTTTCGCG 1SEQ ID NO : 36)
oGSC2 AT ATCCCGAAT GT GCAGTT AACG ACGTT GACACCATCGAAT G
-F2 GTGCAAAACCTTTCGCGG fSEQ ID NO : 37)
oGSC2 T AAT AT AT ACCTCTTT AATTTTT AAT AAT AAAGTT AATCG fSEQ
-R ID NO : 38)
OGSC3 T ATT ATT AAAAATT AAAG AGGT AT AT ATT AATGGTTTCT AAAGG gBlock_Ne NeonGreen
-F AG AAG AAAAAAAT AT G fSEQ ID NO : 39) onGreen oGSC3 CCTCTTTCCACTGCTGCCTCCCGTAGGTTATTTATATAATTCA
-R TCCATTCCCATAACATC fSEQ ID NO : 40)
OGSC4 AGCATTT ACAGATGTT ATGGGAATGGAT G AATT AT AT AAAT AA gBlock_16S 16S tag -F CCT ACGGG AGGCAGCAG fSEQ ID NO : 41) agD
OGSC4 ACCTCTTACGTGCCCGATCAACTCGAGGCATGCCTGCAGGA
-R CTACCAGGGTATCTAATCC fSEQ ID NO : 42)
OGSC5 GAACAGGATTAGATACCCTGGTAGTCCTGCAGGCATGCCTC pSB1 C3 cat -F GAGTTGATCGGGCACGTAA fSEQ ID NO : 43)
OGSC5 GGTCGACGCGGCCGTGGCGCGCCTCCTAGGTGCTCGAGCT
-R CGAGGCTTGGATTCTCACCA fSEQ ID NO : 44)
0GSC6 GAGCTTGGCGTAATCATGGTCATAGCTGTTTCCTGTGTGACG pGRG36 Screen -F CTT AAT GCGCCGCT ACAG fSEQ ID NO : 93) bacbone plasmid presence
0GSC6 CTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACC
-R GCGCAATTAACCCTCACTA fSEQ ID NO : 94)
pGRG_SmTc oGST1 GATCCTAGTAAGCCACGTTTTAATTAATCAGATCCCGGGCTA SXT strA-strB
-F GTATGACGTCTGTCGCAC (SEQ ID NO : 23)
oGST1 GT AGGTT ATTT AT AT AATTCATCCATT CCCAT AACATCT GTTTA
-R CAGCTAGCTCAGTCCT (SEQ ID NO : 24)
oGST2 AAGCT AGCAT AAT ACCT AGGACT GAGCT AGCTGTAAACAGAT gBlock_16S 16S tag
-F GTTATGGGAATGGATGAA (SEQ ID NO : 25) JagD
oGST2 CCCCAAAACTTTCCCCAAAACCCTTCCCCAAAACT GGCTATA
-R CTCGAGGCATGCCTGCAG (SEQ ID NO : 26)
OGST3 GGATTAGATACCCTGGTAGTCCTGCAGGCATGCCTCGAGTAT pFG018 tetB -F AGCCAGTTTTGGGGAAGG fSEQ ID NO : 27)
OGST3 GGGGTCGACGCGGCCGTGGCGCGCCTCCTAGGTGCTCGAG
-R CTAGGTCGACGCTTGGATTC (SEQ ID NO : 28)
dapA deletion oDTD1 CGCGTGCCTCGGCAAAATGCCCTTCTGCTGCCAGTTTGCAG pKD4 aph-llla
-F TGTAGGCTGGAGCTGCTTC (SEQ ID NO : 45)
oDTD1 AAACGTACCATTGAGACACTTGTTTGCACAGAGGATGGCCCT
-R GGGAATTAGCCATGGTCC (SEQ ID NO : 46)
oDTD2 CA*A*A*T AGTTT GTT GTGTAAT GGCATCAGACGCT GATT AAT A Previous Add homology ACGCGTGCCTCGGCAAAAT (SEQ ID NO : 47)
Purpose Name Sequence3, 11 Template To amplify
-F amplicon oDTD2 AC*G*G*TTCTGTCT GCTTGCTTTT AAT GCCATACCAAACGT AC
-R CATT GAGACACTT GTTT GC (SEQ ID NO : 48)
ODTD3 pKD4 Screen dapA
CCGTTTCTGCGGACTGGCTT (SED ID NO : 49)
-int deletion
ODTD2 Nissle 1917 Screen da A TAGCGGCTATCACCAACATC (SEQ ID NO : 50)
3-F deletion
ODTD3
GTGAAGCGCCTTATGAACAATG (SEQ ID NO : 51)
-R
TP114 sanger oTPS1 TP114 1 bp Sanger
TGCTGAACCAGTAACAACCACC (SEQ ID NO : 67)
sequencing -F sanger sequencing
TP1 14 oTPS1
GTCGCCGCTGTGGATTCAAC (SEQ ID NO : 68)
-R
oTPS2 TP1 14 10 Sanger GTTCAATACACATTACAGCCCACC (SEQ ID NO : 69)
-F kb sanger sequencing
TP1 14 oTPS2
CTGCGCTCAAAGTCACGTATGG (SEQ ID NO : 70)
-R
OTPS3 TP1 14 20 Sanger
TT ACGCAACAG AAT CT GAAAGCAC (SEQ ID NO : 71)
-F kb sanger sequencing
TP1 14
OTPS3
GAAGGTGGCCTGTCATCGAG (SEQ ID NO : 72)
-R
OTPS4 TP1 14 25 Sanger TGTCCGATTCGTCCTGGTTG (SEQ ID NO : 73)
-F kb sanger sequencing
TP1 14
OTPS4
GTATTTGTCCAGCGCCCGG (SEQ ID NO : 74)
-R
OTPS5 TP1 14 37 Sanger TTCAGATGCGTCGTGCAATG (SEQ ID NO : 75)
-F kb sanger sequencing
TP1 14
OTPS5
CACACTTGAGCGTCTTTCTGA (SEQ ID NO : 76)
-R
0TPS6 TP1 14 42,5 Sanger AGAAGCTCTTGAGTCCGACC (SEQ ID NO : 77)
-F kb sanger sequencing
TP1 14
0TPS6
GACTTATTCCGCCAACCCAAATT (SEQ ID NO : 78)
-R
OTPS7 TP1 14 47 Sanger GGCCCGCTCAAGGTCTTTC (SEQ ID NO : 79)
-F kb sanger sequencing
TP1 14
OTPS7
GCTGGAGAACACCCTGATTATGT (SEQ ID NO : 80)
-R
0TPS8 TP1 14 50 Sanger AAAGTTCTTTGCGCCTGTCATAGC (SEQ ID NO : 81)
-F kb sanger sequencing
TP1 14
0TPS8
GAAGCCAGGTTTGTTGCTGTG (SEQ ID NO : 82)
-R
OTPS9 TP1 14 52,5 Sanger TTTCTCTGCTACAGCATCTTTCTTC (SEQ ID NO : 83)
-F kb sanger sequencing
TP1 14
OTPS9
GGAACTGCCTCGGTGAAT (SEQ ID NO : 84)
-R
oTPS1 TP1 14 57,5 Sanger GGCATAAGGCGTGGACAATGG (SEQ ID NO : 85)
0-F kb sanger sequencing
TP1 14 oTPS1
CAAACGTGCTAATCGCCTGGC (SEQ ID NO : 86)
0-R
pBXB1 oBXB1 ATTTCCCCGAAAAGT GCCACCT GACGTCTAAGAAACCATTCA gBIock-Bxbl bxb1 integrase
-F CGAGGCAGAATTTCAGAT (SEQ ID NO : 1)
oBXB1 ATCCCCT GATTCT GT GGAT AACCGTATT ACCGCCTTT GAGCG
-R CCCTGCAGGAAATAATAA1SEQ ID NO : 2)
oBXB2 GATTATTAATCCGGCTTTTTTATTATTTCCTGCAGGGCGCTCA pSB1 A3
-F AAGGCGGTAATACGGTT (SEQ ID NO : 3)
Purpose Name Sequence3, 11 Template To amplify oBXB2 AAGCTAAGGATTTTTTTTATCTGAAATTCTGCCTCGTGAATGG
-R TTTCTTAGACGTCAGGT (SEQ ID NO : 4)
KNI3 Cargo 0KILI- AACCACCGCGGTCTCAGTGGTGTACGGTACAAACCCCGACC pGRG36 ori\/Di insertion F GACAGTAAGACGGGTAAGC (SEQ ID NO : 5)
device
0 Kl L 1 - T AT ACTTTCT AG AG AAT AGGAACTTCGG AAT AGGAACTTCGG
R CT GAAAGCGCT ATTT CTT fSEQ ID NO : 6)
OKIL2- AAGTTCCTATTCTCTAGAAAGTATAGGAACTTCGTGCACTGAT pKN02 gRNA 1 F TAAGCATTGGTAACAGG fSEQ ID NO : 7) (unpublishe
d)
OKIL2- TTTCGGGACATTCAGGAGATTTTCGCCGGACGTACGCATTTT
R CAGCACACTGAGACTTGT (SEQ ID NO : 8}
OKIL3- AATGCGTACGTCCGGCGAAAATCTCCTGAATGTCCCGAAAG pKN02 gRNA 2 F GGCAGAAAGATGAATGACT fSEQ ID NO : 9) (unpublishe
d)
OKIL3- CTAAAACAGGGATTGGCTGAGACGAAAGAATCTATTATACAG
R AAAAATTTTCCTGAAAGC fSEQ ID NO : 101
OKIL4- TTCTGTATAATAGATTCTTTCGTCTCAGCCAATCCCTGTTTTA pKN02 gRNA 2 F GAGCTAGAAATAGCAAG fSEQ ID NO : 1 T) (unpublishe
d)
0 Kl L4- AACCGAGT GACCAAG AGAGG AT GAAGCATTT GCCG AGT AGT
R TCAGCACACTGAGACTTGT fSEQ ID NO : 12Ί
OKIL5- CT ACTCGGCAAATGCTTCATCCTCTCTT GGTCACTCGGTT GG pKN02 gRNA 3
F GCAGAAAGATGAATGACTGTC fSEQ ID NO : 13~) (unpublishe
d)
OKIL5- CTAAAACTATTGGCCACGTTTAAATCAGAATCTATTATACAGA
R AAAATTTTCCT GAAAGC fSEQ ID NO : 141
0KIL6- TTCTGTATAATAGATTCTGATTTAAACGTGGCCAATAGTTTTA pKN02 gRNA 3 F GAGCTAGAAATAGCAAG fSEQ ID NO : 15Ί (unpublishe
d)
0KIL6- GTCGGAAAAGTGGCCATCATTTGACGAACTACAGCCCGGGT
R TCAGCACACTGAGACTTGT fSEQ ID NO : 16~)
OKIL7- CCCGGGCTGTAGTTCGTCAAATGATGGCCACTTTTCCGACG pKD4 aph-llla F GTGCT GACCCCGG AT GAAT fSEQ ID NO : 171
OKIL7- ATCATTCTATAGTATTAAGTATTGTTTTATGGCTGATAAAAGC
R GCTTTTGAAGCTGGGGT fSEQ ID NO : 18~)
0KIL8- GTTCTTCGCCCACCCCAGCTTCAAAAGCGCTTTTATCAGCCA pKN02 cas9 F T AAAACAAT ACTT AAT AC fSEQ ID NO : 19)
0KIL8- CACCACTGAGACCGCGGTGGTTGACCAGACAAACCACGACT
R CAGTCACCTCCTAGCTGAC fSEQ ID NO : 20t
KilM Cargo OKIL9- AAGTTCCTATTCTCTAGAAAGTATAGGAACTTCGTGCACTGAT pKN02 gRNA 1 insertion F TAAGCATTGGTAACAGG fSEQ ID NO: 21t (unpublishe
device d)
OKIL9- GTCGGAAAAGTGGCCATCATTTGACGAACTACAGCCCGGGT
R TCAGCACACTGAGACTTGT fSEQ ID NO: 22)
pREC1 oREC1 GACGACGGCGGTCTCCGTCGTCAGGATCATCCGGGCGGAG pKD4 oriVR6K
-F GATATTCATATGGACCATGG fSEQ ID NO : 52)
oREC1 T AT ACTTTCT AG AG AAT AGGAACTTCGG AAT AGGAACTTCTTT
-R TGCGGCCGCAAGATCCG fSEQ ID NO : 53)
oREC2 TATTCCGAAGTTCCTATTCTCTAGAAAGTATAGGAACTTCATA pFG018 tetB
-F GCCAGTTTT GGGGAAGG fSEQ ID NO : 54t
oREC2 TCCTGACGACGGAGACCGCCGTCGTCGACAAGCCGGCCGA
-R CTAGGTCGACGCTTGGATTC (SEQ ID NO : 55)
TP1 14::tetB 0TRTI- ACAT GGCAAAGGT AGCGTTGCCAAT GATGTT ACAGAT G AGC pREC1 FRT-fefB-
F GGATCTTGCGGCCGCAAAA (SEQ ID NO : 56) attBBM
0TRTI- GCCG AT GCGCCAACCGCATTCATT AAAGACT AACT ACCAT GG
R TCCATATGAATATCCTCC fSEQ ID NO : 571
OTPT2- TP1 14::fefB Screen tetB
GAGCCATATTCAACGGGAAACGTC (SEQ ID NO : 58)
F insertion
OTPT2-
CACATCGGTGAAAGCTATGCC (SEQ ID NO : 59)
R
Screen oTPK1 GCTCGCTTGGACTCCTGTTG (SEQ ID NO : 60) pREC1 and screen ..tetB in
oTPK1
CGTTGGCAAGACTGGCATGAT (SEQ ID NO : 61)
-R
oTPK2 TP114::Kill Screen TTGAAGGGTAGTCCAGAAGATAACG (SEQ ID NO : 62)
-F 3 or 1 ::Kill3/Kill1
TP1 14 oTPK2
GGTAAATGGCACTACAGGCGC (SEQ ID NO : 63)
-R
OTPK3 TP114::Kill Screen FRT CCTGTTACCAATGCTTAATCAGTGCAC (SEQ ID NO : 64)
3 or 1 recombination
OTPK4 TP114::Kill Screen ARNg GTGCACTGATTAAGCATTGGTAACAGG ( SEQ ID NO : 65)
-F 3 or 1 number
OTPK4
GGTCAGCACCGTCGGAAAAG (SEQ ID NO : 66)
-R
pNA22 NAI- ATT ACACGT CTT GAGCGATT CGCG ACT ATCACCGGAAGAGCA TP114 repA + 1 ,000
F GA (SEQ ID NO : 95) bp
NAI- GAAGCAGCTCCAGCCTACACTCGAGTTTATTCCGCAAGTGAT
R TAA (SEQ ID NO : 96)
oNA2- TTAATCACTTGCGGAATAAACTCGAGTGTAGGCTGGAGCTGC pKD3 pKD3's F TTC (SEQ ID NO : 97) backbone oNA2- TCTGCTCTTCCGGTGATAGTCGCGAATCGCTCAAGACGTGTA
R AT (SEQ ID NO : 98)
pNA23 oNA3- ATT ACACGT CTT GAGCGATT CGCG ATCCCCGGAAT ACAGCGT TP114 1 ,000 bp +
F CAT (SEQ ID NO : 99) repA oNA3- GAAGCAGCTCCAGCCTACACGAATTCTAGTGGGGTGGCGAA
R GCTG (SEQ ID NO : 100)
oNA4- CAGCTTCGCCACCCCACTAGAATTCGTGTAGGCTGGAGCTG pKD3 pKD3's F CTTC (SEQ ID NO : 101) backbone oNA4- ATGACGCTGTATTCCGGGGATCGCGAATCGCTCAAGACGTG
R TAAI (SEQ ID NO : 102)
pNA24 oNA3- ATT ACACGT CTT GAGCGATT CGCG ATCCCCGGAAT ACAGCGT TP114 1 ,000 bp +
F CAT (SEQ ID NO : 103) repA + 1 ,000 bp
NAI- GAAGCAGCTCCAGCCTACACTCGAGTTTATTCCGCAAGTGAT
R TAA 1SEQ ID NO : 1041
oNA2- TTAATCACTTGCGGAATAAACTCGAGTGTAGGCTGGAGCTGC pKD3 pKD3's F TTC (SEQ ID NO : 105) backbone oNA4- ATGACGCTGTATTCCGGGGATCGCGAATCGCTCAAGACGTG
R TAAT 1SEQ ID NO : 1061
pGRG-pir+ 0 ir1.p CCTAGTAAGCCACGTTTTAATTAATCAGATCCCGGGCATGAG~ EC100Dpir pir
m TGG AT AGT ACGTT GCT AA (SEQ ID NO : 107)
opirl - R GTCAGTTTAGGTTAGGCGCCATGCATCTCGAGGCTTGGTCA
m CCCCTTAGCTTTTTTGGGA fSEQ ID NO : 1081
opir2-F CCAAGCCT CG AGAT GCAT GG fSEQ ID NO : 1091 pFG018 rrnB terminator opir2-R CGCGGCCGTGGCGCGCCTCCTAGGTGCTCGAGGGACGTCT
m AAG AAACC ATT ATT ATC ATG fSEQ ID NO : 1101
opir3-F GATGCTGGTGGCGAAGCTGT fSEQ ID NO : 11 11 attBTn7 ScreenpGRGin tegrations opir3-R GATGACGGTTTGTCACATGGA fSEQ ID NO : 1121
opir3-in ACGTCCATCATGACCTTGAGTCTCATAAAAAAACCTCATCATT pir
m CTGTATATCTTAACGCC fSEQ ID NO : 1 131
TP114 ArepA.·. oRepl - CGTTAAAAAAGACCCGGTCACTGGTCAGAACACACTCAGAGT pKD3 Replace cat-ori I/R6K F GTAGGCTGGAGCTGCTTC (SEQ ID NO : 1 14) TP114-083 by cat-ori I/R6K oRepl - AAGGGGCTGAAGAGAGT GCCGATTGT ATCAGGCAGCT AAAC
R TGTCAGACCAAGTTTACTC (SEQ ID NO : 115)
oRep2- Screen
GATTGTATCAGGCAGCTAAA fSEQ ID NO : 1 161 TP1 1‘
F P A repAv.c at-ori l/R6K oRep2-
CT GGTCAGAACACACTCAGA fSEQ ID NO : 117)
pNA01 oNA5- CTTAATTAATTAATCCAGAGGCATCGCCTGCCCCTCCCTTTT pSIM7 oriV pBBRi for
Purpose Name Sequence3, 11 Template To amplify
F GGTGTCCA fSEQ ID NO : 1 181 pNA01 and pNA02 oNA5- TCTAGATTCAGCTGAATTCCCGGGTGCACCGAGGCGGCTAC
R AGCCGATAGTCTGGAA fSEQ ID NO : 1191
oNA6- CAGGCGATGCCTCTGGATTAATTAATTAAGGCTTGGATTCTC pREC1 tetB for pNA01
F ACCAATAA fSEQ ID NO : 1201 and pNA02 oNA6- TTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATAGCCAGT
R TTTGGGGAAGG (SEQ ID NO : 121)
oNA7- GTGCACCCGGGAATTCAGCT G AAT CT AG AT ACGTAGT ACT AA TP114 or/Trp for
F GTCCTCAAGGTTCGTAGA fSEQ ID NO : 1221 pNA01 and pNA02 oNA7- AT AT ACTTT AG ATT GATTT AAAACTTCATTTTT AATTTGTT ACAT
R TTCCTCTCTCTCTCT fSEQ ID NO : 1231
rNA02 oNA8- GGCTGTATGCACGGCATTTTTTTGTCCTTCTAAAACACATAAG TP114 Nicking site
F CTTTGTACACAAGCCCG fSEQ ID NO : 1241 deletion oNA8- ACATGCCCGGAACGGGCTTGTGTACAAAGCTTATGTGTTTTA
R GAAGGACAAAAAAATGCC fSEQ ID NO : 1251
pKN30 and OKN01 CATGCTGGAGTTCTTCGCCCACCCCAGCTTCAAAAGCGCTCT pNA01 or pNA backbone pKN31 -F GCCTGCCCCTCCCTTTTG fSEQ ID NO : 1261 pNA02
OKN01 TGCGCCCTGAGTGCTTGCGGCAGCGTGAGGGGATCTTTTGT
-R TACATTTCCTCTCTCTCTC fSEQ ID NO : 1271
OKN02 T ATGCCAAT GT AGGAGGT AG AGAGAG AG AGG AAATGTAACA pKD4 aph-llla
-F AAAGATCCCCTCACGCTGC fSEQ ID NO : 1281
OKN02 CCCCGTCGAGCCGGTTGGACACCAAAAGGGAGGGGCAGGC
-R AGAGCGCTTTTGAAGCTGGG fSEQ ID NO : 1291
TP114Aor/T::c oriT1-F CAAGCATT GT AACAT GCCCGGAACGGGCTT GTGTACAAAGG KD3 cat at-tetB TGTAGGCTGGAGCTGCTTC fSEQ ID NO : 1301 m
oriT1-R GGTGATTATGTGGGTTGTTTTGTGGGTTGTCAATGGTGGGAA
TTAGCCATGGTCCATATG (SEQ ID NO : 131)
oriT2-F GCGCCGTTCGGGGTTGCAAAGGGGCGTCCCCTTTGGCACAA oriT1 -F + Add homology GCATT GT AACAT GCCCGGA ISEQ ID NO : 1321 oriT1-R for onT ptu oriT2-R TACCTTATTTAAAGCAATTTGCTCGCCGTTTGTGTGGGTGATT product
ATGTGGGTTGTTTTGTG ISEQ ID NO : 1331 m oriT3-F CCCAACTT AACT G AGAAAGACACC ISEQ ID NO : 1341 TP1 14 Screen oriT3-R GCTAGTTCTGTCTTGGTGTTGTTGT fSEQ ID NO : 135) deletion clones
TP114 pilV opilV- TCACCATCACGGCGACT ACAAAGACGAT GACGACAAGTAAAT pKD3 Delete the
Ashufflon cat F1 GGGAATTAGCCATGGTCC ISEQ ID NO: 1711 shufflon opilV- TAAAAAATCACTCTGTAGTTGTTCTATCATAAGTGAATCCGTG
R1 T AGGCTGGAGCTGCTTC fSEQ ID NO: 172)
opilV- CAGTACAGGGGCGATACTTTCGTGCCAATCCGGTCGGTGGC
F2 ATCACCATCACCATCACGGCGACTACA ISEQ ID NO: 1731
opilV- TP1 14 Screen
GGAGGAATATGCTGTCCTGG (SEQ ID NO: 174)
F3 deletion opilV-
TCTGGT GGCAAT AAAGT G AACT (SEQ ID NO: 175)
R3
TP114ApilS::c opilSI - ATGT CTT CT ATT AAT ATTTT AAAT ATGCGTTCT GTTTTTTGTGT pKD3 cat at F AGGCTGGAGCTGCTTC ISEQ ID NO : 1361
opilSI - TCAGG AATCAGT GOT GAAGGT CAGCGT ATT GCTGTCAG AT AT
R GGGAATTAGCCATGGTCC fSEQ ID NO : 137)
TP1 14 Screen pits
°pllS2" TAACGTCCTGCAACACTAAT (SEQ ID NO : 138)
deletion clones opilS2-
GCTTATCCGATGCACATGAA (SEQ ID NO : 139)
R
TP114 Arc/.vca orci1 -F CAAGAATCTCTACCTTCCCCCCTTTTTTGTCTGGAGGGGATT pKD3 cat t GGGAATTAGCCATGGTCCfSEQ ID NO : 176)
orci1-R TAAAAAATCACTCTGTAGTTGTTCTATCATAAGTGAATCCTGT
GTAGGCTGGAGCTGCTTfSEQ ID NO : 177)
orci2-R GTGATATTGCATTTCGAAGCAAGfSEQ ID NO : 178) TP1 14 Screen rci deletion clones orci2-R CCAGGACAGCATATTCCTCCfSEQ ID NO : 179)
Purpose Name Sequence3, 11 Template To amplify
TP1 14 ApilV::c opilVI - CTCACTCGTACCGGGAATATTATTCTGAGGATTAAGAGCAAT pKD3 cat at F GGGAATTAGCCATGGTCCfSEQ ID NO : 180)
opilVI - TAAAAAATCACTCTGTAGTTGTTCTATCATAAGTGAATCCGTG
R TAGGCTGGAGCTGCTTC fSEQ ID NO : 181)
opilV2- CCGTTCCTTTGTGGCGGAATfSEQ ID NO : 1821 TP1 14 Screen p HV
F deletion clones opilV2- CCAGGACAGCATATTCCTCCfSEQ ID NO : 183)
R
TP1 14Ashuffl opilV3- CAGTACAGGGGCGATACTTTCGTGCCAATCCGGTCGGTGGT TP1 14 pilV1 on .pUV1-cat F CTGGTGGCAATAAAGTGAAfSEQ ID NO : 184)
opilV3- GGACCATGGCTAATTCCCATTTAACCGAAGGGGCAACAATfS
R EQ ID NO : 185)
opilV4- ATTGTTGCCCCTTCGGTTAAATGGGAATTAGCCATGGTCCfSE pKD3 cat
F Q ID NO : 186)
opilVI - TAAAAAATCACTCTGTAGTTGTTCTATCATAAGTGAATCCGTG
R TAGGCTGGAGCTGCTTC (SEQ ID NO : 187)
TP1 14Ashuffl opilV5- CAGTACAGGGGCGATACTTTCGTGCCAATCCGGTATCTGGA TP1 14 pilV2 on :.pilV2-cat F CAACGGCAAAAGTGAACTTfSEQ ID NO : 188)
opilV5- GGACCATGGCTAATTCCCATTTAAATCCCAGCACCAGGAAfS
R EQ ID NO : 189)
opilV6- TTCCTGGTGCTGGGATTTAAATGGGAATTAGCCATGGTCC(S pKD3 cat
F EQ ID NO : 190)
opilVI - TAAAAAATCACTCTGTAGTTGTTCTATCATAAGTGAATCCGTG
R TAGGCTGGAGCTGCTTC fSEQ ID NO : 191)
TP1 14Ashuffl opil - CAGTACAGGGGCGATACTTTCGTGCCAATCCGGTACGTGGA TP1 14 pilV3 on .pUV3-cat F AAAAAATTGGCGCAGGTGAfSEQ ID NO : 192)
opil - GGACCATGGCTAATTCCCATTCACTGGCAAATGGCGTAAAfS
R EQ ID NO : 193)
opilV8- TTTACGCCATTTGCCAGTGAATGGGAATTAGCCATGGTCCfS E pKD3 cat
F Q ID NO : 194)
opilVI - TAAAAAATCACTCTGTAGTTGTTCTATCATAAGTGAATCCGTG
R TAGGCTGGAGCTGCTTC fSEQ ID NO : 1951 _
TP1 14Ashuffl opilV9- CAGTACAGGGGCGATACTTTCGTGCCAATCCGGTACGTGGA TP1 14 pilV3' on .pUV3'-cat F GAAGGGCATCAGGTAGCACfSEQ ID NO : 196)
opilV9- GGACCATGGCT AATTCCCATTCACTGGCAAACCACGAT GTfS
R EQ ID NO : 197)
opilVI 0 ACAT CGTGGTTT GCCAGT G AAT GGG AATT AGCCAT GGT CCfS pKD3 cat -F EQ ID NO : 198)
opilVI - TAAAAAATCACTCTGTAGTTGTTCTATCATAAGTGAATCCGTG
R TAGGCTGGAGCTGCTTC fSEQ ID NO : 1991
TP1 14Ashuffl opilV1 1 CAGTACAGGGGCGATACTTTCGTGCCAATCCGGTGTGTGGA TP1 14 pilV4 on \.pilV4-cat -F GGGCATTAGGTGGAAAGCTfSEQ ID NO : 200)
opilV1 1 GGACCATGGCT AATTCCCATTT AATT G AGAGTT ACACAGGfS E
-R Q ID NO : 201 )
opilVI 2 CCTGTGTAACTCTCAATTAAATGGGAATTAGCCATGGTCCfS E pKD3 cat -F Q ID NO : 202)
opilVI - TAAAAAATCACTCTGTAGTTGTTCTATCATAAGTGAATCCGTG
R TAGGCTGGAGCTGCTTC fSEQ ID NO : 2031
TP1 14Ashuffl opilVI 3 CAGTACAGGGGCGATACTTTCGTGCCAATCCGGTGTGTGGA TP1 14 pilV4' on .pUV4'-cat -F GAACTTCCGGTTCCTCTAAfSEQ ID NO : 204)
opilVI 3 GGACCATGGCT AATTCCCATTTAAGTTTGGTATCCAAAAAfSE
-R Q ID NO : 205)
opilVI 4 TTTTTGGATACCAAACTTAAATGGGAATTAGCCATGGTCCfS E pKD3 cat -F Q ID NO : 206)
opilVI - TAAAAAATCACTCTGTAGTTGTTCTATCATAAGTGAATCCGTG
R TAGGCTGGAGCTGCTTC fSEQ ID NO : 207)
TP1 14Ashuffl opilVI 5 CAGTACAGGGGCGATACTTTCGTGCCAATCCGGTACGTGGC TP1 14 pi! V5 on .pUV5-cat -F AGAAAAATGGCGGCGGTACfSEQ ID NO : 208)
opilVI 5 GGACCATGGCTAATTCCCATTCACTGACACAATGCATAAGfSE
-R Q ID NO : 209)
opilVI 6 CTTATGCATTGTGTCAGTGAATGGGAATTAGCCATGGTCCfSE pKD3 cat -F Q ID NO : 210)
opilVI - TAAAAAATCACTCTGTAGTTGTTCTATCATAAGTGAATCCGTG
R TAGGCTGGAGCTGCTTC fSEQ ID NO : 21 11
TP1 14Ashuffl opilVI 7 CAGTACAGGGGCGATACTTTCGTGCCAATCCGGTTCGTGGA TP1 14 pi! V5' on .pUV5'-cat -F AATCAATAGGTTCATGTGCfSEQ ID NO : 212)
opilVI 7 GGACCATGGCT AATTCCCATTTAGCGGAAGCAGTGAACAGfS
-R EQ ID NO : 213)
opilVI 8 CTGTTCACTGCTTCCGCTAAATGGGAATTAGCCATGGTCCfSE pKD3 cat -F Q ID NO : 214)
opilVI - TAAAAAATCACTCTGTAGTTGTTCTATCATAAGTGAATCCGTG
R TAGGCTGGAGCTGCTTC fSEQ ID NO : 2151
pPHS opPNS- CTGTCAGACCAAGTTTACTCfSEQ ID NO : 216) pBAD30 ori 1/
P15A, araC
a. Oligonucleotide's priming site are underlined.
b. Mutation introduced in the oligonucleotides are in bold.
DNA purification. Purification of DNA was performed between each step of plasmid assembly to avoid buffer incompatibility or stop enzymatic reactions. PCR reactions were generally purified by Solid Phase Reversible Immobilization (SPRI) using Agencourt Ampure XP DNA binding beads (Beckman Coulter) according to the manufacturer’s guidelines. When DNA samples were digested with restriction enzymes, DNA was purified using DNA Clean and Concentrator (ZYMO RESEARCH) following manufacturer’s recommendation for cell suspension DNA purification protocol. After purification, DNA concentration and purity was routinely assessed using a Nanodrop spectrophotometer when necessary.
DNA transformation into E. coliby electroporation. Routine plasmid transformations were performed by electroporation. Electrocompetent E. coli strains were prepared from 20 mL of LB broth. Cultures reaching exponential growth phase of 0.6 optical density at 600 nanometers (OD60onm) were then washed three times in sterile distilled water. Cells were then resuspended in 200 pL of water and distributed in 40 pL aliquots. The DNA was then added to the electrocompetent cells and the mixture was transferred in a 1 mm electroporation cuvette. Cells were electroporated using a pulse of 1.8 kV, 25 pF and 200W for 5 ms. Cells were then resuspended in 1 mL of non-selective LB medium and recovered for 1 hour before plating on selective media.
DNA transformation into E. coliby heat-shock. Heat-shock transformation was mostly used to clone Gibson assembly products. Chemically competent cells were prepared according to the rubidium chloride protocol as described previously (Green et al., 2013). Chemically competent cells were flash-frozen and conserved at -80°C before use. Gibson assembly products were directly transformed into EC100Dp/r+ chemically competent cells at a 1/10 volume ratio. Routinely, up to 10 pL of DNA was added to 100 pL competent cells before transformation by a 45 seconds heat shock at 42°C. Cells were then resuspended in 1 mL of non-selective LB medium and let to recover for 1 hour at 37°C before plating on selective media.
Introduction of selection markers in Escherichia coli Nissle 1917 (EcN) strains for conjugation quantification. The modified EcN strains were obtained by Tn7 insertion of the antibiotic resistance cassettes as described previously (McKenzie et al., 2006). Integration was verified by PCR using corresponding primers as described in table 2. Loss of ampicillin resistance was confirmed to verify plasmid elimination. More specifically, the pGRG36 vector was purified from E. coli EC100D pir+ and digested with Smal + Xhol. The inserts were amplified by PCR using their corresponding primers (Table 2) and inserted by Gibson assembly between attLTn 7 and attRTn 7 sites of the digested pGRG36 plasmid (Figure 1). The Gibson assembly products were then transformed in chemically competent E. coli EC100D pir+ strain. The resulting plasmids were analyzed using restriction enzymes, and positive clones
were transformed into E. coli MFDpir+ (Ferrieres et al., 2010). Plasmids were mobilized from E. coli MFDpir+ to EcN by conjugation. To mediate cassette insertion into the terminatorof glmS, EcN was first cultivated at 30°C in LB with arabinose until 0.6 OD60onm· Cells were next heat-shocked at 42°C for 1 hour and incubated at 37°C overnight to allow for plasmid clearance. An aliquot of the bacterial culture was then streaked onto a LB agar plate. >20 colonies were analyzed, and colonies that only grew in the absence of ampicillin, but contained the insert’s selection markers were then investigated by PCR using the appropriate primers listed in table 2.
Construction of E. coli KNOlAdapA. A DAP auxotrophic variant was also obtained through the deletion of the dapA gene in EcN (Born et al., 1999) by recombineering using pSIM6. DAP auxotrophy was shown to be a good marker to discriminate donor and recipient strains for conjugation without hindering transfer frequencies as DAP auxotroph reversion was never reported (Ronchel et al, 2001) and, when complemented, DAP auxotrophy has little impact on the fitness of the bacterium (Allard et al., 2015). To generate a DAP auxotrophic strain, the aph-llla resistance cassette of pKD4 was amplified by PCR with added homology for the regions flanking dapA. A second PCR round on the purified PCR product then allowed to increase the length of homology. Recombineering was performed in EcN using pSIM6 as described previously (Datta et al., 2006). Briefly, EcN containing pSIM6 was electroporated with the purified PCR product. Kanamycin resistant bacteria were selected and DAP auxotrophy confirmed. Insertion of the cassette and deletion of dapA were also verified by PCR with corresponding primers (Table 2). After the confirmation of the cassette insertion in the dapA gene locus, the strain was cured from pSIM6 by heatshock at 42°C for 1 hour followed by overnight incubation at 37°C. The culture was then streaked on selective plates to identify Ap sensitive clones, which were next transformed with pCP20 to eliminate the resistance cassette as described previously (Datsenko et al., 2000). pCP20 plasmid was cured by heat-shock following the same procedure as before. Next, the SmSp insert was added in the genome of EcNAc/apA strain to complete KNOI Ac/apA.
In vitro conjugation assay. For all in vitro conjugation assays, donor strains were KNOlAdapA and recipient strains were KN02 unless specified otherwise. The strains were grown from frozen stocks 18 hours prior to conjugation experiments, mixed at a 1 :1 volume ratio (100 pL each), centrifuged at maximum speed for one minute and washed in 200 pL of LB without antibiotics. The bacteria mix was then spun down and, either resuspended in 5 pL of LB broth and deposited on a LB agar plate with DAP, or resuspended to 1.0 OD60onm in LB broth with DAP. The cell mix was then incubated at 37°C for the desired conjugation time before being resuspended in 800 pL sterile PBS and diluted 1/10 serially in sterile PBS to avoid growth during dilution and plating. 5 pL of each dilution were then spotted in duplicates on LB
plates with appropriate antibiotics to select donors, recipients and transconjugants, and the number of Colony Forming Units (CFU) was counted. All conjugation frequencies were calculated by dividing the number of transconjugants by the total number of recipient CFUs. The conjugation frequencies per donor were however equivalent (data not shown) since cells were always mixed 1 :1. All conjugation experiments were repeated with at least three independent biological replicates.
Mouse model. All mice-related protocols were designed in compliance with our institution Animal Care Comity Guidelines and were strictly evaluated to avoid animal suffering. Animals were provided with water and regular chow ad libitum throughout all experiments. Animals were housed in individually ventilated cages and no more than 5 individuals shared the same cage. All animals used were C57 BL/6 females of 16-20 g (Charles River) and were given a 3 days adaptation period upon arrival. Animal weight and health was monitored daily throughout each experiment. No significant health or weight loss was noted for any mouse in any experiment. For Sm treated mice groups, Sm working concentration was first evaluated to maximize Enterobacteriaceae clearance and EcN colonization (Kotula et al., 2014). A concentration of 1 g/L of Sm was chosen and added to drinking water 2 days prior to gavages for all Sm-treated mice groups. From that point, water bottles were refreshed every 3 days to maintain optimal Sm activity. Bacterial load was monitored in feces sampling at specified time points. At the end of the experiment, animals were anesthetized with isoflurane and sacrificed by cervical dislocation. Animals were then dissected to reveal the colonization pattern and gut bacterial content was evaluated by CFU.
Mice inoculum preparation. Two days prior to gavages, the appropriate strains were streaked from frozen stocks onto MacConkey selective plates and incubated overnight at 37°C. The next day, colonies were inoculated in selective LB broth at 37°C. Three to four hours prior to mice oral challenge, the strains were subcultured again with a large inoculum (200 pL or 500 pL) in 20 mL selective LB broth and incubated at 37°C until 0.6 ± 0.1 OD60onm was reached. The cells were then washed once in PBS and concentrated in a volume equivalent to 6.0 OD60onm· An aliquot of the inoculum was used to evaluate cell concentration. 100 pL of the final cell suspension were administered orally to each mouse (approximately 1x108 CFU).
Feces and tissues processing. Collection tubes were prepared prior to the experiment by adding 500 pL of PBS and a single 0.2 mm glass bead to a sterile 1.5 mL microtube. Then, tubes were weighted before and after sampling to normalize CFU by sample weight. Samples were homogenized in a FastPrep-24 (MP) bead beater for 1 minute at maximum speed. Then, the homogenates were centrifuged at 500 x g for 30 second to avoid possible pipetting of larger debris. Centrifugation has shown no significant impact on retrieved CFU (data not shown). The samples were then serially diluted 1/10 in sterile PBS from 10° to 107
of the initial concentration and 2.5 mI_ of each dilution was spotted on selective MacConkey plates in technical duplicates. CFUs/mg sample were calculated as a function of the sample measured weight. For each experiment, total Enterobacteriaceae clearance was also followed on MacConkey plates without antibiotics as a control for Sm treatment (data not shown).
Mice dissection and EcN colonization pattern assessment. The mice were sacrificed on day 4 and dissected to extract the duodenum, jejunum, ileum, caecum, ascending colon and descending colon. To distinguish the parts of the intestine, the first 3 centimeters (cm) of small intestine attached to the stomach were considered to be the duodenum, the 6 central cm the jejunum, and the last 6 cm (closest to the caecum) the ileum. The ascending and descending colons were the exact halves of the colon. Two spaced quarters of each section were sampled for CFUs analysis. The longitudinal half of the caecum was used for CFU as well. Since the caecum is a large and distinct structure of the mouse intestine, and since EcN colonizes strongly the caecum, this region was chosen as a representative part of the intestine to study colonization and conjugative bacterial cell treatments.
In vivo conjugation mouse model. For in vivo conjugation experiment, mice were orally challenged with the recipient strain 2 or 12 hours prior to the introduction of donor strain. This, in order to avoid possible plasmid transfer in the PBS solution prior to gavage. Conjugation was then monitored by feces sampling at specified time points. Mice were sacrificed at the end of the experiment and caecum was extracted to verify conjugation levels in the murine gut. Feces were homogenized and CFU were acquired on MacConkey plates as described in the Feces and tissue processing section.
Statistical analysis. Statistical significance was performed on the logarithmic value of the data using One-way ANOVA unless specified otherwise. P-values are directly indicated on the graphs and represent statistical significance of the difference between the two data groups. Differences in the data were considered significant when the P-value was bellow 0.05.
1.1 Escherichia coli Nissle 1917 modified strain construction
Generation of antibiotic-resistant EcN variants. Since EcN had no natural antibiotic resistance phenotype (Sonnenborn et al. , 2009), efficiency of conjugation between two strains of EcN was impossible to quantify. The use of two different resistance markers was essential to distinguish between the donor and recipient strains. Furthermore, the presence of an antibiotic resistance marker on a conjugative plasmid allowed for the distinction between recipients and transconjugants. Several strains of EcN were therefore developed to allow quantification of conjugation efficiency. One way to generate an antibiotic resistant
variant of a strain was to insert a resistance gene in its chromosome. Integration of DNA in the chromosome of a bacterium was efficiently achieved using a Tn7-based system (McKenzie, 2006). This system used a plasmid, pGRG36, as a vector for the expression of the Tn7 machinery, but also as a backbone for the insertion of a DNA sequence of interest. The DNA sequence of interest required cloning between the attLTnJ and attRTnJ sites of pGRG36 so that it could be inserted in the terminator sequence of glmS. The Tn7 strategy was used to insert antibiotic resistance cassettes into EcN and created three different strains (Figure 1). Those strains were all resistant to Sm which was used to hinder the microbiota in vivo. Additional resistance phenotypes were unique for each three strains. For instance, the donor KN01 was also resistant to Sp, the recipients KN02 was also resistant to Cm, and KN03 was also resistant to Tc.
Auxotrophy as a selection marker for conjugation. As opposed to antibiotic resistance which allows a cell to grow in the presence of an antibiotic, auxotrophy prevents a cell from growing under normal conditions. This can be particularly useful to further distinguish donor and recipient strains in a conjugation experiment as no known reversion mechanisms were yet reported and auxotrophic donor strains present no defect in their ability to conjugate. EcN was first transformed with pSIM6, a plasmid that expressed the lambda red recombination system. Then, the dapA gene was replaced with an antibiotic resistance cassette as previously described (Datsenko et al., 2000). The deletion of dapA interrupted the lysine biosynthesis pathway as well as the peptidoglycan wall synthesis (Figure 2). The cell therefore became unable to synthesize its cell wall and the lysine amino acid. Both functions are essential for cell survival under normal conditions. However, the mutation could be complemented by an exogenous source of DAP (Allard et al., 2015). Plasmid pGRG36- SmSp was then used to insert SmSp resistance gene in EcNAc/apA’s chromosome thereby creating KN0'\AdapA. This donor strain was not able to grow without DAP and allowed for a better distinction of transconjugants and recipients. Also, as a control, it was verified that the deletion of dapA did not affect the conjugation efficiency, which it did not (data not shown).
EcN colonized the murine gut. In order to compare conjugation efficiencies of several conjugative plasmids in vitro and in vivo, the ability of EcN to colonize the murine gut was verified. Sm was previously shown to increase colonization stability of E. coli in mice, and since KN01 had the lowest Minimal Inhibitory Concentration (MIC) for Sm (Table 3), it was used in colonization assays to determine the concentration of Sm needed to (1) clear the Enterobacteriaceae from the microbiome and (2) facilitate colonization of the donor and recipient strains. Concentrations of 1 ,000 mg/L (Figure 3.A), 400 mg/L (Figure 3.B), 250 mg/L (Figure 3.C), 100 mg/L (Figure 3.D), 50 mg/L (Figure 3.E) and 0 g/L (Figure 3.F) of Sm were tested. 1 g/L of Sm in drinking water cleared Enterobactericeae while allowing stable
colonization of the KN01 strain. This concentratiion was used in subsequent experiments.
Colonization pattern of EcN was also addressed both in Sm treated and untreated mice by analyzing KNOTS CFU density in the duodenum, jejunum, ileum, caecum, ascending colon and descending colon (Figure 3.G). Colonization was higher in all part on the intestine of Sm treated mice and was particularly high (>103 CFU/mg tissue) for parts of the intestine between the ileum and the anus. EcN was therefore a good strain for the quantification of conjugation in vivo because of its ability to colonize different regions of the intestinal tract.
However, the caecum was used for subsequent conjugation quantification, as it is a distinct structure that yielded higher density of KN01 (104 CFU/mg tissue).
Table 3. Minimal inhibitory concentration (MIC) of EcN strains.
Strain Sm Sp Ap Cm
EcN 25 mg/L 50 mg/L < 25 mg/L < 8 mg/L
KN01 800 mg/L 3 g/L < 25 mg/L < 8 mg/L
KN02 > 1 .6 g/L 50 mg/L < 25 mg/L 264 mg/L
1.2 - Comparison of conjugative transfer efficiency in vitro versus in vivo
Selection of bacterial conjugative plasmids. To find the most efficient bacterial conjugative system for the transfer of DNA in vivo, six conjugative plasmids were chosen. Those six plasmids span six different incompatibility families (Table 4). Incompatibility families are a classification based on the ability of two plasmids to co-exist at the same time in a cell. For two plasmids to belong to the same incompatibility family, they have to be unable to be maintained simultaneously in a cell. There are two major ways plasmids can be incompatible
(1) by inhibiting the transfer of the other plasmid inside the hosting cell and (2) by strong similarity between their maintenance modules. By selecting plasmids from different incompatibility families, plasmids had a higher chance of being more phylogenetically distant from one another. The plasmids were also selected for their reported in vitro transfer efficiencies (Bradley et al., 1980).
Table 4. Conjugative plasmid selected for this example.
Size Accession
Name Incompatibility CDS Resistances* Supplier/Source
(bp) number
pOX38 IncFI 59,705 62 SpR, TcR, SuR MF370216.1 ** Laura Frost's research group
R6K lncX2 39,872 52 ApR, SmR None***
Name Incompatibility CDS Resistances* Supplier/Source
(bp) number
(DSMZ)
TP1 14 Incl2 64,818 92 KmR MF521836.1 DSM-4246
(DSMZ) pVCR94AX3 IncC 121 ,195 152 KmR KF551948.1** Carraro et al.
2017
R388 IncW 33,913 48 SuR, TmR NC 028464 DSM-5189
(DSMZ)
RP4 IncPI a 60096 70 ApR, TcR BN000925.1 DSM-3876
(DSMZ)
*As experimentally tested in 96 well plates
**NCBI available sequence are engineered variant resistant to other antibiotics
*** Available at http://www.sanger.ac.uk/resources/downloads/plasmids/
Bacterial conjugation efficiency was affected by the physical properties of the environment.
The six conjugative plasmids were transferred into the KNOI Ac/apA and KN01 strains, which constituted the donor strains for the following experiments. Conjugation experiments, between KNOIAc/apA containing one of the conjugative plasmids and KN02 as the recipient, were carried both on agar plates (solid mating) and in broth (liquid mating) in an effort to predict the conjugation efficiency of conjugative plasmids in vivo (Figure 4.A). Conjugation on solid support allows conjugative plasmids to transfer without the need for mating pair stabilization as cells are immobilized on a solid surface and no shearing forces are present
(Bradley, 1984). However, conjugation in liquid is a much more instable environment as cells must firmly grip to each other to avoid conjugation interruption by shearing forces as cells are constantly moving in the liquid. Most plasmids transferred efficiently on agar plates, but pOX38 and R6K were able to conjugate at similar frequencies in both agar and broth.
Conjugation mediated by TP1 14 seemed only mildly affected under liquid transfer conditions.
Discrepancies of conjugation efficiency between in vivo conditions and in vitro laboratory condition. The conjugative plasmids which could be of interest for therapeutic applications (in vivo) and could constitute the most appropriate transfer machinery for the bacterial conjugative bacterial cell system was determined. Conjugation between KN01 and KN02, used as donor and recipient, respectively, was performed in a Sm-treated conjugation mouse model. The mice were fed with the recipient strain 2 hours prior to the introduction of the donor strain. The proportion of transconjugants was monitored for three days in feces (Figure
4.B). On the third day of the experiment, mice were sacrificed and the proportion of transconjugants was addressed in the caecum. The conjugation results in the caecum were consistent with those found in the feces (Figure 4.C). Raw CFUs data for donors, recipients, and transconjugants are shown for each plasmid in Figure 5. Only three plasmids (pOX38, R6K and TP1 14) had been able to conjugate reproducibly in mice. Among those, TP1 14 had a transfer rate nearly 100-fold higher in vivo than in vitro (Figure 4.A). Also, TP1 14 was able to transfer between ~240 and >1 ,400, 000-fold more efficiently than all other tested plasmids at any of the tested timepoint. In vivo transfer activity of TP1 14 was also confirmed in an additional group of 5 mice and was also compared to the one measured in vitro on solid medium using the same time course (Figure 4.D). Conjugation of TP1 14 over 48 hours or more yielded very little improvement in transfer rates in vitro compared to in vivo conditions. To verify if conjugation in Sm-treated mice reflects the transfer rates in an undisturbed microbiome, 12-hour conjugation experiments were carried in both Sm-treated or untreated mouse model with TP1 14 and R6K (Figure 4.E). Transfer rates were similar in both conditions, hereby showing that the presence of the intestinal microbiome did not affect conjugation by TP1 14 or R6K. The influence of colonization time between recipient and donor introduction was also investigated with 2 or 12 hours between gavages. This parameter had little influence on the overall conjugation frequencies of TP1 14 (Figure 6.A) and R6K (Figure 6.B). In addition, colonization levels of recipient strains for both TP1 14 and R6K were similar regardless of the period between gavages (Figure 6.C and 6.D). However, TP1 14 was the only conjugative plasmids tested capable of transferring in vivo with an efficiency rate of nearly 100%, it was therefore considered the most interesting candidate to use as the transfer machinery of the COP.
EXAMPLE II - IDENTIFICATION OF GENES AND GENETIC ELEMENTS REQUIRED FOR
IN VIVO CONJUGATIVE DELIVERY OF DNA BY TP114
Strains, plasmids and growth conditions. All strains and plasmids are described in Table 1. All plasmid sequences are provided in the sequence appendix. Oligonucleotides used in this example, strain growth conditions, DNA manipulation, plasmid construction, recombineering and routine transformation can be found in the Material and Method section of the Example I.
Sequencing of TP114. TP1 14 was acquired from DSMZ (DSM-4246) and transferred from E. coli K12 J53-2 by conjugation into E. coli MG1655NxR. The resulting strain was grown at 37°C in selective LB broth to obtain sufficient DNA for sequencing. An lllumina library was prepared using the QIAseq FX Library kit (Qiagen) from size-selected genomic DNA fragments of approximately 400 to 600 bp. The lllumina library was sequenced on a MiSeq instrument using paired-end reads of 300 bp to assemble longer composite reads covering
the entire insert (Rodrigue et al., 2010). A MinlON (Oxford Nanopore Technologies, UK) sequencing library was also prepared using 1.5 pg of high-molecular weight genomic DNA and the R9 Nanopore sequencing kit (SQK-NSK007, Oxford Nanopore Technologies, UK). Illumina sequencing reads were assembled with the Roche gsAssembler version 2.6 either de novo or using reference sequences from other conjugative plasmids from the Incl2 family (R721 , AP002527.1 ; pChi7122, FR851304; pRM 12761 , CP007134.1 ; pSLy21 , NZ_CP016405.1). Large de novo and reference contigs were then manually assembled and scaffolded with high-quality MinlON reads using BLASTn. Finally, 10 regions of 1.5 kb selected based on lower read coverage were re-sequenced by Sanger sequencing to confirm the assembly with corresponding primers (Table 2). The resulting circular sequence of 64,818 bp (43% G+C content) was submitted to the RAST annotation server (Aziz et al., 2008), and a total of 92 open reading frames (ORF) were predicted. The annotation was then adjusted to name homologous genes consistently between TP1 14 and the reference Incl2 plasmid R721 (GenBank: AP002527.1).
Analysis of TP114 gene function. In silico analysis of TP1 14 gene function was performed using both CDsearch (Marchler-Bauer et al., 2017) and BLASTp (Altschul et al., 1990). A protein multi-fasta file was first generated for all 92 Open Reading Frames (ORF) predicted by RAST (Aziz et al., 2008). The multi-fasta file was processed by CDsearch to find conserved protein domains and attribute protein families, or superfamilies, to each protein coding genes of TP1 14. The multi-fasta file was also submitted to BLAST to identify putative protein homologues when CDsearch would fail to identify any protein domain with high confidence (e-value <1x10 15). Both analyses were performed using default parameters. BLAST hits with high identity levels were used to attribute putative functions only when more than five hits showed the same result. Proteins that failed at matching these criteria were considered of unknown function.
Comparative genomics. Gene content comparison was performed on TP1 14 against a database of 7 randomly selected plasmids of the Inch and Incl2 subfamilies based only on sequence availability (Table 5). The BRIG stand-alone software (Alikhan et al., 201 1) was used to perform BLAST based homology analysis between TP1 14 and each plasmid group. Homology was analysed using both nucleotide sequence of the whole plasmids and amino- acid sequence of the coding genes. Conservation of genes was evaluated using the sequence identity cut-offs of 100%, 70%, and 50 %. The identity percentage was calculated by attributing scores of -2 for mismatches, +1 for matches and a linear cost for insertion/deletion. Genes were then categorized as core genes when present in 100% of the plasmids, soft core genes when present in above 50% of the plasmids, or accessory genes when present in less than 50% of the plasmids.
Deletion of pits in TP114. An FRT flanked cat gene was amplified from pKD3 was used to delete pilS in TP1 14 by recombineering (Datsenko et al., 2000). The recombinant clones of Mΰ I qddRί were then screened using appropriate primers (Table 2). The pilS deletion generated TP1 14Ap/7S::ca7, which was then transferred to E. coli strain KN01 . The ability of wild type and its pilS mutant to transfer from E. coli KN01 to E. coli KN03 was assayed under solid, liquid (static), liquid with agitation and in vivo conditions.
Deletion of the pWVadhesin, the pil V C-terminus shuffion and locking of C-terminus variants of pilV. A cassette containing theca/ chloramphenicol resistance gene flanked by FRT sequences was amplified from pKD3 using appropriate primers (Table 2) providing homology to the regions adjacent to the shufflase gene rci. The cassette was then inserted in TP1 14 by recombineering using pSIM6 in MΰI qddRί to generate TP1 14Arc/::ca/. pSIM6 was then cured by incubation of the strain at a non-permissive temperature, and nexttransformed with pE-FLP. This resulted in the excision of the cat gene from TP1 14Arc/::ca/, creating TP1 14Ara, a variant of TP1 14 lacking the recombination capabilities provided by rci. Then, a cassette containing a FRT flanked cat gene with homology to regions adjacent to pilV N- terminus as well as the previous rc/deletion was amplified and used for a second round of recombieering again using pSIM6 in a MG1655RfR strain. The resulting strain TP1 14 pilV- rcr.cat was then treated with pE-FLP, generating TP1 14 pilV-rci. Alternatively, a cassette containing a FLAG-tag and an FRT flanked cat gene was amplified from pKD3 (with the FLAG-tag being provided by the PCR primer). The cassette was inserted in TP1 14 to replace the 3' end of pilV, the shuffion and the deleted shufflase gene region by recombineering (Datsenko et al., 2000). Recombinant clones of E. coli M01655R^ were then screened using appropriate primers (Table 2). The deletion generated TP1 14p/7VAshufflon-rc/'::cafin which the shuffion is replaced by a FLAG-tag. EachC-terminalvariants of pilVwere also amplified by PCR and fused to an FRT flanked chloramphenicol resistance cassette. The complete cassette contained homology regions for the pilV gene and the shufflase deletion scar. Recombineering using these cassettes generated “locked” configurations for eachpilV variants (TP1 14p/7\/4lshufflon::p/7\/7-ca/, TP1 14p/7\/4lshufflon::p/7\/2-ca/,
TP1 14p//Mdshufflon::p//V3-ca/, TP 1 14p//\/ kshufflon: : pilV3’-cat, TP1 14p//\/ kshufflon: :p//\/4- cat, TP'\ '\4pilVAshuff\orw\pilV4’-cat, T P 1 14 p/7 \/Zl s h u ff I o n : : p/7 \/5- ca f,
TP'\ '\4pilVAshutf\on::pilV5’-caf). Mutant versions of TP1 14, including TP1 14Dr/7\/-/Ό/, TP1 14Ap/7\/Ashufflon-rc/::ca7, and the variants of the pilV adhesins (TP1 14p/7½4shufflon::p/7\/7-ca/, TP1 14p/7½4shufflon::p/7\/2-ca/, TP1 14p/7 \shufflon : :p/7\/3- cat, TP1 '\4pilVAshuff\on::pilV3’-cat, TP'\ '\4pilVAshuff\on::pilV4-cat,
TP1 14p//½4shufflon::p/7\/4’-ca/, TP1 14p/7½4shufflon::p/7\/5-ca/, TP1 14p/7 l s h u ff I o n : : p/7 \/5 - cat) were transferred to E. coli strain KN01 . The ability of the wildtype TP1 14 and its pilV
mutant versions of TP114 to transfer from E. coli KN01 to E. coli KN03 was assayed under solid, liquid (static) and liquid with agitation conditions.
Construction and use of pPHS and pPHV4’. Plasmid pPilS was constructed by amplifying the pilS gene from TP1 14, oriVp^k-araC-PBkD from pBAD30 and cat from pSB1 C3 using primers listed in Table 2 and joining them by Gibson assembly. Plasmid pPilS was then transformed into KN01 + TP1 14Ap/7S for complementation studies. In a similar way, plasmid pPilV4’ was constructed by amplifying the pilV4’ gene from TP1 14, oriVp^k-araC-PBkD from pBAD30 and cat from pSB1 C3 using primers listed in Table 2 and joining them by Gibson assembly. Plasmid pPilV4’ was then transformed into KN01 + TP1 14 pilV for complementation studies. The pilS and pilV4’ genes are under the regulation of AraC40, providing arabinose inducible expression. For complementation experiments, donor and recipient strains were grown overnight at 37°C. Two hours before conjugation, arabinose was added to the donor strain cultures at a final concentration of 1 % w/v. Then, OD60onm of each culture was measured and cells were washed in LB + 1 % arabinose then resuspended in a volume equivalent to 40 OD60onm in LB + 1 % arabinose. A volume of 2.5 pL of the donor and recipient strains were then mixed together and deposited on an LB + 1 % arabinose plate for solid conjugation or mixed with 195 pL of pre-warmed LB + 1 % arabinose for conjugation under both liquid static and liquid shaking conditions. Matings were then performed at 37°C for 2 hours. Additionally, conjugations under the liquid shaking condition were placed on a rotary agitator. After incubation, the matings were serially diluted 1/10 and plated on selective media for CFU analysis of the donor, recipient, and transconjugant strains.
In vitro conjugation assay. All in vitro conjugation experiments were performed as described in the Material and Method section of Example I. Of note, for liquid mating with agitation, cell mixes were incubated at 37°C for 2 hours on a rotary mixer instead of the standard static incubation.
High-density transposon mutagenesis (HDTM). A conjugation assisted random transposon mutagenesis experiment was performed. The transposition system was composed of pFG036 (a plasmid coding for a cl transcription repressor), pFG051 (a pir- dependent suicide plasmid coding for the Tn5 transposon machinery under the repression of cl, a RP4-based origin of transfer and a SpR transposon) and MFDp/r+ (Ferrieres et al., 2010) (which has an RP4 conjugative machinery, diaminopimelic acid auxotrophy and the Pi protein necessary for pFG05Ts maintenance in the cell). The HDTM experiment was performed in several successive steps in order to clearly identify the function of genes involved at each one of these steps. First, pFG051 was transferred by conjugation from MFDp/r+ to EcN containing TP1 14 for 2 hours at 30°C on LB + DAP plates in triplicates. Once in EcN, Tn5 machinery was expressed from pFG051 to mediate random transposon insertions in TP1 14. Then,
transconjugants were entirely plated onto 6 plates per replicates and incubated overnight at 37°C. After the incubation, transconjugant clones formed a cell lawn that was collected using a cell scrapper and subsequently resuspended in LB broth with selective antibiotics. Transconjugants, which forms the mutant library, were then washed, resuspended in 4.5 mL of LB + 25% glycerol and frozen for storage. Also, 100 pL of the mutant library was used in two subsequent conjugative transfer experiments towards KN02 and then towards KN03, which were both carried in parallel in vitro and in vivo.
Mouse model for in vivo HDTM library conjugation. Mice related experiments were done as described in the Material and Method section of Example I with only minor modifications. The donor strain inoculum was prepared 3 to 4 hours prior to mice oral gavage. 500 pL of a frozen stock of the High-Density Transposon Mutagenesis (HDTM) mutant library was inoculated in 20 mL selective LB broth and incubated at 37°C for 4 hours before gavage. When ready, cells were washed once in PBS and concentrated in a volume equivalent to 6.0 OD60onm· Mice were orally challenged with the recipient strain 3 hours prior to the introduction of the donor strain. Conjugation was then monitored by feces sampling at 24 and 48 hours. At 48 hours, mice were sacrificed and the caecum was extracted. Also, for transconjugants, 4 x 100 pL per mice were also plated in order to obtain a large number of transconjugant clones for the sequencing.
HDTM libraries sequencing. For each sample, a 1.5 mL frozen stock aliquot of mutant library was thawed on ice for 15 minutes. The aliquot was centrifugated and cells were resuspended in 300 pL of Cell lysis buffer from the Quick gDNA Miniprep kit (ZymoResearch). DNA was fragmented using a Bioruptor Plus (Diagenode) for 12 cycles of 30 seconds ON, 30 seconds OFF at 4°C. After fragmentation, the Quick gDNA Miniprep kit’s protocol for cell suspension was followed and DNA was eluted in 50 pL of molecular grade water. 10 pg of DNA was then end-repaired using End-repair Mix HC (Enzymatics) followed by DNA purification using AMPure DNA XP magnetic beads (Agencourt). Purified DNA was then adenylated using TaqB (Enzymatics) supplemented with dATP for 30 minutes at 68°C and purified again with AMPure DNA XP beads (Agencourt). Nextera adaptator B was then generated by annealing two oligonucleotides: 5’-P04-CTGTCTCTTATACACATCTCCGAGCCCACGAGAC-lnvdT-3’ (SEQ ID NO: 91) and 5’-
CAAGCAGAAGACGGCATACGAGATTCGCCTTAGTCTCGTGGGCTCGGAGATGTGTATA AGAGACAGT-3’ (SEQ ID NO: 92) together. Annealing was performed by heating 40 pM of each oligonucleotide in annealing buffer (10 mM Tris NaCI pH 7.5, 50 mM NaCI) to 98°C and then slowly decreasing 0.1 °C each 10 seconds until 4°C was reached. Nextera adaptator B was ligated using T4 DNA ligase (Enzymatics) overnight at 16°C. DNA was purified again using DNA Ampure XP beads (Agencourt) and barcoding was performed in a qPCR machine
using Veraseq DNA polymerase (Enzymatics). Amplification reaction was stopped at the end of the exponential phase. DNA was purified again and quantified using Quant-it PicoGreen DNA assay. Quality and size distribution of the amplified mutant library was assessed on Bioanalyzer using a High Sensitivity DNA Chip. Mutant libraries were then pooled and sequenced by lllumina using the Nextera technology.
HDTM mutant analysis. Reads were first trimmed based on their quality and the presence of the Nextera lllumina adapter using Trimmomatic, version 0.32, with the parameters SLIDINGWINDOW:4:20 and MINLEN:30 (Bolger et at, 2014). The quality of the reads, before and after trimming, was assessed with FastQC using the default parameters (Andrew, 2010). Reads mapping on EcN’s chromosome were filtered out and the remaining reads were mapped onto TP1 14. These alignments were done with BWA MEM using the default parameters (Li, 2013). Alignments with a mapping quality score lower than 30 were discarded. The position of the middle base pair of the 9 bp Tn5 insertion site duplication was then used to represent every corresponding alignment (Goryshin et al., 1998). Insertion sites only represented by one read were discarded in an attempt to filter out sequencing noise. The insertion maps with normalized reads count (based on the library size) were then visualized using UCSC Genome Browser in a Box (Haeussler et al., 2015). The essentiality of the genes in condition 1 was verified manually, searching for low coverage regions that were mappable and reproducible in all of the three replicates. A gene count table was then generated by calculating the normalized read count (based on library size) of each TP1 14’s gene for each condition. Insertion sites in the first 5% and last 15% of the gene were not considered in the read count as they may lead to functional gene fragments. The genes important for in vitro and in vivo conjugation were determined based on the gene read count ratio between condition 1 and the test condition. The formula used to compute the gene read count ratios is: (Read count x - Read count 1)/Read count 1. A core set of genes which were considered to be essential for conjugation in vitro ( traABCDEGHIJK , trbJ, nikAB) and in vivo (pilLNOPQRSUV) were then used to set the maximal ratio value for each condition. All genes with gene count ratios below the maximal value were considered essential in the given condition.
2.1 - TP114 conjugative plasmid comparative genomics
TP114 sequencing and annotation. In Example I, TP1 14 was identified to be the most potent conjugative plasmid for DNA cargo delivery in vivo. Therefore, it was the most interesting plasmid to be used as transfer machinery for the COP system. However, little is known about TP1 14. The first step toward the comprehension of TP1 14’s transfer efficiency in vivo was thus to determine its complete sequence. TP1 14 was sequenced within an E. coli MG1655 strain using lllumina and Oxford Nanopore sequencing technologies. Sequence was then
assembled in several ways including reference mapping onto related plasmid R721 from the Incl2 plasmid family and de novo sequence assembly. Then, the plasmid was automatically annotated using RAST to find potential ORFs. Annotation was then rectified by comparing them to annotations from R721 based on sequence homology. Genes with over 98% nucleotide homology to a gene on R721 were re-annotated to be consistent between plasmids. TP1 14’s full sequence and annotation was then submitted to Genbank under accession number: MF521836.1. Plasmid TP1 14 is 64,818 bp long containing 92 CDS and has an average G + C proportion of 43%. TP1 14’s genes were further characterized using BLASTnand CDsearchto find functional homologs. As conjugative plasmids tend to be modular, each gene was then attributed to a specific module with a specific function (type IV secretion system (T4SS), mating pair stabilization, maintenance, regulation, selection and unknown function). Genes were then mapped onto TP1 14 to generate a first graphical map of TP1 14’s genes (Figure 7).
TP114 gene conservation analysis. One way to determine the importance of a specific gene present on a conjugative plasmid is to analyse its conservation. The conservation of genes can be evaluated by sequence homology against closely related conjugative plasmids. Fortunately, conjugative plasmids have been categorized in incompatibility families based on their ability to be stably maintained in the same cell or to be targeted by the same bacteriophage. The inability of two plasmids to share a same host is often linked to similarity between replication protein sequences. As such, since the primary sequence of TP1 14’s replication protein is highly similar to the one of R721 , which belongs to the Incl2 plasmid subfamily, TP1 14 was classified as an Incl2 plasmid. The Incl plasmid family is divided in two subfamilies, Inch and Incl2 and it is still unclear how much both groups share sequence homology. Therefore, comparative genomics analysis was carried on both plasmid subfamilies. Seven plasmids of both Inch and Incl2 subfamilies were selected based on the availability of their full genome sequence in Genbank (NCBI) (Table 5). These plasmids were then used as database for homology analysis with TP1 14 using the stand-alone BRIG software. TP1 14’s genes were mostly highly conserved throughout the Incl2 plasmids both at the nucleic acid and amino acid levels (Figure 8.A and Figure 8.B). This suggested that most Incl2 plasmids could share TP1 14’s ability to transfer DNA efficiently in vivo. Homology towards Inch plasmids was really scarce both at the nucleic acid and amino acid levels (Figure 8.C and Figure 8.D). Even though gene functions are similar between Inch and Incl2 groups, high sequence divergence was found between these two subfamilies. Therefore, for the analysis of gene conservation, only the Incl2 subfamily had been considered since sequence homology with the Inch group was too low. Genes were categorized into 3 groups: core, soft core and accessory (Table 6). Core genes were conserved in all plasmids of the
Incl2 family whereas the soft-core genes were conserved in >50% of the plasmids. Finally, genes were considered accessory if they were present in <50% of the plasmids.
Table 5. Plasmids used in the comparative genomic analysis.
Plasmid name Incompatibility group Lenght (bp) NCBI accession number
pESBL-117 Inch 89503 CP008734
pESBL-315 Inch 93037 CP008738
pESBL-12 Inch 96463 CP008735
pE17.16 Inch 101321 CP008733
pESBL-305 Inch 107552 CP008737
pESBL-283 Inch 110137 CP008736
R64 Inch 120826 NC_005014.1
TP114 Incl2 64818 MF521836.1
pChi7122-3 Incl2 56676 FR851304
LN623683.2 Incl2 62139 LN623683.2
pHN1122-1 Incl2 62196 JN797501
pSly21 Incl2 63329 NZ_CP016405.1
pSH146_65 Incl2 65030 JN983044
pHNY2 Incl2 65358 KF601686.2
R721 Incl2 75582 AP002527.1
Table 6. TP1 14 in silico gene function prediction by CDsearch and BLAST.
TP114- 047 TP114-047 None Unknown Accessory
TP114- 048 TP114-048 None Unknown Accessory
TP114- 052 yceB None Unknown Core
TP114- 055 TP114-055 None Unknown Core
TP114- 057 TP114-057 None Unknown Core
TP114- 059 yaiB None Unknown Accessory
TP114- 060 TP114-060 None Unknown Core
TP114- 061 yaiA None Unknown Accessory
TP114- 062 yahA None Unknown Accessory
TP114- 063 TP114-063 None Unknown Soft-core
TP114- 064 TP114-064 None Unknown Core
TP114- 065 yagA None Unknown Core
TP114- 066 TP114-066 None Unknown Core
TP114- 067 TP114-067 None Unknown Core
TP114- 071 TP114-071 None Unknown Core
TP1 14- 077 TP114-077 None Unknown Accessory
TP1 14- 081 TP114-081 None Unknown Core
TP1 14- 082 TP114-082 None Unknown Core
TP1 14- 084 TP114-084 None Unknown Core
TP1 14- 086 yheB pfam06688 Unknown Core
TP1 14- 088 TP114-088 None Unknown Accessory
TP1 14- 090 TP114-090 None Unknown Core
TP1 14- 091 yhbB None Unknown Soft-core
TP1 14- 092 TP114-092 None Unknown Soft-core
'This gene might have a function in more than one module.
TP114 encoded for a mating pair stabilization module. One interesting feature shared by I- complex plasmids (IncB/O (Ind O), Inch , Incl2, IncK and incZ alike) is the presence of genes encoding a functional type IV pilus (T4P) (Sekizuka et al., 2017). As observed with other plasmids from the l-complex, TP114 encoded a T4P that was independent from the traditional T4SS. Such an apparatus is thought to improve mating pair stabilization (hereby named mating pair stabilization module) by binding directly to the recipient’s membrane and retracting the pilus to facilitate donor/recipient direct contact (Bradley, 1984). Very few plasmid families are known to encode T4P (e.g. Inch (Ishiwa et al., 2003), Incl2 (Sekizuka et al., 2017), IncB/O (Ind O) (Papagiannitsis et al., 201 1), IncK (Seiffert et al., 2017), and IncZ (Venturini et al., 2013)).
2.2 - High density transposon mutagenesis of TP114.
HDTM experiment design. Several HDTM experiments were needed to fully characterize TP1 14 genes functions. As such, a scheme describing the HDTM is presented to fully comprehend the extent of the experiment (Figure 9). HDTM step 1 generated a full library of TP1 14 mutants. At this point, if a transposon interrupts a gene required for short-term plasmid maintenance in the cell, this would lead to plasmid loss. By using antibiotics to select for TP1 14 and the Tn5 transposon, cells that have lost the plasmid will not survive. Therefore, insertion of Tn5 within essential maintenance genes will prevent those genes from being sequenced resulting on low coverage of their loci. This HDTM library 1 was then used in two different conjugation experiments, one where the TP1 14::Tn5 were transferred by solid mating in vitro (Step 2) and one where the mating was carried in vivo (Step 4 where transconjugants originated from the feces and Step 6 where the transconjugants were retrieved from the caecum). The HDTM library 2 revealed genes essential for conjugation in vitro whereas HDTM library 4 and 6 revealed genes essential for conjugation in vivo, which allowed us to discriminate between genes required in both environments and genes only required in one specific environment. The HDTM library 2 was further used as donors in two supplementary mating experiments in vitro (Step 3) and in vivo (Step 5 for feces extracted transconjugants and Step 7 for caecum extracted transconjugants). This second transfer is expected to enrich genes which once inactivated by the Tn5 transposon have a positive effect on conjugation efficiency (e.g. a transcription repressor). Furthermore, a second transfer step ensured minimal background by diluting the initial donor cells. Finally, HDTM step 8 required the transfer of the modified TP1 14::fefS into HDTM library 2. Conjugative plasmids of a same incompatibility family can usually prevent the acquisition of another related plasmids through a mechanism called exclusion. Since TP1 14 can mediate exclusion, TP1 14::fefS can only enter the cell if the exclusion related gene(s) are interrupted by the Tn5 transposon.
HDTM analysis consideration. Analysis of the HDTM mutant libraries revealed an average coverage of 9.68 insertions per bp in TP1 14. This high resolution allowed us to assess the essentiality of even the smallest annotated TP1 14 gene. However, TP1 14 encodes a set of 7 genes containing repeated regions in which reads cannot be mapped (termed 0-mappability regions) (Table 7). Those genes appeared under-represented, but were not necessarily essential and were analyzed by considering only the portion of the gene that was mappable. The HDTM experiment also accounted for possible donor DNA contamination. By doing successive transfer experiment with the HDTM library, background contamination was drastically reduced and showed consistent results. As such, a clear drop in background level was seen from HDTM library 2 to HDTM library 3 and consistent results were observed between libraries (Figure 10). The HDTM analysis also took advantage of a high number of
replicates and conditions to allow for comparison of read count between conditions and between biological replicates. Similarity between replicates and conditions was evaluated by Pearson Correlation and revealed well correlated replicates and weaker correlation between the different conditions (Figure 1 1). This kind of distribution was expected as different conditions will put different selective pressure onto the population of mutants, selecting the mutants with the best fits for each specific condition. Only replicates from mice group C of HDTM library 4 and 6 was weakly correlated to others and was discarded from the analysis.
Table 7. O-mappability regions in TP1 14.
Gene Length (bp) 0-map bp % mappable
TP1 14-049 327 22 93.27%
TP1 14-057 216 156 27.78%
TP1 14-063 216 77 64.35%
TP1 14-064 147 70 52.38%
TP1 14-066 138 76 44.93%
TP1 14-067 165 67 59.39%
ycgB 450 22 95.1 1 % HDTM identified important features for TP114 replication and maintenance. The first step of HDTM was to generate a mutant library with insertion in all genes (Library 1). In this set-up, only insertions in the sequences important for the replication and maintenance of TP1 14 should produce non-viable clones. Therefore, genes important for replication should be underrepresented in the read coverage as compared to other genes. As suspected, most genes had high insertion coverage except for a core set of 6 genes which were reproducibly under-represented (Figure 12). Among those, repA was already suspected to be a critical actor in plasmid maintenance, and the aph-lll gene was used for transconjugant selection and therefore would appear essential. The list of essential genes for replication and maintenance is shown in Table 8. Most of these genes were already suspected to be part of the maintenance module or were highly conserved in the Incl2 family. However, ygiA (TP1 14-006), ydiA (TP1 14-035), ydgA (TP1 14-036), traL (TP1 14-044) and TP1 14-072 that were predicted to be implicated in plasmid maintenance were found to be dispensable by HDTM. There was a possibility that these genes were not required for plasmid maintenance in E. coli due to functional redundancy, but could be involved in plasmid maintenance in other host species. Some genes, like traL, might be implicated in another function of the conjugative plasmid. Nonetheless, the gene function prediction was mostly in agreement with the HDTM results.
Table 8. Revised list of essential maintenance genes.
Predicted module Locus tag name in TP1 14 Predicted Function Conservation maintenance TP1 14-050 ycfB hicB Anti-toxin Core maintenance TP1 14-051 ycfA hie A toxin Core maintenance TP1 14-068 parA Partition Core selection TP1 14-076 aph-lll Kanamycin resistance Accessory unknown function TP1 14-082 TP1 14- 082 Unknown Core maintenance TP1 14-083 repA Replication initiation Core
Identification of TP114 genes essential for in vitro conjugation on solid medium. Gene importance for in vitro mating on solid medium was evaluated by gene count ratios. Gene count ratios were calculated by comparing the number of reads that map in a given gene in two different contexts. Briefly, genes which became under-represented following conjugative transfer in vitro (libraries 2 and 3 as compared to library 1) gave negative gene count ratio. To assess gene essentiality and account for any bias, a set of gene, which were predicted to be essential for conjugation ( traABCDEFGHIJK , trbJ, nikAB), was used to evaluate the maximal and average gene count ratio of essential genes. However, traF had a high gene count ratio and was considered an outlier and not essential for conjugation. Gene distribution was plotted for both HDTM library 2 and 3 (Figure 13) and all genes bellow the maximum gene count ratio value were considered essential for conjugation in the given library. Genes that were already determined to be implicated in plasmid maintenance were discarded. The list of genes essential for TP1 14 conjugation in vitro is shown in Table 9.
Table 9. Important genes for in vitro conjugation.
Predicted Locus tag name in TP1 14 Predicted Function Conservation Library module
Unknown TP 1 14- yhaB Unknown Core 2, 3 function 001
Unknown TP 1 14- yhaA Unknown Core 3 function 002
T4SS TP 1 14- traA VirB6 Core 2, 3
003
T4SS TP 1 14- trbJ VirB5 Core 2, 3
004
Unknown TP 1 14- ygeA Unknown Core 3 function 01 1
T4SS TP 1 14- traB VirB1 Core 2, 3
012
T4SS TP 1 14- traC VirB2 Core 2, 3
013
T4SS TP 1 14- traD VirB3 Core 2, 3
014
T4SS TP 1 14- traE VirB4 Core 2, 3
015
T4SS TP 1 14- traG VirB8 Core 2, 3
017
T4SS TP 1 14- traH VirB9 Core 2, 3
018
T4SS TP 1 14- tral VirBI O Core 2, 3
019
T4SS TP 1 14- traJ VirB1 1 Core 2, 3
020
T4SS TP 1 14- traK VirD4 Core 2, 3
021
Mobilization TP 1 14- nikB Relaxase Core 2, 3
041
Mobilization TP 1 14- nikA mobC Core 2, 3
042
Selection TP1 14- TP1 14-043 Putative zinc Soft-core 2
043 transporter
Regulation TP 1 14- yajA Transcription Accessory 2, 3
058 regulator
Unknown TP1 14- TP1 14-060 Unknown Core 2, 3 function 060
Regulation TP 1 14- yafA cogG Transcription Core 3
069 regulator
Identification of genes important for TP114 conjugation in vivo. Gene essentiality for in vivo conjugation of TP1 14 was carried out similarly to the in vitro analysis. However, the set used to fix the maximum gene count ratio was composed of pilLMNOPQRSTUV as it was apparent that p/7 genes were essential only for in vivo conjugation (Figure 14). The pilM and pilT genes were removed from the set as they were outliers. It was critical to use genes which are suspected to be important only in vivo, at least for HDTM library 5 and 7. Indeed, the HDTM library 5 and 7 were generated by in vivo conjugation of HDTM library 2, which in turn is generated from an in vitro transfer of HDTM library 1. As such, since the first in vitro mating put a strong selective pressure for plasmids that can transfer in vitro and genes that are essential in vitro can also be essential in vivo, seeding with these genes excluded the majority of the genes that were only essential in vivo. Nonetheless, as 4 different conditions, including 12 total mice, were used for in vivo conjugation, gene essentiality can also be evaluated by consistency between replicates from the same or different conditions. Genes which were below the threshold for any of the in vivo conditions are listed in Table 10. However, genes which were already proven to be important for plasmid maintenance were not included in this table, as the gene count ratio would often lead to division by 0. When HDTM library 1 was used as donor cells for in vivo conjugation (conditions 4 and 6), 40 genes were found to be essential in the feces samples, and 36 in the caecum samples. Of these genes, 31 were shared in both conditions. Consistently, for the conjugation of HDTM library 2 in vivo (conditions 5 and 7), 40 genes were found to be important in feces samples and 37 genes were found to be important in the caecum samples, with 33 shared genes. In total, 40 genes were found to be important in at least one condition, of which 31 were shared
in all conditions. Interestingly, most of the genes important for in vitro conjugation were also important for in vivo conjugation. As condition 5 and 7 follow an in vitro conjugation, this was expected. Additionally, most of pil genes were essential for in vivo transfer, with pilM and pilT being the only two exceptions.
Table 10. List of important genes for in vivo conjugation.
Predicted module Locus tag name in Predicted Function Conservation Library
TP1 14
Unknown function TP1 14-001 yhaB Unknown Core 4, 5, 6, 7
Unknown function TP1 14-002 yhaA Unknown Core 4, 5, 6, 7
T4SS TP1 14-003 traA virB6 Core 4, 5, 6, 7
T4SS TP1 14-004 trbJ virB5 Core 4, 5, 6, 7 selection TP1 14-008 kikA p-toluenesulfonate Core 4, 5, 6 degradation ( trbM)
Mating Pair TP1 14-009 pilL tcpQ Core 4, 5, 6, 7
Stabilization
Unknown function TP1 14-01 1 ygeA Unknown Core 4, 5, 6, 7
T4SS TP1 14-012 traB virB1 Core 5, 6, 7
T4SS TP1 14-013 traC virB2 Core 4, 5, 6, 7
T4SS TP1 14-014 traD virB3 Core 4, 5, 6, 7
T4SS TP1 14-015 traE virB4 Core 4, 5, 6, 7
T4SS TP1 14-017 traG virB8 Core 4, 5, 6, 7
T4SS TP1 14-018 trahi virB9 Core 4, 5, 6, 7
T4SS TP1 14-019 tral virBIO Core 4, 5, 6, 7
T4SS TP1 14-020 traJ virB11 Core 4, 5, 6, 7
T4SS TP1 14-021 traK virD4 Core 4, 5, 6, 7
Mating Pair TP1 14-022 pilN pilN Core 4, 5, 6, 7
Stabilization
Mating Pair TP1 14-023 pilO pilO Core 4, 5, 6, 7
Stabilization
Mating Pair TP1 14-024 pilP pilP Core 4, 5, 6, 7
Stabilization
Mating Pair TP1 14-025 pilQ virB11 Core 4, 5, 6, 7
Stabilization
Mating Pair TP1 14-026 pilR T2SSF Core 4, 5, 6, 7
Stabilization
Mating Pair TP1 14-027 pilS pi IS Core 4, 5, 6, 7
Stabilization
Mating Pair TP1 14-029 pilU Peptidase_A24 Core 4, 5, 6, 7
Stabilization
Mating Pair TP1 14-030 pilV Shufflon N Core 4, 5, 6, 7
Stabilization
Mobilization TP1 14-041 nikB Relaxase Core 4, 5, 6, 7
Mobilization TP 1 14-042 nikA mobC Core 4, 5, 6, 7 selection TP1 14-043 TP 1 14-43 Putative zinc Soft-core 5, 7 transporter
maintenance TP1 14-044 traL parA Soft-core 4, 5, 7
Regulation TP1 14-058 yajA Transcription Accessory 4, 5, 6, 7
Predicted module Locus tag name in Predicted Function Conservation Library
TP1 14
regulator
Unknown function TP1 14-059 yaiB Unknown Accessory 4, 5, 6, 7 Unknown function TP1 14-060 TP1 14-060 Unknown Core 4, 5, 6, 7 Regulation TP1 14-069 yafA cogG Core 4, 5, 6, 7
Transcription
regulator
Regulation TP1 14-085 yheC CaiF/GrIA Core 4
transcriptional
regulator
Unknown function TP1 14-086 yheB Unknown Core 4, 5, 6, 7 selection TP1 14-087 ydhA Stomatin-like Core 4, 5, 6, 7
protein
Unknown function TP1 14-088 TP1 14-088 Unknown Accessory 4
selection TP1 14-089 yhcA Regulation of Core 4
membrane
protease activity
Unknown function TP1 14-090 TP1 14-090 Unknown Core 5
Unknown function TP1 14-091 yhbB Unknown Soft-core 5
Unknown function TP1 14-092 TP1 14-092 Unknown Soft-core 4
TP114 possessed a core set of genes that were important for conjugation. Performing HDTM
with a high number of biological replicates allowed us to attribute confidence level to the
importance of each gene for different functions of TP1 14 (Table 1 1). The confidence level was based on reproducibility of the result, with ++ being the highest confidence level and - being the lowest. For plasmid maintenance, confidence level was attributed differently than
other conditions. A confidence level of ++ meant that the genes were essential in all replicates and - meant it was not essential in at least one condition. For all other conditions,
confidence level is based on the degrees of reproducibility between conditions, with ++
meaning the gene was essential in all conditions, + meaning it was essential in only one of
the conditions and - meaning it was never essential
Table 1 1 . Confidence level of the importance of TP1 14 genes for conjugation and
maintenance
Essentiality confidence level _
Locus name in maintenance in vitro in vivo in vitro and in vivo Conservation tag TP1 14 conjugation conjugation only conjugation
TP1 14- yhaB ++ ++ ++ Core 001
TP1 14- yhaA + ++ ++ Core 002
TP 1 14- traA ++ ++ ++ Core 003
TP 1 14- trbJ ++ ++ ++ Core
004
TP114- TP114- Accessory
005 005
TP114- ygiA Soft-core 006
TP114- yggB Core 007
TP114- kikA ++ + Core 008
TP 114- pilL ++ ++ Core 009
TP114- pilM Core 010
TP114- ygeA - + ++ ++ Core
011
TP114- traB - ++ + ++ Core
012
TP114- traC - ++ ++ ++ Core
013
TP114- traD - ++ ++ ++ Core
014
TP114- traE - ++ ++ ++ Core
015
TP114- traF Core
016
TP 114- traG - ++ ++ ++ Core
017
TP 114- traH - ++ ++ ++ Core
018
TP 114- tral - ++ ++ ++ Core
019
TP114- traJ - ++ ++ ++ Core
020
TP114- traK - ++ ++ ++ Core
021
TP 114- pilN ++ ++ Core 022
TP 114- pilO ++ ++ Core 023
TP 114- pilP ++ ++ Core 024
TP 114- pilQ ++ ++ Core 025
TP 114- pilR ++ ++ Core 026
TP 114- pilS ++ ++ Core 027
TP 114- pilT Core 028
TP 114- pilU ++ ++ Core 029
TP 114- pilV ++ ++ Core 030
TP 114- rci Core 031
TP 114- yebA Core 032
TP 114- yeaA Core 033
TP114- YdjA Accessory 034
TP114- ydiA Accessory 035
TP114- ydgA Core 036
TP114- hha Core 037
TP114- TP114- Accessory
038 038
TP114- TP114- Accessory
039 039
TP 114- yddA Core 040
TP 114- nikB ++ ++ ++ Core 041
TP 114- nikA ++ ++ ++ Core 042
TP114- TP114- + ++ Soft-core
043 043
TP 114- traL + ++ Soft-core 044
TP114- TP114- Soft-core
045 045
TP 114- ycgB Accessory 046
TP114- TP114- Accessory
047 047
TP114- TP114- Accessory
048 048
TP114- TP114- Core
049 049
TP 114- ycfB ++ ++ ++ ++ Core
050
TP 114- ycfA ++ ++ ++ ++ Core
051
TP 114- yceB Core 052
TP 114- hicB Accessory 053
TP114- TP114- Accessory
054 054
TP114- TP114- Core
055 055
TP 114- yajC Accessory 056
TP114- TP114- Core
057 057
TP114- yajA ++ ++ ++ Accessory 058
TP114- ya/S ++ ++ Accessory
059
TP114- TP114- ++ ++ ++ Core
060 060
TP 114- yaiA Accessory 061
TP114- yahA Accessory 062
TP114- TP114- Soft-core
063 063
TP114- TP114- Core
064 064
TP114- yagA Core 065
TP114- TP114- Core
066 066
TP114- TP114- Core
067 067
TP 114- par A ++ ++ ++ ++ Core
068
TP 114- yafA ++ ++ ++ Core 069
TP 114- yaeC Core 070
TP114- TP114- Core
071 071
TP114- TP114- Core
072 072
TP 114- yadA Accessory 073
TP 114- yadB Accessory 074
TP 114- tnpA Accessory 075
TP 114- aph-lll Accessory 076
TP114- TP114- Accessory
077 077
TP114- TP114- Accessory
078 078
TP114- TP114- Accessory
079 079
TP114- TP114- Core
080 080
TP114- TP114- Core
081 081
TP114- TP114- ++ ++ ++ ++ Core
082 082
TP 114- repA ++ ++ ++ ++ Core
083
TP114- TP114- Core
084 084
TP114- yheC + Core 085
TP114- yheB ++ ++ Core 086
TP 1 14- ydhA ++ ++ Core 087
TP1 14- TP1 14- + - Accessory
088 088
TP1 14- yhcA + - Core 089
TP1 14- TP1 14- + Core
090 090
TP1 14- yhbB + Soft-core 091
TP1 14- TP1 14- + - Soft-core
092 092
Confirmation of the TP114 mating pair stabilization module’s essentiality for in vivo conjugation. The role of the T4P (p/7 genes) in mating pair stabilization in vitro was suspected for R721 , another model plasmid of the Incl2 family. However, TP1 14 HDTM data suggested that the genes predicted to be involved in the formation of the T4P were essential for conjugation in vivo. This meant that the T4P was not required in vitro for solid mating and that the differences in environmental conditionsmade the T4P essential in vivo. To confirm this hypothesis, a complete abolition of the T4P functions would be desirable. The prepilin gene pilS was first selected for deletion as the HDTM data revealed a strong dependence on this gene for in vivo conjugation, and because of its crucial role in the structure of the T4P. The prepilin is a major subunit forming the T4P, it is first processed into pilin by specific peptidase and then secreted and assembled into a pilus. Deleting pilS is thus expected to abolish the formation of the pilus and prevent its assembly. The pilS gene was deleted from TP1 14 using a recombination approach. The resulting TP1 14Ap/7S: :caf was then transferred by solid mating conjugation in KN01 for further testing. As such, TP1 14Ap/7S::caf was tested for its ability to conjugate from KN01 to KN03 in solid, liquid, and agitating liquid conditions (Figure
16.A). Whereas the ability of TP1 14Ap/7S::caf to transfer during solid mating remained unchanged, a drastic drop in transfer efficiency was observed in the presence or absence of shaking in liquid. This suggests that conjugation efficiency of TP1 14Ap/7S::cafwas impaired under unstable conditions compared to wild type TP1 14. Complementation using plasmid pPNS expressing pilS in trans resulted in significant recovery of conjugation rates. To test the involvement of the T4P for conjugation in the intestinal tract, a conjugation experiment was performed using a mouse model. A total of 5 Sm-treated mice were first fed with KN03 as a recipient strain and then with either KN01 + TP1 14 or TP1 14Ap/7S::caf as donors.
Conjugation efficiency was monitored through feces collection for four consecutive days
(Figure 17.A). On the fourth day, mice were sacrificed to compare the conjugation efficiency found in the feces and in the caecum (Figure 17. B). While wild type TP1 14 was able to conjugate at its expected frequency (>10 1), TP1 '\4ApilS: .cat was unable to transfer in vivo.
This reveals the crucial role of TP1 14’s T4P for in vivo conjugation, and potentially explains the conservation of T4P across Incl2 conjugative plasmids.
Role of the pilV adhesin variants and the shufflon in TP114. Although the HDTM data indicated that only the N-terminus portion of pilV was essential for in vivo conjugation, it was suspected that this was in fact an artifact of the HDTM method. This anomaly is due to the presence of a shufflon at the C-terminus of pilV that re-organize the end of the gene to produce several variants. The pilV gene encodes an adhesin thought to be responsible for the recognition of the recipient cell by a donorbacterium. The adhesin is believed to be present on the tip of the T4P to establish contact and stabilize the interaction between the two cells. The shufflase was thus deleted to lock the shufflon in a stable conformation in TP1 14Ara. Then, to assess the role of pilV and the shufflon for in vivo conjugation, the entire pilV gene was deleted in a first experiment, generating TP1 14 pilV-rci. In another variant, only the shufflon was replaced by a Flag-tag, generating TP1 14p/7\/Ashufflon-rc/::caf. In yet another series of variants, the pilV gene variants were locked in a specific conformation (TP1 14p/7Wlshufflon::p/7\/7-ca/, TP1 14p//\/Zlshufflon::p//\/2-caf, TP1 14p//\/Zlshufflon::p//\/3- cat, TP1 14p/7 \/Zl s h u ff I o n : : p/7 \/3 - ca f, T P 1 14p/7 \/Zl s h u ff I o n : : p/7 \/4- ca f,
TP1 14p//½4shufflon::p/7\/4’-ca/, TP1 14p/7½4shufflon::p/7\/5-ca/, TP1 14p/7 l s h u ff I o n : : p/7 \/5 - cat. This was accomplished to test if the C-terminus portion of pilV was essential for PilV- mediated adhesion, and to elucidate the importance of the different pilV variants. As such, TP1 14Dr/7\/-/Ό/, TP1 14p/7\/Ashufflon-rc/::caf and all locked p/7 V variants were transferred to KN01 to test their role in mating pair stabilization. Each construct was transferred to KN03 recipient bacteria by conjugation under solid, liquid static and agitating liquid conditions. The pilV adhesin was found to be essential for conjugation in both liquid conditions, confirming the role of the adhesin in mating pair stabilization (Figure 16.B). More precisely, the ability of TP1 14p/7\/Ashufflon-rc/::ca7 to conjugate on a solid support was slightly decreased as compared to the control, but the ability to conjugate in liquid (shaking or not) was completely abolished (Figure 16. B). This showed that the C-terminus portion of pilVwas essential to generate functional adhesins, contrary to what was suggested by the HDTM data. Furthermore, transfer of TP1 14 pilV variants between two E. coli cells resulted in high conjugation efficiencies for all PilV configurationson solid medium (Figure 16.C) while only some variants (namely TP'\ '\4pilVAshutf\on::pilV3’-cat, TP'\ '\4pilVAshutf\on::pilV4-cat, and TP'\ '\4pilVAshutf\on::pilV4’-caf) were proficient for mating pair stabilization and conjugation understatic liquid (Figure 16. D) and agitating liquid conditions (Figure 16. E). This indicated that pilV variants recognized different structures at the surface of bacteria, and that these interactions were needed for plasmid transfer. As such, variantsTPI 14p/7Wlshufflon::p/7\/3 - cat, TP'\ '\4pilVAshutf\on::pilV4-cat, and TP1 \4pilVAshutf\on::pilV4’-cat recognized structures
present at the surface of EcN, most likely lipopolysaccharides, while other variants recognized structures not found on EcN. To confirm that this interaction is required for conjugation in vivo, the pilV deletion mutant TP1 14 pilV-rci, one variant recognizing EcN (TP1 14p/7Wlshufflon::p/7\/4 -ca/)or not (TP1 14p//\/ kshufflon: :p//\/T-caf) was used for an in vivo conjugation assay using KN03 as a recipient (Figure 17.C). In this assay, only TP'\ '\4pilVAshutf\on::pilV4’-catwas able to conjugate in vivo, confirming that the binding of a PilV adhesin to the surface of a recipient bacterium is needed for in vivo conjugation.
Identification of an exclusion protein of TP114. Exclusion and incompatibility are two mechanisms that prevent two plasmids to share a same cell. Whereas incompatibility is passive and occurs when two replication or maintenance system are too similar, exclusion is an active mechanism that prevent either cell-cell contact or DNA entry in the recipient. The existence and extent of exclusion in TP1 14 was first verified. To do so, a conjugation experiment from KN02 to KN01 bearing six different conjugative plasmids was carried. The selected plasmids spanned six incompatibility families and those families were first categorized based on incompatibility and exclusion, only TP1 14 should be able to mediate its own exclusion. As expected, the exclusion phenomenon was only observed with TP1 14 transferring to a TP1 14 bearing recipient (Figure 18.A) (Figure 18.B). However, in this experiment, it was impossible to discern incompatibility from maintenance and exclusion. This was addressed using a mobilizable plasmid pCloDF13 capable of hijacking TP1 14’s T4SS to conjugate, but which uses a completely different replication mechanism. As shown in Figure 18. B, the exclusion ratio of both TP1 14 and pCloDF13 were similar which proved that TP1 14 is subjected to exclusion. In a last effort to characterize this resistance mechanism, an exclusion experiment was carried in vivo where KN01 + TP1 14 and KN02 + TP1 14 v.tetB were used as conjugation pairs. Exclusion was found to be even more stringent in this environment, barely even producing transconjugants over a period of 4 days (Figure 18.C). Of note, results were consistent between the feces and the caecum (Figure 18.D). Using the HDTM library 2, the gene responsible for exclusion was investigated. In the final step of HDTM, TP1 14::fefS was transferred to the HDTM library 2, and transconjugants were sequenced. Most of the transconjugants had transposon insertions in TP1 14-005, a gene of previously unknown function (Table 12). To confirm that the exclusion protein was TP1 14- 005, clones from the HDTM library 8 were transferred first to MG1655NxR, and then back into KN02. The use of two successive rounds of transfers should isolate the TP1 14::Tn5 clone since in vitro conjugation efficiencies allows TP1 14 to transfer to 1 % of the cells, hereby reducing the transfer to a same recipient cell to a minimum. Also, even with deficient exclusion, incompatibility should prevent the persistence of two different clones of TP1 14 within a same cell through maintenance incompatibility mechanisms. As insertions in TP1 14-
005 are more frequent, they should also be easily isolated. Exclusion capacity of the three clones of TP1 14::Tn5 was tested by transferring TP1 14::fefS from KN01 into KN02. Most of the clones were incapable of exclusion as shown by the high conjugative frequencies (Figure 19.A) and exclusion ratio (Figure 19.B). Surprisingly, while testing the ability of these clones to conjugate, exclusion deficient clones were nearly 10-fold more efficient than wild type TP1 14 (Figure 19.C). Therefore, it is understood that the TP1 14-005 gene and the corresponding protein were involved in the exclusion process.
Table 12. Over-representation of some genes in the exclusion HDTM library 8.
Gene Replicate
TP1 14-
005 70.6% 77.6% 76.9%
ydjA 1.0% 0.7% 0.8%
yaeC 7.7% 3.8% 8.9%
pilQ 0.7% 0.9% 1.0%
ydgA 0.7% 0.6%
yaiA 1.3% 1.0%
yadA 0.7% 0.5%
TP1 14-
080 0.7% 0.7%
TP1 14-
048 1.6% 1.5%
pilR 1.5% 1.5%
yebA 0.8%
TP1 14-
045 1.2%
pilN 0.9%
pilM 1.2%
nikB 0.8%
hicB 0.7%
Identification of genes with detrimental effects on TP114 conjugation. The HDTM experiment helped to discover genes which limit TP1 14 conjugation transfer frequency. These genes can be determined by looking at genes having a count ratio that increased following each transfer steps. In our experiments, only two genes were found to be detrimental to TP1 14’s transfer both in vitro and in vivo. These genes were TP1 14-005 (which was previously identified as the exclusion protein) and yaeC (TP1 14-070) (Figure 20). Interestingly, gene function analysis revealed that yaeC is a homolog to finO, a transcription factor that represses genes in the IncF family of conjugative plasmid. The yaeC gene is therefore most likely a transcription repressor that limits TP1 14’s transfer, both in vitro and in vivo. The deletion of this gene could be desirable to enhance TP1 14 transfer both in vitro and in vivo.
EXAMPLE III - CONSTRUCTION OF CONJUGATIVE DELIVERY SYSTEMS DERIVED
FROM TP114
Strains, plasmids and growth conditions. All strains and plasmids used in this example are described in Table 1. All plasmid sequences are provided in the sequence appendix. Oligonucleotides used in this example are listed in Table 2. Details on strain growth conditions, DNA manipulation, plasmid construction, recombineering and routine transformation can be found in the Material and Method section of the Example I.
Construction of the loading dock pREC1. pREC1 is a plasmid used as a template to amplify a loading dock cassette and to insert it into the transfer machinery in one simple step. In this example, pREC1 was used to insert a loading dock into the TP1 14 transfer machinery. To do so, pREC1 was first assembled into a plasmid by Gibson assembly (amplifying the tetB resistance gene from pFG018 and the oriVR6K from pKD4). An FRT and attPBx bi sites were provided by the primer assembly tails. The resulting plasmid pREC1 (Figure 21. C) was then used as a template for subsequent PCR amplification. The FRT-fefS-affPBXbi cassette was inserted in TP1 14 by recombineering, creating TP1 14::fefS plasmid (Datsenko et al., 2000). This version of TP1 14 can then be used for the insertion of the genetic cargo by Double Recombinase Operated Insertion of DNA (DROID).
Construction of pBXB1. The plasmid pBXB1 contains the integrase Bxb1 and was assembled by amplifying the or/VPM BI and the ampicillin resistance gene ( bla ) from pSB1A3, and the Bxb1 integrase gene from gBIock-Bxbl (Figure 21. D). The promoter of the Bxb1 integrase allows constitutive expression of the Bxb1 integrase. The two PCR products were purified by SPRI and pBXB1 was constructed by Gibson assembly before transformation in chemically competent EC100Dp/r+.
Double Recombinase Operated Insertion DNA (DROID). To combine on a same vector the genetic cargo and the transfer machinery, a new process hereby termed DROID, was developed (Figure 22.A). In the present example, the DROID method was used to combine two DNA molecules together without destabilizing the plasmid replication. The DROID method requires both DNA molecules to contain specific integration and recombination sites. For the transfer machinery, the recombination sites are provided by the loading dock present in the tetB insert of TP1 14::fefS (Figure 22. B). For the genetic cargo, the recombination sites are provided during its assembly in a temporary backbone termed «insertion device» (Figure 21. AB), which is removed during the DROID procedure. Both pBXB1 and one of the genetic cargo insertion devices were transformed in the same EC100Dp/r+ strain by electroporation. The modified TP1 14::fefS was then transferred by conjugation at 30°C into EC100Dp/r+ containing pBXB1 and one of the KNI1 or 3 insertion devices. Since the integrase is
constitutively expressed, a single subculturing step is enough to promote integration. The resulting plasmid TP1 14::fefS-KNI1 or 3 was then transferred by conjugation to E. coli MG 1655NXr. Transconjugants bearing both the selection module from the genetic cargo and from TP1 14::fefS were selected. From this point forward, the strains were cultivated at 37°C to avoid activation of the thermosensitive origin of replication (oriVpS doits) from the integrated genetic cargo that could destabilize TP1 14. E. coli MG1655NxR + TP1 14::fefS-Kill1 or 3 were then transformed with pCP20 by electroporation. pCP20 expresses the FLP recombinase which recognizes both the FRT site in the recombineering cassette and the FRT site in the genetic cargo, and recombines them together to knock out tetB, the attLbxb1 site, and the or/Vpscioits which constitutes the insertion device (Figure 22.A). Integrity of the inserts was assessed by PCR for TP1 14, TP1 14::tefS (Figure 22.B), TP1 14::tefS-KiN1 (Figure 22.C) and TP1 14::Kill1 (Figure 22. D) (data not shown for TP1 14::KiM3). The DROID technique could also rapidly be adapted for several applications such as chromosomal insertion of large constructs and assembly of very large plasmids or exogenous chromosomes.
Construction of the Cas9 test genetic cargo insertion devices. Two genetic cargos coding for cas9 and gRNA(s) were inserted in the transfer machinery using the DROID method. While assembly of such a large cluster of genes is usually complex and requires multiple steps, the DROID approach considerably simplifies the process. One of the most important aspects of the cas9-gRNA gene cluster is to design highly specific gRNAs. To design such gRNAs, DNA 2.0 (ATUM) web-based software was run with the chloramphenicol acetyl-transferase gene ( cat) as the target sequence and E. coli K-12’s genome as the off-target sequence. The most potent gRNA spacers (highest AG, lowest off-targets) were then run into BLASTn (Altschul et ai, 1990) against Enterobacteriaceae and EcN genomes to eliminate any candidate gRNAs with high off-targeting. This bioinformatic strategy identified three gRNAs (namely gRNA 1 , 2 and 3) (Figure 21.FG). The genetic cargo was introduced into the transfer machinery vector, in a two-step approach. To do so, a first plasmid was created by combining the cas9 gene from pKN02, the kanamycin resistance cassette from pKD4, and the or/Vpscioits from pGRG36 using corresponding primers (Table 2). gRNA 1 was amplified entirely from a previous construct (pKN02), and the gRNA 2 and 3 spacers were added directly in PCR primers. A first genetic cargo containing the cas9 gene with gRNA 1 (KilM) was generated. Another genetic cargo encoding the cas9 gene with gRNA 1 , 2 and 3 (KNI3) was also obtained. For genetic cargo Kill3, assembly tags were placed between each gRNAs to prevent miss-assembly due to the repetitive nature of the gene locus. Other fragment junctions facilitated the addition of short sequences in the primer tail such as an FRT site, between gRNA 1 and the oriVpS cions, and the attPbx bisite, between oriVpS cions and cas9. Both genetic cargo insertion devices were constructed by Gibson assembly. All gRNA sequences
were confirmed by Sanger sequencing (data not shown). Complete plasmid maps of KNI1 and KΪII3 are shown in Figures 21.A and 21.B respectively.
In vitro conjugation assay. In vitro conjugation experiments were performed as described in the example I Material and method section.
Generation of chloramphenicol resistant Citrobacter rodentium. Details about the use of pGRG36 and Tn7 mediated insertion of DNA is provided in the material and method section of Example I and in Figure 1. Specifically, integration plasmid pGRG36-SmCm was transformed in C. rodentium DBS100 by electroporation. To mediate cassette insertion into glmS gene’s terminator, C. rodentium DBS100 was first cultivated at 30°C in LB with arabinose until OD60onm 0.6. Then, cells were heat-shocked at 42°C for 1 hour and incubated at 37°C overnight to allow for plasmid clearance. Cells were then streaked onto a LB agar plate selecting only the insert. Plasmid curation and insertion of the cassette in the genome was confirmed using the primers 0GSC6-F, 0GSC6-R, OGSC5-F, opir3-F and opir3-R from Table 2.
Construction and test of pNA22, pNA23 and pNA24. The pNA22 to 24 plasmid suite was designed to delimit the minimal DNA sequence responsible for the replication of TP1 14. Replication initiation sequences are usually located near the repA gene previously identified in TP1 14 as TP1 14-083 (see Example II) (Praskier et ai, 2005). In order to isolate the minimal DNA sequence for replication, repA and a 1 ,000-bp region located either upstream or downstream of repA , or both, were cloned into pKD3 by Gibson assembly to yield pNA22 to pNA24. Since pKD3’s replication is pir- dependent, the constructions were transformed into chemically competent E. coli EC100D pir+ (Metcalf et ai, 1994). Then, functionality of the replication origin from TP1 14 was tested by transformation in E. coli BW251 13.
Generation of pir+ EcN (KN05) strain. The pir gene was amplified from EC100Dp/r+ and assembled with rrnB terminator from pFG018 and a Smal + Xhol digested pGRG36 backbone. The resulting plasmid pGRG -pir+ (Kvitko et ai, 2012) was then transformed in MFDp/r+ (Ferrieres et ai , 2010) and mobilized towards EcN by RP4 mediated conjugation. To induce pir insertion into glmS gene’s terminator, EcN was first cultivated at 30°C in LB with arabinose until 0.6 OD60onm· Then, cells were heat-shocked at 42°C for 1 hour and incubated at 37°C overnight to allow for plasmid clearance. Plasmid curation was tested by streaking >20 colonies on plates with or without ampicillin to select for pGRG36’s backbone. The clones for which plasmid curation was confirmed were then screened by PCR for pir insertion using the respective primers presented in Table 2.
Host constrained replication of the transfer machinery. A cassette coding for cat and oriVR6 K was amplified from pKD3 and used to replace TP1 14-083’s CDS (repA) in an E. coli
EC100Dp/r+ strain. Recombinant clones were screened by PCR using the corresponding primers from Table 2. Replication of the resulting plasmid TP1 14Arep/A::caf-or/VR6K should be dependent on a pir gene located in E. coli ECl OODp/'r+’s chromosome pir- dependent replication of TP1 14Arep/A::caf-or/VR6K was verified by conjugative transfer in pir+ (KN05 and EC100Dp/r+) or pir- hosts (KN01).
In silico oriTTpiu prediction. The origin of transfer (or/7) is usually located near the promoter of the nickase in conjugative and mobilizable plasmids. In TP1 14, a large intergenic region is located near the promoter of nikAB (TP1 14-041 and TP1 14-042) that encodes the predicted nickase proteins. In silico analysis of the potential oriT of TP1 14 (or/TTpi i 4) sequence was first compared to other Incl2 plasmids listed in Table 5 using BLASTn to find highly conserved regions in the potential or/TTpn . This allowed to narrow down or/TTpn to a 138- bp core sequence. This core or/TTp114 was then compared to the minimal or/TR64 (Furuya and Komano, 2000) by Pairwise Sequence Alignment using EMBOSS Needle web-based software (Rice et al., 2000). The alignment was performed using default settings (a cost of 10 for gaps creation, 0.5 for extension). Results were then manually annotated to indicate the positions of important repeats, and putative binding and nicking sites.
Construction ofpNAOI and derivatives. The whole intergenic region comprising the predicted or/TTpn was amplified, cloned, and assembled with the broad host-range oriVPB BRI from pSIM7, and tetB from pFG018, using Gibson assembly hereby creating pNAOl An alternative plasmid (pNA02) contained a 7-bp deletion centered on the predicted nicking site of or/TTpn sequence and was assembled with the same backbone as pNA01 . A kanamycin resistant variant of pNA01 and pNA02 (pKN30 and pKN31 respectively) was generated by amplifying the backbone of pNA01 , or pNA02, and by amplifying the kanamycin resistance gene from pKD4. PCR fragments were then purified and assembled together using Gibson assembly.
Deletion of oriTTpiu. An FRT-flanked cat cassette was amplified from pKD3 to delete, by recombineering, a portion of the predicted or/TTp114 comprising the nicking site. Deletion clones were verified by PCR and were then used in conjugation experiments to assess the impact of the deletion on transfer frequency.
Statistical analysis. Statistical significance was performed as described in the material and method section of the Example I.
3.0 - Conformations of the conjugative delivery system
Variations in the genetic cargo’s delivery mode. The genetic cargo nucleic acid molecule can be delivered using different approaches, each with their advantages and potential inconvenients. In this section, COP delivery modes will be explored. First, the COP can be
decomposed in several component as shown in figure 23.A. The COP is first composed of a probiotic bacterium as well as a conjugative delivery system. The conjugative delivery system is itself composed of the transfer machinery and the genetic cargo which themselves contains functional gene modules. As such, the conjugative delivery system can be decomposed into several vectors and can be partially integrated in the chromosome of the probiotic donor. A first strategy to mobilize the genetic cargo is to insert it into the transfer machinery to form a single vector which can autonomously propagate in a population of bacterial cells. While this method can be desirable for applications where maximal conjugal transfer is required, it can also lead to persistence of the conjugative delivery system in the microbiota. Therefore, alternative methods of propagation that provide a transient expression of the genetic cargo should also be considered. For example, it can be necessary for the genetic cargo to be expressed at different levels through time or for a short period of time (e.g. expression of insulin stimulating peptides like GLP-1). Alternatively, if the genetic cargo mediates a negative effect on a bacterial population, its persistence in the environment could lead to the emergence of resistant bacteria. Examples of such alternative method includes constrained cis mobilization and in trans mobilization which are presented in Figure 23. B and are discussed in the next sub-sections.
3.1 - Genetic cargo delivery by cis mobilization
Cis mobilization as a potent conjugative delivery system mode. Cis mobilization is a delivery mode in which both the transfer machinery and the genetic cargo are found on the same vector. In this setup, the Conjugative Delivery System is transferred to the recipient bacterium, which in turn becomes a new donor. Through this process, the conjugative delivery system and the genetic cargo are transferred at an exponential rate, since one donor bacterium can create multiple donor cells and trigger a chain reaction. Due to this exponential diffusion, cis mobilization is theoretically the most efficient delivery mode to propagate the genetic cargo within a bacteria population, but it also leads to conjugative delivery system persistence in the environment. One of the main challenges of the cis- mobilization strategy is to link the Transfer machinery and the genetic cargo on a same vector.
DROID is a potent method for the fusion of DNA molecules. Several methods can be employed to mediate the association of two DNA molecules, for example Gibson assembly, digestion-ligation, Golden Gate, USER-cloning and other derivatives. One alternative is to use recombination-based techniques such as Recombineering, Gene doctoring, or NO SCAR. However, although those techniques can easily delete large DNA fragments, they are limited by the length of the DNA molecule that can be inserted into a specific location. The DROID method presents several advantages over existing methods as it can be done very
quickly and its use of the serine-integrases makes it highly specific to orthogonal insertion sites. DROID allows easy insertion of large DNA molecule that carry complex gene clusters since the fusion of both DNA molecules together is independent of their size. After the fusion of the DNA molecules, undesirable DNA sequences can be easily removed through the action of a second recombinase that excises specific DNA fragments. The DROID method leaves signature scars at either end of the insertion site (one FRT site and one attRBx M site) which are not homologous to one another. This prevents recombination between scars that could knock-out the inserted DNA molecule.
Functional test of the genetic cargo insertion devices. Two genetic cargo insertion devices were constructed to test the DROID method. Since in this example the genetic cargo is composed of a cas9-gRNA complex targeting the cat gene, a regular transformation of KilM or Kill3 insertion devices, in a cell containing a plasmid bearing the cat gene (pT) should lead to plasmid loss. Therefore, as a control to validate that the genetic cargos were functional, the genetic cargo insertion devices were first directly transformed in a cell containing the pT plasmid. Transformation efficiency was found to be significantly higher when selecting only for the genetic cargo insertion devices instead of both the insertion device and the pT plasmid simultaneously (Figure 24). Furthermore, since pT codes for GFP (Figure 21. F), its presence in the cell produces a fluorescent signal that can easily be visualized under blue light. The CFUs from the transformation of both KilM and KNI3 insertion devices were all GFP negative, showing clearance of pT within the transformants. This indicated that both genetic cargos were functional in their insertion devices.
Insertion of two test genetic cargos in the transfer machinery by DROID. The DROID method can be divided in three simple steps (Figure 22.A). The first step is to insert the loading dock into the target DNA molecule by recombineering (Figure 22. B). This step is only required once. The resulting plasmid can then be used to insert several different genetic cargos using the same approach. In the present example, the loading dock is a fragment of pREC1 that encodes for FRT-fefS-affSBxbiand it was inserted in TP1 14, replacing the aph-lll gene and generating TP1 14::fefS. The second step requires both the genetic cargo insertion device and the plasmid pBXB1 to be cloned in the same cell. Two genetic cargo insertion devices were used to test the DROID method: KilM (Figure 21. A) and Kill3 (Figure 21. B). TP1 14 v.tetB was transferred by conjugation towards E. coli EC100D pir+ containing pBXB1 and KilM or 3. Then, TP1 14::fefS-Kill1 or 3 (Figure 22. C) were transferred towards E. coli MG1655NxR. The last step of the method is to transform E. coli MG1655NxR + TP1 14::fefS-KiN1 or 3 with pCP20. This step enables the removal of tetB, affLBxbi and oriVpS Cioits (Figure 22. D). The final conjugative delivery systems, TP1 14::Kill1 and TP1 14::KiM3, were transferred into KN01 and KN0'\AdapA. The whole process from step 1 to 3 requires 1 1 days to complete, but step 1 is
not required each time. This thus reduces the time to perform a clean insertion of the genetic cargo to 7 days.
Cis mobilization of genetic cargos in different cell types. To test whether the genetic cargos were functional after DROID into the transfer machinery (TP1 14), TP1 14::Kill1 was used as a model. The conjugative delivery system TP1 14::Kill1 was transferred into a 1 :1 mix of four different recipient cells: KN02, E. coli MG1655NxR, Enterobacter aerogenes, and Salmonella typhimurium. In this mix, only KN02 is resistant to chloramphenicol and therefore, only KN02 should be targeted by the cas9-gRNA genetic cargo. The efficiency of TP114::Kill1 was evaluated through COP subjected cell survival ratios. These COP-subjected cell survival ratios represent the proportion of cells that received the conjugation delivery system and survived. The COP-subjected cell survival ratios were calculated by dividing the conjugation efficiency of the TP1 14::Kill1 (cells that survived the conjugative delivery system) by the conjugation efficiency of TP1 14 (total cells that should have received a conjugative delivery system). As expected, only KN02 was eliminated by the conjugative delivery system (Figure 25.A). To confirm that the killing was not due to off-targeting in KN02’s genome, a second mix of four recipients was prepared (Citrobacter rodentium KN04, E. coli MG1655NxR, KN03, Salmonella typhimurium). In this mix, only C. rodentium KN04 is resistant to chloramphenicol due to an insertion of the cat gene in its chromosome. The COP subjected cell survival ratios was again used to assess TP1 14::KillTs efficiency. As predicted, only C. rodentium KN04 was affected by the COP system, thereby showing the cas9-gRNA genetic cargo could be expressed in different cells and was highly specific to the cat gene (Figure 25. B).
The DROID method successfully fused the transfer machinery and the genetic cargo in a single vector. In the present example, it was shown that the DROID method could be used to join the transfer machinery with the genetic cargo in three steps. These steps were designed to avoid functional redundancy between the genetic cargo and the transfer machinery. The FRT recombination step was needed to eliminate the selection module from the transfer machinery and the maintenance module from the genetic cargo thereby creating a single DNA molecule with a single selection and vegetative replication module. The present conjugative delivery system was then used in a proof of concept of genetic cargo delivery towards several representatives of Enterobacteriaceae, a taxonomic group of bacteria that often are the cause of antibiotics resistant enteric and urinary infections.
3.2 - Genetic cargo delivery by constrained cis-mobilization
Constrained cis-mobilization as a biosafety measure. A constrained c/s-mobilization system consists of a conjugative delivery system in which the genetic cargo and transfer machinery are located on the same DNA molecule. However, in such a system, the essential replication
initiation gene from the maintenance module is inserted in the chromosome of the donor strain (Figure 23. B). This makes replication of the conjugative delivery system dependent on the COP chromosome. Such a system would still be able to propagate at an exponential rate since the recipient strain would be able to express the genes from the conjugative delivery system. However, the conjugative delivery system will eventually be lost since the system cannot replicate in the recipient bacteria. Therefore, while the system exhibits high conjugative efficiency, it would not persist in the environment, thereby allowing for better control over administration dynamics and better biosafety level.
Localization of oriVTpiu· The first step toward constrained replication is to confirm the localization of the origin of replication of TP1 14 (or/VT P114) since constrained c/s-mobilization requires to relocate repA to the donor cell chromosome. Conjugative plasmid replication is usually initiated by a plasmid-encoded protein called RepA. This protein recruits the DNA replication machinery from the host cell and allows for the replication of plasmid DNA. The replication initiation protein encoded by TP1 14 was predicted by in silico analysis to be encoded by the repA gene TP1 14-083 (see example II). In most conjugative plasmids, the oriV can be found in an intergenic region near the repA gene. However, in TP1 14, repA is flanked by two intergenic regions. It was unclear as to which one is oriVT P114· Consequently, repA was cloned with either 1 ,000-bp upstream, downstream or 1 ,000-bp on both sides of the gene into a plasmid backbone. The plasmid backbone contains oriVR6 K, an origin of replication that is dependent on a chromosomally integrated pir gene. Assemblies were first verified in a pir+ strain E. coli EC100D pir+ and then cloned in a pir- strain BW251 13 for oriVT functionality analysis (Figure 26). From the three plasmids tested, only one could replicate in a pir- host. This plasmid contained repA and 1 ,000 bp on both sides of the gene. This result indicated that TP1 14-083 encoded the actual RepA protein and that oriVT P114 is located near the repA gene.
Constrained cis mobilization of TP114. Constrained c/s-mobilization refers to a conjugative delivery system in which the transfer machinery and the genetic cargo are located on a same vector that replicates under the control of a chromosomally encoded replication protein. In order to develop constrained c/s-mobilization, a characterization of the replication protein and its associated or/V is required. While or/VT P114 and its repA gene were localized, several other factors have to be taken in consideration for the selection of the replication protein and oriV pair. For instance, the replication protein must be rarely found in the chromosome of potential recipient cells to avoid maintenance of the conjugative delivery system in the environment. Also, to reduce incompatibility based rejection of the conjugative delivery system, it can be desirable to use a replication system heterologous to the one of TP1 14. One of the most studied and understood replication system is R6K’s pir- dependent oriVR6K. The Pi protein,
encoded by the pir gene, is naturally found in conjugative plasmids from the IncX family and present the advantage of not being found frequently in bacterial chromosomes. A pir integration system based on the Tn7 integration plasmid pGRG36 was developed and a pir+ EcN strain KN05 was generated. The TP1 14-083 ( repA ) was replaced by the oriVR6 K and a chloramphenicol resistance cassette ( cat) using the commercially available E. coli EC100D pir+ cloning strain generating TP1 14A/-epA::caf-o/-/\/R6K. This plasmid was then transferred into KN05 and KN01 to evaluate the capacity of TP1 14ArepA::caf-or/VR6K to replicate in a pir+ and pir- strain respectively. The conjugation efficiency was compared to wild-type TP1 14 under the same condition (Figure 27.A). As expected, no transconjugants could be retrieved from the transfer of TP1 14ArepA::caf-or/VR6K towards KN01 {pir - strain), but transconjugants were obtained for the conjugation towards KN05 (pir+ strain). However, the conjugation efficiency of TP'\ '\4ArepA::cat-oriVR6K \Nas lower than that of wild type TP1 14. A second conjugation experiment was attempted, this time using KN05 as the donor strain and E. coli EC100D pir+ as the recipient (Figure 27. B). The conjugation efficiency of both plasmids was then equivalent, thereby showing in this case that constrained c/s-mobilization had very little impact on conjugation efficiency. Using such maintenance module, plasmid transfer towards pir- strain should remain unaffected, but replication should be impossible in the recipient cell. This means that expression of the genetic cargo’s payload and the transfer machinery can still be achieved in pir- strains, but this expression is transient. Such transitory expression is desirable in several situations where constant expression can be detrimental to the COP effect, to the environment, or to the subject. Constrained c/s-mobilization is therefore an efficient way to prevent plasmid persistence while still maintaining high conjugation efficiency.
3.3 - Genetic cargo delivery by In trans mobilization
In trans mobilization as a delivery mode for the Genetic Cargo. In trans mobilization is achieved by using a vector system in which the transfer machinery and the genetic cargo are present on two different DNA molecules (Figure 23. B). In this conformation, the genetic cargo must contain a transport module that is compatible with the transfer machinery’s vegetative replication module. The transfer machinery can be immobilized either by insertion in the chromosome or disruption of its own transport module (or/T). This mode of delivery therefore allows the genetic cargo to be mobilized in trans (on a different DNA molecule independent from the transfer machinery) towards a recipient bacterium. The choice of vegetative replication module for the genetic cargo can have great importance over its persistence in the environment. This mode of delivery allows for maximal biocontainment, but more modest transfer efficiency as recipient bacteria cannot propagate the genetic cargo.
Localization of TP114’s origin of transfer (oriT). All mobilizable plasmids are dependent over the recognition of their oriT by a relaxosome to enable the conjugation process to occur. The relaxosome is a multi-heteromeric protein complex that recognizes and binds to specific sequences on the oriT and cleaves a single strand of DNA at a specific location named the nicking site. The single strand of DNA is then guided to the T4SS, and transferred through the T4SS to the recipient cell. Mobilizable plasmids that do not encode a T4SS can use the T4SS encoded by another conjugative plasmid. However, mobilizable plasmids often encode their own relaxosome which are specific to their own oriT sequence. The oriT is therefore one of the most important sequence for genetic cargo transfer by the transfer machinery. Previous examples presented here exploited a physical link between the genetic cargo and the transfer machinery to use the oriT sequence from the transfer machinery in cis for DNA transfer. However, as observed with mobilizable plasmids, it is possible to relocate the oriT sequence from the transfer machinery to the genetic cargo nucleic acid molecule to only mobilize, in trans, the genetic cargo to the recipient cell. Importantly, or/TTp114 first needed to be identified. Based on other conjugative plasmids topology, the oriT sequence is often located near the nickase gene, a subunit of the relaxosome. On TP1 14, the nickase is predicted to be TP1 14-041 (nikB), a core gene that shares the same protein family pfam03432 domain as previously identified nickases. In TP1 14, a 368-bp intergenic region with two diverging genes is located near the promoter region of nikB and represented a potential oriT.
Construction of in trans-mobilizable vectors pNA01. The 368 bp predicted or/TTp114 was cloned into a broad-host-range plasmid backbone containing a vegetative replication module (or/VpBBR-i) and a payload module (tetM). Genetic cargo pNA01 was transformed in KN0'\AdapA containing TP1 14 and then mobilized towards KN02 (Figure 28.A). Conjugation efficiency of pNA01 was comparable to that of TP114, confirming that the 368 bp DNA sequence cloned into pNA01 is or/TTpn .
Identification of TP114’s nicking site. In trans mobilization efficiency could be slightly perfected by immobilization of TP1 14, either by the deletion of the oriT or by integration in the donor’s chromosome. Immobilization of the transfer machinery would prevent its spreading in target bacteria allowing for better biosafety by limiting the persistence of the transfer machinery in the environment. Also, an immobilized transfer machinery can still mobilize the genetic cargo in trans which allows the transfer of only the genes that need to be expressed in the recipients. However, or/T P114 is located in an intergenic region between two diverging genes. This means that this region contains two unannotated promoters, one of which is responsible for the expression of the essential nickase gene nikB. Precise deletion is therefore required to avoid possible impairment of nikB expression of. Sequence
comparison of o/7TTP114 with other plasmids from the Incl2 family (Table 5) provided further information over sequence conservation. Among the tested plasmids, pChi7122-3 only showed homology for the first 138 bp of o/7TTP114 sequence. It is therefore most likely that both the nikB promoter, and the nicking site, could be located in this portion of or/T P114. Bacterial promoters are usually composed of a -10 and -35 box and can optionally contain operator sequences. As nikB is located upstream of or/TTpn and since a -10 box motif was found at position 13, the promoter of nikB is most likely located in the first 100 bp of or/T P114. In the Inch model plasmid R64, the core or/TR64 sequence was determined to be 92-bp long and the nicking site is located at position 77 at a highly conserved guanine. A Pairwise Sequence Alignment performed using EMBOSS Needle software (Rice et al., 2000) revealed 36% homology between the 138 first base pair of or/T P114 and the minimal or/TR64 (Figure 29). This alignment shows that important repeats, and NikA’s binding site, were relatively well conserved between R64 and TP1 14. Using the position of these motifs in TP1 14, the nicking site was predicted to be G at position 124. To validate that the nicking site really was located at position 124, a 7-bp deletion from position 121 to 127 was performed, then cloned into a second vector, namely pNA02. Genetic cargo pNA02 is identical to pNA01 except for the 7-bp deletion and should therefore not be mobilized by TP1 14. In trans- mobilization of pNA02 was tested from KN01 containing TP1 14 to KN02 exactly as performed for pNA01. However, in trans mobilization was virtually abolished by the 7-bp deletion in pNA02’s or/TTp114(Figure 28. B). This indicates that TP1 14’s nicking site is located in this region.
Deletion of oriTTpiu from the transfer machinery of TP114. As discussed above, the mutation of the oriT can be performed to immobilize a conjugative plasmid in a donor strain. In trans mobilization can benefit from this immobilization, otherwise, the transfer machinery and the genetic cargo will compete for transfer through the T4SS. The immobilization of TP1 14 was performed by recombineering-mediated insertion of a FRT flanked cat cassette creating a 138-bp deletion in o/7TTP114 spanning position 1 16 to 254. This deletion covers the nicking site and should prevent the recognition of or/TTP114, and hence the transfer machinery conjugation, while at the same time leaving the expression of nikB unaffected. TP1 14DOG/T: :cat-tetB was generated in MG1655NxR and transferred in KN01 by conjugation. Conjugation efficiency of ΊR 4AoήT:.q3ί-ίoίB was greatly impaired, but still yielded a few transconjugants (Figure 30.A). Such transfer allowed exclusion mechanisms to isolate the TR~\ '\4AoriT. -.cat-tetB from TP1 14 which could potentially still be present in the MG1655NxR strain. Therefore, a second test of conjugation efficiency was performed from KN01 towards MOIQddB^. This time, no transconjugants could be retrieved thereby showing the complete abolition of TP1 14’s transfer capabilities (Figure 30. B). Then, a kanamycin resistant derivative of pNA01 and pNA02, pKN30 and pKN31 respectively, were transformed into
KN01 + JP^ ^AhoriTv.cat-tetB. Using the biocontained transfer machinery, pKN30, and pKN31 were mobilized in trans towards MG1655NxR (Figure 31). Only pKN30 was able to transfer to MG1655NxR, thereby confirming that this confinement method is highly stringent. However, its conjugation efficiency was about 10-fold lower than TP1 14’s. This could be further improved by relocating the relaxosome-associated genes from the transfer machinery to the genetic cargo vector .In summary, in trans mobilization, as demonstrated in the present example, required the relaxosome to be expressed from the transfer machinery (TP1 14). The relaxosome then recognized or/TTp114 on pNA01 and mediated its mobilization through the T4SS. However, the 7-bp deletion of the nicking site in pNA02 prevented the recognition and processing of o/7TTP114 by the relaxosome thus impairing pNA02 transfer (Figure 32).
EXAMPLE IV -GENETIC CARGO DELIVERY AND APPLICATIONS
Strains, plasmids and growth conditions. All strains and plasmid used in this study are described in Table 1. All plasmid sequences are provided in the sequence appendix. Details about strains, plasmids, growth conditions, in vitro conjugation, feces and tissue processing, in vivo conjugation and statistical analysis are provided in the Material and Method section of the Example I.
Fluorescence measurements. To assess the efficacy of the gRNA-cas9 payloads at cutting a specific chromosomal DNA sequence, or a plasmidic DNA sequence, two types of bacterial targets were used. The first one, KN02, possesses a gene coding for a green fluorescent protein (Shaner et al. , 2013) (NeonGreen™) integrated within its genome. The second one, KN03, carries a pT plasmid bearing a chloramphenicol resistance gene and a gfp gene (Ormo et al., 1996). Fluorescence was used as a proxy to measure the integrity of the bacterial genome of KN02 or of plasmid pT in KN03. Briefly, as long as the pT plasmid carrying the chloramphenicol resistance gene is devoid of any cut, it can be maintained in the bacterial host and GFP is expressed. As soon as the CRISPR-Cas9 system cuts the chloramphenicol resistance gene, pT is lost, thereby leading to loss of GFP fluorescence. To determine the efficiency of the COP system to cure pT, the number of green fluorescent and non-fluorescent colonies were counted on recipient and transconjugants selecting plates. Fluorescence was measured using a Typhoon FLA 9500 and images were analyzed using ImageFiji software.
Cell fluorescence induction. To limit possible negative effects of high levels fluorescent proteins on the fitness of the target bacteria, the genes coding for the fluorescent proteins were under the control of inducible promoters. In both genomic and plasmidic targets the fluorescent signal was inducible. The gfp gene from pT is under the repression of AraC, a
protein which action is inhibited by arabinose (Guzman et ai, 1995). gfp on pT is therefore inducible in the presence of 1 % arabinose. In the case of the KN02 cells (chromosomal target), fluorescence is mediated by NeonGreen. E. coli KN02’s NeonGreen gene is inducible with 1 mM IPTG (Lutz et ai, 1997). NeonGreen emits green fluorescence with similar absorption and emission spectrum to GFP, but with brighter fluorescence (Shaner et ai, 2013). Both plasmidic and genomic targets were confirmed for fluorescence emission under a transilluminator using blue light and in a cell sorter (FACSJazz) (data not shown).
Colony photography. To follow pT loss during both in vitro and in vivo experiments, colony fluorescence was detected using a Typhoon FLA 9500 on LB agar plates supplemented with 1 % arabinose. To do so, two images were taken; one for the detection of GFP used a Low Pass Blue filter and a 473 nm laser, the other one for the BrightField image used a Low Pass Red filter and a 635 nm laser. Those two images were merged using ImageFiji software (Schindelin et ai, 2012). Then, the green fluorescent and non-fluorescent colonies were manually counted.
Mortality rate evaluation by FACS. For in vitro experiments where the target was genomic, the mortality rate was investigated using a live/dead approach. Live and dead bacteria were discriminated using propidium iodide (PI) staining. Typically, PI is used to detect dead cells since it can only penetrate through compromised membranes (Davey, 201 1). KN02’s fluorescence was induced throughout the in vitro COP treatment. To detect dead cells, cells were stained with 30 mM PI in NaCI 0.85% for 15 minutes in the dark. Then, cells were washed twice and resuspended in NaCI 0.85% at a density allowing for rapid and accurate cell detection. Green and red fluorescence was immediately evaluated on 100 000 cells per sample using a FACSJazz cytometer.
4.1 - In vivo use of the COP system to transfer a payload with beneficial effects on the recipient bacterium
The COP system can be used to deliver a genetic cargo beneficial to a target bacterium. Such a genetic cargo could encode genes that provide an advantageous phenotype to the recipient bacteria. Providing a phenotypic advantage to certain bacterial species could help rebalance a disturbed microbiota by helping under-represented species to proliferate. For example, this could be achieved by transferring genes allowing the use of a new carbon source, such as lactose. Providing lactose degradation enzymes would benefit the bacteria, but also the subject if lactose intolerant. Another example of beneficial genes that could be contained in the payload would be antibiotics resistance genes. Providing that certain bacteria needs to be enriched in a certain microbiota, transferring a resistance gene to these bacteria before antibiotics treatment would greatly enrich their population.
The COP system can transfer a payload providing beneficial phenotypic traits to target bacteria in vivo. The COP was tested for its ability to transfer a genetic cargo providing a beneficial phenotypic trait to target bacteria, and this, in the gut environment. To do so, the in vivo conjugation mouse model was used with KN01 + TP1 14 as a donor to provide the kanamycin resistance gene to the KN02 recipient bacteria. The COP achieved high level of genetic cargo transfer within the first two days of the experiment as KN02 bacteria became resistant to kanamycin (Figure 33. A). The results obtained were consistent between the caecum and feces (Figure 33. B). Most of the recipient population gained the ability to resist kanamycin throughout the experiment, hereby showing that the COP could be used to transfer and deliver beneficial phenotypic traits to bacteria in vivo.
4.2 - In vivo use of the COP to transfer a payload detrimental for the recipient bacterium
The COP system can be used to deliver the CRISPR-Cas9 system as a payload in vivo. In this example, the COP is used in the gut environment to deliver the CRISPR-cas9 system to inactivate specific genes into target bacteria. For this demonstration, the COP was composed of the probiotic EcN, (Figure 34.A) with a conjugative delivery system having the transfer machinery and the genetic cargo located on the same vector (see cis mobilization in example III) (Figure 34. B). In this configuration, because the transfer machinery and the genetic cargo are linked together, they are both transferred into the target bacterium which becomes a donor contributing to the exponential spreading of the cargo within the local microbiota. Once in a target recipient cell, the Cas9-gRNA payload is expressed and scans the genome for the specific target sequence determined by the tunable gRNA spacer’s sequence. When a target sequence is detected, Cas9 catalyzes a double-stranded break into the cell’s DNA (Figure 34. C). The construction of the conjugative delivery system is detailed in example 3.1. Briefly, the genetic cargo and transfer machinery were linked together using the DROID method. The genetic cargo’s payload was composed of the cas9 gene and one (KilM) or three (KNI3) gRNAs. For this proof of concept, the gRNAs from the genetic cargos were designed to bind specifically to the chloramphenicol resistance gene (cat). The cat target sequence can be naturally found on a plasmid or on the chromosome of target bacteria. When cat is located on the chromosome of the target strain, the target cells were killed as a consequence of the double-strand breaks caused by CRISPR-cas9. On the other hand, when cat is located on a plasmid, the double-strand breaks induced by CRISPR- cas9 promoted the degradation of the plasmid and the cells were“disarmed” (Figure 34. D).
4.2.1 - In vivo use of the COP to deliver a payload that sensitizes bacteria to antibiotics.
The COP can be used to deliver a payload that eliminates a phenotypic trait in target bacteria. One way the COPs can have a detrimental effect mediated by the transfer of its
genetic cargo is by hindering the capacity of the target bacteria to express a given phenotype. One of the most infamous phenotype is the resistance to antibiotics. It could therefore be desirable to design a COP that induces the loss of antibiotic resistance genes in a bacterial population. The use of programmable endonuclease (e. g. cas9) holds great promise for the targeting and elimination of antibiotic resistance genes responsible for the emergence of extremely resistant pathogenic bacteria. Antibiotic resistance genes can be found on both bacterial chromosomes and plasmids. When those genes are located on a horizontally transferable genetic element, like conjugative and mobilizable plasmids, they are more problematic. In this particular scenario, the resistance phenotype is transferable to other species of bacteria. The use of the COP bearing CRISPR-Cas9 allows the targeting of these resistance genes and the disarmament of bacteria bearing such mobile genetic elements in vivo.
The COP can deliver a payload that cures antibiotic resistance plasmids in vivo. The COP was used during an in vivo conjugation experiment to test whether it could deliver a payload in order to eliminate antibiotic resistance plasmids from target bacteria. To do so, the COP was tested for its ability to deliver a payload capable of eliminating pT from a commensal E. coli strain in the murine intestinal tract. KN03 bearing pT was introduced in mice 12 hours prior to KN01 bearing TP1 14 or KN01 bearing TP1 14::Kill1. The presence of pT in the recipient cells was monitored by fluorescence on selective LB plates. A single administration of the COP system could clear as much as 73 % of the target plasmid throughout the four days of the experiment (Figures 35.A and B). No plasmid loss was observed in the control group (KN01 + TP1 14) and all transconjugants of the COP system were devoid of the target plasmid (Figure 35. C). Results from feces samples were consistent with frequencies found in the caecum of the mice at the end of the experiment (Figure 35. D). These results indicated that it was possible to specifically cure a resistance plasmid from a cell in vivo in the gut microbiota using the COP system to deliver a payload containing the CRISPR-cas9, hereby causing the loss of antibiotic resistance phenotype.
42.2 - In vivo use of the COP system to deliver a payload in order to selectively kill bacteria.
The COP system can be used as an alternative to conventional antibiotic drugs. The use of CRISPR-Cas9 to target and eliminate specific bacteria in a mixed community is an application of great interest since the emergence of antibiotic resistance threatens our ability to treat bacterial infections. Using TP1 14::Kill1 , the COP can target chromosomal sequences into pathogenic bacteria’s genomes to eliminate them. Since most bacteria are unable to perform Non-Homologous End Joining (NHEJ), a Cas9 mediated blunt double-stranded cut in their chromosome can result in death. In this in vivo example, a specific strain of bacteria is successfully targeted and eliminated from the community without affecting other closely
related strains. To test the ability of the COP to deliver a payload to eliminate specific bacteria from a complex bacterial community, a set of closely related target and non-target strains was required. For these experiments, KN01 MapA was used as the donor strain, KN02, which carries a targeted chromosomal cat gene was used as the target strain, whereas KN03, which carries a chromosomal tetB gene instead of cat was used as the nontarget strain.
4.2.2.1 - Prophylactic use of the COP system to selectively kill bacteria in vivo.
Applications of prophylactic use for the COP TP114::Kill1 system. The prophylactic treatment implies a daily uptake of the COP to avoid infections or the spreading of unwanted bacterial strain. This type of treatment could be useful in situations where a subject is susceptible to be exposed to a certain type of bacterial infection, e.g. when traveling, or while recovering from an antibiotic treatment. Prophylactic use of COPs could also improve health by targeting antibiotic resistant bacteria thereby lowering the chances of antibiotic resistant infections. A mouse model for the investigation of such prophylactic treatment was designed. Mice were first fed with the COP system bearing TP1 14::KMI1 or TP1 14 (as a control), and then were subjected 12 hours later to a 1 :1 mix of target and non-target bacteria (respectively KN02 and KN03). The respective abundance of the target and the non-target bacteria was then followed over four days in feces (Figure 36. A).
The COP system can be administered prophylactically to deliver a payload that specifically eliminates an invading target bacterium in vivo. The murine prophylactic treatment was followed strictly using COP KN01 + TP1 14::KMI1 or KN01 + TP1 14 (control placebo treatment). The COP prophylactic treatment resulted in as much as 13-fold specific decrease of target strain’s abundance as compared with the non-target strain, and this, in a single day. A clear drop in raw CFUs count of the target strain was observed with only a single dose of COP KN01 bearing TP1 14::KMI1 prophylactic treatment (compare Figure 36. B and 36. D). A competitive ratio was calculated between the target strain and the non-target strain by comparing the CFU count between the two strains. For example, the target strain competitive ratio was calculated by dividing the CFU count of the target strain by that of the total recipient strains CFU. The equivalent was done to calculate the non-target strain competitive ratio. Competitive ratio compares the fitness of the two strains to highlight the impact of the COP system. The strain competitive ratios were consistent with raw CFUs with a clear difference between the recipient strains at days 2, 3 and 4 for the COP KN01 bearing TP1 14::KMI1 treatment, but not the control (compare Figure 36. C and 36. E). The difference in strain colonization suggests that the COP technology can efficiently and specifically eliminate the target strain in vivo, allowing the non-target strain to thrive. This showed that the COP
system could be used prophylactically to deliver a payload in vivo in order to specifically prevent pathogenic bacteria to invade the gut microbiome.
4.2.2.2 - Therapeutic use of the COP system to selectively kill bacteria in vivo.
Examples of therapeutic uses for the COP in vivo. The therapeutic use of the COP implies that the administration of COP occurs after the detection of a given pathogenic bacteria. The COP is administered to eliminate or inactivate the pathogenic bacteria. This type of treatment could be useful in situations where it is preferable to induce mortality in specific pathogen cells. In fact, the high specificity of the COP system would prevent the establishment of opportunist pathogens and would limit the apparition of pathologies such as antibiotic- associated diarrhea. Furthermore, treatment of a subject could be preferable in situations where the target bacteria are resistant to all known antibiotics. Nonetheless, high specificity of the COP could allow precise engineering of the gut microbiota in dysbiosis-affected subjects by efficiently targeting over-represented species in an individual. A mouse model for the investigation of the efficacy of such therapeutic treatment was devised. The mice were first fed with a 1 :1 mix of target and non-target bacteria (respectively KN02 and KN03) and 12 hours later, were treated with the COP containing either the TP1 14::Kill1 system or TP1 14 (control). The abundance of target and non-target bacteria was followed in feces over four days (Figure 37. A).
The COPs can deliver a payload to eliminate a target bacterium in vivo. The murine therapeutic treatment was followed strictly using COP KN01 bearing TP1 14::KMI1 or KN01 bearing TP1 14 (control placebo treatment). A single dose of COP therapeutic treatment yielded outstanding results with as much as 2,1 16-fold diminution of target strain as compared with the non-target strain within just two days. Significant diminution in raw CFUs was observed on days 2, 3 and 4 with COP KN01 bearing TP1 14: :KMI1 -treated mice but not with placebo-treated mice (compare Figure 37. B and 37. D). Target and non-target competitive indexes (as calculated in the prophylactic treatment section) showed clear dominance of the non-target strain for COP KN01 bearing TP1 14::KMI1 treated mice, but not for the control group (compare Figure 37. C and 37. E). This showed that the COP technology could be used therapeutically to deliver a payload specifically designed to eliminate a bacterium in vivo.
4.3 - In vivo use of the COP system for the transfer of a payload with mixed effects on bacterial populations
Utilization of a payload with beneficial and detrimental effects on the recipient bacteria. Using TP1 14::KMI3 delivery system, it is possible to exploit the transfer of both beneficial gene (kanamycin resistance) and detrimental gene (Cas9-gRNA) in a same genetic cargo to
manipulate a population of bacteria. For instance, by transferring the kanamycin resistance gene of TP1 14::KiM3 into target bacteria and exposing those cells to kanamycin, a selective pressure towards the acquisition of the genetic cargo occurs, hereby forcing the recipient bacteria to be subjected to the effects of CRISPR-Cas9. The plasmid pT that contains the target gene cat and expresses GFP was used to demonstrate this approach (Figure 21. E). Since TP1 14 transfers at frequencies around 1 % on solid medium in vitro, CRISPR-Cas9 can potentially mediate plasmid loss in about 1 % of the population without antibiotic selection. TP1 14::KiM3 transfer was measured towards KN03 containing pT by selecting either TP1 14:KiM3 only or both TP1 14::KiM3 and pT. As a control, TP1 14 was transferred to KN03 bearing pT to verify that any plasmid curation was not resulting from plasmid incompatibility (Figure 38.A). The transfer frequency was found to be significantly lower when selecting for both plasmids only with TP1 14::KiM3 but not with TP1 14, thereby suggesting that plasmid curation is due to the gRNA-Cas9 system present on TP1 14::KiM3. Using GFP signals, it was possible to determine the number of recipient bacteria cured of pT following the COP treatment with TP1 14::KiM3. This number varied between 1 and 7% in the absence of selection. However, in the presence of kanamycin to select for TP1 14::KiM3 in the recipient cells, plasmid curation reached 100% (Figures 38. B and 38. C). pT loss was also confirmed by subculturing 63 transconjugant colonies on selective antibiotics media (21 for TP1 14 and 42 for TP1 14::KiM3). Only transconjugants from TP1 14::KiM3 were successfully confirmed for pT loss (42/42). Conjugation of TP1 14 WT plasmid into a KN03 bearing pT strain did not lead to pT loss in transconjugants, showing that plasmid loss was due to the expression of the genetic cargo and not plasmid incompatibility. Using this strategy, the efficiency of the system could be enhanced from ~1 % to 100% in vitro. This type of strategy could be scaled up to use several gRNA that targets several resistance genes at a time. This would allow the system to eliminate several antibiotics resistance genes while spreading only one resistance. It is therefore expected that in some embodiments, it could be desirable to spread a resistance gene and to couple the COP with an antibiotic treatment.
4.4 - COP can deliver a payload across species
COP can deliver a phenotypic trait as a payload across species. The ability of the COP to transfer a beneficial trait to other species of bacteria was first tested by conjugation between E. coli KN0'\AdapA bearing TP1 14 as the COP and diverse bacterial species as individual recipients. The recipient bacteria ( Salmonella enterica substr. Typhimurium SR-11 , E. coli MG 1655NXr, Citrobacter rodentium DBS100, Enterobacter aerogenes ATCC 35029, Klebsiella pneumoniae ATCC 13883, E. coli KN02) were chosen as part of the Enterobacteriaceae family, a family of bacteria most frequently found in the gut microbiome. The process was repeated with alternative transfer machinery (conjugative plasmids
pVCR94, RK24, pOX38, R6K, R388) under solid and liquid mating conditions (Figure 39.A and 39. B). Most of the transfer machineries were able to transfer the antibiotic resistance across species, highlighting the broad host range of these genetic constructs. The transfer frequency of the payload varied between recipients, but all recipient bacteria could successfully acquire the phenotypic trait.
Compatibility in Restriction-Modification (RM) systems benefits Horizontal Gene Transfer (HGT). The discrepancy between transfer frequencies was next investigated. One of the major barriers to HGT is the compatibility of restriction-modification systems. As such, the stability of the transferred genetic cargo depends on the ability of the recipient bacteria to target and eliminate the genetic construct. The targeting of a new DNA molecule often relies on RM system, in which DNA molecules are specifically modified to avoid recognition by specific nucleases. As such, compatibility between the probiotic donor’s modification system and the recipient’s restriction system can influence the recognition rate of the genetic cargo by specific nucleases. It was noted that the transfer of DNA by conjugation seemed limited from E. coli MG1655NxR to EcN and from EcN to MG1655NxR, but not between two MG 1655 or EcN strains for most tested conjugative systems (Figure 40). This result suggested that MG1655’s modification system was not compatible with EcN’s restriction system and vice versa. Therefore, choosing a donor bacterium that has compatible modification systems with the recipient bacteria or engineering one can enhance the payload’s stability in the recipient bacteria. Alternatively, a gene encoding an anti-restriction protein can be added to the genetic cargo to enhance its transmissibility between incompatible bacteria. One of such anti-restriction gene is found on RK24 ( kcIA ), thereby explaining the absence of drop for conjugation from MG 1655 to EcN. The anti-restriction system of RK24 allowed generally better transfer frequency of its antibiotic resistance gene across several species (Figure 39.A). Therefore, adding an anti-restriction protein to the COP system could be desirable to improve the delivery of cargos across species.
Conjugative plasmids are isolated genetic constructs that functions mostly independently from the donor strain. Since conjugative plasmids can spread across several genera of bacteria, their independence from the host chromosome to achieve DNA transfer is important for their persistence through time. While the cell provides resources to a conjugative plasmid, such a plasmid is often auto-regulated and expresses most of the proteins necessary for its adequate functioning. As such, conjugative plasmid’s efficiency to transfer should not be affected significantly by the host bacterium. To test this hypothesis, two distantly related E. coli strains MG1655NxR and E. coli KN01 were chosen to compare conjugation efficiency within the strains. The results from Figure 40 suggested that the conjugation efficiency was not dependent on the donor strain but is rather the same in both tested strains. The transfer
efficiency of all tested conjugative plasmids was the same in both E. coli donor strain, even though those two strains are distantly related. Therefore, the COP system could be easily adapted to operate in a plethora of probiotic hosts.
REFERENCES
Alikhan NF, Petty NK, Ben Zakour NL, Beatson SA. BLAST Ring Image Generator (BRIG): simple prokaryote genome comparisons. BMC Genomics. 201 1 Aug 8; 12:402. doi: 10.1 186/1471-2164-12-402. PubMed PMID: 21824423.
Allard N, Garneau D, Poulin-Laprade D, Burrus V, Brzezinski R, Roy S. A diaminopimelic acid auxotrophic Escherichia coli donor provides improved counterselection following intergeneric conjugation with actinomycetes. Can J Microbiol. 2015 Aug;61 (8):565-74. doi: 10.1 139/cjm-2015-0041. Epub 2015 Jun 3. PubMed PMID: 26166710.
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990 Oct 5;215(3):403-10. PubMed PMID: 2231712.
Andrew S. (2010). FastQC: a quality control tool for high throughput sequence data. Available online at: http://www.bioinformatics.babraham.ac.uk/projects/fastqc/
Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M, Meyer F, Olsen GJ, Olson R, Osterman AL, Overbeek RA, McNeil LK, Paarmann D, Paczian T, Parrello B, Pusch GD, Reich C, Stevens R, Vassieva O, Vonstein V, Wilke A, Zagnitko O. The RAST Server: rapid annotation using subsystems technology. BMC Genomics. 2008 Feb 8;9:75. doi: 10.1 186/1471-2164-9-75. PubMed PMID: 18261238.
Born TL, Blanchard JS. Structure/function studies on enzymes in the diaminopimelate pathway of bacterial cell wall biosynthesis. Curr Opin Chem Biol.1999 Oct;3(5):607-13. Review. PubMed PMID: 10508663.
Benjamin WH Jr, Turnbough CL Jr, Posey BS, Briles DE. Salmonella typhimurium virulence genes necessary to exploit the Itys/s genotype of the mouse. Infect Immun. 1986 Mar;51 (3):872-8. PubMed PMID: 3512437.
Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for lllumina sequence data. Bioinformatics. 2014 Aug 1 ;30(15):21 14-20. doi: 10.1093/bioinformatics/btu170. Epub 2014 Apr 1. PubMed PMID: 24695404.
Bradley DE. Characteristics and function of thick and thin conjugative pili determined by transfer-derepressed plasmids of incompatibility groups 11 , I2, I5, B, K and Z. J Gen Microbiol. 1984 Jun;130(6):1489-502.
Bradley DE, Taylor DE, Cohen DR. Specification of surface mating systems among conjugative drug resistance plasmids in Escherichia coli K-12. J Bacteriol. 1980 Sep; 143(3): 1466-70. PubMed PMID: 6106013.
Carraro N, Matteau D, Luo P, Rodrigue S, Burrus V. The master activator of IncA/C conjugative plasmids stimulates genomic islands and multidrug resistance dissemination. PLoS Genet. 2014 Oct 23;10(10):e1004714. doi: 10.1371/journal.pgen.1004714. eCollection 2014 Oct. PubMed PMID: 25340549.
Ceccarelli D, Daccord A, Rene M, Burrus V. Identification of the origin of transfer (oriT) and a new gene required for mobilization of the SXT/R391 family of integrating conjugative elements. J Bacteriol. 2008 Aug; 190(15):5328-38. doi: 10.1 128/JB.00150-08. Epub 2008 Jun 6. PubMed PMID: 18539733.
Datsenko KA, Wanner BL. One-step inactivation of chromosomal genes in Escherichia coli K-12 using PCR products. Proc Natl Acad Sci U S A. 2000 Jun 6;97(12):6640-5. PubMed PMID: 10829079.
Datta S, Costantino N, Court DL. A set of recombineering plasmids for gram-negative bacteria. Gene. 2006 Sep 1 ;379:109-15. Epub 2006 May 4. PubMed PMID: 16750601.
Davey HM. Life, death, and in-between: meanings and methods in microbiology. Appl Environ Microbiol. 201 1 Aug 15;77(16) :5571 -6. doi: 10.1 128/AEM.00744-1 1. Epub 201 1 Jun 24. Review. PubMed PMID: 21705550.
Ferrieres L, Hemery G, Nham T, Guerout AM, Mazel D, Beloin C, Ghigo JM. Silent mischief: bacteriophage Mu insertions contaminate products of Escherichia coli random mutagenesis performed using suicidal transposon delivery plasmids mobilized by broad-host-range RP4 conjugative machinery. J Bacteriol. 2010 Dec; 192(24):6418-27. doi: 10.1 128/JB.00621 -10. Epub 2010 Oct 8. PubMed PMID: 20935093.
Furuya N, Komano T. Initiation and termination of DNA transfer during conjugation of Inch plasmid R64: roles of two sets of inverted repeat sequences within oriT in termination of R64 transfer. J Bacteriol. 2000 Jun;182(1 1):3191-6. PubMed PMID: 10809699.
Goryshin IY, Miller JA, Kil YV, Lanzov VA, Reznikoff WS. Tn5/IS50 target recognition. Proc Natl Acad Sci U S A. 1998 Sep 1 ;95(18): 10716-21. PubMed PMID: 9724770.
Green R, Rogers EJ. Transformation of chemically competent E. coli. Methods Enzymol. 2013;529:329-36. doi: 10.1016/B978-0-12-418687-3.00028-8. PubMed PMID:2401 1059.
Guzman LM, Belin D, Carson MJ, Beckwith J. Tight regulation, modulation, and high-level expression by vectors containing the arabinose PBAD promoter. J Bacteriol. 1995 Jul; 177(14) :4121 -30. PubMed PMID: 7608087.
Haeussler M, Raney BJ, Hinrichs AS, Clawson H, Zweig AS, Karolchik D, Casper J, Speir ML, Haussler D, Kent WJ. Navigating protected genomics data with UCSC Genome Browser in a Box. Bioinformatics. 2015 Mar 1 ;31 (5):764-6. doi: 10.1093/bioinformatics/btu712. Epub 2014 Oct 27. PubMed PMID: 25348212.
Ishiwa A, Komano T. Thin pilus PilV adhesins of plasmid R64 recognize specific structures of the lipopolysaccharide molecules of recipient cells. J Bacteriol. 2003 Sep; 185(17):5192-9. PubMed PMID: 12923092.
Kotula JW, Kerns SJ, Shaket LA, Siraj L, Collins JJ, Way JC, Silver PA. Programmable bacteria detect and record an environmental signal in the mammalian gut. Proc Natl Acad Sci U S A. 2014 Apr 1 ;1 1 1 (13):4838-43. doi: 10.1073/pnas.1321321 1 1 1. Epub 2014 Mar 17. PubMed PMID: 24639514.
Kvitko BH, Bruckbauer S, Prucha J, McMillan I, Breland EJ, Lehman S, Mladinich K, Choi KH, Karkhoff-Schweizer R, Schweizer HP. A simple method for construction of pir+ Enterobacterial hosts for maintenance of R6K replicon plasmids. BMC Res Notes. 2012 Mar 20;5: 157. doi: 10.1 186/1756-0500-5-157. PubMed PMID: 22433797.
Li, H (2013) Aligning sequence reads, clone sequences and assembly contigs with BWA- MEM. arXiv:1303.3997v1 [q-bio.GN]
Lutz R, Bujard H. Independent and tight regulation of transcriptional units in Escherichia coli via the LacR/O, the TetR/O and AraC/l 1-12 regulatory elements. Nucleic Acids Res. 1997 Mar 15;25(6): 1203-10. PubMed PMID: 9092630.
Marchler-Bauer A, Bo Y, Han L, He J, Lanczycki CJ, Lu S, Chitsaz F, Derbyshire MK, Geer RC, Gonzales NR, Gwadz M, Hurwitz Dl, Lu F, Marchler GH, Song JS, Thanki N, Wang Z, Yamashita RA, Zhang D, Zheng C, Geer LY, Bryant SH. CDD/SPARCLE: functional classification of proteins via subfamily domain architectures. Nucleic Acids Res. 2017 Jan 4;45(D1):D200-D203. doi: 10.1093/nar/gkw1 129. Epub 2016 Nov 29. PubMed PMID: 27899674.
McKenzie GJ, Craig NL. Fast, easy and efficient: site-specific insertion of transgenes into enterobacterial chromosomes using Tn7 without need for selection of the insertion event. BMC Microbiol. 2006 Apr 28;6:39. PubMed PMID: 16646962.
Metcalf WW, Jiang W, Wanner BL. Use of the rep technique for allele replacement to construct new Escherichia coli hosts for maintenance of R6K gamma origin plasmids at different copy numbers. Gene. 1994 Jan 28; 138(1 -2): 1 -7. PubMed PMID: 8125283.
Ormo M, Cubitt AB, Kallio K, Gross LA, Tsien RY, Remington SJ. Crystal structure of the Aequorea victoria green fluorescent protein. Science. 1996 Sep 6;273(5280): 1392-5. PubMed PMID: 8703075.
Papagiannitsis CC, Tzouvelekis LS, Kotsakis SD, Tzelepi E, Miriagou V. Sequence of pR3521 , an IncB plasmid from Escherichia coli encoding ACC-4, SCO-1 , and TEM-1 beta- lactamases. Antimicrob Agents Chemother. 201 1 Jan;55(1):376-81. doi: 10.1 128/AAC.00875-10. Epub 2010 Oct 18. PubMed PMID: 20956594.
Praszkier J, Pittard AJ. Control of replication in l-complex plasmids. Plasmid. 2005 Mar;53(2):97-1 12. Review. PubMed PMID: 15737397.
Rice P, Longden I, Bleasby A. EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet. 2000 Jun;16(6):276-7. PubMed PMID: 10827456.
Rodrigue S, Materna AC, Timberlake SC, Blackburn MC, Malmstrom RR, Aim EJ, Chisholm SW. Unlocking short read sequencing for metagenomics. PLoS One. 2010 Jul 28;5(7):e1 1840. doi: 10.1371/journal.pone.001 1840. PubMed PMID: 20676378.
Ronchel MC, Ramos JL. Dual system to reinforce biological containment of recombinant bacteria designed for rhizoremediation. Appl Environ Microbiol. 2001 Jun;67(6):2649-56. PubMed PMID: 1 1375176; PubMed Central PMCID: PMC92920.
Schindelin J, Arganda-Carreras I, Frise E, Kaynig V, Longair M, Pietzsch T, Preibisch S, Rueden C, Saalfeld S, Schmid B, Tinevez JY, White DJ, Hartenstein V, Eliceiri K, Tomancak P, Cardona A. Fiji: an open-source platform for biological-image analysis. Nat Methods. 2012 Jun 28;9(7):676-82. doi: 10.1038/nmeth.2019. PubMed PMID: 22743772.
Seiffert SN, Carattoli A, Schwendener S, Collaud A, Endimiani A, Perreten V. Plasmids Carrying bla(CMY -2/4) in Escherichia coli from Poultry, Poultry Meat, and Humans Belong to a Novel IncK Subgroup Designated lncK2. Front Microbiol. 2017 Mar 15;8:407. doi: 10.3389/fmicb.2017.00407. eCollection 2017. PubMed PMID: 28360894.
Sekizuka T, Kawanishi M, Ohnishi M, Shima A, Kato K, Yamashita A, Matsui M, Suzuki S, Kuroda M. Elucidation of quantitative structural diversity of remarkable rearrangement regions, shufflons, in Incl2 plasmids. Sci Rep. 2017 Apr 19;7(1):928. doi: 10.1038/S41598- 017-01082-y. PubMed PMID: 28424528.
Shaner NC, Lambert GG, Chammas A, Ni Y, Cranfill PJ, Baird MA, Sell BR, Allen JR, Day RN, Israelsson M, Davidson MW, Wang J. A bright monomeric green fluorescent protein derived from Branchiostoma lanceolatum. Nat Methods. 2013 May;10(5):407-9. doi: 10.1038/nmeth.2413. Epub 2013 Mar 24. PubMed PMID: 23524392.
Sonnenborn, U. & Schulze, J. The non-pathogenic Escherichia coli strain Nissle 1917— features of a versatile probiotic. Microb. Ecol. Health Dis. 21 , 122-158 (2009).
Venturini C, Hassan KA, Roy Chowdhury P, Paulsen IT, Walker MJ, Djordjevic SP. Sequences of two related multiple antibiotic resistance virulence plasmids sharing a unique IS26-related molecular signature isolated from different Escherichia coli pathotypes from different hosts. PLoS One. 2013 Nov 4;8(1 1):e78862. doi: 10.1371/journal. pone.0078862. eCollection 2013. PubMed PMID: 24223859.