WO2022213253A1 - 经修饰的Prp43解旋酶及其用途 - Google Patents
经修饰的Prp43解旋酶及其用途 Download PDFInfo
- Publication number
- WO2022213253A1 WO2022213253A1 PCT/CN2021/085609 CN2021085609W WO2022213253A1 WO 2022213253 A1 WO2022213253 A1 WO 2022213253A1 CN 2021085609 W CN2021085609 W CN 2021085609W WO 2022213253 A1 WO2022213253 A1 WO 2022213253A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- prp43
- helicase
- seq
- modified
- protein
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/90—Isomerases (5.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/483—Physical analysis of biological material
- G01N33/487—Physical analysis of biological material of liquid biological material
- G01N33/48707—Physical analysis of biological material of liquid biological material by electrical means
- G01N33/48721—Investigating individual macromolecules, e.g. by translocation through nanopores
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y306/00—Hydrolases acting on acid anhydrides (3.6)
- C12Y306/04—Hydrolases acting on acid anhydrides (3.6) acting on acid anhydrides; involved in cellular and subcellular movement (3.6.4)
- C12Y306/04013—RNA helicase (3.6.4.13)
Definitions
- This application relates to nucleic acid sequencing technology.
- Nanopore sequencing technology is the third-generation nucleic acid sequencing technology, which obtains DNA/RNA sequence information by recording different electrical signals generated by different bases when DNA/RNA strands pass through nanopores.
- One of the challenges of nanopore sequencing technology is that DNA/RNA molecules often travel through nanopores too fast, exceeding the resolution of the instrument, making it difficult to obtain accurate electrical signals reflecting sequence information. Therefore, how to control or slow down the speed of DNA/RNA molecules passing through the nanopore is crucial to improve the accuracy of nanopore sequencing technology.
- an emerging method to characterize polynucleotides includes transmembrane pores, contact and interaction of helicases with polynucleotides, whereby helicases control the movement of target polynucleotides through nanopores to increase polynucleotides Residence time of acids at nanopores.
- patents WO2013057495A3 and US20150191709A1 disclose a new method for characterizing target polynucleotides using pore and Hel308 helicase or molecular motors capable of binding to nucleotides within the target polynucleotide.
- the helicase or molecular motor of the invention can effectively control the movement of the target polynucleotide through the pore.
- patents US20150065354A1 and US9617591B2 disclose a method for characterizing target polynucleotides using XPD helicase, the method utilizing pore and XPD helicase.
- the XPD helicase of this invention can control the movement of the target polynucleotide through the pore.
- patents US 2016O257942A1 and US20180179500 A1 disclose that T4 phage-derived Dda helicase and some homologous proteins thereof can be applied to polynucleotide through-pore sequencing after modification.
- helicases can be divided into six superfamily (Superfamily, SF), of which the SF1 and SF2 superfamily helicases play the translocation and unwinding functions in the form of monomers, and the SF3-SF6 family play in the form of polymers. effect.
- superfamily SF
- helicases that act in monomeric form are easier to use and more uniform in performance.
- Helicases of the SF1 and SF2 superfamily are based on protein sequence homology, domain arrangement, substrate-binding form and specificity, polarity of helical (5'-3' and 3'-5' orientation), and Characteristics such as rotation or translocation mechanisms are classified into different families, the SF1 superfamily includes UvrD/Rep family helicases, Upf1 family helicases and Pif1 family helicases, and the SF2 superfamily includes Rad3/XPD family helicases , Ski2-like family helicase, DEAH/RHA family helicase, NS3/NPH-II family helicase, DEAD-Box (DEAD-Box) family helicase, RIG-I family helicase, RecQ-like family Helicases, RecG-like family helicases, Swi/Snf family helicases, and T1R family helicases.
- the SF1 superfamily includes UvrD/Rep family helicases, Upf1 family helicases and
- RecD and T4 phage-derived Dda helicases both belong to the Pif1-like family of the SF1 superfamily.
- the substrates tend to be single-stranded DNA, translocation and unwinding in the 5'-3' direction.
- the common unwinding belongs to this family.
- the enzyme also includes Pif1 helicase, TrwC helicase, etc.; Hel308 helicase derived from Methanococcoides burtonii strain (as disclosed in US20150191709A1) belongs to the class ski2 family helicase of the SF2 superfamily, and can simultaneously use single-stranded DNA or RNA.
- As a substrate it polarly shifts or unwinds double-stranded nucleic acids in the 3'-5' direction.
- helicases belonging to this family also include ski2 helicase, Brr2 helicase, Mtr4 helicase, etc.;
- XPD helicase belongs to the Rad3/XPD family helicase of the SF2 superfamily. It specifically binds to single-stranded DNA, and shifts or unwinds double-stranded nucleic acid in the 5'-3' polar direction. Enzymes also include Rad3 helicase and the like.
- each helicase has its own advantages and disadvantages and its own applicable environment, and these helicases are still difficult to meet the requirements of scientific research and medical technology.
- novel helicases that can be used in nucleic acid nanopore sequencing technology in order to improve the applicability, accuracy and sensitivity of nanopore sequencing technology.
- Prp43 helicases especially modified Prp43 helicases, can control the movement of polynucleotide molecules through nanopores, and thus can be used in nanopore sequencing technology.
- a first aspect of the present application relates to a modified Prp43 helicase comprising a RecA1 domain, a RecA2 domain and a Ratchet domain, relative to the corresponding wild-type Prp43 helicase or fragment thereof
- the modified Prp43 helicase includes 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 introduced in at least one domain selected from the group consisting of RecA1 domain, RecA2 domain, Ratchet domain Insertion or substitution of one or more cysteines, and/or insertion or substitution of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 or more unnatural amino acids .
- a second aspect of the present application relates to a protein construct comprising the modified Prp43 helicase described in the first aspect of the present application, and a co-activator protein fused to the C-terminus or N-terminus of the Prp43 helicase G-Path domains of Paf1 or Paf1 fragments containing G-Path domains.
- the third aspect of the present application relates to a nucleic acid encoding the modified Prp43 helicase described in the first aspect of the present application or the protein construct described in the second aspect of the present application.
- a fourth aspect of the present application relates to an expression vector comprising the nucleic acid of the third aspect of the present application.
- the fifth aspect of the present application relates to the nucleic acid of the third aspect of the present application or a host cell comprising the expression vector of the fourth aspect of the present application.
- the sixth aspect of the present application relates to a method for preparing the protein construct described in the second aspect of the present application, comprising: providing a polypeptide of SEQ ID NO: 1 or a variant thereof and a polypeptide of SEQ ID NO: 26 or a variant thereof Into the polypeptide of SEQ ID NO:1 or a variant thereof, at least one cysteine residue and/or at least one unnatural amino acid are introduced, and then the C-terminus or N-terminus of the resulting polypeptide is fused to the polypeptide of SEQ ID NO:26 A polypeptide or variant thereof, forming the protein construct.
- the seventh aspect of the present application relates to a method for preparing the modified Prp43 helicase described in the first aspect of the present application or the protein construct described in the second aspect of the present application, comprising: culturing the modified Prp43 helicase described in the fifth aspect of the present application.
- the host cells described above were used to induce expression, and then the resulting expression product was purified.
- An eighth aspect of the present application relates to a method for controlling the movement of a polynucleotide molecule, comprising combining the polynucleotide molecule with the modified Prp43 helicase described in the first aspect of the present application or the second aspect of the present application. contact with the protein construct.
- a ninth aspect of the present application relates to a method for characterizing a target polynucleotide, the method comprising:
- a tenth aspect of the present application relates to the use of the modified Prp43 helicase of the first aspect of the present application or the protein construct of the second aspect of the present application in characterizing a polynucleotide of interest or controlling the passage of a polynucleotide of interest through a pore. Use on the move.
- An eleventh aspect of the present application relates to an analytical device for characterizing a target polynucleotide, said analytical device comprising one or more nanopores, one or more modified Prp43 according to the first aspect of the present application A helicase or a protein construct as described in the second aspect of the present application, and one or more containers.
- a twelfth aspect of the present application relates to a method of forming a sensor for characterizing a polynucleotide of interest, comprising providing a nanopore, and the modified Prp43 helicase or the present invention in the nanopore and the first aspect of the present application
- a complex is formed between the protein constructs described in the second aspect of the application.
- the present application provides a novel Prp43 helicase mutant or construct thereof that can be used for nucleic acid nanopore sequencing, which has enhanced ATP hydrolysis activity or unwinding activity due to introduction of mutations and/or introduction of accessory proteins, and/ Alternatively, the binding to the target polynucleotide can be maintained for a long time, thereby allowing continuous and stable control of the movement speed of the polynucleotide. Therefore, the Prp43 helicase mutant of the present application or its construct can continuously control the movement of the target polynucleotide through the pore at an appropriate rate required for sequencing, thereby improving the throughput and accuracy of nanopore sequencing.
- polypeptide refers to a molecule comprising amino acid residues linked by peptide bonds and containing more than five amino acid residues. Polypeptides may generally contain 20 or more amino acids, preferably 50 or more amino acids, or 100 or more amino acids. As used herein, the terms “protein”, “protein” and the term “polypeptide” are considered to have the same meaning; thus, the terms “protein”, “protein” and “polypeptide” are used interchangeably. Polypeptides can optionally be modified (eg, glycosylated, phosphorylated, acylated, farnesylated, prenylated, sulfonated, etc.) to increase their functionality or activity.
- modified eg, glycosylated, phosphorylated, acylated, farnesylated, prenylated, sulfonated, etc.
- Enzymes Polypeptides that exhibit activity under certain conditions in the presence of specific substrates can be referred to as "enzymes”. It will be appreciated that due to the degeneracy of the genetic code, a variety of nucleotide sequences can be generated that encode a given polypeptide.
- nucleic acid described herein is a general term for deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), and is a biological macromolecular compound composed of many nucleotide monomers.
- DNA deoxyribonucleic acid
- RNA ribonucleic acid
- nucleic acid and polynucleotide are considered to have the same meaning; thus, the term “nucleic acid” and the term “polynucleotide” are used interchangeably.
- Nucleotide monomers are composed of five-carbon sugars, phosphate groups, and nitrogenous bases. If the five-carbon sugar is ribose, the polymer formed is RNA; if the five-carbon sugar is deoxyribose, the polymer formed is DNA. Nitrogenous bases in nucleotides may include, but are not limited to, adenine (A), guanine (G), thymine (T), uracil (U), and cytosine (C). The nucleotides may be naturally occurring or artificially synthesized.
- nucleotides include, but are not limited to: adenosine monophosphate (AMP), guanosine monophosphate (GMP), thymidine monophosphate (TMP), uridine monophosphate (UMP), cytosine Pyrimidine nucleoside monophosphate (CMP), cyclic adenosine monophosphate (cAMP), cyclic guanosine monophosphate (cGMP), deoxyadenosine monophosphate (dAMP), deoxyguanosine monophosphate (dGMP), deoxythymidine monophosphate Phosphate (dTMP), deoxyuridine monophosphate (dUMP), and deoxycytidine monophosphate (dCMP).
- the nucleotides are selected from AMP, TMP, GMP, CMP, UMP, dAMP, dTMP, dGMP or dCMP.
- a "fragment" of a polypeptide or polypeptide domain means one or more (eg, several, tens or 100, etc.) at the amino and/or carboxyl terminus of the polypeptide or polypeptide domain ) polypeptides or polypeptide domains in which amino acid residues are deleted, but the fragment retains the desired activity.
- a fragment of a Prp43 helicase represents one or more (eg, 1-5, 1-10, 1-20, 1-50, 1- 100, 1-150, 1-200, or, for example, 20, 30, 40, 50, 60, 70, 80 or 90) amino acid residues deleted but still retain the helicase Active polypeptide sequence.
- a fragment of a polypeptide or domain comprises at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 92%, 94%, 96% of its original sequence length %, 98% or 99% of the length.
- fragments of polypeptides or domains comprise at least 50 amino acids, such as at least 60 amino acids, at least 70 amino acids, at least 80 amino acids, at least 90 amino acids, at least 100 amino acids, depending on the length of the original polypeptide or domain.
- fragments of polypeptides or domains may also comprise less than 700 amino acids, such as less than 600 amino acids, less than 500 amino acids, less than 400 amino acids, less than 300 amino acids, less than 200 amino acids , or less than 100 amino acids.
- expression includes any step involved in the production of a polypeptide, including, but not limited to, transcription, post-transcriptional modification, translation, post-translational modification, and secretion.
- An "expression vector” comprises a polynucleotide encoding a polypeptide operably linked to appropriate control sequences (eg, a promoter, and transcriptional and translational stop signals) for expression and/or translation in vitro.
- the expression vector can be any vector (eg, a plasmid or virus) that can conveniently undergo recombinant DNA procedures and can cause expression of a polynucleotide.
- the choice of vector will generally depend on the compatibility of the vector with the cells into which the vector is to be introduced.
- Vectors can be linear or closed circular plasmids.
- the vector may be an autonomously replicating vector, ie a vector that exists as an extrachromosomal entity that replicates independently of chromosomal replication, eg a plasmid, an extrachromosomal element, a minichromosome or an artificial chromosome.
- the vector may be one that, when introduced into a host cell, integrates into the genome and replicates with the chromosome into which it is integrated.
- the integrating cloning vector can integrate at random or predetermined target loci in the chromosome of the host cell.
- the vector system can be a single vector or plasmid or two or more vectors or plasmids that together contain the total DNA to be introduced into the genome of the host cell, or a transposon.
- control sequence refers to a component involved in the regulation of the expression of a coding sequence in a particular organism or in vitro.
- control sequences are transcription initiation, termination, promoter, leader, signal peptide, propeptide, prepropeptide or enhancer sequences; Shine-Delgarno sequence, repressor or activator efficient RNA processing signals, such as splicing and polyadenylation signals; sequences that stabilize cytoplasmic mRNA; sequences that enhance translation efficiency (e.g., ribosome binding sites); sequences that enhance protein stability; , sequences that enhance protein secretion.
- a "host cell” as defined herein is an organism suitable for genetic manipulation and which can be used to produce a product of interest, such as a Prp43 helicase as described herein.
- the host cell may be a host cell found in nature, or a host cell derived from genetic manipulation or classical mutagenesis of a parent host cell.
- the host cell is a recombinant host cell.
- the host cell can be a prokaryotic, archaeal or eukaryotic host cell.
- Prokaryotic host cells can be, but are not limited to, bacterial host cells.
- Eukaryotic host cells can be, but are not limited to, yeast, fungal, amoeba, algal, plant, animal, or insect host cells.
- nucleic acid or protein when used in reference to a nucleic acid or protein (or enzyme) means that the nucleic acid or protein (or enzyme) has been modified in sequence by human intervention compared to its native form.
- recombinant when referring to a cell (eg, a host cell) means that the genome of the cell has been sequence-modified by human intervention if compared to its native form.
- the terms “recombinant” and “modified” are considered synonymous.
- substitution means that a natural amino acid residue present in the corresponding wild-type polypeptide or enzyme is replaced by another amino acid residue.
- amino acid substitution and “amino acid substitution” are considered synonymous.
- variants or “mutant” as used herein have the same meaning and are used interchangeably. They can refer to polypeptides or nucleic acids. A variant refers to the presence of substitutions, insertions, deletions, truncations, transversions, etc. at one or more positions relative to a reference sequence (usually the wild form of a nucleic acid or polypeptide). Variants can be generated by, for example, site saturation mutagenesis, scanning mutagenesis, insertional mutagenesis, random mutagenesis, site-directed mutagenesis, and directed evolution, as well as various other recombinant methods known to those skilled in the art. Nucleic acid variants can be artificially synthesized by techniques known in the art.
- a "mature polypeptide” is defined herein as a polypeptide in its final form and obtained after translation of mRNA into a polypeptide and post-translational modification of the polypeptide.
- Post-translational modifications include N-terminal processing, C-terminal truncation, glycosylation, phosphorylation, and removal of leader sequences (such as signal peptides and/or propeptides) by cleavage.
- the similarity between two polypeptide sequences or nucleic acid sequences can be expressed in terms of their homology.
- identity or “homology” between two sequences are considered to have the same meaning and are used interchangeably herein.
- sequences are aligned for the best match, which is the identical match between the two sequences over the aligned regions percentage.
- the percent sequence homology between two amino acid sequences or between two polynucleotide sequences can be determined using well-known algorithms, such as the Needleman and Wunsch algorithm for aligning two sequences (Needleman, S.B. and Wunsch, C.D. ( 1970) J. Mol. Biol.
- the "opening” mentioned in this application refers to the opening of the polynucleotide binding domain of the wild-type Prp43 helicase itself, and may also refer to the opening of the polynucleotide binding part that binds to the Prp43 helicase.
- An opening is an opening that allows dissociation of the polynucleotide from the Prp43 helicase, and the opening may not always be present, but contains at least one opening in at least one conformational state.
- a "modified Prp43 helicase” or a construct comprising a modified Prp43 helicase as described herein contains one or more openings. The Prp43 helicase is modified so that two or more moieties are attached on the same monomer of the helicase to reduce the size of the opening.
- One or more”, “at least one”, “one or more” or “one or more” as used in this application includes: one, two, three, four, five, six, seven , eight, nine, ten, eleven, twelve, or more, etc.
- two or more or “two or more” or “two or more” includes: two, three, four, five, six, seven, eight, nine , ten, eleven, twelve, or more, etc.
- the "plurality” used in this application includes, but is not limited to, three, four, five, six, seven, eight, nine, ten, eleven, twelve, or more and many more.
- “comprising”, “containing” or “comprising” are open-ended descriptions, meaning that the specified components or steps described are included, as well as other specified components or steps that do not substantially affect.
- the protein or nucleic acid may either consist of the sequence, or may have an additional protein or nucleic acid at one or both ends of the protein or nucleic acid. amino acid residues or nucleotides, but the protein or nucleic acid still has the activity described herein (eg, its ability to control movement of polynucleotides, etc.).
- Prp43 helicase is a known helicase whose structure and function have been studied and reported in the prior art, for example, see Marcel J. Tauchert et.al, "Structural and functional analysis of the RNA helicase Prp43 from the thermophilic eukaryote Chaetomium thermophilum”, Acta Cryst., 2016, F72, 112–120.
- Prp43 helicase for nanopore sequencing or to control the movement of polynucleotide molecules through nanopores.
- Prp43 helicase belongs to the DEAH/RHA helicase of the SF2 superfamily. It can bind to single-stranded DNA or RNA, translocate or unwind double-stranded DNA or RNA nucleic acid in the 3'-5' direction, and belong to the same family of helicases. Helicases also include Prp22 helicase, Prp2 helicase, MLE helicase, DHX9 helicase, and the like.
- FIG. 1 is a schematic 3D structure of a Prp43 helicase (SEQ ID NO: 1) derived from Chaetomium thermophilum.
- Prp43 helicase also contains several domains: N-terminal domain (M1-L96), C-terminal domain WH domain (Y459-P526), Ratchet domain (L527-V640) and OB domain (S641-A764), etc.
- RecA1 and RecA2 contain seven conserved motifs (Motifs), of which Ia (TQPRRVAA), Ib (TDGQLLR) and IV (LLFLTG) interact with substrate nucleic acids, motifs I (GSGKT), II (DEAH), V ( TNIAETSLT) and VI (QRAGRAGR) are involved in the binding of nucleotides, while motif III (SAT) couples nucleotide hydrolysis and nucleic acid translocation or unwinding.
- Motifs conserved motifs
- Ia TQPRRVAA
- Ib TDGQLLR
- IV LLFLTG
- motifs I GSGKT
- II DEAH
- V TNIAETSLT
- VI QRAGRAGR
- Prp43 helicase is enriched in positively charged amino acids at the top of the RecA1 and RecA2 domains, and together with the C-terminal WH, Ratchet, and OB domains form a channel around single-stranded DNA or RNA.
- Prp43 helicase has a strong affinity with single-stranded DNA or RNA, the process is still a thermodynamic dynamic equilibrium process, and cannot completely control the movement of the via hole of the target nucleic acid, especially the length of the target nucleic acid is relatively long. Long, such as nucleic acids of 1000 bases in length, 5000 bases in length, 10000 bases in length, 100000 bases in length or longer.
- the inventors found that the Prp43 helicase can be modified to ensure the binding of the enzyme to the nucleic acid and continuously control the passage of the nucleic acid through the nanopore.
- the inventors found that the introduction of one or more cysteines or unnatural amino acids into the RecA1 domain, RecA2 domain and/or Ratchet domain of Prp43 helicase can reduce the multinucleation of Prp43 helicase The size of the opening of the nucleotide binding domain, thereby improving the binding capacity of its target nucleic acid.
- a first aspect of the present application relates to a modified Prp43 helicase comprising a RecA1 domain, a RecA2 domain and a Ratchet domain, relative to the corresponding wild-type Prp43 helicase or fragment thereof
- the modified Prp43 helicase includes 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 introduced in at least one domain selected from the group consisting of RecA1 domain, RecA2 domain, Ratchet domain Insertion or substitution of one or more cysteines, and/or insertion or substitution of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 or more unnatural amino acids .
- one at least one cysteine residue and/or at least one unnatural amino acid may be introduced in any of the following groups:
- the unnatural amino acids described in this application include, but are not limited to: 4-azido-L-phenylalanine (Faz), 4-acetyl-L-phenylalanine, 3-acetyl-L-phenylalanine Amino acid, 4-acetoacetyl-L-phenylalanine, O-allyl-L-tyrosine, 3-(phenylselenyl)-L-alanine, O-2-propyne- 1-yl-L-tyrosine, 4(dihydroxyboronyl)-L-phenylalanine, 4-[(ethylsulfanyl)carbonyl]-L-phenylalanine, (2S)-2 -Amino-3- ⁇ 4-[(propan-2-ylsulfanyl)carbonyl]phenyl ⁇ propionic acid, (2S)-2-amino-3- ⁇ 4-[(2-amino-3-sulfane (ylpropionyl)amino
- Prp43 helicase in this application should be understood in its broadest sense and is considered to encompass homologous proteins of Prp43 helicase (eg, SEQ ID NO: 1).
- SEQ ID NO: 1 a RecA1 domain, a RecA2 domain and/or a Ratchet domain, and has at least 30% homology to SEQ ID NO: 1, such as at least 35%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96% %, at least 97%, at least 98%, at least 99%, or at least 99.9% homology, it can be considered to be a Prp43 helicase. Accordingly, some helicases known as HrpA helicases or HrpB helicases, such as those listed in Table 1, are also considered to fall within the scope of "
- the Prp43 helicase described in this application can be Prp43 helicase from various conventional sources, for example, the Prp43 helicase can be derived from Chaetomium thermophilum, Bathycoccus prasinos, Uncultured bacterium, Archaeon, Parcubacteria, Sorangium cellulosum, Candidatus Sungbacteria, Mycolicibacterium chitae, Parcubacteria, Thermodesulforhabdus norvegica, Deltaproteobacteria, Puniceicoccales, Desulfobacterium vacuolatum or Desulfobacter sp. or derived from viral metagenome, etc.
- Table 1 gives some examples of homologous Prp43 helicases that can be used in the present application, but the Prp43 helicases of the present application are not limited to these examples.
- Prp43 helicase described in the application is derived from Chaetomium thermophilum.
- the application provides a modified Prp43 helicase comprising a variant of SEQ ID NO: 1 or a fragment thereof, the variant being included in the RecA1 structure Insertion or substitution of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 or more cysteines introduced into the domain, RecA2 domain and/or Ratchet domain, and /or insertion or substitution of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 or more unnatural amino acids.
- the variant is included in M157, Q161, D165, F181, E182, N183, R324, L328, E332, R335, L351, P352, P353, H354, D321 corresponding to SEQ ID NO: 1 , E320, R358, P563, A564, N565, D603, K605, K606, H609, Y615, R616, S619, N623, A626 or K630 in any one or two or more positions to introduce at least one cysteine residue and/or at least one unnatural amino acid residue. More preferably, the introduced cysteine residue or unnatural amino acid residue is located at a position corresponding to any one or two or more of F181, P352, S619 or N623 of SEQ ID NO:1.
- the modified Prp43 helicase comprises removal of the N-terminal domain, preferably at least 96, at least 90, at least 80, at least 70 from position 1 of the N-terminus , at least 60, at least 50, at least 40, or at least 30 residues.
- M1-N60 is preferably removed, i.e. the T61-A764 fragment of SEQ ID NO: 1 is preferably used, and on this basis an insertion or substitution of one or more cysteines is introduced, and/or insertion or substitution of one or more unnatural amino acids.
- two or more cysteine residues or non- Natural amino acid residues are linked between introduced cysteine and cysteine, between introduced unnatural amino acid and unnatural amino acid, and between introduced cysteine and unnatural amino acid. interlinked, between an introduced cysteine and a natural amino acid, or between an introduced unnatural amino acid and a natural amino acid.
- any number and combination of two or more introduced cysteines and unnatural amino acids can be interconnected.
- 2, 3, 4, 5, 6, 7, 8 or more cysteines and/or unnatural amino acids can be linked to each other.
- One or more cysteines can be linked to one or more cysteines.
- One or more cysteines can be linked to one or more unnatural amino acids such as Faz.
- One or more unnatural amino acids such as Faz can be linked to one or more unnatural amino acids such as Faz.
- One or more cysteines can be attached to one or more natural amino acids on the helicase.
- One or more unnatural amino acids such as Faz can be linked to one or more natural amino acids on the helicase.
- connection can be any connection mode, including temporary connection or permanent connection mode, such as covalent connection or hydrogen bond connection or electrostatic interaction or ⁇ - ⁇ interaction or hydrophobic interaction.
- linkage may be permanent, such as a covalent linkage.
- Covalent attachment can be carried out using chemical crosslinkers, which can vary in length from one carbon (phosgene type linker) to multiple Angstroms.
- chemical crosslinkers can vary in length from one carbon (phosgene type linker) to multiple Angstroms.
- PEGs polyethylene glycol
- PNA polypeptide nucleic acid
- TAA threose nucleic acid
- GNA glycerol nucleic acid
- TMAD TMAD
- TMAD catalytic reagents
- a TMAD catalyst is used to covalently link cysteine residues introduced at positions F181 and N623 or at positions P352 and S619 to cysteine residues.
- the modified Prp43 helicase further comprises substitutions for one or more cysteine residues, more preferably C148, C214, C303, C323 corresponding to SEQ ID NO: 1 , C377, C441, C508, C543, C608 one or more cysteine residues are replaced, more preferably cysteine residues are replaced by alanine, glycine, valine, isoleucine , leucine, phenylalanine, tyrosine, serine, threonine, aspartic acid, glutamic acid, lysine, arginine, histidine, methionine, tryptophan, glutamine , asparagine or proline residues.
- the modified Prp43 helicase further comprises one selected from the following group or multiple amino acid modifications:
- the amino acids that interact with nucleotides and are substituted include, but are not limited to: R152, R153, R180, T195, Q198, R201, E316, E317, G349, T381, N382 corresponding to SEQ ID NO: 1 , K403, K405, L416, P526, P557, R562, Q558, H688, P689, T708, K710, Y712, R714.
- at least one amino acid that interacts with the phosphate group of one or more nucleotides in single-stranded DNA, RNA or double-stranded DNA, RNA is substituted.
- the one or more amino acids related to the binding of NTP and/or divalent metal ions include, but are not limited to: T126, D218, S387, E219, R432 corresponding to SEQ ID NO: 1 , R435, T121, K125, T127, T389, R162, D391, F360.
- the one or more amino acids that interact with the transmembrane pore include, but are not limited to: C303, E336, D288, R287, E286, E284, E291 corresponding to SEQ ID NO: 1.
- At least one amino acid that interacts with the sugar and/or base of one or more nucleotides in single-stranded DNA, RNA or double-stranded DNA, RNA is replaced with an amino acid comprising a larger side chain.
- the larger side chains include an increased number of carbon atoms, have an increased length, have an increased molecular volume, and/or have an increased van der Waals volume.
- the larger side chain increases (i) electrostatic interactions between the at least one amino acid and one or more nucleotides in the single- or double-stranded DNA; (ii) hydrogen bonding and/or (iii) ) cation-pi interaction.
- the amino acid of the larger side chain is not alanine (A), cysteine (C), glycine (G), selenocysteine (U), methionine (M), aspartate acid (D) or glutamic acid (E).
- the Prp43 helicase is further modified to reduce the negative charge on its surface.
- the Prp43 helicase also contains substitutions that increase the net positive charge.
- the Prp43 helicase further comprises substitution or modification of surface negatively charged amino acids, polar or non-polar amino acids.
- the substitution includes the substitution of positively charged amino acids, uncharged amino acids for negatively charged amino acids, uncharged amino acids, aromatic amino acids, polar or non-polar amino acids.
- described positively charged amino acid, uncharged amino acid, polar, non-polar amino acid or aromatic amino acid can be natural or non-natural amino acid, it can be synthetic or modified natural amino acid .
- the amino acid sequence of the corresponding wild-type Prp43 helicase may have at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or at least 99.9 % homology.
- the Prp43 helicase is a variant of SEQ ID NO: 1 (i.e., derived from Chaetomium thermophilum), and the variant of SEQ ID NO: 1 is included in SEQ ID NO: 1 At least one cysteine residue and/or at least one unnatural amino acid is introduced at the F181 and/or N623 position of the At least one cysteine residue and/or at least one unnatural amino acid is introduced at the position.
- the Prp43 helicase is a variant of SEQ ID NO: 1 (ie, derived from Chaetomium thermophilum), and the variant of SEQ ID NO: 1 further includes SEQ ID NO: 1 at least one or more cysteines are substituted.
- Substituted amino acids can be alanine, glycine, valine, isoleucine, leucine, phenylalanine, tyrosine, serine, threonine, aspartic acid, glutamic acid, lysine acid, arginine, histidine, methionine, tryptophan, glutamine, asparagine, proline.
- the one or more substituted cysteines are C148, C214, C303, C323, C377, C441, C508, C543, C608.
- the Prp43 helicase is a variant of SEQ ID NO: 1 (i.e., derived from Chaetomium thermophilum), and the variant of SEQ ID NO: 1 removes M1 of the N-terminal domain To the N60 sequence, it is further preferred to remove the N-terminal M1 to L96 sequence.
- the present invention is a helicase in which the N-terminal domain M1 to N60 sequence has been removed.
- the Prp43 helicase is a variant of SEQ ID NO: 1 (i.e., derived from Chaetomium thermophilum), and the variant of SEQ ID NO: 1 is the same as the variant of SEQ ID NO: 1
- the amino acid sequence is at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% , at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or at least 99.9% homology.
- the Prp43 helicase is a modified T61-A764 fragment of SEQ ID NO:1.
- the Prp43 helicase is a modified T61-A764 fragment of SEQ ID NO: 1 (derived from Chaetomium thermophilum), and the modification is F181C/N623C/C508S or P352C/S619C /C508S.
- Prp43 helicases described herein can be modified to aid in identification or purification, for example by adding histidine residues (His tag), aspartic acid residues (asp tag), streptavidin tag, Flag tag, SUMO tag, GST tag or MBP tag, or by adding a signal sequence to facilitate their secretion from cells in which the polypeptide does not naturally contain the signal sequence.
- His tag histidine residues
- asp tag aspartic acid residues
- streptavidin tag Flag tag
- Flag tag Flag tag
- SUMO tag SUMO tag
- GST tag GST tag
- MBP tag MBP tag
- Prp43 helicase described herein may be in the form of a Prp43 helicase oligomer comprising one or more of the Prp43 helicases described herein.
- the Prp43 helicase oligomer may further comprise wild-type Prp43 helicase or other types of helicases.
- the other types of helicases can be Hel308 helicase, XPD helicase, Dda helicase, RecD2 helicase, TraI helicase or TrwC helicase and the like.
- Prp43 helicase and the wild-type Prp43 helicase Preferably, between the Prp43 helicase and the wild-type Prp43 helicase, between the Prp43 helicase and the Prp43 helicase, between the wild-type Prp43 helicase and the wild-type Prp43 helicase, and between the Prp43 helicase
- they can be connected or arranged in a head-to-head, tail-to-tail or head-to-tail manner.
- the Prp43 helicase oligomer comprises two or more Prp43 helicases described in this application, wherein the Prp43 helicases may be different or the same.
- the Prp43 helicase In physiological functions, the Prp43 helicase is involved in the dissociation of the intronic spliceosome composed of U2.U5.U6snRNPs during pre-mRNA processing.
- the function of the enzyme in this process requires the interaction of two glycine-rich moieties.
- the auxiliary proteins Ntr1 and Ntr2 of G-Path motif interact to activate their ATP hydrolysis and unwinding activities; Prp43 helicase is also involved in ribosome synthesis to help the maturation of 18S and 25S rRNAs precursors, This process also requires the activation of G-Path motif-rich proteins Pfa1 and Gno1 proteins.
- Prp43 helicase requires a G-Path domain-containing accessory protein to activate its ATP hydrolysis and helical activities under physiological functional conditions.
- the enzyme has weak activity in the absence of the co-activator protein, it is more preferably more active in ATP hydrolysis and unwinding in the presence of the co-activator protein.
- the inventors found that individual partial fragments containing G-Path domain accessory proteins still have an activating function.
- a protein construct comprising the modified Prp43 helicase described in the first aspect of the present application, and at the C-terminus or N-terminus of the Prp43 helicase Fusion of the G-Path domain of the helper activator protein Paf1 or a fragment of Paf1 containing the G-Path domain.
- the protein construct can also be regarded as a fusion protein.
- This modified Prp43 helicase construct is due to the fusion of the G-Path domain or the fragment containing the G-Path domain of the co-activator protein Paf1 or its homologous protein at the C-terminus or N-terminus of the Prp43 helicase. , the ATP hydrolysis and/or unwinding activity of the modified helicase is significantly enhanced, which is more conducive to controlling the movement of nucleic acid through holes in nanopore nucleic acid sequencing.
- the number of modified Prp43 helicases may be one or more.
- the coactivator protein Paf1 can be Paf1 proteins from various sources routinely used in the art, such as from Chaetomium thermophilum var.thermophilum, Thermothielavioides terrestris, Thermothelomyces thermophilus, Podospora anserina, Neurospora tetrasperma, Paf1 of Coniochaeta sp., Monosporascus sp., Hypoxylon sp., Madurella mycetomatis or Coniochaeta pulveracea.
- Table 2 gives some examples of homologous Paf1 proteins that can be used in the Prp43 helicase constructs of the present application, but the Paf1 proteins of the present application are not limited to these examples.
- the G-path domain sequence is the sequence of the K662-G742 fragment corresponding to SEQ ID NO: 16 (that is, the sequence of SEQ ID NO: 26) in the above-mentioned Pfa1 accessory protein or its homologous protein or a variant thereof. sequence.
- the amino acid sequence of the co-activator protein Paf1 is SEQ ID NO: 16 or has at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97% %, at least 98%, at least 99%, or at least 99.9% homology of the amino acid sequence of the variant, and the co-activator protein Paf1 has the function of activating the Prp43 helicase.
- the G-Path domain of Paf1 is the K662-G742 fragment of SEQ ID NO: 16.
- the Prp43 helicase comprises the sequence of SEQ ID NO: 1 or a variant thereof
- the Pfa1 co-activator protein comprises the sequence of SEQ ID NO: 16 or a variant thereof A variant, or the G-Path domain sequence SEQ ID NO:26 of SEQ ID NO:16 (corresponding to the K662-G742 fragment of the sequence of SEQ ID NO:16) or a variant thereof.
- the protein constructs described herein can be modified to aid in identification or purification, for example by adding histidine residues (His tag), aspartic acid residues (asp tag), streptavidin tag, Flag tags, SUMO tags, GST tags or MBP tags or Strep TagII tags, or by adding a signal sequence to facilitate their secretion from cells in which the polypeptide does not naturally contain the signal sequence.
- His tag histidine residues
- asp tag aspartic acid residues
- streptavidin tag Flag tags
- SUMO tags GST tags or MBP tags or Strep TagII tags
- An alternative way of introducing a genetic tag is to chemically attach the tag to a natural or artificial site on the protein construct.
- the third aspect of the present application provides a nucleic acid encoding the construct of the Prp43 helicase described in the first aspect of the present application and/or the protein described in the second aspect of the present application.
- the fourth aspect of the present application provides an expression vector, the expression vector comprising the nucleic acid described in the third aspect of the present application.
- the nucleic acid is operably linked to a regulatory element in an expression vector, wherein the regulatory element is preferably a promoter.
- the promoter is selected from T7, trc, lac, ara or ⁇ L.
- the expression vector includes but is not limited to plasmid, virus or phage.
- nucleic acid constructs or expression vectors Various methods for inserting nucleic acids into nucleic acid constructs or expression vectors are known to those of skill in the art, see e.g. Sambrook and Russell, Molecular Cloning: A Laboratory Manual, 3rd Edition, CSHL Press, Cold Spring Harbor, NY, 2001.
- the fifth aspect of the present application provides a host cell, the host cell comprising the nucleic acid described in the third aspect of the present application or the expression vector described in the fourth aspect of the present application.
- the host cells include but are not limited to Escherichia coli.
- the host cell is selected from BL21(DE3), JM109(DE3), B834(DE3), TUNER, C41(DE3), Rosetta2(DE3), Origami, Origami B, etc. .
- the sixth aspect of the present application relates to a method for preparing the protein construct described in the second aspect of the present application, comprising: providing a polypeptide of SEQ ID NO: 1 or a variant thereof and a polypeptide of SEQ ID NO: 26 or a variant thereof Into the polypeptide of SEQ ID NO:1 or a variant thereof, at least one cysteine residue and/or at least one unnatural amino acid are introduced, and then the C-terminus or N-terminus of the resulting polypeptide is fused to the polypeptide of SEQ ID NO:26 A polypeptide or variant thereof, forming the protein construct.
- the seventh aspect of the present application relates to a method for preparing the modified Prp43 helicase described in the first aspect of the present application or the protein construct described in the second aspect of the present application, comprising: culturing the modified Prp43 helicase described in the fifth aspect of the present application.
- the host cells described above were used to induce expression, and then the resulting expression product was purified.
- Genetic engineering techniques such as overexpression of enzymes in host cells, genetic modification of host cells, or hybridization techniques are methods known in the art, such as those described in Sambrook and Russel (2001) "Molecular Cloning: A Laboratory Manual (p. 3 editions), Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press or F. Ausubel et al., eds., "Current protocols in molecular biology", Green Publishing and Wiley Interscience, those techniques described in New York (1987).
- the preparation method of the modified Prp43 helicase comprises: according to the amino acid sequence of the Prp43 helicase and or the auxiliary activator protein or the activation domain described in the present application, obtaining a Prp43 helicase encoding the Prp43 helicase.
- the nucleic acid sequence of the gyrase is digested and connected to the expression vector, then transformed into E. coli, induced to express and purified to obtain the Prp43 helicase.
- Prp43 helicase or protein constructs Use of Prp43 helicase or protein constructs
- Prp43 helicase or protein constructs of the present application can be used to control the movement of polynucleotide molecules or to characterize a polynucleotide of interest.
- the Prp43 helicase of the present invention is a useful tool for controlling the movement of target polynucleotides during strand sequencing, when provided with the usual necessary components to facilitate movement, Prp43 helicases along DNA or RNA in 3 '-5' direction, but the orientation of the DNA or RNA in the pore (depending on which end of the DNA or RNA is captured) means that the Prp43 helicase can be used against the direction of the applied field or along the The direction of the field moves the DNA or RNA into the well.
- the opening of the polynucleotide binding domain or polynucleotide binding moiety on the Prp43 helicase or construct can be effectively reduced
- the ATP of the modified Prp43 helicase can be effectively increased Hydrolytic activity or unwinding activity, thereby improving the ability to control the passage of target polynucleotides through the pore.
- An eighth aspect of the present application relates to a method for controlling the movement of a polynucleotide molecule, comprising combining the polynucleotide molecule with the modified Prp43 helicase described in the first aspect of the present application or the second aspect of the present application. contact with the protein construct.
- the control of the movement of the polynucleotide is the movement of the control polynucleotide through the pore.
- the pores are nanopores, and the nanopores are transmembrane pores.
- the pores may be natural or artificial, including but not limited to biological pores, solid state pores, or hybrid biological and solid state pores.
- the method may comprise one or more Prp43 helicases jointly controlling the movement of the polynucleotide.
- a ninth aspect of the present application relates to a method for characterizing a target polynucleotide, the method comprising:
- steps (a) and (b) are repeated one or more times.
- any number of the Prp43 helicases described herein can be used in the method.
- the two or more Prp43 helicases described in this application may be the same or different. Wild-type Prp43 helicases or other types of helicases may also be included.
- two or more helicases can be linked or just arranged to play the function of controlling the movement of the polynucleotide by binding to the polynucleotide respectively.
- the method further comprises the step of applying a potential difference across the pore in contact with the helicase or construct, and the polynucleotide of interest.
- the pores are structures that allow hydrated ions to flow from one side of the membrane to the other layer of the membrane, driven by an applied electrical potential.
- the pores are nanopores, and the nanopores are transmembrane pores.
- the transmembrane pore provides a channel for the movement of the target polynucleotide.
- the pores are selected from biological pores, solid-state pores or hybrid pores of biological and solid-state.
- the pores include, but are not limited to, pores derived from M. smegmatis porin A, M. smegmatis porin B, M. smegmatis porin C, M. smegmatis porin Protein D, hemolysin, lysin, interleukin, outer membrane porin F, outer membrane porin G, outer membrane phospholipase A, WZA or Neisseria autotransport lipoprotein and the like.
- Said membrane may be any membrane existing in the prior art, preferably an amphiphilic layer, ie a layer formed of amphiphilic molecules such as phospholipids having at least one hydrophilic part and at least one lipophilic or hydrophobic part , amphiphilic molecules can be synthetic or naturally occurring.
- the membrane is a lipid bilayer membrane.
- the target polynucleotide can be attached to the membrane using any known method. If the membrane is an amphiphilic layer, such as a lipid bilayer, the polynucleotide is preferably attached to the membrane via a polypeptide present in the membrane or via a hydrophobic anchor present in the membrane.
- the hydrophobic anchor is preferably lipid, fatty acid, sterol, carbon nanotube or amino acid.
- the rate of passage of the target polynucleotide through the pore is controlled by the Prp43 helicase or construct, resulting in an identifiable and stable current level for target determination Characterization of polynucleotides.
- the target polynucleotide is single-stranded, double-stranded or at least partially double-stranded.
- the target polynucleotide can be modified by means of tags, spacers, methylation, oxidation or damage.
- the target polynucleotide is at least partially double-stranded. wherein the double-stranded portion constitutes a Y-adapter structure comprising a leader sequence that preferentially screws into the pore.
- the length of the target polynucleotide may be 10-100,000 bases or more.
- the length of the target polynucleotide can be at least 10, at least 50, at least 100, at least 200, at least 300, at least 400, at least 500, at least 1000, at least 2000, at least 5000, at least 10000, at least 50000 or at least 100000 bases, etc.
- the helicase is incorporated into the internal nucleotides of the single-stranded polynucleotide.
- the target polynucleotide is DNA or RNA.
- the target polynucleotide is RNA
- the RNA in order to improve the ability and efficiency of the RNA to be sequenced passing through the pore, is modified to include a non-RNA polynucleotide.
- the step of RNA modification comprises linking the DNA leader region to the 3' end of the RNA to be tested. It also includes the step of reverse transcription of the RNA to be tested
- the one or more features are selected from the source, length, identity, sequence, secondary structure of the target polynucleotide, or whether the target polynucleotide is modified. Further preferably, the one or more features are performed by electrical and/or optical measurements.
- electrical and/or optical signals are generated by electrical and/or optical measurements, and each nucleotide corresponds to a signal level, and the electrical and/or optical signals are then converted into nucleotide characteristics.
- the electrical measurement includes, but is not limited to, current measurement, impedance measurement, tunnel measurement, wind tunnel measurement, or field effect transistor (FET) measurement, and the like.
- FET field effect transistor
- the electrical signals described herein are selected from measurements of current, voltage, tunneling, resistance, potential, conductivity or lateral electrical measurements.
- the electrical signal is an electrical current through the aperture.
- the characterization further includes applying an improved Viterbi algorithm.
- a tenth aspect of the present application relates to the use of the modified Prp43 helicase of the first aspect of the present application or the protein construct of the second aspect of the present application in characterizing a polynucleotide of interest or controlling the passage of a polynucleotide of interest through a pore. Use on the move.
- An eleventh aspect of the present application relates to an analytical device for characterizing a target polynucleotide, said analytical device comprising one or more nanopores, one or more modified Prp43 according to the first aspect of the present application A helicase or a protein construct as described in the second aspect of the present application, and one or more containers.
- the analysis device is selected from kits, devices or sensors.
- the analysis device is a kit, and the kit further includes a chip comprising a lipid bilayer.
- the pores span the lipid bilayer.
- the kits described herein comprise one or more lipid bilayers, each lipid bilayer comprising one or more of the described pores.
- the kits described herein also include reagents or devices for carrying out the characterization of the polynucleotide of interest.
- the reagents include buffers and tools required for PCR amplification.
- a twelfth aspect of the present application relates to a method of forming a sensor for characterizing a polynucleotide of interest, comprising providing a nanopore, and the modified Prp43 helicase or the present invention in the nanopore and the first aspect of the present application
- a complex is formed between the protein constructs described in the second aspect of the application.
- Figure 1 shows a schematic 3D structure of an N-terminal (M1-N60) truncated wild-type Prp43 helicase (SEQ ID NO: 1) from Chaetomium thermophilum.
- Figure 2 shows N-terminal (M1-N60) truncated wild-type Prp43 helicase, modified Prp43 helicase Prp43-2 (F181C/N623C/C508S), modified Prp43 helicase Prp43-3 (P352C/S619C/C508S), N-terminal (M1-N60) truncated protein construct Prp43-GP, N-terminal (M1-N60) truncated protein construct Prp43-GP-2 (F181C/N623C/C508S) and N-terminal (M1-N60) truncated protein constructs Prp43-GP-3 (P352C/S619C/C508S) single-stranded DNA-dependent ATP hydrolysis activity assay.
- Prp43-GP-3 P352C/S619C/C508S
- Figure 3 shows N-terminal (M1-N60) truncated wild-type Prp43 helicase, modified Prp43 helicase Prp43-2 (F181C/N623C/C508S), modified Prp43 helicase Prp43-3 (P352C/S619C/C508S), N-terminal (M1-N60) truncated protein construct Prp43-GP, N-terminal (M1-N60) truncated protein construct Prp43-GP-2 (F181C/N623C/C508S) and N-terminal (M1-N60) truncated protein constructs Prp43-GP-3 (P352C/S619C/C508S) single-stranded RNA-dependent ATP hydrolysis activity assay.
- Prp43-GP-3 P352C/S619C/C508S
- Figure 4 shows N-terminal (M1-N60) truncated wild-type Prp43 helicase or N-terminal (M1-N60) truncated protein constructs Prp43-GP, N-terminal (M1-N60) under low salt conditions Affinity curve of truncated protein construct Prp43-GP-2 (F181C/N623C/C508S) with single-stranded DNA.
- Figure 5 shows N-terminal (M1-N60) truncated wild-type Prp43 helicase, N-terminal (M1-N60) truncated protein constructs Prp43-GP and N-terminal (M1-N60) truncated protein Results of gel migration experiments for construct Prp43-GP-2 (F181C/N623C/C508S).
- lane 1 is the T44-37-FAM substrate
- lane 2 is the complex bound by the wild-type Prp43 helicase and T44-37-FAM substrate
- lane 3 is the wild-type Prp43 helicase and T44-37-FAM
- lane 4 is the complex of Prp43-GP helicase and T44-37-FAM substrate binding
- lane 5 is Prp43-GP helicase and T44-37-FAM substrate
- lane 6 is the Prp43-GP-2 helicase mutant and T44-37-FAM substrate bound complex
- lane 7 is the Prp43-GP-2 helicase mutant and T44 The product of TMAD-catalyzed treatment after 37-FAM substrate binding.
- Figure 6 shows a schematic diagram of DNA construct X, wherein the 5' end of the corresponding sequence SEQ ID NO: 32 in the A region is connected to 4 iSpC3 spacers (region B), and the spacer region is connected to the corresponding sequence SEQ ID NO in the C region.
- the 3' end of: 33, the 5' end of the C region sequence is connected to the corresponding sequence SEQ ID NO: 34 of the D region, the corresponding sequence SEQ ID NO: 35 of the E region of the construct and the corresponding sequence SEQ ID NO: 36 of the F region ( It has a 3' cholesterol tether) hybridization.
- Figure 7 shows an example of current traces as N-terminal (M1-N60) truncated protein construct Prp43-GP-2 (F181C/N623C/C508S) controls DNA construct X movement through the MspA nanopore (y-axis coordinates are Current (pA, 0 to 100), x-coordinate is time (h:m:s)).
- Figure 8 illustrates: shows a schematic diagram of RNA construct Y wherein SEQ ID NO: 37 (labeled D) has its 3' end linked to 20 iSpC3 spacers (labeled A) and its 5' end is linked to 4 iSpC3 A spacer (labeled B) linked to the 3' end of SEQ ID NO:38 (labeled C), the SEQ ID NO:39 (labeled E) region of the construct is identical to SEQ ID NO:40 ( Labeled F, it hybridizes with a 3' cholesterol tether.
- Figure 9 shows an example of the current trajectory of N-terminal (M1-N60) truncated protein construct Prp43-GP-2 (F181C/N623C/C508S) controlling RNA construct Y through the MspA nanopore (y-axis coordinates are current (pA, 0 to 100), the x-axis coordinate is time (h:m:s)).
- Wild-type Prp43 helicase and modified helicase Prp43 and protein constructs were prepared using standard molecular biology methods, the principles and procedures of which are well known to those skilled in the art (see references cited herein ).
- N-terminal truncated wild-type Prp43 helicase i.e., T61-A764 fragment
- N-terminal truncated Prp43 helicase T61-A764 fragment corresponding to the Prp43 helicase amino acid sequence of SEQ ID NO: 1 with the N removed
- the nucleic acid sequence SEQ ID NO: 28, provided by GenScript Biotechnology Co., Ltd.
- corresponding to the M1 to N60 fragment of the terminal domain was connected to the vector pGS-21a (GenScript Biotechnology Co., Ltd.) by restriction enzyme ligation Company, Cat. No. SD0121), it was transformed into expression competent host cell BL21(DE3) (Beijing Quanshijin Biotechnology Co., Ltd., Cat. No.
- CD601-02 after being verified by sequencing.
- IPTG isopropyl- ⁇ -D-thiogalactoside
- the supernatant is subjected to subsequent protein chromatography purification, including nickel ion affinity chromatography, ion exchange chromatography and molecular sieve separation.
- Target protein the target protein after GST tag removal was detected by SDS-PAGE gel electrophoresis.
- the truncated Prp43 protein (M1 to N60 with the N-terminal domain removed) after the excision of the tag is detected by SDS-PAGE, and it is shown that the size of the target protein is correct, which can be used for subsequent testing and analysis.
- N-terminal truncated Prp43 helicase T61-A764 fragment fusion GP domain protein mutant Prp43-GP-2 (F181C/N623C/C508S) (i.e. SEQ ID NO: 27): according to N-terminal truncated Prp43 helicase T61 -The same preparation method of the A764 fragment was carried out, except that the starting sequence was replaced by SEQ ID NO:30 with the nucleic acid sequence corresponding to the N-terminal truncated Prp43 helicase T61-A764 fragment (SEQ ID NO:28). The protein construct Prp43-GP-2 after excision of the tag was detected by SDS-PAGE and showed that the size of the target protein was correct, which could be used for subsequent testing and analysis.
- modified N-terminal (M1-N60) truncated Prp43 helicases and protein constructs were prepared: Prp43-2 (F181C/N623C/C508S), modified The Prp43 helicase Prp43-3 (P352C/S619C/C508S), N-terminal (M1-N60) truncated protein construct Prp43-GP and N-terminal (M1-N60) truncated protein construct Prp43-GP- 3 (P352C/S619C/C508S).
- Prp43-2 F181C/N623C/C508S
- modified The Prp43 helicase Prp43-3 P352C/S619C/C508S
- N-terminal (M1-N60) truncated protein construct Prp43-GP and N-terminal (M1-N60) truncated protein construct Prp43-GP- 3 P352C/S619C/C
- the ATP hydrolysis activity of GP-3 (P352C/S619C/C508S) was tested when bound or incubated with single-stranded DNA or single-stranded RNA substrates.
- the ATPase hydrolysis activity of Prp43 helicase was detected by absorptiometry.
- the specific steps are to prepare a premixed solution containing 50uM phosphate, transfer 50uL of phosphate standard solution into 950uL of ultrapure water, and number the pipes.
- reaction mixture 160 uL of working reagent was added to each background blank well to stop the reaction. The initial 30 min incubation is not required, after which the background blank reading can be subtracted from the sample reading. Set the reaction combination according to the scheme of Table 4 and Table 5. Each sample, background blank, or negative control reaction requires 70 uL of reaction mix.
- Figures 2 and 3 show the ATP hydrolysis activities of N-terminal (M1-N60) truncated wild-type Prp43 helicase and modified Prp43 helicase or protein constructs after binding to DNA or RNA, respectively. It can be seen from Figure 2 and Figure 3 that after the G-Path activation domain was fused to the C-terminus of Prp43 helicase or mutant, the ATP hydrolysis activity of the enzyme was significantly improved; After cysteine, the ATP hydrolysis activity was also improved.
- This example uses the fluorescence polarization method to detect the N-terminal (M1-N60) truncated wild-type Prp43 helicase or the modified protein constructs Prp43-GP and Prp43-GP-2 (F181C/N623C/C508S) in single-stranded DNA affinity was tested.
- N-terminal (M1-N60) truncated wild-type helicase or modified helicase were diluted according to the following concentration gradients: 800nM, 400nM, 200nM, 100nM, 50nM, 25nM, 12.5nM, 6.25nM, 3.125nM, 1.56nM, BLANK, the enzyme and 10nM single-stranded DNA substrate were incubated in Binding Buffer (10mM HEPES, 50mM KCl, 5% Glycerol, pH7.0) for 20min, and the polarization value was read under 530nM excitation light and 560nM emission light And fit to draw the affinity curve, and set three replicates for each enzyme concentration.
- Binding Buffer (10mM HEPES, 50mM KCl, 5% Glycerol, pH7.0
- the fitting results are shown in Fig. 4.
- the N-terminal (M1-N60) truncated Prp43 helicase is fused with a G-Path domain at the C-terminal, that is, Prp43-GP helicase, or the Prp43-GP helicase is based on Prp43-GP.
- Prp43-GP-2 F181C/N623C/C508S
- the binding of N-terminal (M1-N60) truncated wild-type Prp43 helicase or modified protein constructs Prp43-GP and Prp43-GP-2 was detected by gel shift assay
- the TMAD catalyst catalyzed the enhancement of nucleic acid binding following the formation of a disulfide bond between mutant sites F181C and N623C in mutants.
- the experimental conditions are as follows: 30nM of FAM fluorophore-labeled single-chain polythymidine substrate T44-37-FAM was added to Buffer (10mM HEPES, 50mM KCl, PH7.0), and then the final concentration was 120nM.
- Buffer 10mM HEPES, 50mM KCl, PH7.0
- the wild-type Prp43 helicase and the modified Prp43-2 and Prp43-GP-2 helicases were incubated at 30 °C for 1.5 h; the final concentration of the enzyme was 1000 times the TMAD cross-linking agent to catalyze the mutation site cyste For cross-linking of amino acids, incubate at 30°C for 1.5h.
- DNA construct X as shown in Figure 6 was prepared: the 5' end of the corresponding sequence in the A region (SEQ ID NO: 32) was connected to 4 iSpC3 spacers (region B), which was connected to the corresponding sequence in the C region (SEQ ID NO: 32). ID NO: 33), the 5' end of the C region sequence is connected to the corresponding sequence of the D region (SEQ ID NO: 34), and the corresponding sequence of the E region of the construct (SEQ ID NO: 35) corresponds to the F region
- SEQ ID NO: 36 which has a 3' cholesterol tether
- the A, B, C, and D segments with a concentration of 10uM were synthesized into ligated fragments, and added to the annealing buffer (10mM Tris, pH7.0, 50mM NaCl) in a ratio of 1:1:1 with the E and F fragments.
- annealing buffer 10mM Tris, pH7.0, 50mM NaCl
- annealing was carried out according to the process of 98°C 10min, -0.1°C/0.6s, 300 cycles, 65°C 5min, -0.1°C/0.6s, 400 cycles (among them, A, B, C, D, E,
- the F fragment was provided by Sangon Bioengineering (Shanghai) Co., Ltd.).
- the prepared DNA construct X and modified mutant helicase Prp43-GP-2 (F181C/N623C/C508S) or N-terminally truncated wild-type Prp43-GP were incubated at 25°C in buffer (10 mM HEPES, pH 8.0 , 50mM NaCl, 5% glycerol) were pre-incubated for 30 minutes, and 1000 times the concentration of helicase TMAD catalyst was added for incubation at room temperature for 30 minutes.
- MspA nanopore MspA protein sequence is SEQ ID NO: 31, according to Michael Faller et al., "The Structure of a Mycobacterial Outer-Membrane Channel", Science 303, 1189 (2004); prepared as described in DOI: 10.1126/science.1094114) to obtain electrical measurement signals.
- Montal-Mueller technique ⁇ 25 ⁇ m diameter holes in the PTFE membrane formed a bilayer, separating two ⁇ 100 ⁇ L buffer solutions. All experiments were performed in the described buffer. Single-channel current is measured using an amplifier equipped with a digitizer. The Ag/AgCl electrodes were connected into the buffer such that the cis compartment was connected to the ground of the amplifier and the trans compartment was connected to the active electrode.
- RNA construct shown in Figure 8 was prepared: the 3' end of the corresponding sequence in the D region (SEQ ID NO: 37) was connected to 20 iSpC3 spacers (A region), and its 5' end was connected to 4 iSpC3 spacers (B area), the spacer is connected to the 3' end of the corresponding sequence in the C area (SEQ ID NO: 38), the E area corresponding sequence (SEQ ID NO: 39) and the F area corresponding sequence (SEQ ID NO: 39) of the construct :40) Hybrid.
- the A, B, C, and D segments with a concentration of 10uM were synthesized into ligated fragments, and added to the annealing buffer (10mM Tris, pH7.0, 50mM NaCl) in a ratio of 1:1:1 with the E and F fragments.
- annealing buffer 10mM Tris, pH7.0, 50mM NaCl
- annealing was carried out according to the process of 98°C 10min, -0.1°C/0.6s, 300 cycles, 65°C 5min, -0.1°C/0.6s, 400 cycles (among them, A, B, C, D, E,
- the F fragment was provided by Sangon Bioengineering (Shanghai) Co., Ltd.).
- RNA constructs were pre-incubated with Prp43-GP-2 or N-terminally truncated wild-type Prp43-GP in buffer (10 mM HEPES, pH 7.0, 50 mM NaCl) at 30°C for 30 minutes.
- MspA nanopore MspA protein sequence is SEQ ID NO: 31, according to Michael Faller et al., "The Structure of a Mycobacterial Outer-Membrane Channel", Science 303, 1189 (2004); prepared as described in DOI: 10.1126/science.1094114
- Montal-Mueller technique ⁇ 25 ⁇ m diameter holes in the PTFE membrane formed a bilayer, separating two ⁇ 100 ⁇ L buffer solutions.
- RNA polynucleotide construct and Prp43-GP-2 helicase or N-terminally truncated wild-type Prp43-GP were added to 70 ⁇ L of the cis compartment of the electrophysiology chamber buffer to initiate the capture of the helicase-RNA complex in the nanopore.
- Helicase ATPase activity was activated by adding divalent metal (5 mM MgCl 2 ) and NTP (5 mM ATP) to the cis compartment as needed. Experiments were carried out at a constant potential of +180 mV.
- the results show that the RNA construct is moved by the RNA controlled by the Prp43-GP-2 helicase, and the results of the RNA movement controlled by the Prp43-GP-2 helicase are shown in FIG. 9 .
- the RNA movement controlled by the Prp43-GP-2 helicase was 3 seconds long and corresponded to the translocation of an RNA construct of approximately 30 bp across the nanopore.
- N-terminal truncated wild-type Prp43 (T61-A764 fragment) or N-terminal truncated construct Prp43-GP was difficult to obtain the A/B/C/D fragment of construct Y generated by nanopore. continuous current signal.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Biotechnology (AREA)
- Physics & Mathematics (AREA)
- Microbiology (AREA)
- Medicinal Chemistry (AREA)
- Biophysics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Immunology (AREA)
- Analytical Chemistry (AREA)
- Hematology (AREA)
- Pathology (AREA)
- Urology & Nephrology (AREA)
- General Physics & Mathematics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Food Science & Technology (AREA)
- Nanotechnology (AREA)
- Plant Pathology (AREA)
- Enzymes And Modification Thereof (AREA)
- Peptides Or Proteins (AREA)
Abstract
Description
Claims (46)
- 一种经修饰的Prp43解旋酶,包括RecA1结构域、RecA2结构域和Ratchet结构域,相对于对应的野生型Prp43解旋酶或其片段而言所述经修饰的Prp43解旋酶包括在选自RecA1结构域、RecA2结构域、Ratchet结构域的至少一个结构域中引入的1、2、3、4、5、6、7、8、9、10、11个或更多个半胱氨酸的插入或置换,和/或1、2、3、4、5、6、7、8、9、10、11个或更多个非天然氨基酸的插入或置换。
- 如权利要求1所述的经修饰的Prp43解旋酶,其中所述引入的半胱氨酸残基或非天然氨基酸残基位于对应于SEQ ID NO:1的M157、Q161、D165、F181、E182、N183、R324、L328、E332、R335、P353、L351、P352、H354、D321、E320、R358、P563、A564、N565、D603、K605、K606、H609、Y615、R616、S619、N623、A626或K630中的任一个或两个以上的位置,优选地位于对应于SEQ ID NO:1的F181、P352、S619或N623中的任一个或两个以上的位置。
- 如权利要求1或2所述的经修饰的Prp43解旋酶,其中所述野生型Prp43解旋酶的片段是Prp43解旋酶的N端结构域被去除后所得片段,优选地去除N端的位置1开始的至少96个、至少90个、至少80个、至少70个、至少60个、至少50个、至少40个或至少30个残基。
- 如权利要求1至3任一项所述的经修饰的Prp43解旋酶,其中所述经修饰的Prp43解旋酶还包括对于一个或多个半胱氨酸残基的置换,优选为对应于SEQ ID NO:1的C148、C214、C303、C323、C377、C441、C508、C543、C608的一个或多个半胱氨酸残基被置换,更优选地半胱氨酸残基被置换为丙氨酸、甘氨酸、缬氨酸、异亮氨酸、亮氨酸、苯丙氨酸、酪氨酸、丝氨酸、苏氨酸、天冬氨酸、谷氨酸、赖氨酸、精氨酸、组氨酸、蛋氨酸、色氨酸、谷氨酰胺、天冬酰胺或脯氨酸残基。
- 如权利要求1至4任一项所述的经修饰的Prp43解旋酶,其中所述被引入的半胱氨酸残基和非天然氨基酸残基总个数为2个或以上,并且在至少一个被引入的半胱氨酸或非天然氨基酸残基与另一个被引入的半胱氨 酸或非天然氨基酸残基之间形成相互连接。
- 如权利要求5所述的经修饰的Prp43解旋酶,其中所述连接选自共价连接或氢键连接或静电相互作用或π-π相互作用或疏水相互作用等,优选共价连接。
- 如权利要求6所述的经修饰的Prp43解旋酶,其中所述共价连接是-S-S键或者是通过选自碳酰氯、马来酰亚胺、活性酯、琥珀酰亚胺、叠氮化物、烷烃、烯烃、炔烃、聚乙二醇(PEGs)、多肽、多糖、脱氧核糖核酸(DNA)、肽核酸(PNA)、苏糖核酸(TNA)、甘油核酸(GNA)、聚酰胺或TMAD的交联剂或催化剂实现的共价连接。
- 如权利要求1至7任一项所述的经修饰的Prp43解旋酶,其中所述经修饰的Prp43解旋酶还包含选自以下组的一个或多个氨基酸修饰:(a)一个或多个与核苷酸相互作用的氨基酸被置换;(b)一个或多个与NTP和/或二价金属离子结合相关的氨基酸被置换;(c)一个或多个与跨膜孔相互作用的氨基酸被置换;(d)降低Prp43解旋酶的表面的负电荷的进一步修饰。
- 如权利要求1至8任一项所述的经修饰的Prp43解旋酶,其来源于Chaetomium thermophilum、Bathycoccus prasinos、Uncultured bacterium、Archaeon、Parcubacteria、Sorangium cellulosum、Candidatus Sungbacteria、Mycolicibacterium chitae、Parcubacteria、Thermodesulforhabdus norvegica、Deltaproteobacteria、Puniceicoccales、Desulfobacterium vacuolatum或Desulfobacter sp.或来源于病毒基因组(viral metagenome)。
- 如权利要求1至8任一项所述的经修饰的Prp43解旋酶,其中所述野生型Prp43解旋酶是选自具有以下序列之一的Prp43解旋酶:SEQ ID NO:1、SEQ ID NO:2、SEQ ID NO:3、SEQ ID NO:4、SEQ ID NO:5、SEQ ID NO:6、SEQ ID NO:7、SEQ ID NO:8、SEQ ID NO:9、SEQ ID NO:10、SEQ ID NO:11、SEQ ID NO:12、SEQ ID NO:13、SEQ ID NO:14、SEQ ID NO:15。
- 如权利要求1至8任一项所述的经修饰的Prp43解旋酶,其与对应的野生型的Prp43解旋酶的氨基酸序列具有至少30%、至少40%、至少50%、 至少60%、至少70%、至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、至少99%或至少99.9%的同源性。
- 如权利要求1至8任一项所述的经修饰的Prp43解旋酶,其来源于Chaetomium thermophilum,优选地其与SEQ ID NO:1的氨基酸序列具有至少30%、至少40%、至少50%、至少60%、至少70%、至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、至少99%或至少99.9%的同源性。
- 如权利要求1至3任一项所述的经修饰的Prp43解旋酶,其来源于Chaetomium thermophilum,且所述引入的半胱氨酸残基或非天然氨基酸残基位于对应于SEQ ID NO:1的F181、P352、S619或N623中的任一个或多个位置。
- 如权利要求13所述的经修饰的Prp43解旋酶,其为经修饰的SEQ ID NO:1的T61-A764片段,且所述修饰选自F181C/N623C/C508S和P352C/S619C/C508S。
- 如权利要求1至14任一项所述的经修饰的Prp43解旋酶,其为寡聚体形式,包含一个或多个如权利要求1至12任一项所述的经修饰的Prp43解旋酶。
- 一种蛋白构建体,其包括如权利要求1至15任一项所述的经修饰的Prp43解旋酶,以及在所述Prp43解旋酶的C端或N端融合的辅助激活蛋白Paf1的G-Path结构域或含有G-Path结构域的Paf1片段。
- 如权利要求16所述的蛋白构建体,其中包含一个或多个所述经修饰的Prp43解旋酶。
- 如权利要求16或17所述的蛋白构建体,其中所述辅助激活蛋白Paf1是来源于Chaetomium thermophilum var.thermophilum、Thermothielavioides terrestris、Thermothelomyces thermophilus、Podospora anserina、Neurospora tetrasperma、Coniochaeta sp.、Monosporascus sp.、Hypoxylon sp.、Madurella mycetomatis或Coniochaeta pulveracea的Paf1。
- 如权利要求16或17所述的蛋白构建体,其中所述辅助激活蛋 白Paf1的氨基酸序列选自SEQ ID NO:16、SEQ ID NO:17、SEQ ID NO:18、SEQ ID NO:19、SEQ ID NO:20、SEQ ID NO:21、SEQ ID NO:22、SEQ ID NO:23、SEQ ID NO:24、或SEQ ID NO:25,或具有与SEQ ID NO:16、SEQ ID NO:17、SEQ ID NO:18、SEQ ID NO:19、SEQ ID NO:20、SEQ ID NO:21、SEQ ID NO:22、SEQ ID NO:23、SEQ ID NO:24或SEQ ID NO:25其中之一的氨基酸序列具有至少30%、至少40%、至少50%、至少60%、至少70%、至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、至少99%或至少99.9%的同源性的变体的氨基酸序列,并且所述辅助激活蛋白Paf1具有激活Prp43解旋酶的功能。
- 如权利要求16或17所述的蛋白构建体,其中所述Paf1的G-Path结构域为SEQ ID NO:16序列的K662-G742片段(SEQ ID NO:26)或与SEQ ID NO:26的氨基酸序列具有至少30%、至少40%、至少50%、至少60%、至少70%、至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、至少99%或至少99.9%的同源性的变体的氨基酸序列,并且所述变体具有激活Prp43解旋酶的功能。
- 如权利要求16所述的蛋白构建体,其中所述Prp43解旋酶为SEQ ID NO:1的T61-A764片段,并且在对应于SEQ ID NO:1的F181、P352、S619或N623中的任一个或两个以上的位置处具有引入的1个或多个半胱氨酸的插入或置换和/或非天然氨基酸的插入或置换,且所述辅助激活蛋白Paf1的氨基酸序列为SEQ ID NO:16。
- 如权利要求16所述的蛋白构建体,其中所述Prp43解旋酶为SEQ ID NO:1的T61-A764,并且还具有选自F181C/N623C/C508S和P352C/S619C/C508S的修饰,且所述Prp43解旋酶的C端与氨基酸序列为SEQ ID NO:26的多肽融合。
- 一种编码如权利要求1至15任一项所述的经修饰的Prp43解旋酶或如权利要求16至22任一项所述的蛋白构建体的核酸。
- 如权利要求23所述的核酸,其中所述核酸被包含在选自于质粒、 病毒或噬菌体的载体中。
- 一种包含如权利要求23所述核酸的表达载体。
- 如权利要求25所述的表达载体,其中所述表达载体选自于质粒、病毒或噬菌体。
- 如权利要求25或26所述的表达载体,其中所述表达载体进一步包含用于控制所述核酸的表达的调控元件。
- 如权利要求27所述的表达载体,其中所述调控元件是与所述核酸可操作地连接的启动子。
- 如权利要求28所述的表达载体,其中所述启动子选自T7、trc、lac、ara或λL。
- 包含如权利要求23或24所述的核酸或包含如权利要求25至28任一项所述表达载体的宿主细胞。
- 如权利要求30所述的宿主细胞,其为大肠杆菌(Escherichia coli.)。
- 一种制备如权利要求16至22任一项所述的蛋白构建体的方法,包括:提供SEQ ID NO:1的多肽或其变体和SEQ ID NO:26的多肽或其变体,在SEQ ID NO:1的多肽或其变体中引入至少一个半胱氨酸残基和/或至少一个非天然氨基酸,然后在所得多肽的C端或N端融合SEQ ID NO:26的多肽或其变体,形成所述蛋白构建体。
- 一种制备如权利要求1至15任一项所述的经修饰的Prp43解旋酶或如权利要求16至22任一项所述的蛋白构建体的方法,包括:包括培养如权利要求30或31所述的宿主细胞,并进行诱导表达,然后纯化所得表达产物。
- 一种控制多核苷酸分子移动的方法,包括将所述多核苷酸分子与如权利要求1至15任一项所述的经修饰的Prp43解旋酶或如权利要求16至22任一项所述的蛋白构建体接触。
- 如权利要求34所述的控制多核苷酸分子移动的方法,其中所述多核苷酸分子被控制穿过纳米孔,所述的纳米孔为跨膜孔。
- 如权利要求35所述的控制多核苷酸分子移动的方法,其中所述跨膜孔选自蛋白孔、固态孔或生物与固态杂交的孔,优选地所述的蛋白孔选 自于耻垢分枝杆菌孔蛋白A、耻垢分枝杆菌孔蛋白B、耻垢分枝杆菌孔蛋白C、耻垢分枝杆菌孔蛋白D、溶血素、胞溶素、白细胞介素、外膜孔蛋白F、外膜孔蛋白G、外膜磷脂酶A、WZA或奈瑟氏菌自转运脂蛋白。
- 一种表征目标多核苷酸的方法,所述的方法包括:(a)将目标多核苷酸与如权利要求1至15任一项所述的经修饰的Prp43解旋酶或如权利要求16至22任一项所述的蛋白构建体接触,使得所述Prp43解旋酶或蛋白构建体控制所述目标多核苷酸移动穿过一纳米孔;(b)获取目标多核苷酸中的核苷酸与所述纳米孔相互作用时的一个或多个特征,从而表征所述目标多核苷酸。
- 如权利要求37所述的表征目标多核苷酸的方法,其中所述方法进一步包括跨所述纳米孔施加电势差的步骤。
- 如权利要求37或38所述的表征目标多核苷酸的方法,其中所述方法使用一个或多个所述Prp43解旋酶或蛋白构建体。
- 如权利要求37或38所述的表征目标多核苷酸的方法,其中所述的纳米孔为跨膜孔,所述跨膜孔选自蛋白孔、固态孔或生物与固态杂交的孔,优选地所述的蛋白孔选自于耻垢分枝杆菌孔蛋白A、耻垢分枝杆菌孔蛋白B、耻垢分枝杆菌孔蛋白C、耻垢分枝杆菌孔蛋白D、溶血素、胞溶素、白细胞介素、外膜孔蛋白F、外膜孔蛋白G、外膜磷脂酶A、WZA或奈瑟氏菌自转运脂蛋白。
- 如权利要求1至15任一项所述的经修饰的Prp43解旋酶或如权利要求16至22任一项所述的蛋白构建体在表征目标多核苷酸或控制目标多核苷酸穿过孔的移动中的用途。
- 一种用于表征目标多核苷酸的分析装置,所述的分析装置包含一个或多个纳米孔、一个或多个如权利要求1至15任一项所述的经修饰的Prp43解旋酶或如权利要求16至22任一项所述的蛋白构建体、以及一个或多个容器。
- 如权利要求42所述的用于表征目标多核苷酸的分析装置,所述分析装置中还包括包含脂质双层的芯片,其中所述的纳米孔横跨脂质双层。
- 如权利要求42或43所述的用于表征目标多核苷酸的分析装置, 所述分析装置中还包括缓冲剂、PCR扩增试剂。
- 如权利要求42、43或44所述的用于表征目标多核苷酸的分析装置,所述的分析装置是试剂盒或传感器。
- 一种形成表征目标多核苷酸的传感器的方法,包括提供纳米孔,和在所述纳米孔和如权利要求1至15任一项所述的经修饰的Prp43解旋酶或如权利要求16至22任一项所述的蛋白构建体之间形成复合物。
Priority Applications (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US18/553,834 US20240368568A1 (en) | 2021-04-06 | 2021-04-06 | Modified prp43 helicase and use thereof |
| EP21935490.9A EP4299746A4 (en) | 2021-04-06 | 2021-04-06 | Modified prp43 helicase and use thereof |
| CN202180006254.2A CN115777019A (zh) | 2021-04-06 | 2021-04-06 | 经修饰的Prp43解旋酶及其用途 |
| PCT/CN2021/085609 WO2022213253A1 (zh) | 2021-04-06 | 2021-04-06 | 经修饰的Prp43解旋酶及其用途 |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/CN2021/085609 WO2022213253A1 (zh) | 2021-04-06 | 2021-04-06 | 经修饰的Prp43解旋酶及其用途 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2022213253A1 true WO2022213253A1 (zh) | 2022-10-13 |
Family
ID=83545051
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2021/085609 Ceased WO2022213253A1 (zh) | 2021-04-06 | 2021-04-06 | 经修饰的Prp43解旋酶及其用途 |
Country Status (4)
| Country | Link |
|---|---|
| US (1) | US20240368568A1 (zh) |
| EP (1) | EP4299746A4 (zh) |
| CN (1) | CN115777019A (zh) |
| WO (1) | WO2022213253A1 (zh) |
Cited By (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2024138701A1 (zh) * | 2022-12-30 | 2024-07-04 | 深圳华大生命科学研究院 | 一种解旋酶突变体及其制备方法和在高通量测序中的应用 |
| WO2024089270A3 (en) * | 2022-10-28 | 2024-07-18 | Oxford Nanopore Technologies Plc | Pore monomers and pores |
| WO2025067293A1 (zh) * | 2023-09-27 | 2025-04-03 | 北京齐碳科技有限公司 | 解旋酶及其应用 |
| WO2025138248A1 (zh) * | 2023-12-29 | 2025-07-03 | 深圳华大生命科学研究院 | Ski2-like解旋酶及其应用 |
Families Citing this family (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN116334030B (zh) * | 2023-05-06 | 2024-01-16 | 深圳市梅丽纳米孔科技有限公司 | 一种经修饰的CfM HL4解旋酶及其应用 |
| CN119709690A (zh) * | 2023-09-27 | 2025-03-28 | 北京齐碳科技有限公司 | 一种用于纳米孔测序的酶修饰方法 |
| CN118256468B (zh) * | 2024-02-22 | 2025-08-26 | 北京普译生物科技有限公司 | 一种修饰的ToPif1解旋酶及其应用 |
| CN118126983B (zh) * | 2024-02-22 | 2025-09-02 | 北京普译生物科技有限公司 | 一种修饰的mpk2解旋酶及其应用 |
Citations (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2013057495A2 (en) | 2011-10-21 | 2013-04-25 | Oxford Nanopore Technologies Limited | Enzyme method |
| US20150065354A1 (en) | 2011-12-29 | 2015-03-05 | Oxford Nanopore Technologies Limited | Method for characterising a polynucelotide by using a xpd helicase |
| US20150191709A1 (en) | 2012-07-19 | 2015-07-09 | Oxford Nanopore Technologies Limited | Modified helicases |
| US20160257942A1 (en) | 2013-10-18 | 2016-09-08 | Oxford Nanopore Technologies Ltd. | Modified helicases |
| US20180179500A1 (en) | 2014-10-07 | 2018-06-28 | Oxford Nanopore Technologies Ltd. | Modified enzymes |
| CN108676785A (zh) * | 2018-03-07 | 2018-10-19 | 天津市湖滨盘古基因科学发展有限公司 | 一种atp依赖型rna解旋酶dhx3突变蛋白及应用 |
| CN109266537A (zh) * | 2018-09-14 | 2019-01-25 | 首度生物科技(苏州)有限公司 | 使用单分子多次通过纳米孔达到精密测序的基因测序仪 |
| CN112147185A (zh) * | 2019-06-29 | 2020-12-29 | 清华大学 | 一种控制多肽穿过纳米孔速度的方法及其应用 |
Family Cites Families (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20030119013A1 (en) * | 2001-04-23 | 2003-06-26 | Bo Jiang | Identification of essential genes of Aspergillus fumigatus and methods of use |
| KR102168813B1 (ko) * | 2013-03-08 | 2020-10-22 | 옥스포드 나노포어 테크놀로지즈 리미티드 | 효소 정지 방법 |
| GB201609220D0 (en) * | 2016-05-25 | 2016-07-06 | Oxford Nanopore Tech Ltd | Method |
-
2021
- 2021-04-06 CN CN202180006254.2A patent/CN115777019A/zh active Pending
- 2021-04-06 WO PCT/CN2021/085609 patent/WO2022213253A1/zh not_active Ceased
- 2021-04-06 EP EP21935490.9A patent/EP4299746A4/en not_active Withdrawn
- 2021-04-06 US US18/553,834 patent/US20240368568A1/en active Pending
Patent Citations (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2013057495A2 (en) | 2011-10-21 | 2013-04-25 | Oxford Nanopore Technologies Limited | Enzyme method |
| US20150065354A1 (en) | 2011-12-29 | 2015-03-05 | Oxford Nanopore Technologies Limited | Method for characterising a polynucelotide by using a xpd helicase |
| US9617591B2 (en) | 2011-12-29 | 2017-04-11 | Oxford Nanopore Technologies Ltd. | Method for characterising a polynucleotide by using a XPD helicase |
| US20150191709A1 (en) | 2012-07-19 | 2015-07-09 | Oxford Nanopore Technologies Limited | Modified helicases |
| US20160257942A1 (en) | 2013-10-18 | 2016-09-08 | Oxford Nanopore Technologies Ltd. | Modified helicases |
| US20180179500A1 (en) | 2014-10-07 | 2018-06-28 | Oxford Nanopore Technologies Ltd. | Modified enzymes |
| CN108676785A (zh) * | 2018-03-07 | 2018-10-19 | 天津市湖滨盘古基因科学发展有限公司 | 一种atp依赖型rna解旋酶dhx3突变蛋白及应用 |
| CN109266537A (zh) * | 2018-09-14 | 2019-01-25 | 首度生物科技(苏州)有限公司 | 使用单分子多次通过纳米孔达到精密测序的基因测序仪 |
| CN112147185A (zh) * | 2019-06-29 | 2020-12-29 | 清华大学 | 一种控制多肽穿过纳米孔速度的方法及其应用 |
Non-Patent Citations (9)
| Title |
|---|
| F. AUSUBEL ET AL.: "Current protocols in Molecular biology", 1987, GREEN PUBLISHING AND WILEY INTERSCIENCE |
| LONG JIAFU, ZHOU HAO, CHEN FEILONG: "Research progress in structure and function of polymerase-associated factor 1 complex", SCIENCE CHINA: CHINESE BULLETIN OF LIFE SCIENCE = SCIENTIA SINICA VITAE, vol. 49, no. 9, 1 September 2019 (2019-09-01), pages 1143 - 1154, XP055976785, ISSN: 1674-7232, DOI: 10.1360/SSV-2019-0154 * |
| MARCEL J. TAUCHERT ET AL.: "Structural and functional analysis of the RNA helicase Prp43 from the thermophilic eukaryote Chaetomium thermophilum", ACTA CRYST., vol. F72, 2016, pages 112 - 120, XP072457883, DOI: 10.1107/S2053230X15024498 |
| MICHAEL FALLER ET AL.: "The Structure of a Mycobacterial Outer-Membrane Channel", SCIENCE, vol. 303, 2004, pages 1189, XP003029246, DOI: 10.1126/science.1094114 |
| NEEDLEMAN, S. B.WUNSCH, C. D., J. MOL. BIOL., vol. 48, 1970, pages 443 - 453 |
| SAMBROOKRUSSEL: "Molecular Cloning: A Laboratory Manual", 2001, COLD SPRING HARBOR LABORATORY PRESS |
| See also references of EP4299746A4 |
| TANAKA NAOKO, SCHWER BEATE: "Mutations in PRP43 That Uncouple RNA-Dependent NTPase Activity and Pre-mRNA Splicing Function", BIOCHEMISTRY, vol. 45, no. 20, 1 May 2006 (2006-05-01), pages 6510 - 6521, XP055976580, ISSN: 0006-2960, DOI: 10.1021/bi052656g * |
| TAUCHERT MARCEL J, FOURMANN JEAN-BAPTISTE, LÜHRMANN REINHARD, FICNER RALF: "Structural insights into the mechanism of the DEAH-box RNA helicase Prp43", ELIFE, vol. 6, 16 January 2017 (2017-01-16), XP055976581, DOI: 10.7554/eLife.21510 * |
Cited By (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2024089270A3 (en) * | 2022-10-28 | 2024-07-18 | Oxford Nanopore Technologies Plc | Pore monomers and pores |
| WO2024138701A1 (zh) * | 2022-12-30 | 2024-07-04 | 深圳华大生命科学研究院 | 一种解旋酶突变体及其制备方法和在高通量测序中的应用 |
| EP4644570A4 (en) * | 2022-12-30 | 2026-03-04 | Bgi Shenzhen | Helicase mutant, its preparation process and its use in high-throughput sequencing |
| WO2025067293A1 (zh) * | 2023-09-27 | 2025-04-03 | 北京齐碳科技有限公司 | 解旋酶及其应用 |
| WO2025138248A1 (zh) * | 2023-12-29 | 2025-07-03 | 深圳华大生命科学研究院 | Ski2-like解旋酶及其应用 |
Also Published As
| Publication number | Publication date |
|---|---|
| CN115777019A (zh) | 2023-03-10 |
| US20240368568A1 (en) | 2024-11-07 |
| EP4299746A4 (en) | 2024-06-19 |
| EP4299746A1 (en) | 2024-01-03 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2022213253A1 (zh) | 经修饰的Prp43解旋酶及其用途 | |
| US12258591B2 (en) | Modified helicases | |
| US12448646B2 (en) | SSB method | |
| JP6614972B2 (ja) | 修飾ヘリカーゼ | |
| CN112646019B (zh) | 突变胞溶素孔 | |
| EP3259351B1 (en) | Method for producing hetero-oligomeric pores | |
| JP6889701B2 (ja) | アルファ溶血素バリアント | |
| JP2022095668A (ja) | 変異体ポア | |
| WO2022126304A1 (zh) | 一种经修饰的解旋酶及其应用 | |
| CN120230736A (zh) | 用于表征生物分子的ns3解旋酶突变体及其用途 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21935490 Country of ref document: EP Kind code of ref document: A1 |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2021935490 Country of ref document: EP |
|
| ENP | Entry into the national phase |
Ref document number: 2021935490 Country of ref document: EP Effective date: 20230926 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| WWW | Wipo information: withdrawn in national office |
Ref document number: 2021935490 Country of ref document: EP |




