WO2002022830A2 - Transglutaminase gene products - Google Patents

Transglutaminase gene products Download PDF

Info

Publication number
WO2002022830A2
WO2002022830A2 PCT/GB2001/004120 GB0104120W WO0222830A2 WO 2002022830 A2 WO2002022830 A2 WO 2002022830A2 GB 0104120 W GB0104120 W GB 0104120W WO 0222830 A2 WO0222830 A2 WO 0222830A2
Authority
WO
WIPO (PCT)
Prior art keywords
nucleotide sequence
polypeptide
sequence
gene
amino acid
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/GB2001/004120
Other languages
French (fr)
Other versions
WO2002022830A3 (en
Inventor
Daniel Peter Aeschlimann
Pascale Marie Grenard
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University College Cardiff Consultants Ltd
Cardiff University
Original Assignee
University College Cardiff Consultants Ltd
Cardiff University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from GB0022768A external-priority patent/GB0022768D0/en
Priority claimed from GB0111995A external-priority patent/GB0111995D0/en
Priority to DE60140552T priority Critical patent/DE60140552D1/en
Application filed by University College Cardiff Consultants Ltd, Cardiff University filed Critical University College Cardiff Consultants Ltd
Priority to AU2001287860A priority patent/AU2001287860A1/en
Priority to AT01967485T priority patent/ATE449180T1/en
Priority to EP01967485A priority patent/EP1317548B1/en
Priority to US10/380,533 priority patent/US7052890B2/en
Priority to DK01967485.2T priority patent/DK1317548T3/en
Publication of WO2002022830A2 publication Critical patent/WO2002022830A2/en
Publication of WO2002022830A3 publication Critical patent/WO2002022830A3/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/1025Acyltransferases (2.3)
    • C12N9/104Aminoacyltransferases (2.3.2)
    • C12N9/1044Protein-glutamine gamma-glutamyltransferase (2.3.2.13), i.e. transglutaminase or factor XIII
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/53Immunoassay; Biospecific binding assay; Materials therefor
    • G01N33/564Immunoassay; Biospecific binding assay; Materials therefor for pre-existing immune complex or autoimmune disease, i.e. systemic lupus erythematosus, rheumatoid arthritis, multiple sclerosis, rheumatoid factors or complement components C1-C9
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/53Immunoassay; Biospecific binding assay; Materials therefor
    • G01N33/573Immunoassay; Biospecific binding assay; Materials therefor for enzymes or isoenzymes
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K48/00Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy

Definitions

  • the present invention relates to the identification of novel Ixansgluta inase enzymes TG Z and TG Y .
  • Transglutaminases are a family of structurally and functionally related enzymes that catalyze the post-translational modification of proteins via a Ca 2+ dependant transferase reaction between the ⁇ -carboxamide group of a peptide-bound glutamine residue and various primary amines. Most commonly, ⁇ -glutamyl- ⁇ -lysine cross links are formed in or between proteins by reaction with the ⁇ -amino group of lysine residues.
  • transglutaminases contain a central core domain containing enzymatic activity, and a N-terminal ⁇ -sandwich domain and two C-te ⁇ ninal ⁇ -barrel domains, which are thought to be involved in the regulation of enzyme activity and specificity.
  • transglutaminase genes Seven different transglutaminase genes have been characterised in higher vertebrates on the basis of their primary structure (Aeschlimann, D, and Paulsson, M (1994) Thromb. Haemostasis 71: 402-415 Aeschlimann et ah (1998) J Biol Chem 273, 3542).
  • Transglutaminases can be found throughout the body, but each transglutaminase is characterised by its own typical tissue distribution, although each may be present in a number of different tissue types often in combination with other transglutaminases.
  • Transglutaminase gene products have specific functions in the cross linking of particular proteins or tissue structures. For review see Aeschlimann .
  • transglutaminases have adopted additional functions such as the tissue transglutaminase (TG C ), which is involved in GTP-binding in receptor signalling, and band 4.2 protein which functions as a structural component of the cytoskeleton.
  • TG C tissue transglutaminase
  • band 4.2 protein which functions as a structural component of the cytoskeleton.
  • keratinocyte transglutaminase TG K
  • epidermal transglutaminase TGE
  • TG keratinocyte terminal differentiation and the cross-linking of structural proteins to form the cornified envelope.
  • the fourth enzyme TGc is expressed in skin primarily in the basal cell layer, and plays a role in the stabilisation of the dermo-epidermal junction.
  • the importance of proper cross-hnking of the cornified envelope is exemplified by the pathology seen in patients suffering from a severe form of the skin disease referred to as congenital ichthyosis, which has been linked to mutations in the gene encoding TG K .
  • transglutaminase enzymes appear to be encoded by a family of closely related genes. Alignment of these genes demonstrates that all members of the transglutaminase family exhibit a similar gene organisation, with remarkable conservation of intron distribution. Furthermore, phylogenetic analysis indicates that an early gene duplication event subsequently gave rise to two different transglutaminase lineages; one comprising TGc, TG E , and band 4.2 protein; the other, factor XHIa, TGK and possibly also TG P (Aeschlimann and Paulsson (1994) (supra)).
  • TGK and factor XHIa have been mapped to human chromosome 14ql l.2 and chromosome 6p24-25 respectively, whereas TGc and TG E have been mapped to chromosome 20ql l, and TG P has been mapped to chromosome 3p21-22.
  • the gene structure is remarkably conserved among all members of the transglutaminase gene family. Not only is the position of intron splice points highly conserved, but also the intron splice types. This similarity in gene structure and homology of the primary structure of the transglutaminases provides further support for the proposition that the different transglutaminase genes are derived from a common ancestral gene.
  • the inventors have previously isolated a cDNA encoding a novel member of the trahsglutaminase gene family TGx, from human foreskin keratinocytes (Aeschilmann et al (1998) J. Biol. Chem., 273, 3452-3460). Two related transcripts with an apparent size of 2.2 and 2.8 kb were obtained.
  • the deduced amino acid sequence for the full-length gene product encodes a protein with 720 amino acids and a molecular mass of 81kDa.
  • TGx is the product of a ⁇ 35kb gene located on chromosome 15, comprising 13 exons and 12 introns.
  • the intron splice sites were found to conform to the consensus for splice junctions in eukaryotes.
  • the transcription initiation site is localised to a point 159 nucleotides upstream of the initiator methionine and the likely polyadenylation site is localised ⁇ 600 nucleotides downstream of the stop codon.
  • the two mRNA isoforms are the result of alternative splicing of exon HI and give rise to 2 protein variants of TGx which comprise catalytic activity.
  • TGx is expressed predominately in epithelial cells, and most prominently during foetal development, in epidermis and in the female reproductive system.
  • the inventors have now localised the TGM5 gene to chromosome 15ql5 by fluorescent in situ hybridisation.
  • Band 4.2 protem has previously been mapped to this chromosomal region (Sung L. A. et al (1992) Blood 79: 2763-2770; Najfeld V. et al (1992) Am. J. Hum. Genet 50: 71-75) and has subsequently been assigned to position 15ql5.2 by expression mapping of the LGMD2A locus on chromosome 15 (Chiannikulchai N. et al (1995) Hum. Mol. Genet 4: 717-725).
  • the TGM7 derived mRNA (Fig. 6A and Fig. 6B) comprises an open reading frame of 2130 nucleotides and a polyadenylation signal (AATAAA) 158 nucleotides downstream of the termination codon (TGA).
  • the deduced protein for TG Z consists of 710 amino acids.
  • the deduced protein for TG Z from Fig. 6A has a molecular mass of 79, 908 Da and an isoelectric point of 6.7.
  • the deduced protein for TG Z from Fig. 6B has a molecular weight of 80,065 and an isoelectric point of 6.6.
  • the TGM6 full length transcript (Fig. 10A) comprises an open reading frame of 2109 nucleotides.
  • the deduced protein for the long form of TG Y consists of 708 amino acids and has a calculated molecular mass of 79, 466 Da and an isoelectric point of 6.9.
  • the transcript for the short form of TG Y (Fig. 10B) comprises an open reading frame of 1878 nucleotides and the deduced protein consists of 626 amino acids with a molecular mass of 70, 617 Da and an isoelectric point of 7.6.
  • the inventors calculated their amino acid similarity based upon sequence alignments, and calculated their evolutionary distances using different algorithms. All the algorithms used predicted a close relationship between TG X , TG Z , TG Y , TG E , band 4.2 protein and TG C , and factor XDIa and TG K , respectively.
  • the grouping of TG X , TG Z , TG Y , TG E , TG C , and band 4.2 protein in one subclass and factor XDIa and TG K in another is supported by the results of this analysis and by the gene structure and genomic organisation of the different transglutaminase genes.
  • the inventors have therefore determined the structure of the human TGM5 gene, and its flanking sequences, and have mapped the gene to the 15ql5 region of chromosome 15. Further, the inventors have determined that the human TGM5 gene comprises 13 exons separated by 12 introns spanning roughly 35kb, and that the structure of the TGM5 gene is identical to that of EPB42 (band 4.2 protein), TGM2 (TGc) and TGM3 (TG E ) genes. Southern blot analysis has also shown that TGM5 is a single copy gene in the haploid genome.
  • the inventors developed a method for detection and identification of transglutaminase gene products based on RT-PCR with degenerate primers and using this method have discovered the gene product of the TGM5 gene in kerarinocytes (Aeschlimann et al (1998) I. Biol. Chem. 273, 3452-3460). Using this method, the inventors have identified another new transglutaminase gene product in human foreskin keratinocytes and in prostate cacrinoma tissue which has been designated TG Z or transglustaminase type VT A full-length cDNA for this gene product was obtained by anchored PCR.
  • TGM7 TG 2
  • Fig. 5C The inventors have therefore determined that the transglutaminase genes, TGM5 (TGx), TGM7 (TGz) and EPB42 (band 4.2 protein) are positioned side by side within approximately 100 kb on chromosome 15.
  • mice homologues of these genes are similarly arranged on mouse chromosome 2.
  • the inventors have identified and determined the nucleotide and amino acid sequences as well as tissue distribution for the novel fransglutaminase gene products TGz and TG .
  • a nucleotide sequence comprising at least a portion of the nucleotide sequence of Fig. 6 A, Fig. 6B, Fig. 10A or Fig. 10B; a nucleotide sequence which hybridise to the nucleotide sequence of Fig. 6A, Fig. 6B, Fig. 10A or Fig. 10B; a nucleotide sequence which is degenerate to the nucleotide sequence of Fig. 6A, Fig. 6B, Fig. 10A or Fig. 10B; all of which nucleotide sequences encode a polyp eptide having transglutaminase activity.
  • the nucleotide sequence consists of the nucleotide sequence of Fig. 6A, Fig. 6B, Fig. lOA or Fig. 10B.
  • the first aspect of the present invention also provides a nucleotide sequence which hybridises under stringent conditions to the nucleotide sequences of Fig. 6A, Fig. 6B, Fig. 10A or Fig. 10B and which encodes a polypeptide having fransgluta inase activity.
  • the nucleotide sequence has at least 80%, more preferably 90%) sequence homology to the nucleotide sequence shown in Fig. 6A, Fig. 6B, Fig. 10A or Fig. 10B. Homology is preferably measured using the BLAST program.
  • the invention further provides a method of expressing a polypeptide comprising inserting a nucleotide sequence according to the first aspect of the present invention into a suitable host and expressing that nucleotide sequence in order to express a polypeptide having transglutaminase activity.
  • the invention also provides a vector comprising a nucleotide sequence according to the first aspect of the present invention.
  • polypeptide having an amino acid sequence comprising at least a portion of the amino acid sequence of Fig. 6A, Fig. 6B, Fig. 10A or Fig. 10B, wherein the polypeptide has transglutaminase activity.
  • the invention also provides a polypeptide sequence which is at least 90% identical to the amino acid sequence of Fig. 6A, Fig. 6B, Fig. 10A or Fig. 10B and which has transglutaminase activity.
  • the amino acid sequence of the polypeptide having fransglutaminase activity may differ from the amino acid sequence given in Fig. 6A, Fig. 6B, Fig. 10A or Fig. 10B by having the addition, deletion or substitution of some of the amino acid residues.
  • the polypeptide of the present invention only differs by about 1 to 20, more preferably 1 to 10 amino acid residues from the amino acid sequence given in Fig. 6A, Fig. 6B, Fig. 10A or Fig. 10B.
  • the invention also provides a composition comprising the polypeptide of the present invention for use in transamidation reactions on peptides and polypeptides.
  • the invention also provides a polypeptide comprising exons VD through to exon X of the sequence shown in Fig. 6A or Fig. 6B.
  • the position of the exons on the sequence shown in Fig. 6A or Fig. 6B can be dete ⁇ nined from Fig. 8 where intron splice sites are marked with arrow heads.
  • a polypeptide comprising exons D through to exon IV or exons X through to exon XD of the sequence show in Fig. 10A or Fig. 10B.
  • the positions of the exons on the sequence shown in Fig. 10A or Fig. 10B can be determined from Fig. 8.
  • composition comprising the polypeptide according to the present invention for use in the cross-linking of proteins.
  • a diagnostic method comprising detecting expression of the polypeptide according to the present invention in a subject or in cells derived from a subject.
  • the invention also provides an antibody directed against the polypeptide according to the present invention.
  • the antibody may be any antibody molecule capable of specifically binding the polypeptide including polyclonal or monoclonal antibodies or antigen binding fragments such as Fv, Fab, F(ab') 2 fragments and single chain Fv fragments.
  • the invention further provides a method of gene therapy comprising correcting mutations in a non wild type nucleotide sequence corresponding to the nucleotide sequence of Fig. 6 A, Fig. 6B, Fig. 10A or Fig 10B.
  • Such gene therapy methods can be performed by homologous recombination techniques or by using ribozymes to correct small sequence mutation. Suitable techniques are well known to those skilled in the art.
  • a method of diagnosis of autoimmune disease comprising taking a sample from a subject and testing that sample for the presence of a fransglutaminase encoded by the nucleotide sequence of Fig. 6A, Fig. 6B, Fig. 10A or Fig 10B or portions thereof.
  • the fransglutaminase is detected by using an antibody having affinity for the transglutaminase.
  • the invention also provides a competitive protein binding assay for the differential diagnosis of autoimmune diseases comprising the detection of antibodies against the transglutaminase encoded by the nucleotide sequence of Fig. 6A, Fig. 6B or Fig.lOA or Fig 10B, or portions thereof.
  • the protein binding assay comprises using exogenous transglutaminase TGz or TGy, or both, as a competitive antigen.
  • Fig. 1 is a representation of the genomic organisation of the human TGM5 gene.
  • the human TGM5 gene is represented with the exons numbered I to XDI indicated by solid boxes separated by the introns 1 to 12. The sizes of the introns and exons are given in bp (base pairs).
  • the 5'- and 3'- untranslated regions in exon 1 and XDI, respectively, are represented by hatched boxes with functional elements defining the transcript indicated. Additional sequence elements found in the TGM5 gene are indicated as follows: Alu, Alu 7SL derived retroposon; STS, sequence tagged site. Below the genomic map, a representation of the sequences present in the individual BAG clones is depicted.
  • Fig. 2 is a representation of the structure of the 5' untranslated region of the human TGM5 gene and mapping of the transcriptional start site.
  • A Primer extension analysis of poly (A + ) RNA isolated from primary human keratinocyte prior to (lane 1) or after (lane 2) culture in suspension for 12h. Extension products were separated on denaturing polyacrylamide gel alongside a Sanger dideoxynucleotide sequencing reaction of the appropriate genomic DNA fragment primed with the same oligonucleotide. The transcriptional start site is indicated by the arrow.
  • B Nucleotide sequence of the proximal 5' region of the TGM5 gene, 5' ends of mRNA from primary keratinocytes mapped by RACE are indicated by arrowheads. The major transcription start site identified by primer extension is highlighted with an asterisk (labelled +1). Consensus sequences for putative regulatory elements are underlined.
  • Fig. 3 is a representation of the structure of the 3' untranslated region of the human TGM5 gene. 3'-flanking sequence is shown with sequences homologous to known consensus sequences for 3' processing of transcripts (AATAAA, CAYTG and YGTGTTYY) underlined. The termination points of cDNA's isolated from human keratinocytes (Aeschlimann et al (1998) I. Biol Chem 273, 3453-3460) by 3' RACE are indicated by arrowheads. A pair of inverted long repeat sequences is highlighted in italics.
  • Fig. 4 is a southern blot analysis of human genomic DNA hybridised to genomic TG X probes. Human genomic DNA was digested with BamHl, Eco ⁇ I, and Hin ⁇ restriction enzymes and hybridised with short 32 P-labelled DNA fragments corresponding to intron 2 and flanking sequences of exon D and DI (left panel) and exon X (right panel), respectively. The migration positions of the HincRR DNA size markers is indicated on the left.
  • Fig. 5 shows the chromosomal localisation of the human TGM5 gene by fluorescence in situ hybridisation.
  • A representative picture of fluorescine-labelled genomic DNA of BAC-228(P20) (fluroescence, arrows) hybridised to metaphase spreads of human chromosome stained with propidium iodide.
  • B An ideogram of banded chromosome 15 showing the localisation of the fluorescent signal on 13 chromosomes.
  • Fig. 6 shows A. the nucleotide sequence and deduced amino acid sequence for human TG Z .
  • B an alternative nucleotide sequence and deduced amino acid sequence for human TG Z .
  • the initiation and termination codons as well as polyadenylation signal (AATAAA) are underlined.
  • Fig.7 is a representation of the different tissue expression patterns for TGx, band 4.2 protein TG Y and TGz in different fetal and mature human tissues.
  • Human tissue Northern dot blot normalised for average expression of 9 different housekeeping genes probed with a fragment corresponding to the C-terminal ⁇ -barrel domains of TGx (A), TGz (B), (C) TG Y and band 4.2 (D).
  • a diagram showing the type of poly (A) + RNA dotted onto the membrane is shown in panel E.
  • Fig. 8 is a comparison of the structure of the different human transglutaminase genes.
  • A. is an alignment of the nine characterised human gene products (TGx, TG , TG Z (shown in Fig. 6A), TGc, TG E , band 4.2, factor XDI a-subunit, TG K , TG P; ) is shown, with dashes indicating gaps inserted for optimal sequence alignment and underlined residues representing amino acids conserved in at least five gene products.
  • the sequences are arranged to reflect the transglutaminase domain structure, based on the crystal structure of factor XID a-subunit.
  • Known intron splice sites are marked by '.
  • B. is an alignment of the nine characterised human gene products (TGx, TG Y , TG Z (shown in Fig. 6B), TG C , TG E , band 4.2, factor XDI a-subunit, TG K , TG P ,) is shown, with dashes indicating gaps inserted for optimal sequence alignment and underlined residues representing amino acids conserved in at least five gene products.
  • the sequences are arranged to reflect the transglutaminase domain structure, based on the crystal structure of factor XDI a-subunit.
  • Known intron splice sites are marked by arrowheads.
  • Fig. 9 is a phylogenetic tree of the transglutaminase gene family and genomic organisation of the genes in man and in mouse. Sequences were aligned to maximise homology as shown in Fig. 8 except including sequences from different species as available: h, human; m, mouse; r, rat. Note, the mouse sequence for TG X 3 is at present incomplete and no information is available for the N-terminal domain.
  • panel A5 a hypothetical pedigree for the gene family is given that is consistent with the data on the sequence relationship of the individual gene products (A) as well as with the data on the gene structure and genomic organisation (B).
  • Phylogenetic trees based on the amino acid sequence homology of the gene products have been constructed using the NJ method (Saitou and Nei, 1987) of the PHYLDP software package for (1) the N-terminal ⁇ -sandwich domain (2) the catalytic core domain, (3) the C-terminal ⁇ -barrel domains, and (4) the entire gene products, (C). Shows the similarity of TGx to the other transglutaminase gene products.
  • the domain structure is based on the X-ray crystallo graphic structure of the factor XDI a-subunit dimer and inferred on the other gene products based upon the sequence alignment shown in Fig. 8. The numbers reflect % sequence identity.
  • Fig. 10 shows the nucleotide sequence and deduced amino acid sequence of TG Y .
  • A. Shows the nucleotide and deduced amino acid sequence for the long form of TG Y .
  • B. Shows the nucleotide sequence and deduced amino acid sequence for the short form of TG Y .
  • Fig. 11 is a schematic representation of the organisation of the identified fransglutaminase gene clusters in the human genome.
  • a unique insertion sequence of about 30 amino acids between the catalytic core domain and ⁇ -barrel domain 1 found in TG X was used as a template to design specific primers for the screening of a human genomic library.
  • the characterisation of several genes of the transglutaminase gene family showed that the positions of the introns has been highly conserved and a comparison of the TGx sequence to the sequences of the other transglutaminases indicated that this unique sequence is present within an exon, exon X (see Fig. 8, aa 460-503) in TG X .
  • a PCR reaction from human genomic DNA using oligonucleotides PI and P2 that match sequences at either end of this unique segment yielded a DNA fragment of expected size which was confirmed to be the correct product by sequencing (results not shown).
  • Screening of a human genomic DNA BAC library by PCR using these oglionucleotides revealed two positive clones, BAC-33(P5) and BAC-228(P20) that were subsequently shown by Southern blotting with different cDNA probes to contain sequences spanning at least exon D to exon X of the TGM5 gene (results not shown). Restriction analysis further indicated that each of the BAC clones contained substantially more than 50kb of human genomic DNA.
  • the 3'-unfranslated region was obtained by step-wise extension of the known sequence using direct sequencing of BAC plasmid DNA. Both BAC clones terminated short of exon I and all attempts at isolating clones spanning exon I by screening of BAC and PI libraries failed. Exon I and intron 1 sequences were finally derived by nested PCR from human genomic DNA using conditions optimised for long range genomic PCR.
  • TGM5 gene comprises approximately 35kb of genomic DNA and contains 13 exons and 12 introns (Fig. 1). All intron/exon splice sites conform to the known GT/AG donor/acceptor site rale and essentially to the consensus sequence proposed by Mount S.M (1982) Nucleic Acids Res. 10, 459-472. (Table I). A sequence homologous to the branch point consensus CTGAC (Keller E. B and Noon. W. A (1984) Proc. Natl. Acad. Sci. USA 81: 7417-7420) was found 24 to 44 nucleotides upstream of the 3' splice site in introns 1, 3-6, and 9-12.
  • the size of the introns varied considerably, ranging from 106bp to more than 6kb (Fig. 1, Table II).
  • the sequence obtained from the two different BAC clones matched with the exception of a deletion spanning the sequence from intron 6 to intron 8 in BAC-33(P5) (Fig. 1).
  • the isolation and sequencing of cDNAs encoding TGx and Northern blotting with TG X cDNA probes revealed expression of at least two differentially spliced mRNA transcripts for TGx in human keratinocytes. Solving the gene structure confirmed the short form of TGx to be the result of alternative splicing of exon ID as predicted.
  • a third isolated cDNA that differed also at the exon Dl/exon IV splice junction turned out to be the result of incomplete or absent splicing out of intron 3 as the sequence upstream of exon IV in the cDNA matched with the 3' sequence of intron 3.
  • Exon 3 encodes part of the N-terminal ⁇ -barrel domain of TGx and the absence of the sequence encoded by exon 3 is expected to result in major structural changes in at least this domain of the protein. Nevertheless, expression of TG X in 293 cells using the full-length cDNA resulted in synthesis of two polypeptides with a molecular weight consistent with the predicted products from the alternatively spliced transcripts (results not shown).
  • TGx promoter is a TATA-less promoter.
  • C/BP may bind to CAAT-box
  • NF1 nuclear factor I
  • USF upstream stimulatory factor
  • c-Myb is found in TATA-less proximal promoters of genes involved in hematopoiesis and often interacts with Ets-factors, and these sites may be operative in the expression of TGx in hematopoetic cells, e.g HEL cells API, Ets and SPl elements are typically found in keratinocyte-specific genes and may be involved in transcriptional regulation in keratinocytes.
  • HEL cells API, Ets and SPl elements are typically found in keratinocyte-specific genes and may be involved in transcriptional regulation in keratinocytes.
  • Several API sites are present within 2.5kb of the upstream sequence and could interact with the proximal API factor for activation. SPl sites are properly positioned upstream of the start points of the shorter transcripts raising the possibility that these could also be functional, though to a lesser degree.
  • exon XDI contained a consensus polyadenylation signal AATAAA-600bp downstream of the termination codon (Fig. 3). This is in good agreement with the size of the mRNA (2.8kb) encoding full-length TG X expressed in human keratinocytes as detected by Northern blotting considering the length of the coding sequence (2160bp.).
  • a CAYTG signal that binds to U4 snRNA which is identical for 4 out of 5 nucleotides is present in tandem in 3 copies 7 nucleotides downstream of the polyadenylation signal.
  • the TGM5 gene was subsequently localised to chromosome 15 by fluorescent in situ hybridisation of human metaphase chromosome spreads using genomic DNA derived from either BAC clone as a probe (Fig. 5A).
  • a comparison of the probe signal to the DAPI banding pattern localised the TGM5 gene to the 15ql5 region.
  • the localisation was subsequently refined by determining the distance of the fluorescent signal to the centromere as well as to either end of the chromosome on 13 copies of chromosome 15 and expressing it as a fractional distance of the total length of the chromosome.
  • TGM5 is part of a cluster of transglutaminase genes.
  • the EPB42 gene has previously been assigned to locus 15ql5.2 on chromosome 15. This raised the possibility that the EPB42 gene may be arranged in tandem with the TGM5 gene. Indeed, PCR with specific primers for sequences derived from the 5' and 3' of the EPB42 gene yielded products of appropriate size from both, BAC-33(P5) and BAC-228(P20) (results not shown) and sequencing confirmed the identity of the PCR products. Southern blotting of BAC plasmid DNA with cDNA probes comprising the 5' or 3' end of the EPB42 gene and subsequent comparison of the pattern of labelled restriction fragments with that of the TGM5 gene allowed us to map this locus in more detail.
  • the EPB42 gene and TGM5 are arranged in the same orientation being spaced apart by ⁇ 1 lkb (Fig. 5C) approximately 30% of which was sequenced to characterise the 3' and 5' flanking UTR of the TGM5 and EPB42 gene, respectively.
  • Fig. 5C ⁇ 1 lkb
  • This tgm.3 gene showed a best fit location for the segment defined by D2Mit447 proximal and D2Mit258 distal, with a highest LOD of 14.8 and 12.2 to D2Mit258 (78.0cM) and D2Mit338 (73.9cM), respectively.
  • the tgm2 gene showed a best fit location for the segment defined by D2Mitl39 proximal (86.0cM) and D2Mit225 distal (91.0cM), with a highest LOD of 17.0 to the anchor market D2Mit287, consistent with it's assigned locus 89.0cM from the centromere, ⁇ anda, ⁇ ., et al., (1999). Arch. Biochem. Biophys. 366, 151-156.
  • TG C , TG E which are more closely related to the TGM5, TGM7, and EPB42 genes than the other fransglutaminase genes based on amino acid sequence comparison and similarity in gene structure (Fig. 9) have been mapped to human chromosome 20qll (Gentile V. et al (1994) Genomics 20, 295-297; Kim I. G. et al (1994) I. Invest. Dermatol 103, 137-142).
  • the syntenic regions of the 15ql5 and 20qll locus in the mouse are present in a short segment, 2F1-G, of chromosome 2, which puts all five fransglutaminase genes in proximity (Fig. 9B).
  • the mouse homologue of band 4.2 protein has been mapped to this region of mouse chromosome 2 (White R. A. et al (1992) Nat. Genet 2, 80-83).
  • this clone was shown to contain the gene encoding the mouse homologue of TG Z .
  • Genomic sequences derived from this BAC clone and also cDNA sequences derived from cDNA prepared from mouse uterus showed that the mouse and human gene products are 85% identical on the nucleotide level.
  • TGM6 located approximately 45kb downstream of the TGM3 gene consistent with our hypothesis (Fig. 11).
  • Fig. 11 we screened a large number of cell lines for expression of a respective gene product by PCR.
  • a corresponding gene product, TG Y , or transglutaminase type VI could be identified in a small cell lung carcinoma cell line and a full-length cDNA was subsequently derived by anchored PCR.
  • a full-length cDNA sequence for TG Z was obtained by anchored PCR using oligo(dT)-Not I primed cD ⁇ A prepared from human foreskin keratinocytes, prostate carcinoma tissue and human carcinoma cell line PC3, essentially following the strategy previously described (Aeschlimann et al (1998) J Biol Chem 273: 3245-3460).
  • the oligo(dT)-Not I primer was used as the anchoring primer to obtain the 3' end of the cDNA.
  • 5' RACE was used to determine the 5' end of the cDNA. The obtained sequence information (Fig.
  • the deduced protein consists of 710 amino acids.
  • the cDNA and amino acid sequence in Fig. 6A was first determined and the deduced protein has a calculated molecular mass of 79,908 Da and an isoelectric point of 6.7.
  • the cDNA and amino acid sequence in Fig. 6B was then determined. This sequence differs by a few nucleotides and amino acids from the sequence given in Fig. 6A.
  • the protein deduced from the sequence given in Fig. 6B has a calculated molecular mass of 80,065 and an isoelectric point of 6.6.
  • a full-length cDNA sequence for TG Y was obtained by PCR using oligo(dT) primed cDNA prepared from the lung small cell carcinoma cell line H69, and using sequence specific primers based on the presumptive transcribed genomic sequence. 5' RACE was used to determine the 5' end of the cDNA.
  • the obtained sequence information for the long form of TG Y (Fig. 10A) contained an open reading frame of 2109 nucleotides.
  • the deduced protein for the long form of TG Y consists of 708 amino acids and has a calculated molecular mass of 79,466 Da and an isoelectric point of 6.9.
  • a shorter transcript was also isolated which apparently resulted from alternative splicing of the sequence encoded by exon X ⁇ .
  • the absence of exon XD results in a frame shift and thereby in premature teimination within exon XDI.
  • the obtained sequence information for the short form of TG Y (Fig. 10B) contained an open reading frame of 1878 nucleotides.
  • the deduced protein for the short form of TG Y consists of 626 amino acids and has a calculated molecular mass of 70,671 Da and an isoelectric point of 7.6.
  • the sequence alterations due to the splicing result in a short protein which terminates just after the first C-terminal ⁇ -barrel domain.
  • the ⁇ -barrel domains have been implicated in the regulation of enzyme-substrate interaction, and the lack of the second C-terminal ⁇ -barrel domain (see Fig. 8, d5) is likely to be of biological significance.
  • the catalytic mechanism of transglutaminases has been solved based on biochemical data available for several transglutaminases and the X-ray crystallographic structure of the factor XDI a-subunit dimer.
  • the reaction center is formed by the core domain and involves hydrogen-bonding of the active site Cys to a His and Asp residue to form a catalytic triad reminiscent of the Cys-His-Asn triad found in the papain family of cysteine proteases.
  • the residues comprising the catalytic triad are conserved in TG Y (Cys276, His335, Asp358) and TG Z (Cys227, His336, Asp359) (Fig. 8) and the core domain shows a high level of conservation as indicated by a sequence identity of about 50%> between these gene products and the other transglutaminases (Fig. 9).
  • a Tyr residue in barrel 1 domain of the a subunit of factor XDI is hydrogen-bonded to the active site Cys residue and it has been suggested that the glutamine substrate attacks from the direction of this bond to initiate the reaction based on analogy to the cysteine proteases.
  • TGx is expressed in a number of different cell types (Aeschlimann et al (1998) I. Biol. Chem. 273, 3452-3460). To obtain a more complete picture on the expression of TGx and the novel gene products, we performed a dot blot Northern blot analysis of more than 50 adult and fetal human tissues.
  • Band 4.2 protein was expressed at high level in bone marrow and fetal spleen and liver, consistent with its role in hematopoietic cells, and virtually undetectable in all other tissues, hi contrast, TGx, TG and TGz showed widespread expression at low level, with highest levels of TGx, TG Y and TGz mRNA present in the female reproductive system, in the central nervous system, and in testis, respectively (Fig. 7).
  • TGz is expressed in osteosarcoma cells (MG-63), dermal fibroblasts (TI6F, HCA2), erythroleukemia cells (HEL), in primary keratinocytes, mammary epithelium carcinoma cells (MCF7), HELA cells, skin, brain, heart, kidney, lung, pancreas, placenta, skeletal muscle, fetal liver, prostate and in prostate carcinoma tissue.
  • MG-63 osteosarcoma cells
  • TI6F, HCA2 dermal fibroblasts
  • HEL erythroleukemia cells
  • MCF7 mammary epithelium carcinoma cells
  • HELA cells skin, brain, heart, kidney, lung, pancreas, placenta, skeletal muscle, fetal liver, prostate and in prostate carcinoma tissue.
  • a similar analysis for TG Y revealed expression only in a lung small cell carcinoma cell line (H69) and extremely low levels of expression in tissues.
  • TG Z is expressed widely in cells and tissues and expression levels are not apparently affected by cellular differentiation, (i.e keratinocyte differentiation or fibroblast senescence).
  • TG Y expression was very restricted and expression was only found in H69 cell line. This cell line has characteristics of neuronal cells such as the expression of neuron-specific enolase and brain isozyme of creatine kinase which together with widespread expression in tissues of the nervous system suggests that TG Y expression may be specific to neuronal cells.
  • Transglutaminase action has been implicated in the formation of aberrant protein complexes in the central nervous system leading to nerve cell degeneration, e.g in Alzheimers and Huntington's disease. Based on its expression pattern, TG Y is a logical candidate to bring about the underlying fransglutaminase-related pathological changes.
  • Oligonucleotides were from Oligos. Etc. Inc. (Wilsonville, OR) or life technologies and restriction enzymes from Promega Corp. (Madison, WI).
  • a human BAC library established in a F-factor-based vector, pBeloBAC 11, and maintained in E. coli DH10B was screened by PCR (Genome Systems, Inc., St. Louis, MO).
  • a 147bp DNA fragment unique to TGx was amplified from lOOng of genomic DNA in lOO ⁇ l of lOmM Tris/HCl, pH 8.3, 50mM KC1 containing 2mM MgCl 2 , 0.2mM dNTPs using 2.5 units of Tag DNA polymerase (Fisher Scientific Corp.
  • PCR cycles were 45 sec at 94°C (denaturation), 2 min at 60°C (annealing), and 3 min at 72°C (elongation) for a total of 37 cycles, with the first cycle containing an extended denaturation period (6 min) during which the polymerase was added (hot start), and the last cycle contained an extended elongation period (10 min).
  • Two positive clones were identified, BAC-33(P5) and BAC-228(P20) (Genome Systems), and their identity verified by Southern blotting.
  • Plasmid DNA was prepared using a standard alkaline lysis protocol. 2 ⁇ g plasmid DNA was restricted with BamHL, EcoRI, and Spel and probed with a 3 P-labelled-500bp NcoVBspHl and ⁇ 600b ⁇ spHI/Nde ⁇ cDNA fragment of TG X , respectively, as described below.
  • KC1 containing 2mM MgCl 2 , 0.2mM dNTPs and 50pmol of the desired oligonucleotide primers.
  • the PCR cycles were 45 sec at 94°C (denaturation), 1 min at 60°C (annealing), and 5 min at 72°C (elongation).
  • a total of 32 cycles were carried out, with the first cycle comprising an extended denaturation period (6 min) during which the polymerase was added (hot start) and the last cycle comprised an extended elongation period (10 min).
  • oligonucleotides were used as upstream and downsfream primers, respectively, in the individual reactions: intron 2, 5'-GGACCACCTGCTTGTTCGCCGGGG,5'-AGGGGCTGGGGCTGTGATGGCGTG; intron S ⁇ '-ACCTCTTGAAAATCCACATCGACTCC S'-CAGTTCTTGCTGCCTTGGTAGATGAAGCC; intron 4,5'-GACAGTGAACCCCAGAGGCAGGAG, 5'-TCTGTGGCTGGGTCAGTCTGGAAGTGCA (P3); intron S ⁇ '-GCCTGCACTTCCAGACTGACCCAGCCACA, 5'-TCCAGTTTCCATTGAGCACCCCA; intron 6,5'-TGCTGGGTCTTTGCTGCCGTCATGTGC, 5'-TCCTTCTTCTTATTCCCCAAAATCCTGCC; intron 7, 5'-TAGATGAGTATTATGACAACACAGGCAGG, 5'-GCGTCCAGCACCTGCCAGCCTCC,-
  • PCR's were carried out with 1.25 units of Pfu Turbo DNA polymerase (Sfratagene) and 260ng genomic DNA in a total of lOO ⁇ l of supplied reaction buffer supplemented with 0.2mM dNTPs, 2 ⁇ l DMSO and 50 p ol primers.
  • the PCR cycles were 45 sec at 94°C (denaturation), 1 min at 68°C (annealing), and 2 min at 72°C (elongation).
  • a total of 37 cycles were made, with the first cycle containing an extended denaturation period (6 min) during which the polymerase was added (hot start), and the last cycle containing an extended elongation period (10 min).
  • Double stranded cDNA was prepared from poly(A + ) RNA of cultured normal human keratinocytes (Aeschilmann et al (1998) J. Biol Chem. 273, 3452-3460) with the Copy Kit (Invitrogen, San Diego, CA).
  • the cDNA was purified from nucleotides using the GlassMax DNA Isolation Kit (Life Technologies, Inc.) and tailed in the presence of 200 ⁇ M dCTP with 10 units of terminal deoxynucleotidyl transferase (Promega) for 30 min at 37°C to anchor the PCR at the 5'-end.
  • the PCR reaction was anchored by performing a total of 5 cycles of one-sided PCR at a lower annealing temperature (37°C) with the abridged anchor primer (Life Technologies, Inc.) only. Following transfer of 25% of this reaction at 94°C to a new tube containing abridged anchor primer and TG x -s ⁇ ecific primer P3 (see above), the first round of amplification was carried out for a total of 37 cycles under the conditions described above except for annealing which was carried out at 55°C.
  • Nested PCR was done with the universal amplification primer (Life Technologies, Inc.) and TG x -specific primer P4, 5'-TGAAGTACAGGGTGAGGTTGAAGG, as described above (annealing at 60°C) using 1.0 ⁇ l from the first round PCR.
  • Oligonucleotide P5 5'-CATGGTAGCTGCCTCCGGTTCCTG containing a 5'-infrared label (IRD 800) was purchased from MWG Biotech (Ebesberg, Germany).
  • Primer P5 (5.3pmol) was hybridised to l ⁇ g of poly (A + )RNA from primary keratinocytes (Aeschlimann et al (1998) J. Biol. Chem. 273, 3452 ' 3460) and reverse transcription performed with 200 units of Superscript D RNAse H reverse transcriptase (Life Technologies) in a total of 20 ⁇ l for 90 min at 42°C according to the manufacturer's instructions.
  • Enzyme was heat inactivated and primer extension products extracted with phenol chloroform, precipitated with ethanol, and then analysed on a 4.5% denaturing polyacrylamide gel adjacent to dideoxynucleotide chain termination sequencing reactions (Thermo Sequenase Cycle Sequencing Kit; Amersham) derived from a double-stranded genomic DNA fragment using the same primer.
  • Plasmid DNA from BAC clones was further purified for direct sequencing by digestion with 200 ⁇ g/ml of RNase A (Sigma, St. Louis, MO) for lh at 37°C and by subsequent micro-dialysis using Spectra/Por 2 membranes (Spectrum Medical Industries, Inc. Laguana Hills, CA). PCR produts were gel purified using the QIA quick Gel Extraction Kit (Qiagen, Inc. Chatsworth, CA) for sequencing.
  • Cycle sequencing was performed by the dideoxy chain termination method using the Cyclist Exo-P_ ⁇ DNA Sequencing Kit (Sfratagene, LaJolla, CA) and pre-cast 6% polyacrylamide gels with the CastAway Sequencing System (Sfratagene) or using the dRhodamine Terminator Cycle Sequencing Ready Reaction Kit (PE Biosystems) and an ABI 310 automated sequencer.
  • Probes were hybridised to the blot overnight at 65°C in 500 roM NaH 2 PO 4 , pH 7.5, containing I M ⁇ DTA and 7% SDS.
  • the membrane was washed at 65°C to a final stringency of 40mM NaH 2 PO 4 , pH 7.5, ImM ⁇ DTA, and 1% SDS, and the result developed by exposure of the membrane to BioMax MR film (Eastman Kodak, Rochester, NY).
  • DNA probes were prepared by random prime labelling of plasmid DNA of BAC-33(P5) and BAC-228(P20) with fluorescine-conjugated dUTP using the Prime-It Fluor Fluorescence Labelling Kit (Sfratagene). Probes were denatured at 75°C for 10 min in hybridisation buffer consisting of 50% formamide (v/v) and 10%o dextran sulphate (w/v) in 4x SSC and prehybridised at 42°C for 20 min to 0.2 ⁇ g/ml human competitor DNA (Sfratagene) to block repetitive DNA sequences.
  • hybridisation buffer consisting of 50% formamide (v/v) and 10%o dextran sulphate (w/v) in 4x SSC and prehybridised at 42°C for 20 min to 0.2 ⁇ g/ml human competitor DNA (Sfratagene) to block repetitive DNA sequences.
  • Probes were subsequently hybridised to the chromosome spreads at 37°C overnight, followed by washing to a final stringency of O.lx SSC at 60°C.
  • Spreads were mounted in phosphate-buffered glycerol containing 200 ng/ml propidium iodide to counterstain chromosomes.
  • Slides were examined by epifluorescence microscopy using a lOOx objective and images captured with a DC-330 CCD camera (DAGE-MTI, Inc. Michigan City, IN) using a LG-3 frame grabber board (Scion Corp. Frederick, MD) in a Mclhtosh 8500 workstation and a modified version of the NTH image 1.6 software (Scion Corp.).
  • Images representing fluorescine-labelling and propidium iodide staining of the same field were superimposed using Adobe Photoshop 3.0 (Adobe Systems, Inc. Mountain View, CA) to map the gene to a chromosomal region.
  • poly(A)+RNA was prepared from about 10 6 H69 cells (American Type Culture Collection, Rockville, MD) by oligo(dT)-cellulose column chromatography using the Micro-Fast Track Kit (Invifrogen, San Diego, CA) and recovered in 20 ⁇ l lOmM Tris HCI, pH 7.5.
  • the ⁇ oly(A)+RNA (5.0 ⁇ l) was reverse transcribed into DNA in a total volume of 20 ⁇ l using the cDNA Cycle Kit (Invitrogen) with l.O ⁇ l oligo(dT) primer (0.2 ⁇ g/ ⁇ l).
  • Overlapping fragments of TG Y were amplified by PCR using ohgonucleotides 5'-ATCAGAGTCACCAAGGTGGAC, 5'-AGAAACACATCGTCCTCTGCACACC (P6), 5 '-CAGGCTTTCCTCTCACCGCAAACAC, 5'-CGTACTTGACTGGCTTGTACCTGCC,
  • 5'-AGGTTGAGGCAGGATTAACTGAGGCCTC 5'-AGGTTGAGGCAGGATTAACTGAGGCCTC.
  • PCRs were carried out with 1.25 units of AmpliTaq Gold DNA polymerase (PE Biosystems) and 2.0 ⁇ l cDNA in a total of 50 ⁇ l of supplied reaction buffer supplemented with 2mM MgCI 2 , 0.2mM dNTPs and 25 pmol of the appropriate gene-specific primers.
  • a total of 40 PCR cycles were made, with an elevated annealing temperature of 65°C for the initial 5 cycles and an annealing temperature of 60°C for the remaining cycles.
  • the 5'-end of the cDNA was isolated by 5 '-RACE as described above with the exception of using the gene-specific ohgonucleotides P6, 5'-GATGTCTGGAACACAGCTTTGG, and
  • PCR-products were either directly sequenced or when desired, cloned by taking advantage of the 3' A-overhangs generated by Taq DNA polymerase using the Original TA-Cloning Kit (Invitrogen).
  • TG Z For cloning of TG Z , we used a series of degenerate and gene-specific ohgonucleotides to isolate overlapping DNA fragments, essentially following our previously described strategy. TG z -specific oligonucleoti.de primers were
  • the 5 '-end of the cDNA was isolated by 5 '-RACE as described above with the exception of using the gene-specific ohgonucleotides P ' 7, 5'-TGAAGCTCAGCCGGAGGTAGAAG, and 5 '-GACAGACTCAAGCCGCAAGGTTG.
  • RNA Master Blot containing poly(A) + mRNA of 50 different tissues was obtained from Clonetech Laboratories, fric (Palo Alto, CA). 32 P-labeled probes were prepared by random prime labelling of DNA fragments of the different transglutaminase gene products using the Multiprime DNA Labelling System (Amersham, Int., Amersham, UK). DNA fragments of 500-700bp compromising the 3 '-end of TG X , TGz, and band 4.2 protein, were generated by restriction with Pst I and Ace I, Nco I and Not I (exon XD and XDI), and Xho I, respectively.
  • the cD ⁇ A encoding human band 4.2 protein was kindly provided by Dr Carl M. Cohen, Boston, MA.
  • a ⁇ 220bp 32 P-labeled fragment of TG Y was generated by PCR using ohgonucleotides 5'-CAGCCTCAGTCACCGCCATCCGC and 5'-GATACTTGTAGAGGTCAGTGATG. Hybridization was performed under the conditions recommended by the manufacturer.
  • the labeled membrane was exposed to BioMax MR film (Eastman Kodak) and films developed after 15 to 24hr for first exposure and 3 to 5 days for second exposure.
  • TGY and TG Z from different tissues cD ⁇ A from various cell lines and human tissue was prepared as previously described.
  • a panel of cD ⁇ As from human tissue (Multiple Tissue cD ⁇ A Panel I) were also obtained from Clonetech Laboratories.
  • a 365 or 287bp fragment of TG Z was amplified by PCR using ohgonucleotides 5'-TGGGCAAGGCGCTGAGAGTCCATG and 5'-GCTGGAGGGCGGGTCTCAGGGAGC or
  • the 100 radiation hybrid (RH) clones of the T31 mouse/hamster RH panel (McCarthy et ah, (1997), Genome Res., 7, 1153-1161) (Research Genetics, Huntsville, AL) were screened by PCR.
  • a 139bp fragment of the tgm5 gene was amplified with primers5'-TGAGGACTGTGTGCTGACCTTG (f) and 5'-TCCTGTGTCTGGCCTAGGG (r), a 149bp fragment of the epb42 gene with primers 5'-CAGGAGGAGTAAGGGGAATTGG (f) and 5'-TGCAGGCTACTGGAATCCACG (r), a 400b ⁇ fragment of tgm7 with primers 5'-GGGAGTGGCCTCATCAATGG (f) and 5'-CCTTGACCTCACTGCTGCTGA (r), a -600bp fragment with tgm3 with primers 5'-TCGGTGGCAGCCTCAAGATTG (f) and 5'-AGACATCAATGGGCAGGCATGG (r), and 655bp and 232bp fragments of tgm2 with primers 5'-TTGGGGAGCTGGAGCAAC (f) and 5'-ATCCAGGACTCCACCC
  • PCRs were carried out in a GeneAmp 9600 ther acycler with 0.035 units/ ⁇ l AmpliTaq Gold polymerase in standard reaction buffer containing 2mM MgCl 2 , 0.2mM dNTPs, 0.4 ⁇ M of each primer and 2.5 ng/ ⁇ l genomic DNA in a total reaction volume of 25 ⁇ l.
  • PCR conditions were: polymerase activation for lOmin at 95°C, annealing at 60°C for 45sec, extension at 72°C for lmin and denaturation at 94°C for 30sec for 35 cycles with a final extension of 3.5min at 72°C.
  • PCR reactions were analyzed by agarose gel electrophoresis using 1% or 1.5% gels. The hybrid cell panel was analyzed at least twice in each case to exclude PCT related errors. The data was submitted to the Jackson Laboratory Radiation Hybrid Database for analysis and mapped relative to known genomic markers
  • Table I Splice donor and acceptor sequences in the human TGM5 gene. Residues consistent with the splice site consensus sequence (MAG/GTRAG and YAG/G) are underlined.
  • Table III Apparent polymorphisms in the cDNA and genomic DNA sequences for TGx. The positions with nucleotide and amino acid variations are underlined.

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Immunology (AREA)
  • Chemical & Material Sciences (AREA)
  • Molecular Biology (AREA)
  • Hematology (AREA)
  • Biomedical Technology (AREA)
  • Urology & Nephrology (AREA)
  • General Health & Medical Sciences (AREA)
  • Microbiology (AREA)
  • Biotechnology (AREA)
  • Biochemistry (AREA)
  • Medicinal Chemistry (AREA)
  • Physics & Mathematics (AREA)
  • Genetics & Genomics (AREA)
  • Food Science & Technology (AREA)
  • Cell Biology (AREA)
  • General Physics & Mathematics (AREA)
  • Pathology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Analytical Chemistry (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • Organic Chemistry (AREA)
  • General Engineering & Computer Science (AREA)
  • Rehabilitation Therapy (AREA)
  • Rheumatology (AREA)
  • Enzymes And Modification Thereof (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The invention provides a nucleotide sequence comprising at least a portion of the nucleotide sequence of Fig. 10A, Fig. 6B or Fig. 10A or Fig. 10B; nucleotides which hybridise to the nucleotide sequences of Fig. 6A, Fig. 6B or Fig. 10A or Fig. 10B; nucleotides which are degenerate to the nucleotide sequences of Fig. 6A, Fig. 6B or Fig. 10A or Fig. 10B; all of which nucleotides encode a polypeptide having transglutaminase activity.

Description

Transglutaminase Gene Products
The present invention relates to the identification of novel Ixansgluta inase enzymes TGZ and TGY.
Transglutaminases are a family of structurally and functionally related enzymes that catalyze the post-translational modification of proteins via a Ca2+ dependant transferase reaction between the γ-carboxamide group of a peptide-bound glutamine residue and various primary amines. Most commonly, γ-glutamyl-ε-lysine cross links are formed in or between proteins by reaction with the ε-amino group of lysine residues. Analysis of the three-dimensional structure of the a-subunit of factor XIII showed that transglutaminases contain a central core domain containing enzymatic activity, and a N-terminal β-sandwich domain and two C-teπninal β-barrel domains, which are thought to be involved in the regulation of enzyme activity and specificity.
Seven different transglutaminase genes have been characterised in higher vertebrates on the basis of their primary structure (Aeschlimann, D, and Paulsson, M (1994) Thromb. Haemostasis 71: 402-415 Aeschlimann et ah (1998) J Biol Chem 273, 3542). Transglutaminases can be found throughout the body, but each transglutaminase is characterised by its own typical tissue distribution, although each may be present in a number of different tissue types often in combination with other transglutaminases. Transglutaminase gene products have specific functions in the cross linking of particular proteins or tissue structures. For review see Aeschlimann.and Paulsson (1994) (supra) and Aeschlimann and Tholmazy (2000) Connective Tissue Res. 41, 1-27. For example, factor Xllla stabilises the fibrin clot in haemostasis, whereas prostate transglutaminase (TGp)1 is involved in semen coagulation. Other transglutaminases have adopted additional functions such as the tissue transglutaminase (TGC), which is involved in GTP-binding in receptor signalling, and band 4.2 protein which functions as a structural component of the cytoskeleton. Four transglutaminases have been shown to be expressed during the different stages of epidermal growth and differentiation. Three of these, keratinocyte transglutaminase (TGK), epidermal transglutaminase (TGE) and TG , are associated with keratinocyte terminal differentiation and the cross-linking of structural proteins to form the cornified envelope. The fourth enzyme TGc, is expressed in skin primarily in the basal cell layer, and plays a role in the stabilisation of the dermo-epidermal junction. The importance of proper cross-hnking of the cornified envelope is exemplified by the pathology seen in patients suffering from a severe form of the skin disease referred to as congenital ichthyosis, which has been linked to mutations in the gene encoding TGK .
All transglutaminase enzymes appear to be encoded by a family of closely related genes. Alignment of these genes demonstrates that all members of the transglutaminase family exhibit a similar gene organisation, with remarkable conservation of intron distribution. Furthermore, phylogenetic analysis indicates that an early gene duplication event subsequently gave rise to two different transglutaminase lineages; one comprising TGc, TGE, and band 4.2 protein; the other, factor XHIa, TGK and possibly also TGP (Aeschlimann and Paulsson (1994) (supra)). The genes encoding TGK and factor XHIa have been mapped to human chromosome 14ql l.2 and chromosome 6p24-25 respectively, whereas TGc and TGE have been mapped to chromosome 20ql l, and TGP has been mapped to chromosome 3p21-22.
Comparison of the structure of the individual transglutaminase genes shows that they may be divided into two subclasses, wherein the genes encoding TGC, TGE, TGP and band 4.2 protein comprise 13 exons and 12 introns, and the genes encoding factor XHIa and TGK contain two extra exons. Exon IX of the former group is separated into two exons (X and XT) in TGK and factor XDIa, and the amino-terminal extensions of TGK and factor XDIa comprise an additional exon. However, except for the acquisition of an additional intron and the recruitment of an exon by the genes encoding factor X__Ha and TGK, the gene structure is remarkably conserved among all members of the transglutaminase gene family. Not only is the position of intron splice points highly conserved, but also the intron splice types. This similarity in gene structure and homology of the primary structure of the transglutaminases provides further support for the proposition that the different transglutaminase genes are derived from a common ancestral gene. The inventors have previously isolated a cDNA encoding a novel member of the trahsglutaminase gene family TGx, from human foreskin keratinocytes (Aeschilmann et al (1998) J. Biol. Chem., 273, 3452-3460). Two related transcripts with an apparent size of 2.2 and 2.8 kb were obtained. The deduced amino acid sequence for the full-length gene product encodes a protein with 720 amino acids and a molecular mass of 81kDa. A sequence comparison of TGx to the other members of the transglutaminase gene family revealed that the domain structure and the residues required for enzymatic activity and Ca2+ binding are conserved and show an overall sequence identity of about 35%, with the highest similarity being found within the enzyme's catalytic domain.
The inventors subsequently determined that TGx is the product of a ~35kb gene located on chromosome 15, comprising 13 exons and 12 introns. The intron splice sites were found to conform to the consensus for splice junctions in eukaryotes. The transcription initiation site is localised to a point 159 nucleotides upstream of the initiator methionine and the likely polyadenylation site is localised ~600 nucleotides downstream of the stop codon. The two mRNA isoforms are the result of alternative splicing of exon HI and give rise to 2 protein variants of TGx which comprise catalytic activity. TGx is expressed predominately in epithelial cells, and most prominently during foetal development, in epidermis and in the female reproductive system.
The inventors have now localised the TGM5 gene to chromosome 15ql5 by fluorescent in situ hybridisation. Band 4.2 protem has previously been mapped to this chromosomal region (Sung L. A. et al (1992) Blood 79: 2763-2770; Najfeld V. et al (1992) Am. J. Hum. Genet 50: 71-75) and has subsequently been assigned to position 15ql5.2 by expression mapping of the LGMD2A locus on chromosome 15 (Chiannikulchai N. et al (1995) Hum. Mol. Genet 4: 717-725). A short sequence encompassing the left arm of one of the YAC clones (926G102) used for expression mapping matched with the sequence of intron 12 of the TGM5 gene placing the genes encoding TGx and band 4.2 protein in close proximity on chromosome 15 (Fig. 5C). PCR with specific primers for 5' (exon I) or 3' (exon X_H) sequences of band 4.2 protein as well as southern blot analysis revealed that the BAC clones containing the TGM5 gene also contained the EPB42 gene and that the 2 genes are arranged in tandem. Further analysis by the inventors has recently led to the identification of two novel transglutaminase genes TGM7 and TGM6 which encode the proteins TGZ and TGY respectively. Alternative mRNA sequences of the TGM7 gene are given in Fig. 6A and Fig. 6B. The TGM7 derived mRNA (Fig. 6A and Fig. 6B) comprises an open reading frame of 2130 nucleotides and a polyadenylation signal (AATAAA) 158 nucleotides downstream of the termination codon (TGA). The deduced protein for TGZ consists of 710 amino acids. The deduced protein for TGZ from Fig. 6A has a molecular mass of 79, 908 Da and an isoelectric point of 6.7. The deduced protein for TGZ from Fig. 6B has a molecular weight of 80,065 and an isoelectric point of 6.6.
The TGM6 full length transcript (Fig. 10A) comprises an open reading frame of 2109 nucleotides. The deduced protein for the long form of TGY consists of 708 amino acids and has a calculated molecular mass of 79, 466 Da and an isoelectric point of 6.9. The transcript for the short form of TGY (Fig. 10B) comprises an open reading frame of 1878 nucleotides and the deduced protein consists of 626 amino acids with a molecular mass of 70, 617 Da and an isoelectric point of 7.6.
To analyse the relationship between the different transglutaminase genes, the inventors calculated their amino acid similarity based upon sequence alignments, and calculated their evolutionary distances using different algorithms. All the algorithms used predicted a close relationship between TGX, TGZ, TGY, TGE, band 4.2 protein and TGC, and factor XDIa and TGK, respectively. The grouping of TGX, TGZ, TGY, TGE, TGC, and band 4.2 protein in one subclass and factor XDIa and TGK in another is supported by the results of this analysis and by the gene structure and genomic organisation of the different transglutaminase genes.
The inventors have therefore determined the structure of the human TGM5 gene, and its flanking sequences, and have mapped the gene to the 15ql5 region of chromosome 15. Further, the inventors have determined that the human TGM5 gene comprises 13 exons separated by 12 introns spanning roughly 35kb, and that the structure of the TGM5 gene is identical to that of EPB42 (band 4.2 protein), TGM2 (TGc) and TGM3 (TGE) genes. Southern blot analysis has also shown that TGM5 is a single copy gene in the haploid genome. The inventors developed a method for detection and identification of transglutaminase gene products based on RT-PCR with degenerate primers and using this method have discovered the gene product of the TGM5 gene in kerarinocytes (Aeschlimann et al (1998) I. Biol. Chem. 273, 3452-3460). Using this method, the inventors have identified another new transglutaminase gene product in human foreskin keratinocytes and in prostate cacrinoma tissue which has been designated TGZ or transglustaminase type VT A full-length cDNA for this gene product was obtained by anchored PCR. Long range genomic PCR was used comprising different combinations of primers designed from the flanking sequences of the TGM5 - EPB42 gene sequence and the TGZ cDNA sequence to explore whether the gene encoding TG2 (TGM7) was present in close proximity to the other two transglutaminase genes. This placed the TGM7 gene approximately 9kb upstream of the TGM5 gene and demonstrated that the genes are arranged in tandem fashion (Fig. 5C). The inventors have therefore determined that the transglutaminase genes, TGM5 (TGx), TGM7 (TGz) and EPB42 (band 4.2 protein) are positioned side by side within approximately 100 kb on chromosome 15. It has also been found that the mouse homologues of these genes are similarly arranged on mouse chromosome 2. Finally, the inventors have identified and determined the nucleotide and amino acid sequences as well as tissue distribution for the novel fransglutaminase gene products TGz and TG .
According to a first aspect of the invention there is provided a nucleotide sequence comprising at least a portion of the nucleotide sequence of Fig. 6 A, Fig. 6B, Fig. 10A or Fig. 10B; a nucleotide sequence which hybridise to the nucleotide sequence of Fig. 6A, Fig. 6B, Fig. 10A or Fig. 10B; a nucleotide sequence which is degenerate to the nucleotide sequence of Fig. 6A, Fig. 6B, Fig. 10A or Fig. 10B; all of which nucleotide sequences encode a polyp eptide having transglutaminase activity.
Preferably the nucleotide sequence consists of the nucleotide sequence of Fig. 6A, Fig. 6B, Fig. lOA or Fig. 10B. The first aspect of the present invention also provides a nucleotide sequence which hybridises under stringent conditions to the nucleotide sequences of Fig. 6A, Fig. 6B, Fig. 10A or Fig. 10B and which encodes a polypeptide having fransgluta inase activity. Preferably the nucleotide sequence has at least 80%, more preferably 90%) sequence homology to the nucleotide sequence shown in Fig. 6A, Fig. 6B, Fig. 10A or Fig. 10B. Homology is preferably measured using the BLAST program.
The invention further provides a method of expressing a polypeptide comprising inserting a nucleotide sequence according to the first aspect of the present invention into a suitable host and expressing that nucleotide sequence in order to express a polypeptide having transglutaminase activity.
The invention also provides a vector comprising a nucleotide sequence according to the first aspect of the present invention.
According to another aspect of the invention there is provided a polypeptide having an amino acid sequence comprising at least a portion of the amino acid sequence of Fig. 6A, Fig. 6B, Fig. 10A or Fig. 10B, wherein the polypeptide has transglutaminase activity.
The invention also provides a polypeptide sequence which is at least 90% identical to the amino acid sequence of Fig. 6A, Fig. 6B, Fig. 10A or Fig. 10B and which has transglutaminase activity. The amino acid sequence of the polypeptide having fransglutaminase activity may differ from the amino acid sequence given in Fig. 6A, Fig. 6B, Fig. 10A or Fig. 10B by having the addition, deletion or substitution of some of the amino acid residues. Preferably the polypeptide of the present invention only differs by about 1 to 20, more preferably 1 to 10 amino acid residues from the amino acid sequence given in Fig. 6A, Fig. 6B, Fig. 10A or Fig. 10B.
The invention also provides a composition comprising the polypeptide of the present invention for use in transamidation reactions on peptides and polypeptides. The invention also provides a polypeptide comprising exons VD through to exon X of the sequence shown in Fig. 6A or Fig. 6B. The position of the exons on the sequence shown in Fig. 6A or Fig. 6B can be deteπnined from Fig. 8 where intron splice sites are marked with arrow heads.
According to a further aspect of the invention, there is provided a polypeptide comprising exons D through to exon IV or exons X through to exon XD of the sequence show in Fig. 10A or Fig. 10B. As indicated above, the positions of the exons on the sequence shown in Fig. 10A or Fig. 10B can be determined from Fig. 8.
According to another aspect of the invention there is provided a composition comprising the polypeptide according to the present invention for use in the cross-linking of proteins.
According to a further aspect of the invention there is provided a diagnostic method comprising detecting expression of the polypeptide according to the present invention in a subject or in cells derived from a subject.
The invention also provides an antibody directed against the polypeptide according to the present invention. The antibody may be any antibody molecule capable of specifically binding the polypeptide including polyclonal or monoclonal antibodies or antigen binding fragments such as Fv, Fab, F(ab')2 fragments and single chain Fv fragments.
The invention further provides a method of gene therapy comprising correcting mutations in a non wild type nucleotide sequence corresponding to the nucleotide sequence of Fig. 6 A, Fig. 6B, Fig. 10A or Fig 10B. Such gene therapy methods can be performed by homologous recombination techniques or by using ribozymes to correct small sequence mutation. Suitable techniques are well known to those skilled in the art. In accordance with a further aspect of the invention there is provided a method of diagnosis of autoimmune disease comprising taking a sample from a subject and testing that sample for the presence of a fransglutaminase encoded by the nucleotide sequence of Fig. 6A, Fig. 6B, Fig. 10A or Fig 10B or portions thereof. Preferably the fransglutaminase is detected by using an antibody having affinity for the transglutaminase.
The invention also provides a competitive protein binding assay for the differential diagnosis of autoimmune diseases comprising the detection of antibodies against the transglutaminase encoded by the nucleotide sequence of Fig. 6A, Fig. 6B or Fig.lOA or Fig 10B, or portions thereof.
Preferably the protein binding assay comprises using exogenous transglutaminase TGz or TGy, or both, as a competitive antigen.
The invention will now be described with reference to the accompanying Figures 1 to 11, in which:
Fig. 1 is a representation of the genomic organisation of the human TGM5 gene. The human TGM5 gene is represented with the exons numbered I to XDI indicated by solid boxes separated by the introns 1 to 12. The sizes of the introns and exons are given in bp (base pairs). The 5'- and 3'- untranslated regions in exon 1 and XDI, respectively, are represented by hatched boxes with functional elements defining the transcript indicated. Additional sequence elements found in the TGM5 gene are indicated as follows: Alu, Alu 7SL derived retroposon; STS, sequence tagged site. Below the genomic map, a representation of the sequences present in the individual BAG clones is depicted.
Fig. 2 is a representation of the structure of the 5' untranslated region of the human TGM5 gene and mapping of the transcriptional start site. A. Primer extension analysis of poly (A+) RNA isolated from primary human keratinocyte prior to (lane 1) or after (lane 2) culture in suspension for 12h. Extension products were separated on denaturing polyacrylamide gel alongside a Sanger dideoxynucleotide sequencing reaction of the appropriate genomic DNA fragment primed with the same oligonucleotide. The transcriptional start site is indicated by the arrow. B. Nucleotide sequence of the proximal 5' region of the TGM5 gene, 5' ends of mRNA from primary keratinocytes mapped by RACE are indicated by arrowheads. The major transcription start site identified by primer extension is highlighted with an asterisk (labelled +1). Consensus sequences for putative regulatory elements are underlined.
Fig. 3 is a representation of the structure of the 3' untranslated region of the human TGM5 gene. 3'-flanking sequence is shown with sequences homologous to known consensus sequences for 3' processing of transcripts (AATAAA, CAYTG and YGTGTTYY) underlined. The termination points of cDNA's isolated from human keratinocytes (Aeschlimann et al (1998) I. Biol Chem 273, 3453-3460) by 3' RACE are indicated by arrowheads. A pair of inverted long repeat sequences is highlighted in italics.
Fig. 4 is a southern blot analysis of human genomic DNA hybridised to genomic TGX probes. Human genomic DNA was digested with BamHl, EcoΕI, and Hinά restriction enzymes and hybridised with short 32P-labelled DNA fragments corresponding to intron 2 and flanking sequences of exon D and DI (left panel) and exon X (right panel), respectively. The migration positions of the HincRR DNA size markers is indicated on the left.
Fig. 5 shows the chromosomal localisation of the human TGM5 gene by fluorescence in situ hybridisation. A. representative picture of fluorescine-labelled genomic DNA of BAC-228(P20) (fluroescence, arrows) hybridised to metaphase spreads of human chromosome stained with propidium iodide. B. An ideogram of banded chromosome 15 showing the localisation of the fluorescent signal on 13 chromosomes. C. Is a schematic map of the respective locus showing the organisation of the genes encoding TGX (TGM5), band 4.2 protein (EPB4.2) and TGZ (TGM7) as well as other genes [mitochondrial ATPase subunit D pseudogene; D-type cyclin-interacting protein 1 (DIP1); EST Genbank AA457639, AA457640)], LI repetitive element and genetic markers.
Fig. 6 shows A. the nucleotide sequence and deduced amino acid sequence for human TGZ. B. an alternative nucleotide sequence and deduced amino acid sequence for human TGZ. The initiation and termination codons as well as polyadenylation signal (AATAAA) are underlined.
Fig.7 is a representation of the different tissue expression patterns for TGx, band 4.2 protein TGY and TGz in different fetal and mature human tissues. Human tissue Northern dot blot normalised for average expression of 9 different housekeeping genes probed with a fragment corresponding to the C-terminal β-barrel domains of TGx (A), TGz (B), (C) TGY and band 4.2 (D). A diagram showing the type of poly (A)+ RNA dotted onto the membrane is shown in panel E.
Fig. 8 is a comparison of the structure of the different human transglutaminase genes. A. is an alignment of the nine characterised human gene products (TGx, TG , TGZ (shown in Fig. 6A), TGc, TGE, band 4.2, factor XDI a-subunit, TGK, TGP;) is shown, with dashes indicating gaps inserted for optimal sequence alignment and underlined residues representing amino acids conserved in at least five gene products. The sequences are arranged to reflect the transglutaminase domain structure, based on the crystal structure of factor XID a-subunit. N-terminal propeptide domain (dl), β-sandwich domain (d2), catalytic core domain (d3) and β-barrel domains 1 (d4) and 2 (d5) (from top to bottom). Known intron splice sites are marked by '. B. is an alignment of the nine characterised human gene products (TGx, TGY, TGZ (shown in Fig. 6B), TGC, TGE, band 4.2, factor XDI a-subunit, TGK, TGP,) is shown, with dashes indicating gaps inserted for optimal sequence alignment and underlined residues representing amino acids conserved in at least five gene products. The sequences are arranged to reflect the transglutaminase domain structure, based on the crystal structure of factor XDI a-subunit. N-terminal propeptide domain (dl), β-sandwich domain (d2), catalytic core domain (d3) and β-barrel domains 1 (d4) and 2 (d5) (from top to bottom). Known intron splice sites are marked by arrowheads.
Fig. 9 is a phylogenetic tree of the transglutaminase gene family and genomic organisation of the genes in man and in mouse. Sequences were aligned to maximise homology as shown in Fig. 8 except including sequences from different species as available: h, human; m, mouse; r, rat. Note, the mouse sequence for TGX 3 is at present incomplete and no information is available for the N-terminal domain. In panel A5, a hypothetical pedigree for the gene family is given that is consistent with the data on the sequence relationship of the individual gene products (A) as well as with the data on the gene structure and genomic organisation (B). Phylogenetic trees based on the amino acid sequence homology of the gene products have been constructed using the NJ method (Saitou and Nei, 1987) of the PHYLDP software package for (1) the N-terminal β-sandwich domain (2) the catalytic core domain, (3) the C-terminal β-barrel domains, and (4) the entire gene products, (C). Shows the similarity of TGx to the other transglutaminase gene products. The domain structure is based on the X-ray crystallo graphic structure of the factor XDI a-subunit dimer and inferred on the other gene products based upon the sequence alignment shown in Fig. 8. The numbers reflect % sequence identity.
Fig. 10 shows the nucleotide sequence and deduced amino acid sequence of TGY. A. Shows the nucleotide and deduced amino acid sequence for the long form of TGY. B. Shows the nucleotide sequence and deduced amino acid sequence for the short form of TGY.
Fig. 11 is a schematic representation of the organisation of the identified fransglutaminase gene clusters in the human genome.
Isolation and Determination of the Structure of the Human TGM5 Gene.
A unique insertion sequence of about 30 amino acids between the catalytic core domain and β-barrel domain 1 found in TGX was used as a template to design specific primers for the screening of a human genomic library. The characterisation of several genes of the transglutaminase gene family showed that the positions of the introns has been highly conserved and a comparison of the TGx sequence to the sequences of the other transglutaminases indicated that this unique sequence is present within an exon, exon X (see Fig. 8, aa 460-503) in TGX. A PCR reaction from human genomic DNA using oligonucleotides PI and P2 that match sequences at either end of this unique segment yielded a DNA fragment of expected size which was confirmed to be the correct product by sequencing (results not shown). Screening of a human genomic DNA BAC library by PCR using these oglionucleotides revealed two positive clones, BAC-33(P5) and BAC-228(P20) that were subsequently shown by Southern blotting with different cDNA probes to contain sequences spanning at least exon D to exon X of the TGM5 gene (results not shown). Restriction analysis further indicated that each of the BAC clones contained substantially more than 50kb of human genomic DNA.
The similarity in the gene structure of the different transglutaminase genes prompted us to approach the characterisation of introns by PCR amplification using oligonucleotide primers corresponding to the flanking exon sequences at the presumptive exon/intron boundaries. All intron/exon boundaries were sequenced from the PCR products obtained in at least two independent PCR reactions, where applicable from both BAC clones, to exclude mutations introduced by Taq DNA polymerase, and the results compared. When sequences of PCR products comprising adjacent introns had no overlap, the intervening sequence (exon sequence) was determined by direct sequencing from isolated BAC plasmid DNA to confirm the absence of additional introns. Similarly, the 3'-unfranslated region was obtained by step-wise extension of the known sequence using direct sequencing of BAC plasmid DNA. Both BAC clones terminated short of exon I and all attempts at isolating clones spanning exon I by screening of BAC and PI libraries failed. Exon I and intron 1 sequences were finally derived by nested PCR from human genomic DNA using conditions optimised for long range genomic PCR.
We established that the TGM5 gene comprises approximately 35kb of genomic DNA and contains 13 exons and 12 introns (Fig. 1). All intron/exon splice sites conform to the known GT/AG donor/acceptor site rale and essentially to the consensus sequence proposed by Mount S.M (1982) Nucleic Acids Res. 10, 459-472. (Table I). A sequence homologous to the branch point consensus CTGAC (Keller E. B and Noon. W. A (1984) Proc. Natl. Acad. Sci. USA 81: 7417-7420) was found 24 to 44 nucleotides upstream of the 3' splice site in introns 1, 3-6, and 9-12. The size of the introns varied considerably, ranging from 106bp to more than 6kb (Fig. 1, Table II). The sequence obtained from the two different BAC clones matched with the exception of a deletion spanning the sequence from intron 6 to intron 8 in BAC-33(P5) (Fig. 1).
Further, we also resequenced the entire coding sequence of TGx and found 3 point mutations as compared to the previously reported cDNA sequence (Aeschlimann. et al (1998) J. Biol Chem 273 3452-3460) One of the nucleotide exchanges is silent, the other two result in an amino acid exchange (Table ID). The first two mutations were found in both BAC clones, the third was only present in BAC-228(P20) due to the deletion in the other BAC clone. These differences may result from sequence polymorphisms in the human gene pool as there was no ambiguity of the cDNA-derived sequence in this position determined from multiple independently amplified PCR products. However, the fact that a serine and alanine residue are changed into a proline and glycine residue that constitute the conserved amino acid in these positions in the transglutaminase protein family (see Fig. 8, aa 67 and aa 352 in TGX) suggested that these may have been PCR-related mutations in the cDNA sequence. To clarify this issue, we have prepared cDNA from human foreskin keratinocytes from different individuals, amplified full-length cDNA with high fidelity DNA polymerase, and sequenced the respective portions of the cloned cDNAs. The data confirmed that allelic variants exist with differences in these positions (Table ID).
The isolation and sequencing of cDNAs encoding TGx and Northern blotting with TGX cDNA probes revealed expression of at least two differentially spliced mRNA transcripts for TGx in human keratinocytes. Solving the gene structure confirmed the short form of TGx to be the result of alternative splicing of exon ID as predicted. A third isolated cDNA that differed also at the exon Dl/exon IV splice junction turned out to be the result of incomplete or absent splicing out of intron 3 as the sequence upstream of exon IV in the cDNA matched with the 3' sequence of intron 3. Exon 3 encodes part of the N-terminal β-barrel domain of TGx and the absence of the sequence encoded by exon 3 is expected to result in major structural changes in at least this domain of the protein. Nevertheless, expression of TGX in 293 cells using the full-length cDNA resulted in synthesis of two polypeptides with a molecular weight consistent with the predicted products from the alternatively spliced transcripts (results not shown).
Initially, 5' RACE was used to determine the 5' end of TGX cDNAs. Transcripts starting 77,96 and 157 nucleotides upstream of the initiator ATG were isolated in addition to the previously described shorter transcript (Fig. 3B, arrowheads). All of these transcripts were recovered repeatedly in independent experiments. Finally, primer extension experiments located the major transcription initiation site used in keratinocytes 157 nucleotides upstream of the translation start codon (Fig. 3). The proximal promoter region was analysed for potential binding sites of transcription factors using Matϊnspector (Genomatix, Munich Germany) and GCG (Genetics Computer Group, Inc., Madison, Wisconsin) software packages. No classical TATA-box sequence was found but a number of other potential transcription factor binding sites could be identified (Fig. 3b), suggesting that TGx promoter is a TATA-less promoter. Interaction of C/BP (may bind to CAAT-box), nuclear factor I (NF1) and upstream stimulatory factor (USF) to form a core proximal promoter has been demonstrated in a number of TATA-less genes. c-Myb is found in TATA-less proximal promoters of genes involved in hematopoiesis and often interacts with Ets-factors, and these sites may be operative in the expression of TGx in hematopoetic cells, e.g HEL cells API, Ets and SPl elements are typically found in keratinocyte-specific genes and may be involved in transcriptional regulation in keratinocytes. Several API sites are present within 2.5kb of the upstream sequence and could interact with the proximal API factor for activation. SPl sites are properly positioned upstream of the start points of the shorter transcripts raising the possibility that these could also be functional, though to a lesser degree.
The last exon, exon XDI, contained a consensus polyadenylation signal AATAAA-600bp downstream of the termination codon (Fig. 3). This is in good agreement with the size of the mRNA (2.8kb) encoding full-length TGX expressed in human keratinocytes as detected by Northern blotting considering the length of the coding sequence (2160bp.). A CAYTG signal that binds to U4 snRNA which is identical for 4 out of 5 nucleotides is present in tandem in 3 copies 7 nucleotides downstream of the polyadenylation signal. A close match (YCTGTTYY) of another consensus sequence YGTGTTYY that is found in many eukaryotic transcripts and provides a signal for efficient 3' processing is present 46 nucleotides downsfream of the polyadenylation signal. However, we have previously reported that all cDNAs isolated by RT-PCR with an oligo(dT) oligonucleotide from human keratinocytes ended within 9 to 34 nucleotides downstream of the pentanucleotide ATAAA at position 2169. It has been shown that this pentanucleoditde functions as a polyadenylation signal and these shorter transcripts are selectively enhanced by PCR amplification because of the smaller size of the PCR product. Chromosomal Localisation of the TGM5 Gene.
To address the genomic organisation and identify the chromosomal localisation of the TGM5 gene or genes in the human genome, we performed Southern blot analysis of human genomic DNA cut with Bamt L, EcoRI and HindHf restriction enzymes using probes derived from intron 2 as well as from the sequence encoded by exon X that is unique to TGx (Fig.4). Bands of 4.5, 6.0,10.5, 4.3, 9.3 and 2.6kb were revealed with the respective probes. The simple pattern of restriction fragments hybridising with the probes indicated that the haploid human genome contains only one TGM5 gene.
The TGM5 gene was subsequently localised to chromosome 15 by fluorescent in situ hybridisation of human metaphase chromosome spreads using genomic DNA derived from either BAC clone as a probe (Fig. 5A). A comparison of the probe signal to the DAPI banding pattern localised the TGM5 gene to the 15ql5 region. The localisation was subsequently refined by determining the distance of the fluorescent signal to the centromere as well as to either end of the chromosome on 13 copies of chromosome 15 and expressing it as a fractional distance of the total length of the chromosome. These measurements placed the TGM5 gene close to the centre of the 15ql5 region, i.e. to the 15ql5.2 locus (Fig. 5B).
TGM5 is part of a cluster of transglutaminase genes.
The EPB42 gene has previously been assigned to locus 15ql5.2 on chromosome 15. This raised the possibility that the EPB42 gene may be arranged in tandem with the TGM5 gene. Indeed, PCR with specific primers for sequences derived from the 5' and 3' of the EPB42 gene yielded products of appropriate size from both, BAC-33(P5) and BAC-228(P20) (results not shown) and sequencing confirmed the identity of the PCR products. Southern blotting of BAC plasmid DNA with cDNA probes comprising the 5' or 3' end of the EPB42 gene and subsequent comparison of the pattern of labelled restriction fragments with that of the TGM5 gene allowed us to map this locus in more detail. The EPB42 gene and TGM5 are arranged in the same orientation being spaced apart by ~1 lkb (Fig. 5C) approximately 30% of which was sequenced to characterise the 3' and 5' flanking UTR of the TGM5 and EPB42 gene, respectively. To analyse the relationship between the most closely homologous genes of the fransglutaminase gene family (TGM7, TGM5 and EPB42 on human chromosome 15ql5.2 and TGM2 and TGM3 on chromosome 20qll/12), we mapped the respective mouse genes using radiation hybrid mapping. All genes mapped to the distal part of mouse chromosome 2. The genes for tg7?.7, tgm5, and epb42 showed a best fit location for the segment defined by D2Mitl04 proximal and D2Mit305 (66.9cM) and an LOD of >20 to D2Ertd616e (69.0cM). This is in good agreement with the assigned locus 67.5cM distal from the centromere, White, R.A., et al., (1992) Nat. Genet. 2, 80-83. This tgm.3 gene showed a best fit location for the segment defined by D2Mit447 proximal and D2Mit258 distal, with a highest LOD of 14.8 and 12.2 to D2Mit258 (78.0cM) and D2Mit338 (73.9cM), respectively. The tgm2 gene showed a best fit location for the segment defined by D2Mitl39 proximal (86.0cM) and D2Mit225 distal (91.0cM), with a highest LOD of 17.0 to the anchor market D2Mit287, consistent with it's assigned locus 89.0cM from the centromere, Νanda, Ν., et al., (1999). Arch. Biochem. Biophys. 366, 151-156.
We developed a method for detection and identification of transglutaminase gene products based on RT-PCR with degenerate primers and using this method discovered the gene product of the TGM5 gene in keratinocytes. Using this same method, we have identified another new transglutaminase gene product in human foreskin keratinocytes and in prostate carcinoma tissue which we designated TGZ or fransglutaminase type VD. A full-length cDΝA for this gene product was obtained by anchored PCR (see below). We used long range genomic PCR with different combinations of primers designed from the flanking sequences of the TGM5 - EPB42 gene segment and the TGZ cDΝA sequence to explore whether the gene encoding TGZ, TGM7, was present in close proximity to the other two fransglutaminase genes. This placed the TGM7 gene approximately 9kb upstream of the TGM5 gene and demonstrated that the genes are arranged in tandem fashion (Fig. 5C). The 5' UTR of the TGM5 gene was sequenced (Fig. 2). The genes encoding TGC, TGE, which are more closely related to the TGM5, TGM7, and EPB42 genes than the other fransglutaminase genes based on amino acid sequence comparison and similarity in gene structure (Fig. 9) have been mapped to human chromosome 20qll (Gentile V. et al (1994) Genomics 20, 295-297; Kim I. G. et al (1994) I. Invest. Dermatol 103, 137-142). The syntenic regions of the 15ql5 and 20qll locus in the mouse are present in a short segment, 2F1-G, of chromosome 2, which puts all five fransglutaminase genes in proximity (Fig. 9B). The mouse homologue of band 4.2 protein has been mapped to this region of mouse chromosome 2 (White R. A. et al (1992) Nat. Genet 2, 80-83). We have isolated a BAC clone containing the gene encoding the mouse homologue of TGX and shown that the tgm5 gene is located next to the epb42 gene in tandem fashion, similar to the organisation in the human genome. Furthermore, this clone was shown to contain the gene encoding the mouse homologue of TGZ. Genomic sequences derived from this BAC clone and also cDNA sequences derived from cDNA prepared from mouse uterus showed that the mouse and human gene products are 85% identical on the nucleotide level. To analyse the relationship between the transglutaminase genes in more detail, we calculated the amino acid similarity (Fig. 9C) based on the sequence alignment shown in Fig. 8 and calculated evolutionary distances using different algorithms (Fig. 9A). All algorithms predicted a close relationship between TGx and TGE, and band 4.2 protein and TGc, raising the possibility that a single transglutaminase gene initially locally duplicated to generate a cluster of 3 genes, followed by a duplication of a larger segment of the chromosomal region, gave rise to the organisation of the genes in mouse. In humans these chromosomal regions were apparently redistributed to two different chromosomes. This hypothesis led us to spectulate on the existence of an additional gene on human chromosome 20qll. Careful analysis of the chromosomal sequences of this locus derived by the human genome project revealed the presence of a candidate gene, TGM6, located approximately 45kb downstream of the TGM3 gene consistent with our hypothesis (Fig. 11). To confirm that this is in fact a functional gene and not a pseudogene, we screened a large number of cell lines for expression of a respective gene product by PCR. A corresponding gene product, TGY, or transglutaminase type VI could be identified in a small cell lung carcinoma cell line and a full-length cDNA was subsequently derived by anchored PCR.
Determination of cDNA and amino acid sequences of TGM6 (TGy) and TGM7 (TGZ) gene products
A full-length cDNA sequence for TGZ was obtained by anchored PCR using oligo(dT)-Not I primed cDΝA prepared from human foreskin keratinocytes, prostate carcinoma tissue and human carcinoma cell line PC3, essentially following the strategy previously described (Aeschlimann et al (1998) J Biol Chem 273: 3245-3460). The oligo(dT)-Not I primer was used as the anchoring primer to obtain the 3' end of the cDNA. 5' RACE was used to determine the 5' end of the cDNA. The obtained sequence information (Fig. 6) contained an open reading frame 2130 nucleotides and a polyadenylation signal (AATAAA) 158 nucleotides downstream of the termination codon (TGA). The deduced protein consists of 710 amino acids. The cDNA and amino acid sequence in Fig. 6A was first determined and the deduced protein has a calculated molecular mass of 79,908 Da and an isoelectric point of 6.7. The cDNA and amino acid sequence in Fig. 6B was then determined. This sequence differs by a few nucleotides and amino acids from the sequence given in Fig. 6A. The protein deduced from the sequence given in Fig. 6B has a calculated molecular mass of 80,065 and an isoelectric point of 6.6. A number of aberrantly spliced gene products were isolated which lacked part of exon IX (5 'end) or retained the whole or part of intron 11. These products are unlikely to be of physiological significance but may point out that splicing of certain introns in this gene is a difficult and inefficient process.
A full-length cDNA sequence for TGY was obtained by PCR using oligo(dT) primed cDNA prepared from the lung small cell carcinoma cell line H69, and using sequence specific primers based on the presumptive transcribed genomic sequence. 5' RACE was used to determine the 5' end of the cDNA. The obtained sequence information for the long form of TGY (Fig. 10A) contained an open reading frame of 2109 nucleotides. The deduced protein for the long form of TGY consists of 708 amino acids and has a calculated molecular mass of 79,466 Da and an isoelectric point of 6.9. A shorter transcript was also isolated which apparently resulted from alternative splicing of the sequence encoded by exon Xπ. The absence of exon XD results in a frame shift and thereby in premature teimination within exon XDI. The obtained sequence information for the short form of TGY (Fig. 10B) contained an open reading frame of 1878 nucleotides. The deduced protein for the short form of TGY consists of 626 amino acids and has a calculated molecular mass of 70,671 Da and an isoelectric point of 7.6. The sequence alterations due to the splicing result in a short protein which terminates just after the first C-terminal β-barrel domain. The β-barrel domains have been implicated in the regulation of enzyme-substrate interaction, and the lack of the second C-terminal β-barrel domain (see Fig. 8, d5) is likely to be of biological significance. The catalytic mechanism of transglutaminases has been solved based on biochemical data available for several transglutaminases and the X-ray crystallographic structure of the factor XDI a-subunit dimer. The reaction center is formed by the core domain and involves hydrogen-bonding of the active site Cys to a His and Asp residue to form a catalytic triad reminiscent of the Cys-His-Asn triad found in the papain family of cysteine proteases. The residues comprising the catalytic triad are conserved in TGY (Cys276, His335, Asp358) and TGZ (Cys227, His336, Asp359) (Fig. 8) and the core domain shows a high level of conservation as indicated by a sequence identity of about 50%> between these gene products and the other transglutaminases (Fig. 9). A Tyr residue in barrel 1 domain of the a subunit of factor XDI is hydrogen-bonded to the active site Cys residue and it has been suggested that the glutamine substrate attacks from the direction of this bond to initiate the reaction based on analogy to the cysteine proteases. In TGY, the Tyr residue is conserved (Tyr 540) while in TGZ the Tyr residue has been replaced by His538 similar to TGX (Fig. 8). This is expected to be a conservative change which is supported by our data demonstrating that recombinant TGX from 293 cells has transglutaminase activity. Crystallization experiments with factor XDIa further indicated that 4 residues are involved in binding of Ca2+-ion, including the main chain carbonyl of Ala457 and the side chain carboxyl groups of Asp438, Glu485, and Glu490. All three acidic residues are conserved in TGY and in TGZ (Fig. 8). None of the residues critical to enzyme function are affected by the alternative splicing of TGY. Based on the preservation of critical residues for enzyme function and domain folding and the extensive overall similarity of the TGY isoforms and TGZ to the other members of the fransglutaminase protein family, it can be predicted that the characterized cDNAs are encoding active transglutaminases.
Tissue Expression Patterns for TG , TGy and TGZ.
We have previously shown that TGx is expressed in a number of different cell types (Aeschlimann et al (1998) I. Biol. Chem. 273, 3452-3460).. To obtain a more complete picture on the expression of TGx and the novel gene products, we performed a dot blot Northern blot analysis of more than 50 adult and fetal human tissues. Band 4.2 protein was expressed at high level in bone marrow and fetal spleen and liver, consistent with its role in hematopoietic cells, and virtually undetectable in all other tissues, hi contrast, TGx, TG and TGz showed widespread expression at low level, with highest levels of TGx, TGY and TGz mRNA present in the female reproductive system, in the central nervous system, and in testis, respectively (Fig. 7).
RT-PCR analysis on human cell lines and tissues shows that TGz is expressed in osteosarcoma cells (MG-63), dermal fibroblasts (TI6F, HCA2), erythroleukemia cells (HEL), in primary keratinocytes, mammary epithelium carcinoma cells (MCF7), HELA cells, skin, brain, heart, kidney, lung, pancreas, placenta, skeletal muscle, fetal liver, prostate and in prostate carcinoma tissue. A similar analysis for TGY revealed expression only in a lung small cell carcinoma cell line (H69) and extremely low levels of expression in tissues.
In conclusion, TGZ is expressed widely in cells and tissues and expression levels are not apparently affected by cellular differentiation, (i.e keratinocyte differentiation or fibroblast senescence). TGY expression, on the other hand, was very restricted and expression was only found in H69 cell line. This cell line has characteristics of neuronal cells such as the expression of neuron-specific enolase and brain isozyme of creatine kinase which together with widespread expression in tissues of the nervous system suggests that TGY expression may be specific to neuronal cells. Transglutaminase action has been implicated in the formation of aberrant protein complexes in the central nervous system leading to nerve cell degeneration, e.g in Alzheimers and Huntington's disease. Based on its expression pattern, TGY is a logical candidate to bring about the underlying fransglutaminase-related pathological changes.
Reagents
Oligonucleotides were from Oligos. Etc. Inc. (Wilsonville, OR) or life technologies and restriction enzymes from Promega Corp. (Madison, WI).
Genomic Library Screening
A human BAC library established in a F-factor-based vector, pBeloBAC 11, and maintained in E. coli DH10B was screened by PCR (Genome Systems, Inc., St. Louis, MO). A 147bp DNA fragment unique to TGx was amplified from lOOng of genomic DNA in lOOμl of lOmM Tris/HCl, pH 8.3, 50mM KC1 containing 2mM MgCl2, 0.2mM dNTPs using 2.5 units of Tag DNA polymerase (Fisher Scientific Corp. Pittsburgh, PA) and 50pmol of upstream primer Pl,5'- CCACATGTTGCAGAAGCTGAAGGCTAGAAGC and downsfream primer P2, 5*-CCACATGTCCACATCACTGGGTCGAAGGGAAGG. PCR cycles were 45 sec at 94°C (denaturation), 2 min at 60°C (annealing), and 3 min at 72°C (elongation) for a total of 37 cycles, with the first cycle containing an extended denaturation period (6 min) during which the polymerase was added (hot start), and the last cycle contained an extended elongation period (10 min). Two positive clones were identified, BAC-33(P5) and BAC-228(P20) (Genome Systems), and their identity verified by Southern blotting. Plasmid DNA was prepared using a standard alkaline lysis protocol. 2μg plasmid DNA was restricted with BamHL, EcoRI, and Spel and probed with a 3 P-labelled-500bp NcoVBspHl and ~600bρ spHI/Ndeϊ cDNA fragment of TGX, respectively, as described below.
Amplification of TGM5 Intron Sequences
PCRs were carried out with 2.5 units of Tαq DNA polymerase (Fisher Scientific) and
100-200ng of plasmid DNA from BAC clones in lOOμl of lOmM Tris/HCl, pH 8.3 50mM
KC1 containing 2mM MgCl2, 0.2mM dNTPs and 50pmol of the desired oligonucleotide primers. The PCR cycles were 45 sec at 94°C (denaturation), 1 min at 60°C (annealing), and 5 min at 72°C (elongation). A total of 32 cycles were carried out, with the first cycle comprising an extended denaturation period (6 min) during which the polymerase was added (hot start) and the last cycle comprised an extended elongation period (10 min). The following oligonucleotides were used as upstream and downsfream primers, respectively, in the individual reactions: intron 2, 5'-GGACCACCTGCTTGTTCGCCGGGG,5'-AGGGGCTGGGGCTGTGATGGCGTG; intron S^'-ACCTCTTGAAAATCCACATCGACTCC S'-CAGTTCTTGCTGCCTTGGTAGATGAAGCC; intron 4,5'-GACAGTGAACCCCAGAGGCAGGAG, 5'-TCTGTGGCTGGGTCAGTCTGGAAGTGCA (P3); intron S^'-GCCTGCACTTCCAGACTGACCCAGCCACA, 5'-TCCAGTTTCCATTGAGCACCCCA; intron 6,5'-TGCTGGGTCTTTGCTGCCGTCATGTGC, 5'-TCCTTCTTCTTATTCCCCAAAATCCTGCC; intron 7, 5'-TAGATGAGTATTATGACAACACAGGCAGG, 5'-GCGTCCAGCACCTGCCAGCCTCC,- intron 8,5'-TGAGTGCTGGATGGCCCGGAAGG, 5'-CCCGCTCGTCACTCTGGATGCTC; intron 9, 5'-TTCACCAGGACACGAGTTCTGTTGGCA, P2 (see above); intron 10, PI (see above), 5'-TCAGGACTGCTTTTCTCTTCACCC; intron 11, 5'-ACCCCTGCAAAATCTCCTATTCCC, 5'-AATATCACCTGTATGGAGAGTGGCTGG; intron 12, 5'-TTGAGGACTGTGTGCTGACTGTGGM 5'-AATGATGCTTGCTTGGTGTTGGGG.
PCR's were carried out with 1.25 units of Pfu Turbo DNA polymerase (Sfratagene) and 260ng genomic DNA in a total of lOOμl of supplied reaction buffer supplemented with 0.2mM dNTPs, 2μl DMSO and 50 p ol primers. The PCR cycles were 45 sec at 94°C (denaturation), 1 min at 68°C (annealing), and 2 min at 72°C (elongation). A total of 37 cycles were made, with the first cycle containing an extended denaturation period (6 min) during which the polymerase was added (hot start), and the last cycle containing an extended elongation period (10 min).
Rapid Amplification of 5'-mRNA End
A modified RACE protocol was used to determine the transcription start site and obtain additional sequence information of exon I. Double stranded cDNA was prepared from poly(A+) RNA of cultured normal human keratinocytes (Aeschilmann et al (1998) J. Biol Chem. 273, 3452-3460) with the Copy Kit (Invitrogen, San Diego, CA). The cDNA was purified from nucleotides using the GlassMax DNA Isolation Kit (Life Technologies, Inc.) and tailed in the presence of 200μM dCTP with 10 units of terminal deoxynucleotidyl transferase (Promega) for 30 min at 37°C to anchor the PCR at the 5'-end. The PCR reaction was anchored by performing a total of 5 cycles of one-sided PCR at a lower annealing temperature (37°C) with the abridged anchor primer (Life Technologies, Inc.) only. Following transfer of 25% of this reaction at 94°C to a new tube containing abridged anchor primer and TGx-sρecific primer P3 (see above), the first round of amplification was carried out for a total of 37 cycles under the conditions described above except for annealing which was carried out at 55°C. Nested PCR was done with the universal amplification primer (Life Technologies, Inc.) and TGx-specific primer P4, 5'-TGAAGTACAGGGTGAGGTTGAAGG, as described above (annealing at 60°C) using 1.0 μl from the first round PCR.
Primer Extension Analysis
Oligonucleotide P5 5'-CATGGTAGCTGCCTCCGGTTCCTG containing a 5'-infrared label (IRD 800) was purchased from MWG Biotech (Ebesberg, Germany). Primer P5 (5.3pmol) was hybridised to lμg of poly (A+)RNA from primary keratinocytes (Aeschlimann et al (1998) J. Biol. Chem. 273, 3452'3460) and reverse transcription performed with 200 units of Superscript D RNAse H reverse transcriptase (Life Technologies) in a total of 20μl for 90 min at 42°C according to the manufacturer's instructions. Enzyme was heat inactivated and primer extension products extracted with phenol chloroform, precipitated with ethanol, and then analysed on a 4.5% denaturing polyacrylamide gel adjacent to dideoxynucleotide chain termination sequencing reactions (Thermo Sequenase Cycle Sequencing Kit; Amersham) derived from a double-stranded genomic DNA fragment using the same primer.
DNA Preparation and Sequencing
Plasmid DNA from BAC clones was further purified for direct sequencing by digestion with 200μg/ml of RNase A (Sigma, St. Louis, MO) for lh at 37°C and by subsequent micro-dialysis using Spectra/Por 2 membranes (Spectrum Medical Industries, Inc. Laguana Hills, CA). PCR produts were gel purified using the QIA quick Gel Extraction Kit (Qiagen, Inc. Chatsworth, CA) for sequencing. Cycle sequencing was performed by the dideoxy chain termination method using the Cyclist Exo-P_ ω DNA Sequencing Kit (Sfratagene, LaJolla, CA) and pre-cast 6% polyacrylamide gels with the CastAway Sequencing System (Sfratagene) or using the dRhodamine Terminator Cycle Sequencing Ready Reaction Kit (PE Biosystems) and an ABI 310 automated sequencer.
Southern Blotting
18μg human genomic DNA was digested with BamΗI, EcoRI, and Hind restriction enzymes, separated in a 0.8%> agarose gel and transferred to a Zeta-probe membrane (Bio-Rad, Labs. Hercules, CA). The gel was calibrated using the Lambda DNA/HindΩI markers (Promega). 32P-labelled probes were prepared by random prime labelling using the Multiprime DNA Labelling System (Amersham, Int. Amersham, UK) and PCR products corresponding to intron 2, intron 12, and exon X (see above) as DNA templates. Probes were hybridised to the blot overnight at 65°C in 500 roM NaH2PO4, pH 7.5, containing I M ΕDTA and 7% SDS. The membrane was washed at 65°C to a final stringency of 40mM NaH2PO4, pH 7.5, ImM ΕDTA, and 1% SDS, and the result developed by exposure of the membrane to BioMax MR film (Eastman Kodak, Rochester, NY). Chromosomal Localisation
Human peripheral blood lymphocytes were used to prepare metaphase chromosome spreads (Bebbington C. R. and Hentschel, C. C. G. (1987) in DNA cloning (Volume AT) 184-188, 1RL Press, Oxford UK). Cells were cultured in PB-Max Karyotyping medium (Gibco, BRL, Gaithersburg, MD) for 72h, and synchronised by culture in the presence of 10"7M amethophterin (Fluka) for another 24h. Cells were released from the mitotic block by extensive washing and subsequent culture in the above medium containing 10"5M thymidine for 5h. Cells were subsequently arrested in metaphase by addition of colcemid to a final concentration of O.lμg/ml (Gibco BRL). Harvested cells were incubated in 0.075M KCP for 25 min at 37°C, fixed in methanol/acetic acid (3:1) solution, and chromosome spreads prepared by dropping the cells onto the glass slides. After air drying, chromosomes were treated with lOOμg/ml of RNase A in 2x SSC for lh at 37°C, denatured in 70% (v/v) formamide in 2x SSC for 3 min at 75°C, and dehydrated in a graded ethanol series. DNA probes were prepared by random prime labelling of plasmid DNA of BAC-33(P5) and BAC-228(P20) with fluorescine-conjugated dUTP using the Prime-It Fluor Fluorescence Labelling Kit (Sfratagene). Probes were denatured at 75°C for 10 min in hybridisation buffer consisting of 50% formamide (v/v) and 10%o dextran sulphate (w/v) in 4x SSC and prehybridised at 42°C for 20 min to 0.2 μg/ml human competitor DNA (Sfratagene) to block repetitive DNA sequences. Probes were subsequently hybridised to the chromosome spreads at 37°C overnight, followed by washing to a final stringency of O.lx SSC at 60°C. Spreads were mounted in phosphate-buffered glycerol containing 200 ng/ml propidium iodide to counterstain chromosomes. Slides were examined by epifluorescence microscopy using a lOOx objective and images captured with a DC-330 CCD camera (DAGE-MTI, Inc. Michigan City, IN) using a LG-3 frame grabber board (Scion Corp. Frederick, MD) in a Mclhtosh 8500 workstation and a modified version of the NTH image 1.6 software (Scion Corp.). Images representing fluorescine-labelling and propidium iodide staining of the same field were superimposed using Adobe Photoshop 3.0 (Adobe Systems, Inc. Mountain View, CA) to map the gene to a chromosomal region.
Cloning of Novel Transglutaminase Gene Products by Anchored PCR
For cloning of TGY, poly(A)+RNA was prepared from about 106 H69 cells (American Type Culture Collection, Rockville, MD) by oligo(dT)-cellulose column chromatography using the Micro-Fast Track Kit (Invifrogen, San Diego, CA) and recovered in 20μl lOmM Tris HCI, pH 7.5. The ρoly(A)+RNA (5.0μl) was reverse transcribed into DNA in a total volume of 20μl using the cDNA Cycle Kit (Invitrogen) with l.Oμl oligo(dT) primer (0.2μg/μl). Overlapping fragments of TGY were amplified by PCR using ohgonucleotides 5'-ATCAGAGTCACCAAGGTGGAC, 5'-AGAAACACATCGTCCTCTGCACACC (P6), 5 '-CAGGCTTTCCTCTCACCGCAAACAC, 5'-CGTACTTGACTGGCTTGTACCTGCC,
5'-TCTACGTCACCAGGGTCATCAGTGC, 5'-GCCTGTTCACCGCCTTGCTGT, 5'-CATCACTGACCTCTACAAGTATCC, 5'-ACGGCGTGGGATTCATGCAGG, 5'-CATCCTCTATACCCGCAAGCC, and
5'-AGGTTGAGGCAGGATTAACTGAGGCCTC. PCRs were carried out with 1.25 units of AmpliTaq Gold DNA polymerase (PE Biosystems) and 2.0μl cDNA in a total of 50μl of supplied reaction buffer supplemented with 2mM MgCI2, 0.2mM dNTPs and 25 pmol of the appropriate gene-specific primers. A total of 40 PCR cycles were made, with an elevated annealing temperature of 65°C for the initial 5 cycles and an annealing temperature of 60°C for the remaining cycles. The 5'-end of the cDNA was isolated by 5 '-RACE as described above with the exception of using the gene-specific ohgonucleotides P6, 5'-GATGTCTGGAACACAGCTTTGG, and
5'-TCACAGTCCAGGGCTCTGCTCAG. The PCR-products were either directly sequenced or when desired, cloned by taking advantage of the 3' A-overhangs generated by Taq DNA polymerase using the Original TA-Cloning Kit (Invitrogen).
For cloning of TGZ, we used a series of degenerate and gene-specific ohgonucleotides to isolate overlapping DNA fragments, essentially following our previously described strategy. TGz-specific oligonucleoti.de primers were
5 '-CAACCTTGCGGCTTGAGTCTGTCG, 5'-CAGCAGCTCTGACGGCTTGGGTC
(P7), 5'-ATCACCTTTGTGGCTGAGACCG,
5'-CAAGGGTTAAAAAGTAGGATGAAAGTTC,
5'-CACAGTGTGACTTACCCGCTG, 5'-CATACACCACGTCGTTCCGCTG,
5'-CTTAAAGAACCCGGCCAAAGACTG,
5 '-CGATGGTCAAGTTCCTATCCAXGTTG, 5 '-TGTTGTTTCCAATTTCCGTTCCGC,
5'-TCTGGCACCCTCTGGATACGCAG, 5'-CTTAGGGATCAGCCAGCGCAGC, 5'-GCGGATGAACCTGGACTTTGG, 5'-GGGTGACATGGACTCTCAGCG, 5'--TGGGCAAGGCGCTGAGAGTCCATG, 5'-GCTGGAGGGCGGGTCTCAGGGAGC, and 5'-AGGACAGAGGTGGAGCCAAGACGACATAGCC. Preparation of cDNA from human foreskin keratinocytes and prostate carcinoma tissue has been described previously. The PCRs were performed under the conditions described above or for PCR with degenerate primers as described previously. Nested PCRs were done by replacing the cDNA with l.Oμl from the first PCR reaction. The 5 '-end of the cDNA was isolated by 5 '-RACE as described above with the exception of using the gene-specific ohgonucleotides P'7, 5'-TGAAGCTCAGCCGGAGGTAGAAG, and 5 '-GACAGACTCAAGCCGCAAGGTTG.
Northern Hybridization
A human RNA Master Blot containing poly(A)+ mRNA of 50 different tissues was obtained from Clonetech Laboratories, fric (Palo Alto, CA). 32P-labeled probes were prepared by random prime labelling of DNA fragments of the different transglutaminase gene products using the Multiprime DNA Labelling System (Amersham, Int., Amersham, UK). DNA fragments of 500-700bp compromising the 3 '-end of TGX, TGz, and band 4.2 protein, were generated by restriction with Pst I and Ace I, Nco I and Not I (exon XD and XDI), and Xho I, respectively. The cDΝA encoding human band 4.2 protein (Korsgren et al. 1990) was kindly provided by Dr Carl M. Cohen, Boston, MA. A ~ 220bp 32P-labeled fragment of TGY was generated by PCR using ohgonucleotides 5'-CAGCCTCAGTCACCGCCATCCGC and 5'-GATACTTGTAGAGGTCAGTGATG. Hybridization was performed under the conditions recommended by the manufacturer. The labeled membrane was exposed to BioMax MR film (Eastman Kodak) and films developed after 15 to 24hr for first exposure and 3 to 5 days for second exposure.
Amplification of TGY and TGZ from different tissues cDΝA from various cell lines and human tissue was prepared as previously described. A panel of cDΝAs from human tissue (Multiple Tissue cDΝA Panel I) were also obtained from Clonetech Laboratories. A 365 or 287bp fragment of TGZ was amplified by PCR using ohgonucleotides 5'-TGGGCAAGGCGCTGAGAGTCCATG and 5'-GCTGGAGGGCGGGTCTCAGGGAGC or
5'-AGGACAGAGGTGGAGCCAAGACGACATAGCC, respectively, with an annealing temperature of 60°C. A 218 or 170bp fragment of TGY was amplified by PCR using ohgonucleotides 5'-CAGCCTCAGTCACCGCCATCCGC and
5'-GATACTTGTAGAGGTCAGTGATG or 5'-GTGAAGGACTGTGCGCTGATG and 5'-CGGGAAGTGAGGGCTTACAAG, respectively, and identical conditions as above.
Mapping of Transglutaminase Genes in Mouse Genome
The 100 radiation hybrid (RH) clones of the T31 mouse/hamster RH panel (McCarthy et ah, (1997), Genome Res., 7, 1153-1161) (Research Genetics, Huntsville, AL) were screened by PCR. A 139bp fragment of the tgm5 gene was amplified with primers5'-TGAGGACTGTGTGCTGACCTTG (f) and 5'-TCCTGTGTCTGGCCTAGGG (r), a 149bp fragment of the epb42 gene with primers 5'-CAGGAGGAGTAAGGGGAATTGG (f) and 5'-TGCAGGCTACTGGAATCCACG (r), a 400bρ fragment of tgm7 with primers 5'-GGGAGTGGCCTCATCAATGG (f) and 5'-CCTTGACCTCACTGCTGCTGA (r), a -600bp fragment with tgm3 with primers 5'-TCGGTGGCAGCCTCAAGATTG (f) and 5'-AGACATCAATGGGCAGGCATGG (r), and 655bp and 232bp fragments of tgm2 with primers 5'-TTGGGGAGCTGGAGAGCAAC (f) and 5'-ATCCAGGACTCCACCCAGCA (r) and primers 5'-(GCGGCCGCTAGT)CCACATTGCAGGGCTCCTGACT (f) and 5'-GCTAGCCTGTGCTCACCATGAGG (r), respectively. PCRs were carried out in a GeneAmp 9600 ther acycler with 0.035 units/μl AmpliTaq Gold polymerase in standard reaction buffer containing 2mM MgCl2, 0.2mM dNTPs, 0.4μM of each primer and 2.5 ng/μl genomic DNA in a total reaction volume of 25 μl. PCR conditions were: polymerase activation for lOmin at 95°C, annealing at 60°C for 45sec, extension at 72°C for lmin and denaturation at 94°C for 30sec for 35 cycles with a final extension of 3.5min at 72°C. PCR reactions were analyzed by agarose gel electrophoresis using 1% or 1.5% gels. The hybrid cell panel was analyzed at least twice in each case to exclude PCT related errors. The data was submitted to the Jackson Laboratory Radiation Hybrid Database for analysis and mapped relative to known genomic markers
(http://www.jax.org/resources/documents/cmdata.rhmap). Table I. Splice donor and acceptor sequences in the human TGM5 gene. Residues consistent with the splice site consensus sequence (MAG/GTRAG and YAG/G) are underlined.
Intron number Donor sequence Acceptor sequence
M A Q G L E V A
1 GCTACCATGGCCCAAGgtagqqaaaqcccctgtqqccactqqaqtt ttttgtctaaccctggctgccccattgcagGGCTAGAAGTGGC
F V V E T G P L P D 2 TTCGTGGTTGAAACTGgtaagaaccccagctgqctcacaggggctg tggagggcctcagctctacttccctcctagGACCGCTGCCAGA
N P C P E D A V Y 3 AATCCCTGGTGCCCAGgtaaqgctgqgtgcccaqgcggtgcctcct tgcttcgtgccctcccactctggttcctagAGGATGCTGTCTA N Y G Q F E D K I . TGGAACTATGGACAGgtgaqtctcagccctgcttatggcccatcc tgccttccctctctgcctctccccccgaa^TTTGAAGACAAAAT
V V C A M I N S N D
5 GTGGTGTGTGCCATGqtgaqqtccctggcgtqcccqgggaggagg ctcacacttctctatatggcttctcttcagATCAACAGCAATGA
A V M C T V M R C L 6 GCCGTCATGTGCACAGgtagqaggtagaaaggacctcacaaaaagg acaggtgatttttttgtgccctttttgcagTGATGAGGTGTCT
K D T I W N F H V 7 AAGGATACTATCTGqtgaqaaacaacctctcaacctatttctag caacgtctcccttggctctgtttgatacagGAACTTCCATGTCTG
Q E M S N G V Y C C CAGGAGATGAGCAACGgtgaggctctccagaagaaaggcaggcccc gcccaccgaggctcccctgttctccttcagGCGTCTACTGCTG
Y K Y E E G S Q E
9 TACAAGTATGAAGAAGgttagtaagcaagccagccctactcagagc cagctggtgctgtgctctccccaacttcagGATCCCTCCAGGA
L S P K E A K T Y P
10 CTCTCTCCTAAAGAAGqtacqcatqtgcacagtttgtgtacgcaga tctcaccccatccttgtgttctttctttagCAAAGACCTACCC
S I T I N V L G A A 11 AGCATCACGATTAATgtagacaggagtcctgcaaatggcttgtgg taattctcctt ccctcctggtctgtttagGTTCTAGGAGCAGC
Q Q K- V F L G V L K 12 CAGCAGAAAGTCTTqtaagtgctgcaagtgctcagccttctcct ttttctgacatgctccattctctgttgcagCCTTGGAGTCCTCAA
Figure imgf000029_0001
Table II. Intron sizes and splice types in the human TGM5 gene. Sizes of introns are estimated to be within about a lOObp unless indicated to be sequenced entirely.
Intron number Splice type Size Method
1 1 6,300bp PCR 2 1 102bp Sequencing 3 1 3,300bp PCR 4 0 2,900bp PCR 5 0 600bp PCR 6 1 ~l l,800bp PCR and Restriction Analysis
7 2 l,600bp PCR
8 1 106bp Sequencing
9 1 2,900bp PCR
10 1 545bp Sequencing
11 0 HOObp PCR
12 2 209bp Sequencing
Table III. Apparent polymorphisms in the cDNA and genomic DNA sequences for TGx. The positions with nucleotide and amino acid variations are underlined.
Residue cDNA (a) Gene cDNA (b)
67 S TCA P CCA P CCA
220 Y TAC Y TAT Y TAC
352 A GCA G GGA A GCA
a. Aeschlimann et al, 1998 b. additional sequence variant isolated in this work

Claims

0 Claims
1. A nucleotide sequence comprising at least a portion of the nucleotide sequence of Fig. 6A or Fig. 6B; a nucleotide sequence which hybridise to the nucleotide sequence of Fig. 6 A or Fig. 6B; a nucleotide sequence which is degenerate to the nucleotide sequence of Fig. 6A or Fig. 6B; all of which nucleotide sequences encode a polypeptide having fransglutaminase activity.
2. A nucleotide sequence according to claim 1 consisting of the nucleotide sequence of Fig. 6 A or Fig. 6B.
3. A nucleotide sequence which hybridises under stringent conditions to the nucleotide sequence of Fig. 6A or Fig. 6B and which encodes a polypeptide having transglutaminase activity.
4. A method of expressing a polypeptide comprising inserting a nucleotide sequence according to any preceding claim into a suitable host and expressing that nucleotide sequence in order to express a polypeptide having fransglutaminase activity.
5. A vector comprising a nucleotide sequence according to any one of claims 1 to 3.
6. A polypeptide having an amino acid sequence comprising at least a portion of the amino acid sequence of Fig. 6A or Fig. 6B and which has fransglutaminase activity.
7. A polypeptide according to claim 6 which is at least 90% identical to the amino acid sequence of Fig. 6A or Fig. 6B and which encodes a polypeptide having fransglutaminase activity.
8. A polypeptide according to claim 6 or 7 where the amino acid sequence differs from that given in Fig. 6A or Fig. 6B by about 1 to 20 amino acid additions, deletions or substitutions.
9. A polypeptide according to any one of claims 6 to 8 comprising exon VD through to " exon X of the sequence shown in Fig. 6A or Fig. 6B.
10. A composition comprising a polypeptide according to any one of claims 6 to 9 suitable for use in cross-linking proteins.
11. A composition comprising a polypeptide according to any one of claims 6 to 9 suitable for use in a transamidation reaction on peptides and polypeptides.
12. A diagnostic method comprising detecting expression of a polypeptide according to any one of claims 6 to 9 in a subject or in cells derived from a subject.
13. An antibody directed against a polypeptide according to any one of claims 6 to 9.
14. A method of gene therapy comprising correcting mutations in a non wild type nucleotide sequence corresponding to a nucleotide sequence of Fig. 6 A or Fig. 6B.
15. A nucleotide sequence comprising at least a portion of the nucleotide sequence of Fig. 10A or Fig. 10B; a nucleotide sequence which hybridise to the nucleotide sequence of Fig. 10A or Fig. 10B; a nucleotide sequence which is degenerate to the nucleotide sequence of Fig. 10A or Fig. 10B; all of which nucleotide sequences encode a polypeptide having fransglutaminase activity.
16. A nucleotide sequence according to claim 15 consisting of the nucleotide sequence of Fig. lOA or Fig. 10B.
17. A nucleotide sequence which hybridises under stringent conditions to the nucleotide sequence of Fig. 10A or Fig. 10B and which encodes a polypeptide having transglutaminase activity.
18. A method of expressing a polypeptide comprising inserting a nucleotide sequence J according to any one of claims 15 to 17 into a suitable host and expressing that nucleotide sequence in order to express a polypeptide having fransglutaminase activity.
19. A vector comprising a nucleotide sequence according to any one of claims 15 to 17.
20. A polypeptide having an amino acid sequence comprising at least a portion of the amino acid sequences of Fig. 10A or Fig. 10B and which has transglutaminase activity.
21. A polypeptide according to claim 20 which is at least 90% identical to the amino acid sequences of Fig. 10A or Fig. 10B and which encodes a polypeptide having transglutaminase activity.-
22. A polypeptide according to claim 20 or 21 wherein the amino acid sequence differs from that given in Fig. 10A or Fig. 10B by about 1 to 20 amino acid additions, deletions or substitutions.
23. A polypeptide according to any one of claims 20 to 22 comprising exons D through to exon IV of the sequence shown in Fig. 10A or Fig. 10B.
24. A polypeptide according to any one of claims 20 to 22 comprising exon X through to exon XD of the sequence shown in Fig. 10A or Fig. 10B.
25. A composition comprising a polypeptide according to any one of claims 20 to 24 suitable for use in cross-lir_king proteins.
26. A composition comprising a polypeptide according to any one of claims 20 to 24 suitable for use in a transamidation reaction on peptides and polypeptides.
27. A diagnostic method comprising detecting expression of a polypeptide according to any one of claims 20 to 24 in a subject or in cells derived from a subject.
28. An antibody directed against a polypeptide according to any one of claims 20 to 24.
29. A method of gene therapy comprising correcting mutations in a non-wild type nucleotide sequence corresponding to the nucleotide sequence of Fig. 10A or Fig. 10B.
30. A method of diagnosis of autoimmune disease comprising taking a sample from a subject and testing that sample for the presence of a fransglutaminase encoded by the nucleotide sequences of Fig. 6A, Fig. 6B, Fig. 10A or Fig. 10B, or portions thereof.
31. A method according to claim 30 wherein the autoimmune disease to be diagnosed is selected from Addison's disease, Al haemolytic anaemia, Al thrombocytopenic purpura, Al thyroid diseases, atrophic gastritis - pernicious anaemia, Chron's disease, colitis ulcerosa, Goodpasture syndrome, IgA nephropathy or IgA glomerulonephritis, myasthenia gravis, partial lipodysfrophy, polymyositis, primary biliary cirrhosis, primary sclerosing cholangitis, progressive systemic sclerosis, recurrent pericarditis, relapsing polychondritis, rheumatoid arthritis, rheumatism, sarcoidosis, Sjδgren's syndrome, SLE, splenic atrophy, type I (insulin-dependent) diabetes mellitus, diabetis mellitus, Wegener granulomatosis, ulcerative colitis, vasculitis (both systemic and cutaneous), vitiligo.
32. A competitive protein binding assay for the differential diagnosis of autoimmune diseases comprising the detection of antibodies against the fransglutaminase encoded by the nucleotide sequences of Fig.6 A, Fig. 6B, Fig.lOA or Fig. 10B, or portions thereof.
33. A competitive protein binding assay according to claim 32 comprising non-endogenous fransglutaminase TGZ or TGY, or both, as a competitive antigen.
34. Competitive protein binding assay according to claim 33, wherein the binding assay is a competitive immunoassay selected from RIA, EIA ELIS A, LiA and FiA.
PCT/GB2001/004120 2000-09-15 2001-09-14 Transglutaminase gene products Ceased WO2002022830A2 (en)

Priority Applications (6)

Application Number Priority Date Filing Date Title
DK01967485.2T DK1317548T3 (en) 2000-09-15 2001-09-14 Transglutaminase gene products
US10/380,533 US7052890B2 (en) 2000-09-15 2001-09-14 Transglutaminase gene products
DE60140552T DE60140552D1 (en) 2000-09-15 2001-09-14 TRANSGLUTAMINASE GENE PRODUCTS
AU2001287860A AU2001287860A1 (en) 2000-09-15 2001-09-14 Transglutaminase gene products
AT01967485T ATE449180T1 (en) 2000-09-15 2001-09-14 TRANSGLUTAMINASE GENE PRODUCTS
EP01967485A EP1317548B1 (en) 2000-09-15 2001-09-14 Transglutaminase gene products

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
GB0022768.6 2000-09-15
GB0022768A GB0022768D0 (en) 2000-09-15 2000-09-15 Transglutaminase gene products
GB0111995A GB0111995D0 (en) 2001-05-16 2001-05-16 Transglutaminase gene products
GB0111995.7 2001-05-16

Publications (2)

Publication Number Publication Date
WO2002022830A2 true WO2002022830A2 (en) 2002-03-21
WO2002022830A3 WO2002022830A3 (en) 2002-10-24

Family

ID=26245015

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/GB2001/004120 Ceased WO2002022830A2 (en) 2000-09-15 2001-09-14 Transglutaminase gene products

Country Status (7)

Country Link
US (1) US7052890B2 (en)
EP (1) EP1317548B1 (en)
AT (1) ATE449180T1 (en)
AU (1) AU2001287860A1 (en)
DE (1) DE60140552D1 (en)
DK (1) DK1317548T3 (en)
WO (1) WO2002022830A2 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6887695B2 (en) * 2001-03-28 2005-05-03 Zymogenetics, Inc. Transglutaminase ztg2
EP1978364A1 (en) 2007-04-06 2008-10-08 N-Zyme BioTec GmbH Transglutaminase 6 as a diagnostic indicator of autoimmune diseases

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130109107A1 (en) * 2010-05-25 2013-05-02 The Regents Of The University Of Colorado Diagnosis and treatment of autoimmune disease
WO2025235822A1 (en) * 2024-05-10 2025-11-13 Gmp Biotechnology Limited Therapeutics for cancer using tgfb2 and tgm6

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5726051A (en) 1992-11-03 1998-03-10 Oklahoma Medical Research Foundation Transglutaminase gene

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5470957A (en) * 1993-10-01 1995-11-28 President And Fellows Of Harvard College Immunoinhibitors of factor XIII
US6114119A (en) * 1997-08-29 2000-09-05 Wisconsin Alumni Research Foundation Transglutaminase and gene encoding same

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5726051A (en) 1992-11-03 1998-03-10 Oklahoma Medical Research Foundation Transglutaminase gene

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6887695B2 (en) * 2001-03-28 2005-05-03 Zymogenetics, Inc. Transglutaminase ztg2
EP1978364A1 (en) 2007-04-06 2008-10-08 N-Zyme BioTec GmbH Transglutaminase 6 as a diagnostic indicator of autoimmune diseases
WO2008122432A1 (en) * 2007-04-06 2008-10-16 Zedira Gmbh Transglutaminase 6 as a diagnostic indicator of autoimmune diseases

Also Published As

Publication number Publication date
DE60140552D1 (en) 2009-12-31
US7052890B2 (en) 2006-05-30
AU2001287860A1 (en) 2002-03-26
DK1317548T3 (en) 2010-03-08
WO2002022830A3 (en) 2002-10-24
US20040072186A1 (en) 2004-04-15
EP1317548B1 (en) 2009-11-18
EP1317548A2 (en) 2003-06-11
ATE449180T1 (en) 2009-12-15

Similar Documents

Publication Publication Date Title
Grenard et al. Evolution of transglutaminase genes: identification of a transglutaminase gene cluster on human chromosome 15q15: structure of the gene encoding transglutaminase X and a novel gene family member, transglutaminase Z
US10392610B2 (en) Pregnancy-associated plasma protein-A2 (PAPP-A2) polypeptides
Park et al. Suprabasin, a novel epidermal differentiation marker and potential cornified envelope precursor
JP2003519463A (en) TCL-1b gene, protein, methods related thereto and substances related thereto
JP2001299373A (en) DNA segment encoding erbB-3 polypeptide, antibodies and bioassay for detecting said polypeptide
SK112996A3 (en) Firoblast growth factor-10
Chen et al. Norrie disease gene: characterization of deletions and possible function
Nomoto et al. Expression of phospholipases γ1, β1, and δ1 in primary human colon carcinomas and colon carcinoma cell lines
US7052890B2 (en) Transglutaminase gene products
CN101440128A (en) Differentially expressed protein in human glioma and uses thereof
AU684930B2 (en) DNA encoding CAI resistance proteins and uses thereof
EP0748376A1 (en) Rna editing enzyme and methods of use thereof
Lener et al. Molecular cloning, gene structure and expression profile of mouse C1 inhibitor
Raju et al. Molecular Cloning and Biochemical Characterization of Bovine Spleen Myristoyl CoA: ProteinN-Myristoyltransferase
Wang et al. Bovine dopamine. beta.-hydroxylase, primary structure determined by cDNA cloning and amino acid sequencing
JPH11183A (en) Polynucleotide and polypeptide sequences of rat cathepsin K
Mueller et al. The murine ortholog of matrix metalloproteinase 19: its cloning, gene organization, and expression
CA2232810A1 (en) Novel compounds
JPH1175874A (en) New compound
JPWO2005019472A1 (en) Method for detecting synoviolin activity regulating action
US7566455B1 (en) E6AP-binding proteins
JP2002186492A (en) Human rce1
US6790936B1 (en) CAI resistance proteins and uses thereof
EP1279733A1 (en) Nucleic acids and polypeptides involved in the predisposition to infection to human papilloma virus, to epidermodysplasia verruciformis and/or to psoriasis
Cammas et al. Correlation of the exon/intron organization to the conserved domains of the mouse transcriptional corepressor TIF1β

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PH PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
121 Ep: the epo has been informed by wipo that ep was designated in this application
AK Designated states

Kind code of ref document: A3

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PH PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

WWE Wipo information: entry into national phase

Ref document number: 2001967485

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 2001967485

Country of ref document: EP

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

WWE Wipo information: entry into national phase

Ref document number: 10380533

Country of ref document: US

NENP Non-entry into the national phase

Ref country code: JP