ABCDEFGHIJKLMNOPQRSTUVW
1
ACIDsummary namepfam summary#seed#full#structures#species#architecturesave. domain lengthave. id% in full alignmentav.coverage% of a seq. by domain#Uniprot#RP15#RP35#RP55#RP75creation dateversion numberother DBcategory in the DB
2
PF00001
7tm_17 transmembrane receptor (rhodopsin family)This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins [1].631275666506281078254.91966.532661472491550446101798130592######24PrositeFamily
3
PF00002
7tm_27 transmembrane receptor (Secretin family)This family is known as Family B, the secretin-receptor family or family 2 of the G-protein-coupled receptors (GCPRs). They have been described in many animal species, but not in plants, fungi or prokaryotes. Three distinct sub-families are recognised. Subfamily B1 contains classical hormone receptors, such as receptors for secretin and glucagon, that are all involved in cAMP-mediated signalling pathways. Subfamily B2 contains receptors with long extracellular N-termini, such as the leukocyte cell-surface antigen CD97 (Swiss:P48960); calcium-independent receptors for latrotoxin (such as Swiss:O94910), and brain-specific angiogenesis inhibitors (such as Swiss:O14514) amongst others. Subfamily B3 includes Methuselah and other Drosophila proteins (e.g. Swiss:P83119). Other than the typical seven-transmembrane region, characteristic structural features include an amino-terminal extracellular domain involved in ligand binding, and an intracellular loop (IC3) required for specific G-protein coupling [1].2730397848541112228.82324.8247972415996382252430797######27PrositeFamily
4
PF00003
7tm_37 transmembrane sweet-taste receptor of 3 GCPRThis is a domain of seven transmembrane regions that forms the C-terminus of some subclass 3 G-coupled-protein receptors. It is often associated with a downstream cysteine-rich linker domain, NCD3G Pfam:PF07562, which is the human sweet-taste receptor, and the N-terminal domain, ANF_receptor Pfam:PF01094. The seven TM regions assemble in such a way as to produce a docking pocket into which such molecules as cyclamate and lactisole have been found to bind and consequently confer the taste of sweetness [1].6162057075581286244.12630.6131871344772571618320861######25PrositeDomain
5
PF00004
AAAATPase family associated with various cellular activities (AAA)AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes [2].207182497236294392269129.92521.375445213093185687160221243904######32PrositeDomain
6
PF00005
ABC_tranABC transporterABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporters are composed of two copies of this domain and two copies of a transmembrane domain Pfam:PF00664. These four domains may belong to a single polypeptide as in Swiss:P13569, or belong in different polypeptide chains.55838816119791503055148.52636.6834895311031253890498322381428870######30PrositeDomain
7
PF00006
ATP-synt_abATP synthase alpha/beta family, nucleotide-binding domainThis entry includes the ATP synthase alpha and beta subunits, the ATP synthase associated with flagella and the termination factor Rho.40942395201689872902133543.022242696706211744158468037######28PrositeDomain
8
PF00007
Cys_knotCystine-knot domainThe family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation.244182353754495.92633.498790346121831174224######25
Published_alignment enriched with PDOC00234 members.
Domain
9
PF00008
EGFEGF-like domainThere is no clear separation between noise and signal. Pfam:PF00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.671549254096161050831.8408.462424963344457674120681157842######30
Swissprot_feature_table
Domain
10
PF00009
GTP_EFTUElongation factor Tu GTP binding domainThis domain contains a P-loop motif, also found in several other families such as Pfam:PF00071, Pfam:PF00025 and Pfam:PF00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel domains.142956066949117821224.72735.43452895155274601290201145733######30PrositeDomain
11
PF00010
HLHHelix-loop-helix DNA-binding domainNULL14181103145146569253.42912.9513484211476325386316384806######29UnknownDomain
12
PF00011
HSP20Hsp20/alpha crystallin familyNot only do small heat-shock-proteins occur in eukaryotes and prokaryotes but they have also now been shown to occur in cyanobacterial phages as well as their bacterial hosts [2].2531174400741423097.42253.6973695773160562918043776######24PrositeDomain
13
PF00012
HSP70Hsp70 proteinHsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase domain and the carboxyl terminus is the substrate binding region.27543794468954645407.82980.7617954710726271034881073912######23PrositeFamily
14
PF00013
KH_1KH domainKH motifs bind RNA in vitro. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia.783109086179883084965.12322.52216351158394114684850120318######32
Published_alignment
Domain
15
PF00014
Kunitz_BPTIKunitz/Bovine pancreatic trypsin inhibitor domainIndicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure. Certain family members are similar to the tick anticoagulant peptide (TAP, Swiss:P17726). This is a highly selective inhibitor of factor Xa in the blood coagulation pathways [1]. TAP molecules are highly dipolar [2], and are arranged to form a twisted two- stranded antiparallel beta-sheet followed by an alpha helix [1].9923880321540141653.13614.35446907097113111973424673######26PrositeDomain
16
PF00015
MCPsignalMethyl-accepting chemotaxis protein (MCP) signalling domainThis domain is thought to transduce the signal to CheA since it is highly conserved in very diverse MCPs.955646383748873168.73129.6829789170452732357728108096######24
Blast MCP1_ECOLI/361-421
Family
17
PF00016
RuBisCO_largeRibulose bisphosphate carboxylase large chain, catalytic domainThe C-terminal domain of RuBisCO large chain is the catalytic domain adopting a TIM barrel fold.162894659159350248.43763.67181204426145728574596######23PrositeDomain
18
PF00017
SH2SH2 domainNULL5259304847549104378.52813.73986857704178104420160346######27
Swissprot_feature_table
Domain
19
PF00018
SH3_1SH3 domainSH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organisation. First described in the Src cytoplasmic tyrosine kinase Swiss:P12931. The structure is a partly opened beta barrel.55929697671443242547.6296.6914612311658281866681793857######31PrositeDomain
20
PF00019
TGF_betaTransforming growth factor beta like domainNULL2631230925149010298.83525.72605714834092945712628######23PrositeDomain
21
PF00020
TNFR_c6TNFR/NGFR cysteine-rich regionNULL4891294031143516539.82818.952340513283528913512915######21
Swissprot_feature_table
Domain
22
PF00021
UPAR_LY6u-PAR/Ly-6 domainThis extracellular disulphide bond rich domain is related to Pfam:PF00087.275051933035274.62149.568176391130735055064######24PrositeDomain
23
PF00022
ActinActinNULL243159714011607413323.52983.21757726283140662451532517######22PrositeFamily
24
PF00023
AnkAnkyrin repeatAnkyrins are multifunctional adaptors that link specific proteins to the membrane-associated, spectrin- actin cytoskeleton. This repeat-domain is a 'membrane-binding' domain of up to 24 repeated units, and it mediates most of the protein's binding activities. Repeats 13-24 are especially active, with known sites of interaction for the Na/K ATPase, Cl/HCO(3) anion exchanger, voltage-gated sodium channel, clathrin heavy chain and L1 family cell adhesion molecules. The ANK repeats are found to form a contiguous spiral stack such that ion transporters like the anion exchanger associate in a large central cavity formed by the ANK repeat spiral, while clathrin and cell adhesion molecules associate with specific regions outside this cavity [2][3].1062285094792344448433.5314.18527436448126402252330270######33
Swissprot_feature_table
Repeat
25
PF00024
PAN_1PAN domainThe PAN domain [1] contains a conserved core of three disulphide bridges. In some members of the family there is an additional fourth disulphide bridge the links the N and C termini of the domain. The domain is found in diverse proteins, in some they mediate protein-protein interactions, in others they mediate protein-carbohydrate interactions.8611531134122172580.11618.032027041696066985312388######29Patthy LDomain
26
PF00025
ArfADP-ribosylation factor familyPfam combines a number of different Prosite families together20286932631781443157.33574.08509155528127632255629822######24SwissprotDomain
27
PF00026
AspEukaryotic aspartyl proteaseAspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (Pfam:PF00077), which are much smaller and appear to be homologous to a single domain of the eukaryotic asp proteases.242425119241566398284.22368.31406525235121201905124960######26
Overington enriched
Family
28
PF00027
cNMP_bindingCyclic nucleotide-binding domainThis domain sensor domain can bind cAMP, cGMP, c-di-GMP, oxygen and 2-oxoglutarate (Matilla et. al., FEMS Microbiology Reviews, fuab043, 45, 2021, 1. https://doi.org/10.1093/femsre/fuab043).210871737318105121988.22019.4273176150803894879702128031######32PrositeDomain
29
PF00028
CadherinCadherin domainNULL55278836786929236093.12545.264555613460687224209664284697######20
Swissprot_feature_table
Domain
30
PF00029
ConnexinConnexinConnexin proteins form gap-junctions between cells. They carry four transmembrane regions, hence why this family now includes Connexin_CCC, which represented the second pair of TMs.176751119425843203.54067.4615435514222557837717######22PrositeDomain
31
PF00030
CrystallBeta/Gamma crystallinThe alignment comprises two Greek key motifs since the similarity between them is very low.861976617045610481.33145.6234555150858891477220372######22
Swissprot_feature_table
Domain
32
PF00031
CystatinCystatin domainVery diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. All members except Swiss:Q03196 and Swiss:Q10993 are found. Pfam:PF00666 are related to this family but have not been included.3465021346588290.21845.1911960925222549156730######24PrositeDomain
33
PF00032
Cytochrom_B_CCytochrome b(C-terminal)/b6/petDNULL1564475208350155105.13528.48201107662217944927382######20PrositeDomain
34
PF00033
Cytochrome_BCytochrome b/b6/petBNULL84593204352872181.44548.75257634722229046197482######22PrositeDomain
35
PF00034
Cytochrom_CCytochrome cThe Pfam entry does not include all Prosite members. The cytochrome 556 and cytochrome c' families are not included. All these are now in a new clan together. The C-terminus of DUF989, Pfam:PF06181, has now been merged into this family.5329176573590638098.61832.571402993899135452963053607######24PrositeDomain
36
PF00035
dsrmDouble-stranded RNA binding motifSequences gathered for seed by HMM_iterative_training Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localisation of at least five different mRNAs in the early Drosophila embryo. Also by interferon-induced protein kinase in humans, which is part of the cellular response to dsRNA.7635979137835744664.92618.42853934905141162940343094######29
Published_alignment
Domain
37
PF00036
EF-hand_1EF handThe EF-hands can be divided into two classes: signalling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typically undergo a calcium-dependent conformational change which opens a target binding site. The latter group is represented by calbindin D9k and do not undergo calcium dependent conformational changes.56412785432156153528.7308.7426421238154511019713354######35PrositeDomain
38
PF00037
Fer44Fe-4S binding domainSuperfamily includes proteins containing domains which bind to iron-sulfur clusters. Members include bacterial ferredoxins, various dehydrogenases, and various reductases. Structure of the domain is an alpha-antiparallel beta sandwich.528314252617257102323.4398.041525066012175913073648694######30PrositeDomain
39
PF00038
FilamentIntermediate filament proteinNULL262197384505176275.5325839126192162371587222111######24PrositeCoiled-coil
40
PF00039
fn1Fibronectin type I domainNULL57122537025215638.54215.69185613851785796412263######21
Swissprot_feature_table
Domain
41
PF00040
fn2Fibronectin type II domainNULL25695298131039441.6484.915739720211768429610######22PrositeDomain
42
PF00041
fn3Fibronectin type III domainNULL9842288365234071058784.82024.0866621238649113655305543435202######24
Swissprot_feature_table
Domain
43
PF00042
GlobinGlobinNULL731196731053665140109.12240.3138232211150021045516204######25
Structure_superposition
Domain
44
PF00043
GST_CGlutathione S-transferase, C-terminal domainGST conjugates reduced glutathione to a variety of targets including S-crystallin from squid, the eukaryotic elongation factor 1-gamma, the HSP26 family of stress-related proteins and auxin-regulated proteins in plants. Stringent starvation proteins in E. coli are also included in the alignment but are not known to have GST activity. The glutathione molecule binds in a cleft between N and C-terminal domains. The catalytically important residues are proposed to reside in the N-terminal domain [1]. In plants, GSTs are encoded by a large gene family (48 GST genes in Arabidopsis) and can be divided into the phi, tau, theta, zeta, and lambda classes [2].3332925758413045596.51734.631099824405143882851948363######28OveringtonDomain
45
PF00044
Gp_dh_NGlyceraldehyde 3-phosphate dehydrogenase, NAD binding domainGAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossmann NAD(P) binding fold.7418775767865799101.84229.0382487250586181819530276######27OveringtonDomain
46
PF00045
HemopexinHemopexinHemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidases (TIMPs).762805319262327844.92725.1948274265478052075428921######22SMARTRepeat
47
PF00046
HomeodomainHomeodomainNULL146105144285148789856.33314.19189058164564151882539108758######32UnknownDomain
48
PF00047
igImmunoglobulin domainMembers of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involved in protein-protein and protein-ligand interactions.3422324345513204183.41610.5138375224558571487321692######28Bateman ADomain
49
PF00048
IL8Small cytokines (intecrine/chemokine), interleukin-8 likeIncludes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds.31092444602805258.82550.6817271525228463289314######23
Overington enriched
Domain
50
PF00049
InsulinInsulin/IGF/Relaxin familySuperfamily includes insulins; relaxins; insulin-like growth factor; and bombyxin. All are secreted regulatory hormones. Disulfide rich, all-alpha fold. Alignment includes B chain, linker (which is processed out of the final product), and A chain.28357817994152569.93153.646926411102125023554######21
Overington enriched
Domain
51
PF00050
Kazal_1Kazal-type serine protease inhibitor domainUsually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tandem arrays. Small alpha+beta fold containing three disulphides. Alignment also includes a single domain from transporters in the OATP/PGT family Swiss:P46721.2580339570843052.72816.66175081565300963478466######24PrositeDomain
52
PF00051
KringleKringle domainKringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta.2313699244525570784021.523071181839771000313724######21
Swissprot_feature_table
Domain
53
PF00052
Laminin_BLaminin B (Domain IV)NULL212740435441533132.9288.65117051123258156757631######21
Swissprot_feature_table
Domain
54
PF00053
Laminin_EGFLaminin EGF domainThis family is like Pfam:PF00008 but has 8 conserved cysteines instead of six.7211497265540404149.53018.54183606162303623288373117318######27
Swissprot_feature_table
Domain
55
PF00054
Laminin_G_1Laminin G domainNULL2114185544872356131.12413.0222419181544011069214478######26
Swissprot_feature_table
Domain
56
PF00055
Laminin_NLaminin N-terminal (Domain VI)NULL1198854224837632223215.56140231136275767768984######20
Swissprot_feature_table
Domain
57
PF00056
Ldh_1_Nlactate/malate dehydrogenase, NAD binding domainL-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the family. Malate dehydrogenases catalyse the interconversion of malate to oxaloacetate. The enzyme participates in the citric acid cycle. L-lactate dehydrogenase is also found as a lens crystallin in bird and crocodile eyes. N-terminus (this family) is a Rossmann NAD-binding fold. C-terminus is an unusual alpha+beta fold.24188099917889145138.83042.1169814279685071671226422######26
Overington enriched
Domain
58
PF00057
Ldl_recept_aLow-density lipoprotein receptor domain class ANULL331386772224985081384115.832169112195545836104088141124######21
Swissprot_feature_table
Repeat
59
PF00058
Ldl_recept_bLow-density lipoprotein receptor repeat class BThis domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure [2].1961764159483307441.73114.33987296823167544504862087######20Swiss-ProtRepeat
60
PF00059
Lectin_CLectin C-type domainThis family includes both long and short form C-type536524111589442238107.32129.7610465011628230605093366933######24
Swissprot_feature_table
Domain
61
PF00060
Lig_chanLigand-gated ion channelThis family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors.43215431387959330256.92330.9135057396988831733722711######29
Blastp NMZ1_HUMAN
Family
62
PF00061
LipocalinLipocalin / cytosolic fatty-acid binding protein familyLipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. The family also encompasses the enzyme prostaglandin D synthase (EC:5.3.99.2). Alignment subsumes both the lipocalin and fatty acid binding protein signatures from PROSITE. This is supported on structural and functional grounds. The structure is an eight-stranded beta barrel.15592091143500981291870.27165561021278667789282######26
Prosite and HMM_iterative_training
Domain
63
PF00062
LysC-type lysozyme/alpha-lactalbumin familyAlpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzymes that cleave the peptidoglycan of bacterial cell walls. Structure is a multi-domain, mixed alpha and beta fold, containing four conserved disulfide bonds.122181143733738115.93666.1435325670915032232######23
Overington and HMM_iterative_training
Domain
64
PF00063
Myosin_headMyosin head (motor domain)NULL164783530615601422505.63140.8828727221172933667248855######24
Blastp MYSA_HUMAN/1-840
Domain
65
PF00064
NeurNeuraminidaseNeuraminidases cleave sialic acid residues from glycoproteins. Belong to the sialidase family - but this alignment does not generalise to the other sialidases. Structure is a 6-sheet beta propeller.337501321318.74872.0610011243474851######21
Overington and HMM_iterative_training
Repeat
66
PF00066
NotchLNR domainThe LNR (Lin-12/Notch repeat) domain is found in three tandem copies in Notch related proteins. The structure of the domain has been determined by NMR [1] and was shown to contain three disulphide bonds and coordinate a calcium ion. Three repeats are also found in the PAPP-A peptidase [2].104769755541165535.6414.83130121226253358477789######20
Swissprot_feature_table
Domain
67
PF00067
p450Cytochrome P450Cytochrome P450s are haem-thiolate proteins [6] involved in the oxidative degradation of various compounds. They are particularly well known for their role in the degradation of environmental toxins and mutagens. They can be divided into 4 classes, according to the method by which electrons from NAD(P)H are delivered to the catalytic site. Sequence conservation is relatively low within the family - there are only 3 absolutely conserved residues - but their general topography and structural fold are highly conserved. The conserved core is composed of a coil termed the 'meander', a four-helix bundle, helices J and K, and two sets of beta-sheets. These constitute the haem-binding loop (with an absolutely conserved cysteine that serves as the 5th ligand for the haem iron), the proton-transfer groove and the absolutely conserved EXXR motif in helix K. While prokaryotic P450s are soluble proteins, most eukaryotic P450s are associated with microsomal membranes. their general enzymatic function is to catalyse regiospecific and stereospecific oxidation of non-activated hydrocarbons at physiological temperatures [6].50213489217550382142340.71778.984291273177298434178969256957######25
Overington and HMM_iterative_training
Domain
68
PF00068
Phospholip_A2_1Phospholipase A2Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with two core disulfide-linked helices and a calcium-binding loop. This alignment represents the major family of PLA2s. A second minor family, defined by the honeybee venom PLA2 PDB:1POC and related sequences from Gila monsters (Heloderma), is not recognised. This minor family conserves the core helix pair but is substantially different elsewhere. The PROSITE pattern PA2_HIS, specific to the first core helix, recognises both families.111298252546231105.63558.056921503101722963063######22
Overington and HMM_iterative_training
Domain
69
PF00069
PkinaseProtein kinase domainNULL386364386176745913393241.42136.711154676112033280166517261703019######28UnknownDomain
70
PF00070
Pyr_redoxPyridine nucleotide-disulphide oxidoreductaseThis family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain.903821752695166.62316.59991138237373497######30PrositeDomain
71
PF00071
RasRas familyIncludes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See Pfam:PF00009 Pfam:PF00025, Pfam:PF00063. As regards Rab GTPases, these are important regulators of vesicle formation, motility and fusion. They share a fold in common with all Ras GTPases: this is a six-stranded beta-sheet surrounded by five alpha-helices [1].60116397181919461584150.82956.08203376224105013491449120446######25SwissprotDomain
72
PF00072
Response_regResponse regulator receiver domainThis domain receives the signal from the sensor partner in bacterial two-component systems. It is usually found N-terminal to a DNA binding effector domain.5239137479082959932112.32529.26176699347801188590406863715567######27ProdomDomain
73
PF00073
Rhvpicornavirus capsid proteinCAUTION: This alignment is very weak. It can not be generated by clustalw. If a representative set is used for a seed, many so-called members are not recognised. The family should probably be split up into sub-families. Capsid proteins of picornaviruses. Picornaviruses are non-enveloped plus-strand ssRNA animal viruses with icosahedral capsids. They include rhinovirus (common cold) and poliovirus. Common structure is an 8-stranded beta sandwich. Variations (one or two extra strands) occur.492951690195611391510.26110539282282294294######23
Overington and HMM_iterative_training
Domain
74
PF00074
RnaseAPancreatic ribonucleaseRibonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices.140187669419416114.12769.8308212945812621863######23
Overington and HMM_iterative_training
Domain
75
PF00075
RNase_HRNase HRNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers.23186505747358888135.52430.459394610793154772173629183######27
Swissprot; SCOP and HMM_iterative_training
Domain
76
PF00076
RRM_1RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain)The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins (Swiss:P05455) have an N terminal rrm which is included in the seed. There is a second region towards the C terminus that has some features characteristic of a rrm but does not appear to have the important structural core of a rrm. The LA proteins (Swiss:P05455) are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.6837016613493091380167.42322.5860742260121151111282223384008######25
Published_alignment
Domain
77
PF00077
RVPRetroviral aspartyl proteaseSingle domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein; usually pol, more rarely gag. Retroviral proteases appear to be homologous to a single domain of the two-domain eukaryotic aspartyl proteases such as pepsins, cathepsins, and renins (Pfam:PF00026).82204178239041697.22112.423529501464195222772574######23Eddy SRDomain
78
PF00078
RVT_1Reverse transcriptase (RNA-dependent DNA polymerase)A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, retroviruses, group II introns, bacterial msDNAs, hepadnaviruses, and caulimoviruses.6910068993846085615170.61525.356325104477178101104293123620######30
Published_alignment and HMM_iterative_training
Domain
79
PF00079
SerpinSerpin (serine protease inhibitor)Structure is a multi-domain fold containing a bundle of helices and a beta sandwich.113202214981889177302.32483.7543699360885761650221884######23
Overington and HMM_iterative_training
Domain
80
PF00080
Sod_CuCopper/zinc superoxide dismutase (SODC)superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to hydrogen peroxide and molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in the human SOD1 gene cause familial amyotrophic lateral sclerosis (Lou Gehrig's disease). Structure is an eight-stranded beta sandwich, similar to the immunoglobulin fold.39883298693710116130.73158.572532016214033742411304######23
Overington and HMM_iterative_training
Domain
81
PF00081
Sod_Fe_NIron/manganese superoxide dismutases, alpha-hairpin domainsuperoxide dismutases (SODs) catalyse the conversion of superoxide radicals to hydrogen peroxide and molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the Mn/Fe-binding family is one. In humans, there is a cytoplasmic Cu/Zn SOD, and a mitochondrial Mn/Fe SOD. N-terminal domain is a long alpha antiparallel hairpin. A small fragment of YTRE_LEPBI matches well - sequencing error?201179052173257783.54137.949862177056591130819175######25
Overington and HMM_iterative_training
Domain
82
PF00082
Peptidase_S8Subtilase familySubtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see Pfam:PF00089). Structure is an alpha/beta fold containing a 7-stranded parallel beta sheet, order 2314567.4462667659710623363032044.221917749445299005844489086######25OveringtonDomain
83
PF00083
Sugar_trSugar (and other) transporterNULL3312981542584511913601779.513020411795052278100978161171######27
Prosite hmmls-iteration
Family
84
PF00084
SushiSushi repeat (SCR repeat)NULL32132661716572248356.72630.82215470170813881496551134575######23
Swissprot_feature_table
Domain
85
PF00085
ThioredoxinThioredoxinThioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active centre disulfide bond. Some members with only the active site are not separated from the noise.34807366769088970101.22341.25219851143923747370868106116######23PrositeDomain
86
PF00086
Thyroglobulin_1Thyroglobulin type-1 repeatThyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation [2]. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 and 5 with 6.165139464547258966.43019.4424890252651861080514328######21
Swissprot_feature_table
Domain
87
PF00087
Toxin_TOLIPSnake toxin and toxin-like proteinThis family predominantly includes venomous neurotoxins and cytotoxins from snakes, but also structurally similar (non-snake) toxin-like proteins (TOLIPs) such as Lymphocyte antigen 6D and Ly6/PLAUR domain-containing protein. Snake toxins are short proteins with a compact, disulphide-rich structure. TOLIPs have similar structural features (abundance of spaced cysteine residues, a high frequency of charge residues, a signal peptide for secretion and a compact structure) but, are not associated with a venom gland or poisonous function. They are endogenous animal proteins that are not restricted to poisonous animals [1].462367216872.73155.86187889192495647######24OveringtonDomain
88
PF00088
TrefoilTrefoil (P-type) domainNULL19244625350424742.9336.347265891161536294654######21
Swissprot_feature_table
Domain
89
PF00089
TrypsinTrypsinNULL7076462327828332023204.62553.415698113899311986120684736######29
SCOP and Prosite
Domain
90
PF00090
TSP_1Thrombospondin type 1 domainNULL2463967219583343749.53410.7610463614309242795174766704######22
Published_alignment
Domain
91
PF00091
TubulinTubulin/FtsZ family, GTPase domainThis family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial cell division. It is part of a ring in the middle of the dividing cell that is required for constriction of cell membrane and cell envelope to yield two daughter cells. FtsZ and tubulin are GTPases. FtsZ can polymerise into tubes, sheets, and rings in vitro and is ubiquitous in eubacteria and archaea. Tubulin is the major component of microtubules.873177227018702367177.93642.931207206070149122748440045######28PrositeDomain
92
PF00092
VWAvon Willebrand factor type A domainNULL1276347635162913269171.61929.481461619608239575198674201######31ProdomDomain
93
PF00093
VWCvon Willebrand factor type C domainThe high cutoff was used to prevent overlap with Pfam:PF00094.1922367746479458.63411.9336084260862261607022580######21
Published_alignment
Domain
94
PF00094
VWDvon Willebrand factor type D domainSwiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.4022690315792293144.42218.9837435384983661821823276######28DotterDomain
95
PF00095
WAPWAP-type (Whey Acidic Protein) 'four-disulfide core'WAP belongs to the group of Elafin or elastase-specific inhibitors.41995522747751944.53514.91161101995340374419750######24
Swissprot_feature_table
Domain
96
PF00096
zf-C2H2Zinc finger, C2H2 typeThe C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [2].159120191458616481418223.14120.4517833521555773628018229601209189######29Boehm SDomain
97
PF00097
zf-C3HC4Zinc finger, C3HC4 type (RING finger)The C3HC4 type zinc-finger (RING finger) is a cysteine-rich domain of 40 to 60 residues that coordinates two zinc ions, and has the consensus sequence: C-X2-C-X(9-39)-C-X(1-3)-H-X(2-3)-C-X2-C-X(4-48)-C-X2-C where X is any amino acid [1]. Many proteins containing a RING finger play a key role in the ubiquitination pathway [2].3534770371614132643.6327.03500994425100631882225347######28
Swissprot_feature_table
Domain
98
PF00098
zf-CCHCZinc knuckleThe zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involved in eukaryotic gene regulation, such as C. elegans GLH-1. Structure is an 18-residue zinc finger.7962734881680221317.7435.4520323815514342585215466850######26
Overington and HMM_iterative_training
Domain
99
PF00100
Zona_pellucidaZona pellucida-like domainNULL3371263431476571230.91738.1422076214949721025613166######26
Swissprot_feature_table
Family
100
PF00101
RuBisCO_smallRibulose bisphosphate carboxylase, small chainNULL8221753638192798.23657.227919334120720783327######23SwissprotDomain