ABCDEFGHIJKLMNOPQRSTUVWXY
1
Last modified:9/6/2022 14:41:19(auto-updated)
2
USEDo NOT useNotesExample
3
terminase, small subunitTerSSisi_1
4
terminaseIf there are not two obvious large and small terminase genes in the same genome, just assign the function "terminase".TM4_4
5
terminase, large subunitTerLSisi_2
6
terminase, large subunit (ATPase domain) Only applicable to Cluster AY genomes (8-21-18), AT genomes (2-28-2020), and DT genomes (7-4-20). AS genomes appear to have a gene 1 with some alignment to the large subunit, but it is unclear if the domains are intact. (10-21-19, 2-21-2020)also applies to cluster GD genomesAuxilium_gp2
7
terminase, large subunit (nuclease domain) Only applicable to Cluster AY genomes (8-21-18), AT genomes (2-28-2020) and DT genomes (7-4-20). AS genomes appear to have a gene 1 with some alignment to the large subunit, but it is unclear if the domains are intact. (10-21-19, 2-21-2020)also applies to cluster GD genomesAuxilium_gp3
8
DNA packaging ATPase protein
for tectiviridae onlyBadulia_12
9
10
portal proteinhead to tail connectorTM4_5
11
12
13
scaffolding proteinScaffoldD29_gp16
14
capsid maturation proteasesometimes the CMP hits to ClpP proteases. If so, look for a serine-type endopeptidase activityLangerak_gp4
15
major capsid proteincapsidSisi_6
16
major capsid pentamer proteinRosebush gp16experimental evidence
17
major capsid hexamer proteinRosebush gp15experimental evidence
18
capsid accessory proteinPatience_gp4experimental evidence
19
minor capsid proteinPatience_gp22experimental evidence
20
Hypothetical ProteinMuF-like minor capsid protein6/16/21
21
capsid decoration protein, LamD-likeTuruncu_32
22
capsid decoration proteinhead decoration proteinlook for the 7 beta strands and 2 alpha heiices pf PBD's 1CSE_A (lamdba)Alice_107, Alice_232https://www.nature.com/articles/nsb0300_230.pdf
23
capsid maturation proteasewe are no longer using "capsid morphogenesis protein"Schubert_gp3
24
major capsid and protease fusion proteinin this case, the scaffolding function is also part of the fusion, but we don't explicitly write it in the function nameCluster AN arthrobacter phages, EE microbacterium phages
25
head fiber proteinBriton15_18
26
head-to-tail adaptorwe are no longer calling "head-to-tail connector" or "head-to-tail connector complex protein" 3-6-19must have an HHPRED alignment to one of the following crystal structures: SPP1 15 (5A21 chain C or D in the macromolecular complex) OR must have an HHPRED alignment to one of the following crystal structures: HK97 gp6 or or Bacillus protein yqbG Please see the portal and head-to-tail connector case study at the links provided. Note: SPP1 gp17 and 17.1 are NOT h-t connectors (they are the tail terminator and major tail subunit.https://seaphages.org/meetings/33/
27
head-to-tail stoppermust have an HHPRED alignment to one of the following crystal structures: SPP1 16 (5A21 chain E or F in the macromolecular complex)Please see the portal and head-to-tail connector case study at the links provided. Note: SPP1 gp17 and 17.1 are NOT h-t connectors (they are the tail terminator and major tail subunit.https://seaphages.org/meetings/33/
28
tail terminatormust have an HHPRED alignment to one of the following: SPP1 17 (5A21 chain G in the macromolecular complex) or Lambda U (3FZ2_chains A through F)Please see the portal and head-to-tail connector case study at the links provided. Note: SPP1 gp17 and 17.1 are NOT h-t connectors (they are the tail terminator and major tail subunit.)https://seaphages.org/meetings/33/
29
major tail proteinmajor tail subunitSisi_11
30
tail assembly chaperoneTail scaffolding proteinEvidence needed to call TAC: Please see Bioinformatics Guide for what evidence is neededTM4_15; 16https://seaphagesbioinformatics.helpdocsonline.com/article-54
31
Hypothetical Proteintail assembly chaperonedo not call TAC when there is NO evidencecluster EA1
32
tape measure proteinTape Measure, tmp, tapemeasureTM4_17
33
minor tail proteintail fiber-like protein, collagen-like, glycine richIf you have significant hits to either collagen-like or glycine-rich proteins, and are in the syntenic region of minor tail proteins, you can call them minor tail proteins.Sisi_15-18, Nebs_gp4
34
minor tail protein, D-ala-D-ala carboxypeptidasemust include "minor tail" as part of the functional assingmentSisi_19
35
tail sheath proteinfound in contractile tailed phagesAlice_120
36
tail fiber
37
tailspike proteintailspike has triple beta coils. make sure you are matching the spike part of the protein and not the N-terminal tail tip binding domain.Turuncu_23
38
tail needle proteinZeta1847_gp20
39
baseplate J proteinAlice_133
40
tail tube proteinfound in contractile tailed phagesRosiePosie_19
41
baseplate wedge proteinRosiePosie_38
42
43
Hypothetical ProteinHK97_gp10Hendrix lab (studiers of HK(& and capsid construction) never id'd a function for this gene
44
lysin ALysA, endolysin Aonly appropriate for Mycobacteriophages; or for Actino phage in which you can identify a lysin bif not a Mycobacteriophage, must have a lysin B, otherwise it is endolysinSisi_30
45
lysin A, protease M15 domainsome Gordonia phages have lysin A split into two genes. make sure to label each domain.if not a Mycobacteriophage, must have a lysin B, otherwise it is endolysin, protease M15 domain
46
lysin A, protease M23 domainif not a Mycobacteriophage, must have a lysin B, otherwise it is endolysin, protease M23 domain
47
lysin A, protease C39 domainsome Gordonia phages have lysin A split into two genes. make sure to label each domain.if not a Mycobacteriophage, must have a lysin B, otherwise it is endolysin, protease C39 domain
48
lysin A, glycosyl hydrolase domainsome Gordonia phages have lysin A split into two genes. make sure to label each domain.if not a Mycobacteriophage, must have a lysin B, otherwise it is endolysin, glycosyl hydrolase domain
49
lysin A, L-Ala-D-Glu peptidase domainif not a Mycobacteriophage, must have a lysin B, otherwise it is endolysin, L-Ala-D-Glu peptidase domain
50
lysin A, N-acetylmuramoyl-L-alanine amidase domain
if not a Mycobacteriophage, must have a lysin B, otherwise it is endolysin, N-acetylmuramoyl-L-alanine amidase domain
51
lysin A, protease domainif not obviously M15 or C39some actinobacteriophages (not yet seen in the mycobacteriophages) have lysin A split into two genes. make sure to label each domain.if not a Mycobacteriophage, must have a lysin B, otherwise it is endolysin, protease domain
52
53
lysin BLysB, endolysin BSisi_31
54
endolysinsome arthrobacter and streptomyces phages have a single endolysin with domains not found in the Mycobacteriophages (like the CHAP domain). use "endolysin" rather than lysin A if the phage does not infect Mycobacterium and no lysin b can be identified.
55
endolysin, N-acetylmuramoyl-L-alanine amidase domain
56
57
58
holinevidence needed to call a holin can include biochemical data (1), seqeunce similarity to genes with biochemical data (2), at least 2 transmembrane domians found and the gene be adjacent to the endolysins (s), conderved domain hits (4), and the abscence of additional transmembrane domains in the area. The literature suggests that some phages have more than one holin, for now when we seem multiple possibilities for a holin gene, let's call them membrane proteins.D29_11
59
serine integraseBxb1_35
60
tyrosine integraseSisi_43
61
tyrosine homologous recombinasefor phages that have an apparent integrase, but none of the rest of the machinery to be temperate is identifiable iin the genome (like an immunity repressor, attP, excise, etc)Settecandela_38
62
serine homologous recombinase
63
ParA-like dsDNA partitioning proteinRedRock_37
64
ParB-like dsDNA partitioning proteinDo not label anything a ParB or having a ParB partitioning domain without the presence of a ParA partner in the genomeRedRock_38
65
ParB-like nuclease domaindoes not have to have a ParA partner
66
ParB-N-terminal-like domain methyltransferase
at present (7-5-22), this is an orpham. It is big gene (becuase it contains the methyltransferase. Do not confuse with the hits to the DNA binding portion of the ParB.Evaa_gp2
67
RepA-like replication initiator
Rachaly_36
68
immunity repressorRepressorlikely to have an HHPred match to C1 protein in lambdaSisi_45
69
Imm-like superinfection immunity proteinthis is not this phage's immunity repressorthis a pfam hit to T4's superinfection immunity protein: significance was an e-value of 10e-14.Niza_72
70
immunity represso (Cluster A)to be used when there is a second immunity repressor that is NOT associated with the own phage's immunity casette (system).Both examples are of non-Cluster A phages that contain a Cluster A repressor, but the Cluster designation in the function name should be the cluster where it came from.LRRHood_44, SamScheppers_83
71
exciseExcisionase, Xis, only one per phage;check to see if CRODo not call a protein excise unless you can identify the integrase and the immunity repressor in the phage. A more general "helix-turn-helix DNA binding protein" might be more appropriate if you can't distinguish otherwise.BPs_34
72
recombination directionality factorRDFRedRock_58
73
Cro (control of repressor's operator)Do not call a protein Cro unless you can identify the integrase, excise, and the immunity repressor in the phage.Che9c_47
74
antirepressorSisi_47
75
DnaE-like DNA polymerase III (alpha)Spud_203
76
DNA polymerase ILuchador_50
77
DNA polymerase III sliding clamp (Beta)Corndog_84
78
DnaC-like helicase loaderAlice_189
79
helicase loaderSamman98_70
80
DNA helicaseATP-dependent helicaseChah_54
81
DnaB-like dsDNA helicaseRedRock_68
82
RepA-like helicaseSour_52
83
DNA primaseSpud_199
84
DNA primase/helicasemake sure it has both partsSchubert_31
85
DNA primase/polymerasemake sure it has both partsRosebush_54
86
DNA primase/polymerase/helicasemake sure it has all three partsGreenHearts_47
87
DnaQ-like (DNA polymerase III subunit)DNAQ is the exonuclease of Pol III (epsilon subunit)Sisi_35
88
nucleotidyl transferaseSpud_3
89
FIC domain nucleotidyl transferaseMUST contain HPFxxGNGR motifBradissa_34
90
polynucleotide kinasepnkSpud_250
91
Lsr2-like DNA bridging proteinLsr2Omega_61
92
RecA-like DNA recombinaseSpud_205
93
RecB-like exonuclease/helicaseIf both a helicase and nuclease domain are present, the RecB label should be used.RedRock_72
94
host nuclease inhibitorLaroye_83
95
VRR-Nuc domain proteinZeina_89
96
Cas4 exonucleaseThis family of exonucleases is similar to the exonuclease domain of RecB. The Cas4 label should be used if the gene includes only the exonuclease region. IF the gene also includes a helicase domain, the RecB label should be used. Cas4 family nucleases tend to have alignments to the crystal structure 4R5Q_A, 41C1_A and to the PD-(D/E)XK nuclease superfamily (PF12705.7, among others).BiteSize_gp54
97
Ku-like dsDNA break-binding proteinKuOmega_206
98
MRE11 double-strand break endo/exonuclease
99
DprA-like DNA processing chain A
CherryBlossom_70
100
DprA-like ssDNA binding protein