ABCDEFGHIJKLMNOPQRST
1
Group MembersNameFunction
2
1Mona HögbergHiSeq -> assembly
3
2Bodil CronholmHiSeq -> trimming -> assembly
4
3Muhammad KashifMiSeq -> trimming -> assembly
5
4Tania TajrinMiSeq -> assembly
6
7
Dataset 1G5 HiSeq dataDataset 2G5 MiSeq data
8
No trimmingNo trimming
9
SpadesIDBA-UDRaySpadesIDBA-UDRay
10
Assembly time11m80m143m19Assembly time14m0m45
11
Number of readsNumber of reads202058
12
Number of contigsNumber of contigs
13
Number of contigs >1kbNumber of contigs >1kb
14
Total assembly sizeTotal assembly size
15
Total assembly size >1kbTotal assembly size >1kb
16
Largest contig270072021642469Largest contig
17
N508636552813712N50750
18
G+C%G+C%
19
Number of ORFsNumber of ORFs
20
CompletenessCompleteness
21
# of contaminant contigs# of contaminant contigs
22
23
24
Trimming with TrimmomaticTrimming with Trimmomatic
25
SpadesIDBA-UDRaySpadesIDBA-UDRay
26
Assembly time1:090:122:14Assembly time1m570m293m18
27
Number of reads (merged + unmerged pairs)101855101855101855Number of reads (merged + unmerged pairs)2 x 1010292 x 1010292 x 101029
28
Number of contigs767991Number of contigs227210588
29
Number of contigs >1kb475050Number of contigs >1kb527461
30
Total assembly size310889308118212290Total assembly size425634392106398473
31
Total assembly size >1kb291377287902184235Total assembly size >1kb346336324985260119
32
Largest contig424692629918700Largest contig431991420320816
33
N501370492734303N501406762247048
34
G+C%40.3740.4239.86G+C%40.6940.6540.57
35
Number of ORFs471419616Number of ORFs277545588
36
Completeness0.11390.11360.0919Completeness0.17050.14870.1323
37
# of contaminant contigs# of contaminant contigs
38
39
40
Optional assembly optimization
41
Dataset 3N21 MiSeq data
42
reads treatment
43
assembly program and parameters
44
Assembly time
45
Number of reads
46
Number of contigs
47
Total assembly size
48
Largest contig
49
N50
50
G+C%
51
Number of ORFs
52
Completeness
53
# of contaminant contigs
54
55
Total assembly size177148
56
Largest contig24892
57
N5011747
58
G+C%48.94
59
Number of ORFs321
60
Completeness0.0212
61
Markers found2/139
62
# of contaminant contigs
63
16S identity> AY861966.1.1230 Archaea;Euryarchaeota;Thermoplasmata;South African Goldmine Gp(SAGMEG);uncultured euryarchaeote
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100