CSE 180: summary
Yana Safonova
Sequencing data
Illumina
Long-read technologies (PacBio, Nanopore)
Short-read long-distance reads (10x, Hi-C)
Whole Genome Sequencing
Whole Exome Sequencing
RNA-Seq
Target sequencing
Sequencing data
Illumina
Long-read technologies (PacBio, Nanopore)
Short-read long-distance reads (10x, Hi-C)
Whole Genome Sequencing
Whole Exome Sequencing
RNA-Seq
Target sequencing
Sequencing data
Illumina
Long-read technologies (PacBio, Nanopore)
Short-read long-distance reads (10x, Hi-C)
Whole Genome Sequencing
Whole Exome Sequencing
RNA-Seq
Target sequencing
Read assembly
Alignment to reference
De novo gene prediction
Differential gene expression
Haplotype assembly
Sequencing data
Illumina
Long-read technologies (PacBio, Nanopore)
Short-read long-distance reads (10x, Hi-C)
Whole Genome Sequencing
Whole Exome Sequencing
RNA-Seq
Target sequencing
Read assembly
Population studies / comparative genomics
Meta-
genomics
Alignment to reference
De novo gene prediction
Differential gene expression
Haplotype assembly
Genomics
Transcriptomics
Rep-seq: special type of RNA-seq data
B and T cells produce special types of receptors that are not encoded in the original (germline genome)
Antibodies are agents of the adaptive immune system
V
V
V
D
D
D
J
J
D
chr 14
Antibodies are agents of the adaptive immune system
V
V
V
D
D
D
J
J
D
chr 14
J
D
V
V
J
antibody gene
Antibodies are subjects of fast evolution
Immune system mutates and amplifies a binding antibody
Mutation rate in antibody genes is 3-4 order of magnitude higher than in other genome
One antibody = one antigen
Antibody repertoire is a set of clonal lineages
Antibody repertoire is a set of unknown clonal lineages
Application of Rep-seq data
14
Healthy individual
Flu vaccination
HIV donor
Proteomics
Large sequencing projects
Genomics
Transcriptomics
Metagenomics
(microbiome)
Repertoire sequencing
Proteomics
Phylogenetics
HIV
Protein folding
Protein binding
Bioinformatics is not limited by analysis of sequences
karyotyping
finding non-chromosomal DNAs