1 of 20

CSE 180: summary

Yana Safonova

2 of 20

Sequencing data

Illumina

Long-read technologies (PacBio, Nanopore)

Short-read long-distance reads (10x, Hi-C)

Whole Genome Sequencing

Whole Exome Sequencing

RNA-Seq

Target sequencing

3 of 20

Sequencing data

Illumina

Long-read technologies (PacBio, Nanopore)

Short-read long-distance reads (10x, Hi-C)

Whole Genome Sequencing

Whole Exome Sequencing

RNA-Seq

Target sequencing

4 of 20

Sequencing data

Illumina

Long-read technologies (PacBio, Nanopore)

Short-read long-distance reads (10x, Hi-C)

Whole Genome Sequencing

Whole Exome Sequencing

RNA-Seq

Target sequencing

Read assembly

Alignment to reference

De novo gene prediction

Differential gene expression

Haplotype assembly

5 of 20

Sequencing data

Illumina

Long-read technologies (PacBio, Nanopore)

Short-read long-distance reads (10x, Hi-C)

Whole Genome Sequencing

Whole Exome Sequencing

RNA-Seq

Target sequencing

Read assembly

Population studies / comparative genomics

Meta-

genomics

Alignment to reference

De novo gene prediction

Differential gene expression

Haplotype assembly

Genomics

Transcriptomics

6 of 20

Rep-seq: special type of RNA-seq data

B and T cells produce special types of receptors that are not encoded in the original (germline genome)

7 of 20

Antibodies are agents of the adaptive immune system

  • Antibodies are proteins that bind to an antigen and cause its neutralization

  • Antibodies are not encoded in the genome directly, but present a result of somatic genomic recombination

V

V

V

D

D

D

J

J

D

chr 14

8 of 20

Antibodies are agents of the adaptive immune system

  • Diversity of antibody genes is extremely high
  • Set of produced antibodies (antibody repertoire) is unique for an individual
  • Antibodies are proteins that bind to an antigen and cause its neutralization

  • Antibodies are not encoded in the genome directly, but present a result of somatic genomic recombination

V

V

V

D

D

D

J

J

D

chr 14

J

D

V

V

J

antibody gene

9 of 20

Antibodies are subjects of fast evolution

10 of 20

Immune system mutates and amplifies a binding antibody

Mutation rate in antibody genes is 3-4 order of magnitude higher than in other genome

11 of 20

One antibody = one antigen

12 of 20

Antibody repertoire is a set of clonal lineages

13 of 20

Antibody repertoire is a set of unknown clonal lineages

14 of 20

Application of Rep-seq data

14

Healthy individual

Flu vaccination

HIV donor

15 of 20

Proteomics

  • Quantitative proteomics
  • Protein sequencing
  • Drug design

16 of 20

Large sequencing projects

Genomics

Transcriptomics

Metagenomics

(microbiome)

Repertoire sequencing

Proteomics

17 of 20

Phylogenetics

HIV

18 of 20

Protein folding

19 of 20

Protein binding

20 of 20

Bioinformatics is not limited by analysis of sequences

karyotyping

finding non-chromosomal DNAs