1 of 66

GENOMICS-ENABLED PRECISION MEDICINE FOR CANCER AND DISEASE

2 of 66

Why precision medicine is precise

  • Have more studies?
  • Hypothesis is better?
  • Technology is more up-to-dated?
  • Experiment is more precise?
  • Data is larger?
  • Analysis is more mature?

3 of 66

Hypothesis driven V.S Data driven

Hypothesis driven

Data driven

(Deductive reasoning)

(Inductive reasoning)

General theory

Testable hypothesis

Empirical test

Confirmation/rejection

Empirical observation

Detectable pattern

Tentative hypothesis

General theory

4 of 66

Hypothesis driven V.S Data driven

Hypothesis driven

Data driven

Prospective hypothesis

Need

Not necessary

Precision

Depends

Higher

Targets

Normally single

Numerous

Novel discoveries

Normally few

Numerous

Multi-discipline

Not necessary

Need

Regulation/mechanism

Part

Much complete

5 of 66

Example for hypothesis driven study

Lung cancer

Obtain sample

Hypothesis

Experiment

Discovery

Publication

Numerous times

6 of 66

Example for data driven study

Lung cancer

Obtain sample

Experiment

Confirmation

Publication

Data mining

Experiment based on data

7 of 66

Down syndrome

How can we detect the disorder from gene level?

8 of 66

Omics

  • Omics aims at the collective characterization and quantification of pools of biological molecules that translate into the structure, function, and dynamics of an organism or organisms.

9 of 66

DNA to proteins

10 of 66

11 of 66

12 of 66

Basic procedures of omics study

13 of 66

Genomics

  • Genomics – DNA-Seq
    • Mutation (SNP)
    • Protein-DNA interactions
  • Transcriptomics – RNA-Seq
    • Differential expression
    • Non-coding RNA
    • Splice variants
    • Protein-RNA interactions

14 of 66

Replicates

  • Avoid bias
  • Produce reliable results
  • Make statistics reasonable

15 of 66

Biological Replicates VS technical replicates

  • Biological replicates
    • Experiments from different samples but same conditions
    • Avoid the special or individual cases
  • Technical replicates
    • Doing multiple times of experiments from the identical samples
    • Avoid the technical influences.

16 of 66

Instrument

17 of 66

Alzheimer’s disease

  • Risk gene – APOE-e4
  • APOE-e4 can be inherited from the parents
  • These genes, which are estimated to account for 1% or less of Alzheimer's cases

Do we have an efficient way to detect more novel mutations of Alzheimer’s disease?

18 of 66

DNA-Seq

  • DNA sequencing is the process of determining the nucleic acid sequence – the order of nucleotides in DNA.

19 of 66

20 of 66

Read depth and length

  • Read depth
    • The amount of DNA/RNA reads produced from sequencers
  • Read length
    • How many nucleotides are included in a read

21 of 66

Example

GeneA

GeneA

ATACGTCATCCGCGCTATCGAATATTTTACGACCGCGATGAGGA

Reference Sequence

GTCATCCACGCTATCGAATATTTTACGACC

GTCATCCGCGCTATCGAATATTTTACGACC

Lung cancer

Normal

22 of 66

The way to detect mutation

23 of 66

The way to detect mutation

24 of 66

Mutations of Cancer

  • Tumor-suppressor genes BRCA1 and BRCA2
  • HER2-positive breast cancer

25 of 66

Mechanism of BRCA2 and BRCA1

26 of 66

HER2 mutations

27 of 66

Normal cell V.S HER2+ cell

28 of 66

Genome-wide association study (GWAS)

  • GWA studies typically focus on associations between single-nucleotide polymorphisms (SNPs) and traits like major human diseases

29 of 66

Genome-wide association study (GWAS)

30 of 66

Shortages of DNA-Seq

  • The related genes of the mutations are still unclear
  • The influenced functions or pathways of the mutated genes are unknown
  • Silent mutations - Change a codon into a mutant codon that specifies exactly the same amino acid.
  • Gene expression cannot be measured

31 of 66

RNA-seq

  • RNA-Seq can reveal the quantity of RNAs in biological sample at a given moment and condition

32 of 66

33 of 66

Applications of Rna-Seq

  • Alternative gene spliced transcripts
  • Gene fusion
  • Non-coding RNA
  • Quantification comparison

34 of 66

Splice event

35 of 66

Gene fusion

36 of 66

Philadelphia chromosome – CML and BCR-ABL

37 of 66

Quantification comparison

38 of 66

Differential expression analysis

39 of 66

Clustering heatmap

40 of 66

41 of 66

Non-coding RNA

  • RNA molecule that is not translated into a protein
  • tRNA, rRNA

42 of 66

RNA-Seq is the efficient approach

43 of 66

Bacteria – small RNA (sRNA)

  • Length = 50-500 nts
  • Highly structured and containing several stem-loops.
  • Mechanism
    • Bind to protein targets and modify the function of the bound protein.
    • Interact with mRNA targets and regulate gene expression by binding to complementary mRNA and blocking translation.
    • Unmask or block the ribosome-binding site

44 of 66

45 of 66

Functions of sRNA

  • Stress response
  • Regulation of outer membrane proteins
  • Virulence
  • Biofilm formation
  • Antibiotic resistance

46 of 66

Example of sRNAs in Bordetella pertussis

  • MgSO4 is known to repress the expression of the virulence factors of B. pertussis that are coordinately regulated by the bvg locus
  • BvgAS is a two-component system, when BvgS is active, BvgA is phosphorylated (BvgA~P), and virulence-activated genes are expressed

47 of 66

Micro RNA (miRNA)

  • Length: ~22 nts
  • miRNA forms base-pairing with complementary mRNA to achieve RNA silencing
    • Cleavage of the mRNA
    • Destabilization of the mRNA through shortening of its poly(A) tail
    • Less efficient translation of the mRNA
  • Related diseases
    • Chronic lymphocytic leukemia
    • Heart disease
    • DNA repair
    • Kidney disease
    • Nervous – Stroke, Alzheimer’s disease
    • Obesity
    • Hemostatsis

48 of 66

49 of 66

50 of 66

51 of 66

Circular RNA (circRNA)

52 of 66

53 of 66

Small interfering RNA (siRNA)

Main differences

  • miRNA
    • Internal
    • Can interact with not 100% complementary sequences
  • siRNA
    • External
    • Only can interact with 100% complementary sequences

54 of 66

First siRNA treatment for amyloidosis

55 of 66

Long non-coding RNA (lncRNA)

  • Length is ~200 nts
  • Currently, only small proportion of lncRNAs are proved having biological functions.

56 of 66

57 of 66

DNA-Seq

DNA

RNA

RNA-Seq

58 of 66

Why we need single cell

  • A typical human cell consists
    • ~2 x 3.3 billion base pairs of DNA
    • ~600 million bases of mRNA
  • Every cell is different, the differences are lost in bulk cells.
  • Studying heterogenicity of the samples from diseases, tissues or organisms.
  • Understanding the potential risk of specific diseases deeply.

59 of 66

Personalized medicine

  • One of the most important goal for personalized medicine is to understand the risk of getting diseases.
  • Only using single cell can really reflect the risk of a person for getting a specific disease.

60 of 66

Bulk cells V.S Single cell

61 of 66

The issue of bulk cells

  • Bulk cells can reflect the most representative situation but will ignore the samples which may be few but important.

Ignore

62 of 66

Cell differentiation

63 of 66

Acute Myeloid Leukemia (AML)

  • AML contains many subtypes. The commonly found types are at least 8 types.

64 of 66

Progress

65 of 66

Academic research

RNA-Seq

DNA-Seq

66 of 66

Academic research

Genomics

Transcriptomics