Analysis of Single-Cell T-Cell Receptor Sequencing
Kelly Street
Assistant Professor
Division of Biostatistics
Jan 12, 2023
What is a T cell?
T Cell Structure
Why should we be interested?
Motivation
Motivation
Motivation
Motivation
Motivation
Application: Renal Cell Carcinoma
Figure 1. Single-cell profiling of clear cell renal cell carcinoma
T Cell Exhaustion
Figure 3. CD8+ T cell trajectory analysis reveals increased terminal exhaustion with advancing disease stage
Trajectory Analysis
Figure 3. CD8+ T cell trajectory analysis reveals increased terminal exhaustion with advancing disease stage
TCR Sequencing
Figure 4. TCR analysis reveals lower diversity in terminally exhausted T cells
T Cell Structure
cellranger vdj
filtered_contig_annotations.csv
TCR Sequencing
barcode is_cell contig_id high_confidence length chain v_gene
TCGGGACAGACTAAGT-4 True TCGGGACAGACTAAGT-1_c.. True 826 TRB TRBV5-6
TCGGGACAGACTAAGT-4 True TCGGGACAGACTAAGT-1_c.. True 697 TRA TRAV6
TCGGGACAGACTAAGT-4 True TCGGGACAGACTAAGT-1_c.. True 695 TRA TRAV1-2
CTGATAGTCTTGTACT-4 True CTGATAGTCTTGTACT-1_c.. True 707 TRB TRBV6-6
CTGATAGTCTTGTACT-4 True CTGATAGTCTTGTACT-1_c.. True 851 TRA TRAV6
d_gene j_gene c_gene full_length productive cdr3 cdr3_nt reads
None TRBJ1-1 TRBC1 True True CASSPGDTEAFF TGTGCCAGCAGCCCGGGCGA.. 58796
None TRAJ34 TRAC True True CALDGPGNTDKLIF TGTGCTCTAGACGGCCCAGG.. 44152
None TRAJ33 TRAC True None None None 32494
TRBD2 TRBJ2-7 TRBC2 True True CASSYSTVYEQYF TGTGCCAGCAGTTACTCTAC.. 54812
None TRAJ39 None False None None None 478
umis raw_clonotype_id raw_consensus_id type sample
12 clonotype9 clonotype9_consensus_1 TCR S11_N
5 clonotype9 clonotype9_consensus_2 TCR S11_N
5 clonotype9 None TCR S11_N
9 clonotype1 clonotype1_consensus_2 TCR S11_N
17 clonotype1 None TCR S11_N
Cells can be ambiguous
chain productive cdr3
TRB True CASDIGGESYNELTF
TRA True CIPRFAFGPLTVF
Cells can be ambiguous
chain productive cdr3
TRB True CASDIGGESYNELTF
TRA True CIPRFAFGPLTVF
✔
Cells can be ambiguous
chain productive cdr3
TRB True CASDIGGESYNELTF
TRA True CIPRFAFGPLTVF
✔
chain productive cdr3
TRB True CSADISGSSYNEQFF
TRA True CIVRVAFGQNFVF
TRA None CAPSFSGNTPLVF
✔
Cells can be ambiguous
chain productive cdr3
TRB True CASDIGGESYNELTF
TRA True CIPRFAFGPLTVF
✔
chain productive cdr3
TRB True CSADISGSSYNEQFF
TRA True CIVRVAFGQNFVF
TRA None CAPSFSGNTPLVF
✔
chain productive cdr3
TRB True CSADISGSSYNEQFF
TRA True CIVRVAFGQNFVF
TRA True CAPSFSGNTPLVF
?
Cells can be ambiguous
chain productive cdr3
TRB True CASDIGGESYNELTF
TRA True CIPRFAFGPLTVF
✔
chain productive cdr3
TRB True CSADISGSSYNEQFF
TRA True CIVRVAFGQNFVF
TRA None CAPSFSGNTPLVF
✔
chain productive cdr3
TRB True CSADISGSSYNEQFF
TRA True CIVRVAFGQNFVF
TRA True CAPSFSGNTPLVF
?
chain productive cdr3
TRB True CASSQEAGTSGNTIYF
TRB None CSAMVRPSGNTNKLTF
?
Cells can be ambiguous
chain productive cdr3
TRB True CASDIGGESYNELTF
TRA True CIPRFAFGPLTVF
✔
chain productive cdr3
TRB True CSADISGSSYNEQFF
TRA True CIVRVAFGQNFVF
TRA None CAPSFSGNTPLVF
✔
chain productive cdr3
TRB True CSADISGSSYNEQFF
TRA True CIVRVAFGQNFVF
TRA True CAPSFSGNTPLVF
?
chain productive cdr3
TRB True CASSQEAGTSGNTIYF
TRB None CSAMVRPSGNTNKLTF
?
chain productive cdr3
TRB True CASSWGLGTEAFF
IGL None None
TRA None None
TRA True CALSGRGEGGSEKLVF
TRA True CAGLDTGTASKLTF
IGH None None
Multi None None
IGK None None
?
# alpha chains
# beta chains
Cells can be ambiguous
Motivation
Cells can be ambiguous
chain productive cdr3
TRB True CASDIGGESYNELTF
TRA True CIPRFAFGPLTVF
chain productive cdr3
TRB True CSADISGSSYNEQFF
TRA True CIVRVAFGQNFVF
TRA None CAPSFSGNTPLVF
chain productive cdr3
TRB True CSADISGSSYNEQFF
TRA True CIVRVAFGQNFVF
TRA True CAPSFSGNTPLVF
chain productive cdr3
TRB True CASSQEAGTSGNTIYF
TRB None CSAMVRPSGNTNKLTF
chain productive cdr3
TRB True CASSWGLGTEAFF
IGL None None
TRA None None
TRA True CALSGRGEGGSEKLVF
TRA True CAGLDTGTASKLTF
IGH None None
Multi None None
IGK None None
Cells can be ambiguous
chain productive cdr3
TRB True CASDIGGESYNELTF
TRA True CIPRFAFGPLTVF
chain productive cdr3
TRB True CSADISGSSYNEQFF
TRA True CIVRVAFGQNFVF
TRA None CAPSFSGNTPLVF
chain productive cdr3
TRB True CSADISGSSYNEQFF
TRA True CIVRVAFGQNFVF
TRA True CAPSFSGNTPLVF
chain productive cdr3
TRB True CASSQEAGTSGNTIYF
TRB None CSAMVRPSGNTNKLTF
chain productive cdr3
TRB True CASSWGLGTEAFF
IGL None None
TRA None None
TRA True CALSGRGEGGSEKLVF
TRA True CAGLDTGTASKLTF
IGH None None
Multi None None
IGK None None
Clonotype Identification
Ambiguous clonotype
Unique clonotype
beta
alpha
Do these cells share a clonotype?
Clonotype Identification
beta
alpha
common beta
common alpha
Do these cells share a clonotype?
Clonotype Identification
beta
alpha
common beta
common alpha
Do these cells share a clonotype?
Clonotype Identification
beta
alpha
common beta
common alpha
Do these cells share a clonotype?
Clonotype Identification
beta
alpha
common beta
common alpha
TCR analysis
Clonotype information was derived from the Cell Ranger V(D)J Annotation pipeline v2.1.0. All assembled contigs were filtered to retain only those that were assigned a raw clonotype ID and categorized as being both full length and productive. Cell barcodes were then filtered to retain only those with two contigs, categorized as one TCR alpha chain and one TCR beta chain. Each clonotype was assigned a unique identifier, consisting of the predicted amino acid sequences of the CDR3 regions of these two chains, which was used to match clonotypes across samples.
Option 1: Remove ambiguity
Option 2: Everything’s a clonotype
beta
alpha
common beta
common alpha
Clonotype 1:
Clonotype 2:
Option 2: Everything’s a clonotype
beta
alpha
common beta
common alpha
Clonotype 1:
Clonotype 2:
Option 3: Big Tent clonotypes
Clonotype Identification
Clonotype 1:
Clonotype 2:
Ambiguous:
beta
alpha
common beta
common alpha
Proportional Assignment (E-M)
66.6% Clonotype 1
33.3% Clonotype 2
Clonotype 1:
Clonotype 2:
beta
alpha
common beta
common alpha
multi-mapped reads : gene transcripts :: ambiguous T cells : clonotypes
Proportional Assignment (E-M)
read
transcripts
T cell
clonotypes
50 - 50
multi-mapped reads : gene transcripts :: ambiguous T cells : clonotypes
80 - 20
Proportional Assignment (E-M)
read
transcripts
T cell
clonotypes
multi-mapped reads : gene transcripts :: ambiguous T cells : clonotypes
Proportional Assignment (E-M)
read
transcripts
T cell
clonotypes
100 - 00
Same Data, More Cells
Same Data, More Cells
Unique Counts
log1p Unique Counts
E-M Counts
log E-M Counts
(currently) limited:
Quantifying (alpha) Diversity
Improved Signal
Figure 4. TCR analysis reveals lower diversity in terminally exhausted T cells
Improved Signal
Unique
E-M
Simulation
Bioconductor Package
Mercedeh Movassagh
Jill Lundell
David Braun
Mingzhi Ye
The Team
Bonus Content:
Quantifying Plasticity
in Cancer
Kelly Street
Assistant Professor
Division of Biostatistics
Jan 12, 2023
What about DNA methylation?
Plasticity
Shen S, Clairambault J. Cell plasticity in cancer cell populations. F1000Res. 2020 Jun 22;9:F1000 Faculty Rev-635. doi: 10.12688/f1000research.24803.1. PMID: 32595946; PMCID: PMC7309415.
Methylation Variability
Expression Variability
Expression Variability
Expression Variability
Plasticity
GSE97693
Plasticity
GSE81861
Plasticity
GSE201348
David Braun
Plasticity
HCRN
Plasticity
GSE97693
DepMap “essential” genes
Plasticity
GSE97693
Kimberly Siegmund
Darryl Shibata
The Team