ComPPI: a cellular compartment-specific database for protein–protein interaction network analysis
Veres et al., NAR Database Issue, 2015
Assumptions
Manually curated data integration and generation of localization and interaction confidence scores
Data Sources
“Manual” curation
Manual
Localization Tree
Cytosol
Nucleus
Mitochondrion
Secretory-Pathway
Membrane
Extracellular
Localization Score
φLocX=VrespLocX
Weights:
experimental pLocX = 0.8
predicted pLocX = 0.7
unknown pLocX = 0.6
φLocX probability in location
Vresnumber of localization entries
pLocXevidence weight
φLocX= 1 - ((1 - pLocX )Vres * …)
Nucleus
Cytoplasm
Membrane
Extracellular
Interaction Score
Nucleus
Cytoplasm
Membrane
Extracellular
So what does this provide?
“(i) the filtration of localization-based biologically unlikely interactions—where the two interacting proteins have no common localization
(ii) the prediction of possible new localizations and localization-based biological functions”
Evidence Weights??
Optimization constraints:
experimental > predicted AND experimental > unknown
“maximizes the number of high confidence interactions in the positive control data set (HPA) and simultaneously maximizes the number of low confidence interactions in the ComPPI data set not containing HPA data”
Weights:
experimental pLocX = 0.8
predicted pLocX = 0.7
unknown pLocX = 0.6
Evidence Weights??
Optimization constraints:
experimental > predicted AND experimental > unknown
“maximizes the number of high confidence interactions in the positive control data set (HPA) and simultaneously maximizes the number of low confidence interactions in the ComPPI data set not containing HPA data”
Weights:
experimental pLocX = 0.8
predicted pLocX = 0.7
unknown pLocX = 0.6
Does it work?
Crotonase
“crotonase was shown to be overexpressed and localized in the cytosol in hepatocarcinoma cells, where it contributes to lymphatic metastatis”
Concerns
Other Concerns
What happens when GO adds new cellular component terms? New proteins in UniProt?
What if we wanted to add mouse, rat, something else? How hard would it be?
Other sources