1 of 11

Functional association analysis

on protein sumoylation and RNA binding

Xiaotong Yao

Mentors: Rebecca Bish, Christine Vogel

16th Jan 2014

2 of 11

Outline

  • Introduction
  • Progress
  • Discussion

3 of 11

Introduction--SUMO

  • Small Ubiquitin-like MOdifier

a post-translational modification

  • signature C-terminal GG

targeting lysine

  • involved in various biological processes

e.g. protein stability, nuclear-cytoplasm

transport, transcriptional regulation

Flotho, A., & Melchior, F. (2013). Sumoylation: A Regulatory Protein Modification in Health and Disease. Annual review of biochemistry, 82(1).

4 of 11

Introduction

Our hypothesis is

RNA binding proteins tend to be sumoylated more often than average

  • Becky’s inspiration from RBP research
  • essential for yeast & human cells
  • no former integrative research

We plan to answer this question

  • statistically: enough evidence?
  • evolutionarily: human & yeast?

5 of 11

Progress--get data

list of sumo targets in yeast

6 of 11

Progress--get data

list of RNA binding proteins in yeast from Gene Ontology

or from Superfamily

which turned out not a very good choice...

7 of 11

Progress--hypergeometric test

From an urn(total proteome) consisting of

m white balls(RNA binding proteins) and n black balls(complementary set of RBP),

randomly drawn k(number of proteins in your interest list, here SUMO targets) balls,

and q(the quantity of intersection of SUMO targets and RBPs)

out of these k balls are white(RNA binding protein).

H0: No association between RNA binding and sumoylation, any concurrence is purely random effect, and the number of concurrence conforms to hypergeometric distribution with these parameters(m, n, k, q).

HA: There is a higher probability of concurrence of RNA binding and sumoylation.

8 of 11

Progress--Results

  • non-redundant lists of sumo targets: 555 in yeast, 1861 in human
  • RNA binding protein list from Gene Ontology: 561 in yeast, 969 in human
  • choose sizes of proteomes: 5820 in yeast, 20597 in human
  • hypergeometric probabilities: <10^-6 in yeast, <10^-80 in human

9 of 11

Discussion

Current result indicates

  • significant enrichment of RNA binding in sumo target list, in both organisms

Potential misleading factors in current result

  • heterogenous sumo target data
  • incomplete sumo data

10 of 11

Discussion--misleading factors

11 of 11

Discussion

Current result indicates

  • significant enrichment of RNA binding in sumo target list, in both organisms

Potential misleading factors in current result

  • heterogenous sumo target data
  • incomplete sumo data

Further job

  • any other function shows enrichment in sumo targets
  • if sumo target consensus motif show up in RBPs
  • hypothesize why it is so