1 of 43

Structure and cross link proteomics

2 of 43

Amino acids

3 of 43

Charged Amino acids

4 of 43

Polar Amino acids

5 of 43

Non-polar amino acids

6 of 43

Hydrophobic and hydrophilic

7 of 43

8 of 43

Special cases

Polar

Non-polar

Non-polar

Cysteine can form disulfide bond which is important for protein folding

Glycine is the smallest amino acid. Due to its size, it is rare to be replaced by other amino acids. Thus, it is usually located at constant position.

Proline is a circular amino acid. Due to its special formation, it affects protein structure. It can terminate the alpha helix.

9 of 43

Amino acid, peptide and protein

10 of 43

Secondary structure of protein

11 of 43

Tertiary structure of protein

  • Three dimensional shape of a protein
  • The tertiary structure will have a single polypeptide chain "backbone" with one or more protein secondary structures, the protein domains.
  • Amino acid side chains may interact and bond in a number of ways.

12 of 43

Protein domain

  • A region of the protein's polypeptide chain that is self-stabilizing
  • Folds independently from the rest.
  • Many proteins consist of several domains
  • One domain may appear in a variety of different proteins
  • Domains often form functional units

13 of 43

14 of 43

15 of 43

Protein shape and protein function

  • Protein shape is normally related to its function -> family

16 of 43

Contact sites of proteins

  • If the locations of amino acids of a protein or between two proteins are close, they may have an interaction and may assist to form the protein conformation.

Structure stability

Antibody-antigen interaction

Protein complex

Protein-protein interaction

2nd structure formation

17 of 43

Protein-protein interaction (PPI)

  • The interactions include
    • Electrostatic forces
    • Hydrogen bonding
    • Hydrophobic effect
  • Shape plays a key role

18 of 43

The ways to study protein structure or PPIs

19 of 43

X-ray and NMR

  • The conventional experimental methods to study protein structures or know how protein-protein interactions perform are X-ray and NMR.
  • X-ray
    • Useful for large structures: Not limited by size or atomic weight
    • High atomic resolution
    • The sample must be crystallizable
  • NMR
    • Dynamic technique
    • Non-destructive and non-invasive
    • Can provide unique insights into dynamics and intramolecular interactions
    • Size limited
    • Noise level is high
    • Low atomic resolution

They can only study one protein at once

20 of 43

Cross-linking proteomics

  • Cross-linking proteomics is a technique used in proteomics research to study protein-protein interactions and protein structures.
  • It can systematically study protein structure and interaction by checking the contact site of the residues.
  • It involves the use of cross-linking reagents to covalently link interacting or neighboring proteins within a complex or protein structure.
  • By cross-linked protein complexes, cross-linking proteomics provides valuable information about protein-protein interactions, the architecture of protein complexes, and the arrangement of protein domains. It can help elucidate protein interaction networks, validate protein-protein interaction models, and contribute to understanding the three-dimensional structures of proteins and protein complexes.

21 of 43

Cross linker

  • Crosslinkers, also known as bifunctional crosslinkers, are reagents that contain two or more reactive groups which covalently attach via a spacer to functional groups on proteins or other biomolecules.
  • Three types of crosslinkers are available: homobifunctional crosslinkers, heterobifunctional crosslinkers, and photoreactive crosslinkers.
  • Homobifunctional crosslinkers
    • Have identical reactive groups, primarily amine-to-amine or sulfhydryl-to-sulfhydryl. They are typically used to form intramolecular crosslinks or to prepare polymers from monomers.
  • Heterobifunctional crosslinkers
    • Have different reactive groups such as amine-to-sulfhydryl, carboxyl-to-amine, or sulfhydryl-to-carboxyl. They are useful for preparing conjugates between two different biomolecules.
  • Photoreactive crosslinkers
    • Part of heterobifunctional crosslinkers, it reacts with nucleophiles or form C-H insertion sites after exposure to UV light.

22 of 43

A crosslinker has certain characteristics

functionality

length

MS feature

number of reactive groups

type of reactive groups

homo-

functional

hetero-

functional

bi-

functional

tri-

functional

nonzero

zero

non-cleavable

MS-cleavable

23 of 43

A common crosslinker example 1: disuccinimidyl suberate (DSS)

a single carbon chain known as �‘a spacer arm’ or ‘a linker region’

11.4 Ångström (Å)

24 of 43

A common crosslinker example 2: disuccinimidyl sulfoxide (DSSO)

10.3 Å

25 of 43

A common crosslinker example 3: NHS-ester/diazirine

12.5 Å

NHS-ester

NHS-diazirine

26 of 43

UV cross linking

  • UV cross-linking is a technique used in molecular biology and protein-protein interaction studies. Here are some advantages and disadvantages of UV cross-linking:
  • Advantages of UV cross-linking:
    • Simplicity: UV cross-linking is a relatively simple and straightforward technique, requiring minimal specialized equipment.
    • Speed: UV cross-linking reactions can be performed rapidly, allowing for efficient experimental workflows.
    • In situ cross-linking: UV cross-linking can be performed directly on cells or tissues, preserving the native context of protein interactions.
    • Non-specific cross-linking: UV cross-linking can capture both specific and non-specific protein interactions.
    • Preservation of weak interactions
  • Disadvantages of UV cross-linking:
    • Non-specificity: leading to false-positive interactions.
    • Limited distance constraints: UV cross-linking primarily captures proximal interactions within a short distance.
    • Protein damage
    • Low cross-linking efficiency
    • Experimental optimization: UV cross-linking conditions, such as UV intensity, exposure time, and cross-linking reagent concentration, may require optimization for each specific application to achieve optimal results.

27 of 43

Structural information is lost during typical shotgun proteomics experiments

MS/MS spectra

proteins

peptides

No interacted peptides are remained

28 of 43

Chemical crosslinking makes spatially close residues remain together

chemical crosslinking combined with mass spectrometry

XL-MS, CX-MS or CLMS

crosslinked proteins

proteins

crosslink products

MS/MS spectra

29 of 43

Obtained peptide mixture is more complex

single or

ordinary peptide

30 of 43

Obtained peptide mixture is rather more complex

31 of 43

Obtained peptide mixture is more complex

the most abundant

e.g. strong cation exchange �or size exclusion chromatography to enrich crosslinked peptides

crosslink products

single or

ordinary peptide

32 of 43

Obtained peptide mixture is more complex

monolinked peptide or

deadend or

Type 0

crosslink products

e.g.�DSS mass shift ∼156 Da

33 of 43

Obtained peptide mixture is more complex

looplinked peptide or

cyclic peptide or

intra-peptide or

Type 1

crosslink products

e.g.�DSS mass shift:∼138Da

34 of 43

Obtained peptide mixture is more complex

crosslinked peptide or

inter-peptide

Type 2

crosslink products

35 of 43

Obtained peptide mixture is more complex

high order crosslink or

multiple modifications

Type 3

crosslink products

Type 2, 1

Type 2, 2

36 of 43

Specialized and automated software are required �to analyze complex crosslink samples

MS/MS data set

ID XL-peptides Spectrum title Score�1 ASEGKK-DKERR SpecA 126�2 ASEGKK-DKERR SpecB 84

3 ASEGKK-ADKLEEQIR SpecC 32

Cross-linked peptide-to-spectrum match (CSM)

pLink

xQuest

StavroX

SIM-XL

Xi

Kojak

Xilmass

MeroX

XlinkX

Search engines

XL search database

Non-cleavable: � purified protein samples

MS-cleavable: � proteome-wide/large scale

37 of 43

Workflow of cross linking proteomics

38 of 43

Identification of non-cleavable or MS-cleavable crosslinkers

39 of 43

Identification of non-cleavable or MS-cleavable crosslinkers

40 of 43

Identification of non-cleavable or MS-cleavable crosslinkers

41 of 43

Example

42 of 43

Structural modeling

predicting structure

protein-protein docking

43 of 43

Visualization

List of identified crosslinks

A1-K60

K50-K75

K60-K95

K62-K95

2D

    • ProXL
    • xiNet
    • xVis

3D

    • PyMol
    • UCSF Chimera
    • xWalk
  • A crosslinker imposes a certain distance constraint

  • Euclidean distance between two central carbon (Cα−Cα)