1 of 482

Introduction: from image to inference

  • Chris Rorden
    • fMRI strengths and limitations.
    • fMRI signal: tiny, slow, hidden in noise.
    • fMRI processing: a sample experiment.
    • fMRI anatomy: stereotaxic space.

1

Taylor

John

Chris

Alex

Natalie

Samaneh

Roger

2 of 482

A brief history of Nobel prizes for MRI

1937 - Isidor I Rabi, a professor at Columbia University observed the quantum phenomenon dubbed nuclear magnetic resonance (NMR). He recognized that the atomic nuclei show their presence by absorbing or emitting radio waves when exposed to a sufficiently strong magnetic field.

1952 Felix Bloch and Edward Purcell win Nobel for nuclear magnetic resonance (NMR)

1973 Paul Lauterbur uses gradients to encode space, creating 2D MRI ex-vivo images

1970’s Sir Peter Mansfield develops phase and frequency encoding, pioneers echo planar imaging. Makes MRI viable.

3 of 482

A brief history of MRI

1977/1978 - Raymond Damadian builds the first human MRI scanner by hand.

1993 - Functional MR imaging of the brain is introduced.

Damadian with 1st human MRI: Indomitable

4 of 482

No wonder Dr. Freeman Cope, an NMR collaborator of Dr. Damadian's on other NMR studies, told Dr. Damadian, "Your cancer paper contains a second discovery. No one has ever reported a comparison study of the healthy tissues. There are major differences here as well."

5 of 482

Sir Peter Mansfield (with Lauterbur), 2003 Nobel Prize Development of MRI Obituary, Feb 9, 2017

6 of 482

MRI as Art

7 of 482

The Many Flavors of MRI

8 of 482

MRI in the Media

Scary examples from House, Grey’s Anatomy, Terminator Genesys, etc.

9 of 482

Modern neuroscience

  • Different tools exist for inferring brain function.
  • No single tool dominates, as each has limitations.
  • This course focuses on fMRI.

9

Temporal resolution

good (millisecond)

poor (months)

good

(neuron)

poor

(whole brain)

scr

erp

fmri

pet

tms

nirs

lesions

eeg

iap

Spatial resolution

Brain Imaging Methods

10 of 482

Modern neuroscience techniques

11 of 482

Single Cell Recording (SCR)

  • Directly measure neural activity
    • Exquisite timing/spatial information
  • Example
    • Cells in monkey V1 have brief response starting ~50 ms after visual stimuli is displayed.
  • Limitations
    • Invasive: Hard to infer human cognition from animals
    • Poor field of view: only sample small brain region

11

12 of 482

fMRI signal sluggish

  • Unlike single cell recordings (SCR), in fMRI there is a huge delay between activity and signal change.
  • Response in visual cortex shows peak response ~5s after visual stimuli.
  • Thus, fMRI is an indirect measure of brain activity

12

0 6 12 18 24

2

1

0

Time (seconds)

13 of 482

What is the fMRI signal?

  • fMRI is ‘Blood Oxygenation Level Dependent’ measure (BOLD).
    • Capillaries dilate
    • Increased oxygenated versus deoxygenated blood.
  • At first oxygen in blood dips (local consumption)
  • Brain regions become oxygen rich after activity.
  • Very indirect measure.

13

Basal State

Stimulated State

  • Dilation
  • Increased flow
  • More Hbr02 relative to Hbr
  • Increased Signal

14 of 482

fMRI Processing

  • We must heavily process fMRI data to extract a signal.
  • The signal in the raw fMRI data is influenced by many factors other than brain activity.
  • We need to filter the data to remove these artifacts.
  • We will examine why each of these steps is used.

Processing Steps

    • Motion Correct
  1. Spatial
  2. Intensity
    • Physiological Noise Removal
    • Temporal Filtering
    • Temporal Slice Time Correct
    • Spatial Smoothing
    • Normalize
    • Statistics

14

15 of 482

Let’s conduct a study

  • Anatomical Hypothesis: lesion studies suggest location for motor-hand areas (upside-down Omega shaped fold in motor cortex)
  • Ask person to tap finger while in MRI scanner – predict contralateral activity in motor hand area..

15

M1: movement

S1: sensation

16 of 482

Data Collection

  • Participant Lies in scanner.
  • Watch computer screen through mirror.
  • Computer shows flashing arrows (up, left, right).
  • Tap left/right finger after seeing left/right arrow, rest when up arrow is shown.
  • Collect 302 3D volumes of data, one every 1.92s (total time = ~10min).

16

17 of 482

Raw Data

  • The scanner reconstructs 302 3D volumes.
    • Each volume = 64x64x36 voxels
    • Each voxel is 3x3x3.6mm.

  • We need to process this raw data to detect task-related changes.

17

18 of 482

Motion Correction

  • Unfortunately, people move their heads a little during scanning.
  • We need to process the data to create motion-stabilized images.
  • Otherwise, we will not be looking at the same brain area over time.

18

19 of 482

Spatial smoothing

  • Each voxel is noisy
  • By blurring the image, we can get a more stable signal (neighbors show similar effects, noise spikes attenuated).

19

20 of 482

Predicted fMRI signal

  • We need to generate a statistical model.
  • We convolve expected brain activity with hemodynamic response to get predicted signal.
    • Since the HRF is sluggish, convolution blurs the rapid neural signal across many seconds.
    • Note that predicted signal at any moment is often influenced by several previous events.

20

Predicted fMRI signal

=

Event Onset and

Neural Signal

HRF

21 of 482

Predicted fMRI signal

  • We generate predictions for neural responses for the left and right arrows across our dataset.
  • Statistics will identify which areas show this pattern of activity.
  • Several possible statistical contrasts (crucial to inference):
    1. Activity correlated with left arrows: visual cortex, bilateral motor.
    2. More activity for left than right arrows: contralateral motor.

21

22 of 482

fMRI signal change is tiny, noise is high

  • Right motor cortex is brighter after movement of left hand.
  • Note signal increases from ~12950 to ~13100, only 1.2%!
    • And this is after all of our complicated processing to reduce noise.

22

23 of 482

Coordinates - normalization

23

  • Different people’s brains look different
  • ‘Normalizing’ adjusts overall size and orientation

Raw Images

Normalized Images

24 of 482

Why normalize?

  • Benefits of warping individuals brains to stereotaxic space
    • Universal description for anatomical location
    • Allows other to replicate findings
    • Allows between-subject analysis: crucial for inference that effects generalize across humanity.

24

Group analysis

Subject1

Subject2

Subject3

Subject4

Subject5

25 of 482

Goals for this course

  • fMRI is notoriously difficult technique
    • Sluggish signal
    • Poor signal/noise
    • Must find meaningful statistical contrasts

  • This seminar reveals how to
    • Devise meaningful contrasts
    • Maximize signal, minimize noise
    • Control for statistical errors

25

26 of 482

Reporting findings

  • How do we describe anatomy to others?
    • We could use anatomical names, but often hard to identify.
    • We could use Brodmann’s Areas, but this requires histology – not suitable for invivo research.
  • Both show large between-subject variability.
  • Requires anatomical coordinate system.

26

27 of 482

Ambiguous Coordinates

  • Human brain rotated relative to spine
  • Ambiguous coordinates
    • Dorsal/ventral
    • Rostral/caudal
  • Unambiguous coordinates
    • Head/Foot
    • Superior/Inferior
    • Anterior/Posterior

27

R

C

R

C

R

C

V

D

V

D

V

D

Rat

Human

28 of 482

Anatomy – Common Terms

  • Radiological convention: Left on right side
  • Neurological convention: Left on left side

28

Posterior <> Anterior

Posterior <> Anterior

Inferior <> Superior

lateral < medial > lateral

sagittal

coronal

axial

29 of 482

Oblique Slices

  • Slices that are not cut parallel to an orthogonal plane are called ‘oblique’.

29

Ax

Cor

Oblique

  • Thickness of gray matter impossible to determine using any 2D view.
  • Tissue appears thinner if cut is perpendicular to slice, thicker if cut is angled.

30 of 482

Brain Coordinates

  • On Earth: North, South, East and West.
    • 0˚N/S explicitly defined by spheres rotation (equator).
    • 0˚E/W arbitrary (Greenwich by convention).
  • For brain: Left/Right, Sup./Inf., Ant./Post.
    • Origin of L/R explicitly defined (brain symmetry)
    • Origin of S/I and A/P arbitrary.

30

31 of 482

Coordinates - Talairach

  • Anterior Commissure (AC) is the origin for neuroscience.
    • We measure distance from AC
      • 57x-67x0 means ‘right posterior middle’.
      • Three values: left-right, posterior-anterior, ventral-dorsal

31

32 of 482

Coordinates - Talairach

  • Axis for axial plane is defined by anterior commissure (AC) and posterior commissure (PC).
  • Both are small regions that are clear to see on most scans.

32

PC

AC

Y-

Y+

Z+

Z-

33 of 482

Recognizing the cortical lobes

  • NB Insula (hidden) is also a lobe.

33

34 of 482

The major sulci

  • Postcentral easy to find: becomes intraparietal. Precentral easy to find- attached to superior frontal. Between these is the Central (Rolandic).

34

Interhemispheric (Longitudinal) fissure

Sylvian (lateral) fissure

35 of 482

Major sulci

  • You can usually find the central suclus’ motor hand area (omega shape on axial slice)

35

36 of 482

Gyri and sulci

  • Naming of most gyri (ridges) and sulci (valleys) follows simple pattern of position (superior, middle, inferior) and lobe name.

36

37 of 482

Subcortical structures

  • With a bit of practice, it is easy to find subcortical structures such as caudate nucleus, globus pallidus, putamen.

37

www.mricro.com/anatomy/home.html

38 of 482

Image Center/Width

  • How do we view an image that has higher resolution than our computer screen?
  • Panning changes the ‘image center’.
    • We will not see some of the image.
  • Zooming changes the ‘image width’.
    • We may lose details.

38

Pan

Zoom

39 of 482

Intensity Center/Width (Brightness/Contrast)

  • Computers show 256 shades of gray, most scans have thousands.
  • Adjust brightness ‘window center’
    • E.G. range -64..124 makes muscles gray, 114..302 shows kidneys
    • C/W 30/188 vs C/W 208/188
  • Adjust contrast ‘window width’
    • E.G. range -64..124 shows muscles, �-400..596 shows full range.
    • C/W 30/188 vs C/W 98/996
  • CT intensity is calibrated (kidneys always ~208 Hounsfield units)
  • MRI intensity values vary from scan to scan.

39

Pan

Zoom

40 of 482

Brain research prerequisites?

What skills do you need to do MRI research?

  • Probability and Statistics
  • Computer Programming
  • Linear Algebra
  • Magnetic Resonance Imaging
  • Neurophysiology and biophysics
  • Signal and image processing

40

41 of 482

Viewing images with MRIcroGL

41

Click on the image to move crosshair position

Right-drag over region to adjust intensity range

File/OpenTemplates to view a different image.

Window/Render to see volume rendering.

View/FlipLR changes between radiological and neurological view

x1 Zoom Scale

Intensity range �>49=black >133=white

Crosshair at 43x15x16 Talairach: intensity of 109

L,A,S remind us of left, anterior and superior direction

42 of 482

Basic MRI Safety

43 of 482

Magnetism

  • All substances have some form of magnetism.
    • The degree of magnetism exhibited depends on the atoms that form the material.
  • Magnetic susceptibility: the ability of a substance to become magnetized.
    • Ferromagnetic substances have large magnetic susceptibility (i.e. iron). These substances can be easily magnetized and will become a magnet itself.
  • All magnets have a North and a South pole.
  • All magnets have a “fringe” field which extends to the surrounding area.

44 of 482

Bioeffects

  • Burn Hazards are caused by damaged hardware or by electrical currents produced in conductive loops of material.
    • It is important to not let the patient cross their legs or intertwine their fingers during the exam because it creates a closed circuit and can increase the heating in the subject

  • Localized heating is caused by RF energy absorption to a volume of tissue. FDA limits the amount of absorption we can give to the subject.

45 of 482

Specific Absorption Rate (SAR)

  • The rate the subject absorbs the RF energy is described in terms of Specific Absorption Rate (SAR), measured in watts/kg.

  • SAR is calculated by the patient’s weight, height and the expected increase in body temperature for each imaging pulse sequence.
  • scan.

46 of 482

Forces in the MR Environment

  • There are two types of effects the magnet will have on Ferromagnetic substances

    • Translation:
      • The “Missile Effect”

    • Rotation/Torque:
      • The “Rotational Effect”

47 of 482

MRI Safety - Projectiles

  • Projectile effects of metal objects seriously compromise safety.
  • This potential harm cannot be over emphasized.
    • For example, paper clips can travel at a velocity of 40mph @ 3T.
    • Larger objects travel at a higher velocity and may be fatal.

  • *If you question the ferromagnetic qualities of your equipment please ask the MRI technologist *

48 of 482

Items that can be Damaged by the magnet

  • The magnetic field can seriously damage or impair the following personal items:
    • Cameras
    • Watches
    • Credit /Bank cards
    • Hearing Aids
    • Hair Accessories, Belt Buckles, Shoes
    • I-pods
    • Memory Sticks / USB Drives

Careful screening is a must!

49 of 482

What can you take into the magnet??

  • Some items may contain metal and be safe.
  • Examples of non-ferrous metals are:
    • Brass
    • Aluminum
    • Plastic
  • If you have any questions we can check an object with a hand held magnet
  • If item is implanted in body must have information card on implant. Check www.mrisafety.com.

50 of 482

Noise

  • The MR scanner can produce very high acoustic noise levels (up to 130 dB).

51 of 482

Pregnancy considerations

MR imaging may be used in pregnant women if other non-ionizing forms of diagnostic imaging are inadequate or if the examination provides important information that would otherwise require exposure to ionizing radiation (e.g., fluoroscopy, CT, etc.). It is recommended that pregnant patients be informed that, to date, there has been no indication that the use of clinical MR during pregnancy has produced deleterious effects. However, as noted by the FDA, the safety of MR during pregnancy has not been proved.

  • -SMRI Safety Committee

52 of 482

Tatoos

  • There have been some documented cases of unusual sensation or tingling from a tattoo site during the procedure to receiving burns or raised skin at the site.  
  • If the subject complains of any unusual sensation during the exam the study will be immediately stopped.
  • If the area becomes red or irritated a hand cold compresses will be applied (these can be found in the bathroom storage container at the center)

53 of 482

MRI Physics 1: Image Acquisition

  • Chris Rorden & Paul Morgan
    • Magnetic Resonance
    • Radio frequency absorption
    • Relaxation: Radio frequency emission
    • Gradients
    • Increases signal/noise: antenna selection, field strength.

  • Excellent references:
    • www.cis.rit.edu/htbooks/mri/
    • www.mikepuddephat.com/Page/1603/Principles-of-magnetic-resonance-imaging

53

54 of 482

Anatomy of an atom

  • Atoms are the building blocks of our world.
  • Composed of 3 components
    • Electrons: tiny negatively charged particles.
    • Protons: heavy positively charged particles.
    • Neutrons: heavy particles without charge.
  • Electrons are often thought of like planets: tiny objects distantly orbiting a massive core.
  • Neutrons and protons form the dense nucleus of an atom.

54

55 of 482

Atomic Nuclei

  • Nuclei are composed of protons and neutrons.
  • Protons determine the element.
  • Neutrons determines the isotope
    • Hydrogen: always one proton
      • Usually no neutrons (‘Proton’ 1H ); Rarely one (Deuterium 2H) or two (Tritium 3H : radioactive, spontaneous decay).
    • Helium: always two protons
      • Usually two neutrons (4He); Rarely only one (3He).

55

4He

1H

3He

2H

3H

56 of 482

Nuclear Magnetic Resonance

  • Felix Block and Edward Purcell
    • 1946: the nuclei of some elements absorb and re-emit radio frequency energy when in magnetic field
    • 1952: Nobel prize in physics
  • Atoms with odd number of protons/neutrons spin in a magnetic field
    • Nuclear: properties of nuclei of atoms
    • Magnetic: magnetic field required
    • Resonance: interaction between magnetic field and radio frequency
  • Analogy: atoms precess in a magnetic field like tops spin in gravitational field

56

57 of 482

Resonant Frequency of Nuclei

  • We will focus on 1H
    • Most abundant in body (63% of atoms).
    • Elements with even numbers of neutrons and protons have no spin, so we can not image them (4He, 12C).
    • 23Na and 31P are relatively abundant, so can be imaged.
  • Larmor frequency varies for isotope:
    • 1H = 42.58 Mhz/T
    • 2H = 6.54 Mhz/T
    • 3H = 45.41 Mhz/T
    • 13C = 10.7 Mhz/T
    • 19F = 40.1 Mhz/T
    • 31P = 17.7 Mhz/T
  • Therefore, by sending in a RF pulse at a specific frequency we can selectively energize 1H.

57

Field Strength (Tesla)

298

40

1.0

7.0

1H Larmor Frequency (MHz)

At 1.5T, f = 63.76 MHz

At 3.0T, f = 127.7 MHz

At 7.0T, f = 298.0 MHz

58 of 482

Radiofrequency Pulses

  • Align
    • Nuclei align to static magnetic field.
    • Nuclei precesses at Larmor frequency.
  • Flip
    • We transmit a radiofrequency (RF) pulse at the Larmor frequency will be absorbed.
    • Degree of spin tilted (flipped).
  • Relax
    • Nuclei re-aligns to to static field.
    • As spin returns to low energy state it emits a RF signal at the Larmor frequency.
    • We can measure this signal with an antenna.

58

59 of 482

Electromagnetic Spectrum

  • MRI signals are in the same range as FM radio and TV (30-300MHz).
  • Unlike X-rays, MRI is non-ionizing radiation.
  • Specific absorption rate (SAR):
    • Absorbed RF warms tissue
    • Increases ~ with square of field strength.
    • FDA limits SAR, and is a limiting factor for some protocols
    • For head, limit is 3 W/kg averaged over 10 minutes.
      • Problem for light individuals (children)
      • Tune sequence, or alternate between high and low energy sequences

59

Ionizing Radiation

Breaks Bonds

Non-Ionizing Radiation

Heating

Excites Electrons

Excites Nuclei

60 of 482

Making a spatial image

  • To create spatial images, we need a way to cause different locations in the scanner to generate different signals.
  • To do this, we apply gradients.
  • Gradients make the magnetic field slightly stronger at one location compared to another.
  • Lauterbur: first MRI: 2003 Nobel Prize.

60

61 of 482

Slice Selection Gradient

  • Gradients: field stronger at one location compared to another.
  • Larmor frequency different along this dimension.
  • RF pulse only energizes slice where field strength matches Larmor frequency.

61

127 Mhz

RF pulse

Larmor Freq

126 Mhz

127 Mhz

128 Mhz

62 of 482

Slice Selection Gradient

Gradual gradients select thick slices, steep gradients select thinner slices.

  • Rapid changes in field strength (dB/dt) induce electrical currents.
  • For MRI, FDA limits dB/dt - some protocols elicit peripheral nerve stimulation (mild tingling, muscle twitches).
  • Transcranial Magnetic Stimulation (TMS) intentionally employs extreme dB/dt to stimulate brain activity.

Position of gradient determines which 2D slice is selected.

62

Field Strength

Z Position

Field Strength

Z Position

Field Strength

Z Position

Field Strength

Z Position

63 of 482

Phase encoding gradient

  • Orthogonal gradient between RF pulse and readout
  • This adjusts the phase along this dimension.
  • Analogy: Phase encoding is like timezones. Clocks in different zones will have different phases.

63

64 of 482

Frequency encoding gradient

  • Apply final orthogonal gradient when we wish to acquire image.
  • Slice will emit signal at Lamour frequency, e.g. lines at higher fields will have higher frequency signals.
  • Aka ‘Readout gradient’.

64

RF emission

65 of 482

Reconstruction

  • Raw MRI data is 2D fourier image
  • Modern scanners automatically convert k-space to image space (reconstruction).
  • Images with even number of rows/columns faster to reconstruct
    • This is why most image matrices are a power of 2 (64,128,256)
    • Modern scanners allow other values (e.g. 96) but not primes (97)

65

Reconstruction

66 of 482

MRI scanner anatomy

  • Superconducting magnet generates static field.
    • Always on: only quench field in emergency.
    • niobium titanium wire.
  • Coils allow us to
    • Make static field homogenous (shims: metal [permanent] and solenoids [adjustable])
    • Briefly adjust magnetic field (gradients: solenoid coils)
    • Transmit, receive RF signal (body and head RF coils: antennas)

66

Magnet

Body Coil

Gradients

Permanent Shims (16)

67 of 482

MRI terminology

  • Orientation: typically coronal, sagittal or axial, can be in-between these (oblique)
  • Matrix Size:
    • Voxels in each dimension
  • Field of view:
    • Spatial extent of each dimension.
  • Resolution:
    • FOV/Matrix size.

67

Axial Orientation

64x64 Matrix

192x192mm FOV

3x3mm Resolution

Sagittal Orientation

256x256 Matrix

256x256mm FOV

1x1mm Resolution

68 of 482

Volumes

  • 3D volumes are composed of stacks of 2D slices, like a loaf of bread.
  • Each slice has a thickness.
    • Thicker slices have more hydrogen, so more signal
      • Volume of 1x1x1mm voxel is 1mm3
      • Volume of 1x1x2mm voxel is 2mm3
    • Thinner slices provide higher resolution.
  • Optional: gap between slices.
    • Reduces RF interference
    • Allows fewer slices to cover whole brain.

68

1mm Gap

2mm Thick

3mm

69 of 482

EPI

  • In conventional MRI, we collect one line of our matrix with each RF pulse.
  • So a 64x64 matrix with a TR of 2s will be generated in 128s.
  • Problem: this is unacceptable if the object changes rapidly:
    • Heart motion.
    • Brain activity.
  • Echo Planar Imaging (EPI): By rapidly applying the frequency gradient, we can collect a 2D slice with a single RF pulse.
    • Benefit: Collect entire 2D slice with each TR
    • Disadvantage: spatial warping and signal dropout due to slow encoding (spatial processing lecture).

69

EPI

Multishot

n.b. 4x4 matrix shown

70 of 482

Signal to Noise

  • Signal To noise is given by the formula

Signal = V√N

  • Where V is the volume and N is the number of samples averaged (referred to as ‘Nex’, as in ‘number of excitations’).
  • For example, to get the same SNR as a single 3x3x3mm scan (27mm3) we would need to collect 12 2x2x2mm (8mm3) scans or 768 1x1x1mm (1mm3) scans.

70

71 of 482

Signal to Noise: Antennas

71

Head coil

Surface coil

  • The MRI antenna is called a coil.
  • We use different coils for different body parts.
  • For brains, the most common antenna is the head coil, which is a volume coil: it shows the whole brain.
  • We can also use a surface coil: it gives great signal for a small field of view.

72 of 482

Parallel Imaging (SENSE, iPat)

  • Parallel imaging: use multiple surface coils to generate a volume image.
    • Dramatically reduces spatial distortion and increases signal.
    • Optionally, you can acquire images more rapidly by only collecting a portion of k-space.
      • Reduces spatial distortion and increases speed of acquisition. Some loss in signal.
      • E.G. SENSE R=2 collects half of the lines.

72

8-channel

array

73 of 482

Parallel Imaging (SENSE, iPat)

  • Increasing SENSE reduction factor decreases acquisition time and spatial distortion, but high values lead to reduced signal.

73

Effects of SENSE factor (R) on EPI

R=1

R=2

R=3

74 of 482

Signal and Field Strength

74

Outside magnetic field

  • Spins randomly oriented

In magnetic field:

    • Spins align parallel or anti-parallel to magnetic field.
    • At room temperature, ~4 parts per million more protons per Tesla align with versus against field.
    • As field strength increases, there is a bigger energy difference between parallel and anti-parallel alignment (faster rotation = more energy).
      • A larger proportion will align parallel to field.
      • More energy will be released as nuclei align.
      • MR signal increases with square of field strength.

75 of 482

Signal and Field Strength

  • 1.5-3.0T typical clinical; 3.0T typical research
  • NbTi max ~11.7T (~9.4T at atmospheric pressure)
  • NbSn max ~22T (brittle, hard to manufacture)
  • Increasing field strength yields
    • Faster Larmor frequency
    • Larger ratio of nuclei aligned
    • More signal as nuclei realign
    • T1 increases;T2 decreases (contrast lecture)
    • Signal: square of field strength
    • Noise: ~linear with field strength
    • In theory, 3T twice SNR of 1.5T (Less in practice)
    • Benefits: better SNR
    • Costs: Artifacts, Money, SAR, wavelength effects, auditory noise

75

76 of 482

Magnetic attraction

  • MRI field measured in Tesla.
    • 10,000 Gauss per Tesla
    • Compass needle moved by Earths field ~0.5 Gauss
    • 3T MRI ~30,000 Gauss
  • Force in magnetic field (1.8T, unit dynes)
    • Water: -22
    • Copper: -2.6
    • Copper Chloride: +280
    • Iron: +400,000
  • Diamagnetic material repelled (e.g. H2O).
  • Paramagnetic material attracted (e.g. CuCl, Gd).
  • Ferromagnetic material strongly attracted (Fe).
    • Even without magnetic field, magnetic moments aligned
    • Dangerous near MRI scanner

76

77 of 482

MRI Physics 2: Contrasts and Protocols

  • Chris Rorden, Paul Morgan
  • Types of contrast: Protocols
    • Static: T1, T2, PD
    • Endogenous: T2* BOLD (‘fMRI’), DW
    • Exogenous: Gadolinium Perfusion
    • Motion: ASL

77

www.fmrib.ox.ac.uk/~karla/

www.hull.ac.uk/mri/lectures/gpl_page.html

www.cis.rit.edu/htbooks/mri/chap-8/chap-8.htm

www.e-mri.org/cours/Module_7_Sequences/gre6_en.html

78 of 482

Contrast in XRays

  • Xrays (and CT) have a single contrast mechanism: how well does tissue attenuate rays.
    • Air ~transparent
    • bone ~opaque
    • soft tissue ~translucent
    • The only way to influence Xray contrast is to change tissue. E.G. injection of radio-opaque Gd into bloodstream.

78

Analogy: overhead projector ~ Xray

CT: reconstructed from series of Xrays

79 of 482

MR Contrast – a definition

  • Contrasts influence the brightness of a voxel.
    • E.G. water (CSF) dark in a T1-weighted scan, bright in a T2 scan.
  • Unlike X-rays, with MR we can manipulate many contrast mechanisms. This provides tremendous versatility.
  • Types of MR contrasts:
    1. Static Contrast: Sensitive to relaxation properties of the spins (T1, T2)
    2. Endogenous Contrast: Contrast that depends on intrinsic property of tissue (e.g. fMRI BOLD)
    3. Exogenous contrast: Contrast that requires a foreign substance (e.g. Gadolinium)
    4. Motion contrast: Sensitive to movement of spins through space (e.g. perfusion).

79

T1

T2

80 of 482

Anatomy of an MRI scan

  • Place object in strong static magnetic field, then.
    1. Transmit Radio frequency pulse: atoms absorb energy
    2. Wait
    3. Listen to Radio Frequency emission due to relaxation
    4. Wait, Goto 1
  • Time between set 1 and 3 is our Echo Time (TE)
  • Time between step 1 being repeated is our Repetition Time (TR).
  • TR and TE influence image contrast.

Time

TR

TE

81 of 482

T1 and T2 definitions

  • After RF is absorbed, emitted RF signal decays with time.
  • Two simultaneous but independent reasons for signal loss: recovery and dephasing.
  • T1-Relaxation: Recovery
    • Recovery of longitudinal orientation.
    • ‘T1 time’ refers to interval where 63% of longitudinal magnetization is recovered.
  • T2-Relaxation: Dephasing
    • Loss of transverse magnetization.
    • ‘T2 time’ refers to interval where only 37% of original transverse magnetization is present.

81

82 of 482

Contrast: T1 and T2 Effects

82

  • T1 effects measure recovery of longitudinal magnetization.
  • T2 refers to decay of transverse magnetization.
  • T1 and T2 vary for different tissues. For example, hemoglobin has very different T1/T2 than CSF. This difference causes these tissue to have different image contrast.
  • T1 is primarily influenced by TR, T2 by TE.

CSF: Long T2

Blood: Medium T2

CSF: Long T1

Blood: Short T1

Magnetization

TR (ms)

Signal

TE (ms)

T1 1.5T

T2 1.5T

Gray Matter: Short T2

GM: Medium T1

83 of 482

TR and T1: saturation

83

  • Consider very short TR:
    • Fat: rapid recovery, each RF pulse will generate strong signal.
    • Water: slow recovery, little net magnetization to tip. Later pulses generate little signal

T1 effects explain why we discard the first few fMRI scans: the signal has not saturated, so these scans show more T1 than subsequent images.

Before first pulse:�1H in all tissue �strongly magnetized.

CSF

Fat

After several rapid pulses: CSF has little net magnetization, so these tissue will not generate much signal.

84 of 482

T2 Relaxation

  • After RF absorption ends, protons begin to release energy
    • Emission at Larmor frequency.
    • Emissions amplitude decays over time.
    • Different tissues show different rates of decay.
    • ‘Free Induction Decay’ (FID).
    • Analogy: tuning fork – initially loud, quieter over time, always at resonant frequency.
  • Strongest signal immediately after transmission.
    • Most signal with short TE.
    • Why not always use short TE?

84

85 of 482

TE and T2 contrast

  • Signals from all tissue decays with time.
  • Signal decays faster in some tissues relative to others.
  • Optimal contrast between tissue when they emit relatively different signals.

85

Optimal GM/WM contrast

86 of 482

Optimal contrast

  • Optimal TE will depend on which tissues you wish to contrast
    • Gray matter �vs White matter
    • CSF �vs Gray matter

86

Signal

TE (s)

0

.2

87 of 482

T2: Dephasing

  • RF pulse sets phase.
    • Initially, everything in phase: maximum signal.
    • Signals gradually dephase = signal is reduced.
    • Some tissue shows more rapid dephasing than other tissue.

87

Time

CSF

Fat

88 of 482

T1 and T2 contrasts

  • Every scan is influenced by both T1 and T2.
  • However, by adjusting TE and TR we can determine which effect dominates:
    • T1-weighted images use short TE and short TR.
      • Fat bright (fast recovery), water dark (slow recovery)
    • T2-weighted images use long TE and long TR: they are dominated by the T2
      • Fat dark (rapid dephasing), water bright (slow dephasing).
    • Proton density images use short TE and long TR: reflect hydrogen concentration. A mixture of T1 and T2

88

T1

T2

89 of 482

T2 vs T2*

  • T2 only one reason for dephasing:
    • Pure T2 dephasing is intrinsic to sample (e.g. different T2 of CSF and fat).
    • T2* dephasing includes true T2 as well as field inhomogeneity (T2m) and tissue susceptibility (T2ms).
      • Due to these artifacts, Larmor frequency varies between locations.
  • T2* leads to rapid loss of signal: images with long TE will have little coherent signal.

89

0.2

Signal

TE (s)

0

0

1

T2

T2*

90 of 482

Susceptibility artifacts

  • Magnet fields interact with material.
  • Ferromagnetic (iron, nickel, cobalt)
    • Strongly attracted: dramatically increases magnetic field.
    • all steel has Iron (FE), but not all steel is ferromagnetic (try putting a magnet on a austenitic stainless steel fridge).
  • Paramagnetic (Gd)
    • Weakly attracted: slightly increases field.
  • Diamagnetic (H2O)
    • Weakly repelled: slightly decreases field.

90

Field Strength

Field strength increases near some tissues, decreases around others

91 of 482

Tissue Susceptibility

  • Due to spin-spin interactions, hydrogen’s resonance frequency differs between materials.
    • E.G. hydrogen in water and fat resonate at slightly different frequencies (~220 Hz; 1.5T).
      • Macroscopically: These effects can lead spatial distortion (e.g. ‘fat shift’ relative to water) and signal dropout.
      • Microscopically: field gradients at boundaries of different tissues causes dephasing and signal loss.

91

92 of 482

Field Inhomogeneity Artifacts

  • When we put an object (like someone’s head) inside a magnet, the field becomes non-uniform.
  • When the field is inhomogeneous, we will get artifacts: resonance frequency will vary across image.
  • Prior to our first scans, the scanner is ‘shimmed’ to make the field as uniform as possible.
  • Shimming is difficult near tissue boundaries that have very different density (e.g., sinuses have air-bone-soft tissue boundaries).
  • Shimming artifacts more intense at higher fields.

92

Fieldmap showing

inhomogeneity

fMRI image

93 of 482

Spin Echo Sequence

  • Spin echo sequences apply a 180º refocusing pulse halfway between initial 90º pulse and measurement.
  • This pulse eliminates phase differences due to artifacts, allowing measurement of recovery of true T2.
  • Spin echo dramatically increases signal.

93

Signal

Time

0

1

T2

T2*

0.5 TE

0.5 TE

Actual Signal

94 of 482

Analogy for Spin Echo

  • Consider two clocks.
    • Clock 1: minute hand takes 240 minutes to make a revolution.
    • Clock 2: minute hand takes 60 minutes to make a revolution.
  • Simultaneously,set both clocks to read 12:00. (~send in 90º RF pulse).
  • Wait precisely 20min
    • Minute hands now differ: out of phase.
  • Reverse direction of each clock (~send in 180º RF pulse).
  • Wait precisely 20min
    • Minute hands now identical: both read noon.
    • They are briefly back in phase

94

Minute hand rotation

0

160º

20min

20min

95 of 482

BOLD effect

  • Deoxyhemoglobin (Hbr) acts as contrast agent
  • Frequency spread causes signal loss over time
  • Effect increases with delay (TE = echo time)

95

  • But, overall signal reduces with TE.
  • Optimal BOLD TE ~60ms for 1.5T, ~30ms at 3T.

Fera et al. (2004) J MRI 19, 19-26

0.2

TE (s)

0

96 of 482

Optimal fMRI scans

  • More observations with shorter TR, but slightly less signal per observation (due to T1 effects and temporal autocorrelation).
  • When you have a single anatomical region of interest use the fewest slices required for a very short TR.
  • For exploratory group study, use a scan that covers whole brain with minimal spatial distortion (for good normalization).
    • Typical 3T: 3x3x3mm 64x64 matrix, 36 slices, SENSE r=2, TE=32ms, TR= 2000ms
    • Typical 1.5T: 3x3x3mm 64x64 matrix, 36 slices, TE=60ms, TR= 3500ms.

96

  • Shorter TR yields better SNR
  • Diminishing returns
  • G.H. Glover (1999) On Signal to Noise Ratio Tradeoffs in fMRI

97 of 482

Diffusion Imaging

  • Diffusion imaging is an endogenous contrast.
  • Apply two gradients sequentially with opposite polarity.
  • Stationary tissue will be both dephased and rephased, while spins that have moved will be dephased.
  • Sensitive to acute stroke (DWI, see lesion lecture)
  • Multiple directions can measure white matter integrity (diffusion tensor imaging, see DTI lecture)

97

water diffuses faster in unconstrained ventricles than in white matter

98 of 482

Gadolinium Enhancement

  • Gd Perfusion scans are an example of an exogenous contrast.
    • intravenously-injected.
  • Gd not detected by MRI (1H).
    • Gd has an effect on surrounding 1H.
    • Gd shortens T1, T2, T2* of surrounding tissue.
    • makes vessels, highly vascular tissues, and areas of blood leakage appear brighter.
  • Very rare side effect: allergic reaction.
  • Gd can help measure perfusion.
    • Useful for clinical studies: how much blood is getting to a region, how long does it take to get there?

98

99 of 482

Arterial Spin Labelling

  • ASL is an example of a motion contrast
  • IMAGEperfusion = IMAGEuninverted – IMAGEinverted
  • Perfusion is useful for clinical studies: how much blood is getting to a region, how long does it take to get there?

99

White matter = low perfusion

Gray matter = high perfusion

100 of 482

Time of Flight

  • ToF is a motion contrast.
  • In T1 scans, motion of blood between slices can cause artifacts.
  • ToF intentionally magnifies flow artifacts.
  • Several Protocols of ToF, E.G:
    1. Use very short TR, so signal in slice is saturated. External spins flowing into slice have full magnetization.
    2. Conduct a Spin Echo Scan: 90º and 180º inversion pulses applied to different slices. Only nuclei that travel between slices show coherent signal.

100

Saturated Spins

Unsaturated Spins

SLICE

Flow

101 of 482

Advanced Physics Notes

  • T1/T2 vary based on tissue type and scanner field strength.

101

T1(ms)

Gray

White

CSF

Blood

Fat

1.5T

1000

710

4000

1435

300

3.0T

1331

832

4000

15840

380

T2(ms)

Gray

White

CSF

Blood

Fat

1.5T

100

80

2200

150

165

3.0T

85

70

2200

80

133

From Atlas ISBN-10: 0781720362

102 of 482

Advanced Physics Notes

  • Optimum flip angle (‘Ernst angle’) may not be 90 degrees. Ernst angle determined by TR, and T1 of tissue.

102

Figure shows optimal flip angles for TI=1400ms, which is a good choice for the brain at 3T (Mark Cohen)

For fMRI see PMID 21073963

103 of 482

Advanced Physics Notes

  • Field strength influences T1 and T2: influences TR/TE
    • Higher Field = Faster T2 decay: Typically, TE decreases as field increases = faster imaging.
    • Higher Field = Slower T1 recovery: TR increases with field strength. Influences T1 contrast: e.g. time of flight improves dramatically with field strength.
    • N.B. Different tissues have different relative T1/T2 changes with field strength.

1.5T Scanner

3.0T Scanner

Signal

TE (s)

0

.2

Magnetization

0

6

TR (s)

100%

100%

T2

T1

104 of 482

Designing a behavioral experiment

  • Chris Rorden
  • Designing fMRI studies
    • fMRI signal is sluggish and additive.
    • Efficient designs maximize predictable changes in HRF.
    • Efficient designs are often very predictable
      • Participant may anticipate events.
      • Techniques for balancing efficiency and psychological validity.

104

105 of 482

The BOLD timecourse

Hemodynamic response function

    • The HRF is very sluggish
      • Delay between brain activity and changes in fMRI images (~5s).
    • The HRF is additive
      • Doing a task twice causes about twice as much change as doing it once.

105

0 6 12 18 24

Image Brightness

Time (seconds)

106 of 482

BOLD effects are additive

  • Three stimuli presented rapidly result in almost 3 times the signal of a single stimuli (e.g. Dale & Buckner, 1997).
  • Crucial finding for experimental design.
  • Note there are limits to this additivity effect, but the basic point is that more stimuli generate more signal (see Birn et al. 2001)

106

Assumed (linear) responses

107 of 482

Temporal Properties of fMRI Signal

  • We predict the HRF by convolving the neural signal by the HRF.
  • We want to maximize the amount of predictable variability.

107

Convolved Response

=

Neural Signal

HRF

108 of 482

Comparing predictable HRF

Consider 3 paradigms:

    • Fixed ISI: one stimuli every 16 seconds.
      • Inefficient
    • Fixed ISI: one stimuli every 4 seconds.
      • Insanely inefficient: virtually no task-related variability
    • Block design: cluster five stimuli in 8 seconds, pause 12 seconds, repeat.
      • Very efficient.
      • Cluster of events is additive. Note peak amplitude is x3 the 16s design.

108

109 of 482

Block Designs

  • Block designs are optimal (aka ‘Epoch’, ‘Box Car’).
    • Present trials as rapidly as possible for ~12..30 sec
    • Summation maximizes additive effect of HRF
    • Consider experiment:
      • Three conditions, each condition repeated 14 times (once every 900ms)
      • Press left index finger when you see
      • Press right index finger when you see
      • Do nothing when you see

109

110 of 482

Block design limitations

  • While block designs offer statistical power, they are very predictable.
    • E.G. our participants will know they will press the same finger 14 times in a row.

  • Many tasks not suitable for block design
    • E.G. Novelty detection, memory, etc.
    • Your can not post-hoc sort data from block designs, e.g. Konishi, et al., 2000 examine correct rejection vs hits on episodic memory task.

110

111 of 482

Block designs

  • Strengths
    • Optimal design for detect (where)
  • Weaknesses
    • Poor for estimation (when)
    • Predictable
    • Limited number of psychological questions
  • Notes
    • Optimal block length ~12s
    • Avoid long blocks (>40s):
      • Reduced signal variability
      • Low frequency signal will be hard to distinguish from low frequency signals such as drift in MRI signal.

111

Estimation

Detection

112 of 482

Event related designs

  • Much less power than block designs.
    • Simply randomizing trial order of our block design, the typical event related design has one quarter the efficiency.
    • Here, we ran 50 iterations and selected the most efficient event related design.
      • Still half as efficient as the block design.
      • Note this design is not very random: runs of same condition make it efficient.

112

113 of 482

Permuted Blocks

  • Permuted block designs (Liu, 2004):
    • Start with block design
    • Transpose fixed number of trials
  • Can offer a balance of power and predictability.

113

Block after 10 permutations

114 of 482

Jittered Inter-Stimulus Interval

  • Dale et al. suggest using exponential distribution for inter-trial intervals.
  • Exponential Distribution:
    • Many trials have short duration
    • A few trials have long duration
    • Efficient because jittering makes events block-like

114

1 condition, fixed ISI = little variability

1 condition, exponential ISI = more variability

115 of 482

Should you use variable ISIs?

  • In practice, variable ISIs often reduce power.
  • Most experiments have more than one condition, so fixed ISI designs also have temporal variability.
  • Unless you are looking at low-level processes (e.g. early vision), trials must be separated by a couple seconds.
  • For multi-condition studies, the minimum time between trials is crucial.
    • People are faster to respond to fixed ISI than variable ISI
    • Therefore, fixed ISI are often more powerful
    • However, variable ISI may help us reconstruct the true shape of the HRF measured.

115

116 of 482

Tips

  • For event related designs – helpful if TR is either variable or a not evenly divisible by the interstimulus interval.
  • Allows you to accurately estimate whether conditions influence the latency of response.

116

TR divisible by ISI

TR not divisible by ISI

117 of 482

Generate your own experiments…

  • Set the TR (time per volume)
  • Set the number of volumes
  • Set minimum ISI – this will be time between trials for block designs.
  • Set the mean ISI – this will be the average time between trials for event related designs.
  • Set the number of conditions.
  • You can select permutations for block designs, or iterations for event related designs to trade of predictability and power.
  • Measures approximate power (variance). Value relative (for given number of volumes and TR).
  • Drop-down menu selects design
    1. Block
    2. Permuted Block
    3. Fixed ISI Event
    4. Exponential ISI Event
    5. Random ISI Event

117

118 of 482

Response Suppression Designs

  • Show two stimuli in rapid succession.
  • See if a brain region can discriminate if these stimuli are the same or different.
  • Classically, regions show adaptation – less time to process same information twice in a row.
  • E.G. Fusiform gyrus shows more response to two photos of different faces than identical faces: this region can discriminate between these stimuli.

118

119 of 482

Sparse fMRI

  • Standard fMRI acquires data continuously.
    • Loud noises can make it difficult to examine auditory stimuli.
  • Sparse imaging includes a delay between each fMRI volume, so stimuli can be presented while scanner is silent.

119

Time (sec)

0

10

Continuous

Time (sec)

0

10

Sparse

120 of 482

General guidelines (Nichols et al)

  1. If possible, use block design
    • Keep blocks <40s (temporal processing lecture describes why)
  2. Limit number of conditions
    • Pairwise comparisons far apart in time may be confounded by low frequency noise.
  3. Randomize order of events that are close to each other in time.
  4. Randomize ISI between events that need to be distinguished.
  5. Run as many people as possible for as long as possible.
  6. Have testable anatomical prediction

120

121 of 482

Statistics – Modelling Your Data

    • Modelling data:
      • Signal, Error and Covariates
      • Parametric Statistics
    • Thresholding Results:
      • Statistical power and statistical errors
      • The multiple comparison problem
      • Familywise error and Bonferroni Thresholding
      • Permutation Thresholding
      • False Discovery Rate Thresholding
      • Implications: null results uninterpretable

121

122 of 482

Calculating statistics

  • Consider analysis of task where someone moves hand for 12s, rests for 12s.
  • We can model (predict) the hemodynamic response.
  • Our statistics will test how well our model predicts the observed data.
  • Statistics: does model predict substantial amount of variability in observed data.
  • Does this brain area change brightness when we do the task?
    • Top panel: model very good predictor for observed data (very little error)
    • Lower panel: mediocre predictor

122

123 of 482

General Linear Model

  • The observed data is composed of a signal that is predicted by our model and unexplained noise.

123

Observed Data

Amplitude (solve for)

Design Model

Noise (Error, unexplained variance)

Y = αM + ε

cf Boynton et al., 1996

124 of 482

What is your model?

  • Model is predicted effect.
  • Consider Block design experiment:
    • Three conditions, each for 11.2sec
      1. Press left index finger when you see
      2. Press right index finger when you see
      3. Do nothing when you see

124

Intensity

125 of 482

FSL/SPM display of model

  • Analysis programs display model as grid.
  • Each column is regressor
    • e.g. left / right arrows.
  • Each row is a volume of data
    • for within-subject fMRI = time
  • Brightness of row is model’s predicted intensity.

125

Intensity

Time

126 of 482

Statistical Contrasts

  • fMRI inference based on contrast.
  • Consider study with left arrow and right arrow as regressors
    1. [1 0] identifies activation correlated with left arrows: we could expect visual and motor effects.
    2. [1 –1] identifies regions that show more response to left arrows than right arrows. Visual effects should be similar, so should select contralateral motoric.
  • Choice of contrasts crucial to inference.

126

127 of 482

Statistical Contrasts

  • t-Test is one tailed, F-test is two-tailed.
    • T-test: [1 –1] mutually exclusive of [-1 1]: left>right vs right>left.
    • F-test: [1 –1] = [-1 1]: difference between left and right.
  • Choice of test crucial to inference.

127

128 of 482

How many regressors?

  • We collected data during a block design, where the participant completed 3 tasks
    • Left hand movement
    • Right hand movement
    • Rest
  • We are only interested in the brain areas involved with Left hand movement.
  • Should we include uninteresting right hand movement as a regressor in our statistical model?
    • I.E. Is a [1] analysis the same as a [1 0]?
    • Is a [1 0] analysis identical, better, worse or different from a [1] analysis?

128

=?

129 of 482

Meaningful regressors decrease error

  • Meaningful regressors can explain some of the variability.
  • Adding a meaningful regressor can reduce the unexplained noise from our contrast.

129

130 of 482

Meaningful regressors decrease error

  • Consider voxel that responds strongly to left movement and weakly to right movement.
  • Note that observed and left are identical for both models.
  • Yet, ratio left/error changes.
  • Inclusion of right movements can describe variability that would otherwise be counted as error.

130

131 of 482

Correlated regressors decrease signal

  • Each explanatory variable (EV) should be independent.
  • If two EVs are highly correlated, each will predict the others’ variability.
  • Example: consider a task where someone is asked to press a button after they see a left arrow stimuli.
    • Responses happen ~0.5-1.5s after arrow onset (with some variability).
    • These two events are highly correlated.
    • We will have little power if we add both stimuli onset and button press times to our model.

131

132 of 482

Single factor…

  • Consider a test to see how well height predicts weight.

132

Explained VarianceUnexplained Variance

t =

Small t-score

height only weakly predicts weight

High t-score

height strongly predicts weight

Weight

Height

133 of 482

Adding a second factor…

  • How does an additional factor influence our test?
  • E.G. We can add waist diameter as a regressor.
  • Does this regressor influence the t-test regarding how well height predicts weight?
  • Consider ratio of cyan to green.

133

Increased t

Waist explains portion of weight not predicted by height.

Decreased t

Waist explains portion of weight predicted by height.

Weight

Height

Waist

134 of 482

Regressors and statistics

  • Our analysis identifies three classes of variability:
    1. Signal: Predicted effect of interest
    2. Noise (aka Error): Unexplained variance
    3. Covariates: Predicted effects that are not relevant.
      • E.G. Regressors with a weight of zero
  • Statistical significance is the ratio:

  • Covariates will
    • Improve sensitivity if they reduce error (explain otherwise unexplained variance).
    • Reduce sensitivity if they reduce signal (explain variance that is also predicted by our effect of interest).

134

Signal�Noise

t =

135 of 482

Summary

  • Regressors should be orthogonal
    • Each regressor describes independent variance.
    • Variance should not be explained by more than one regressor.
  • N.B. Adding a regressor reduces our degrees of freedom. This is a big issue for many techniques. In low-level fMRI statistics we have hundreds of DoF, so not much of a concern.
  • E.G. we will see that including temporal derivatives as regressors tend to help event related designs (temporal processing lecture).

135

136 of 482

Group Analysis

  • We test inferences about the general population
  • Conduct time course analysis on many people.
  • Identify which patterns are consistent across group.

136

Group analysis

Subject1

Subject2

Subject3

Subject4

Subject5

137 of 482

Statistical thresholding example

  • Erythropoietin (EPO) doping in athletes
    • EPO improves endurance ~ 10%
    • Races won ~1%
    • Without testing, athletes forced to dope
    • Dangers: Carcinogenic and can cause heart-attacks
    • Therefore: Measure haematocrit
    • Problem: threshold to expel

137

hematocrit

30%

50%

If we set the threshold too low, we will accuse innocent people (high rate of false alarms).

If threshold set too high, we fail to detect dopers (high rate of misses).

hematocrit

30%

50%

138 of 482

Statistical thresholding

  • We need to choose a threshold that balances the benefits of finding effects with the cost of making false alarms.
  • α is our statistical threshold: it measures our chance of Type I error.
    • A 5% alpha level (Z>1.64) means only 1/20 chance of false alarm (p < 0.05).
    • A 1% alpha level (Z>2.3) means only 1/100 chance of false alarm (p< 0.01).

138

Z>0.5

Z>1

Z>2

Z>4

Z>8

Fewer peaks survive as we apply a more stringent threshold.

139 of 482

Liberal, conservative, power

  • With noisy data, we will make mistakes.
  • Statistics allows us to
    • Estimate our confidence
    • Bias the type of mistake we make (e.g. we can decide whether we will tend to make false alarms or misses)
  • We can be liberal: avoiding misses
    • airport weapons detection (X-ray often leads to innocent cases being opened).
  • We can be conservative: avoiding false alarms.
    • criminal conviction: avoid sending innocent people to jail.

139

Type II error

Correct rejection

Accept Ho

Hit (‘Power’)

Type I error

Reject Ho

Ho false

Ho true

Decision

Reality

  • ‘Power’ refers to our chance of detecting real effects. There are four ways we can influence power….

140 of 482

Concrete Example

  • Consider a drug candidate to treat cancer.
  • Drug either does or does not treat cancer.
  • Null hypothesis (Ho): Drug has no effect.
  • Hit: We correctly detect that the drug works!
  • Correct rejection: We correctly determine the drug has no effect.
  • Type I error: We claim drug treats cancer when it does not.
  • Type II error: We decide drug is not effective when it is (miss).

140

Type II error

Correct rejection

Accept Ho

Hit (‘Power’)

Type I error

Reject Ho

Ho false

Ho true

Decision

Reality

  • ‘Power’ refers to our chance of detecting real effects. There are four ways we can influence power….

141 of 482

1.) Alpha and Power

  • By making alpha less strict, we can increase power.�(e.g. p < 0.05 instead of 0.01)
  • In other words, we can become more liberal
  • However, we increase the chance of a Type I error!

141

Power

Type I Error

α

Control Condition (e.g. non-dopers)

Experimental Condition (e.g. dopers)

Hits (dopers expelled)

False alarms (innocent athletes expelled)

142 of 482

2.) Effect Size and Power

  • Power will increase if the effect size increases. (e.g. higher dose of drug, 3T instead of 1.5T MRI).
  • Unfortunately, effect sizes are often small and fixed.

142

143 of 482

3.) Variability and Power

  • Reducing variability increases the relative effect size.
  • Most measures of brain activity noisy.
  • N.B. Statistically what is important is ratio of effect size to variability, SNR.

143

144 of 482

4.) Sample Size

  • A final way to increase our power is to collect more data.
  • We can sample a person’s brain activity on many similar trials.
  • We can test more people.
  • The disadvantage is time and money.
  • Increasing the sample size is often our only option for increasing statistical power.

144

145 of 482

Multiple Comparison Problem

  • Olympics: 10500 athletes
    • At 1% α and all innocent we will wrongly accuse ~105
  • Neuroimaging: gray matter 900cc, fMRI voxels 27mm3, >33333 tests
    • At 1% α, we will make ~333 false alarms.
  • Many tests means many chances to make mistakes: the multiple comparison problem.
  • Probability of at least one error is 1- (1- α)C: familywise error (FWE).

145

146 of 482

Bonferroni Correction

  • Bonferroni Correction: controls FWE.
  • For example: if we conduct 10 tests, and want a 5% chance of any errors, we will adjust our threshold to be p < 0.005 (0.05/10).
  • Benefits: Controls for FWE.
  • Problem: Very conservative = very little chance of detecting real effects = low power.

146

147 of 482

Random Field Theory

  • We spatially smooth our data – peaks due to noise should be attenuated by neighbors.
    • Worsley et al, HBM 4:58-73, 1995.
  • RFT uses resolution elements (resels) instead of voxels.
    • If we smooth our data with 8mm FWHM, then resel size is 8mm.
  • SPM uses RFT for FWE correction: only requires statistical map, smoothness and cluster size threshold.
    • Euler characteristic: unsmoothed noise will have high peaks but few clusters, smoothed data will be have lower peaks but show clustering.
  • RFT has many unchecked assumptions (Nichols)
  • Works best for heavily smoothed data (x3 voxel size)

147

148 of 482

Permutation Thresholding

  • Gold standard to control FWE.
  • Alternative hypothesis (H1): Label ‘Group 1’ and ‘Group 2’ mean something.
  • Null Hypothesis (H0): Labels are meaningless.
  • If Ho true, randomly scrambled orders will generate similar t-values as observed.
  • Alternative H0 suggests observed t-values unusually high.

  • Method:
    1. Permute (randomize) labels
    2. Compute maximum T-score for entire 3D volume
    3. Repeat steps 1 and 2 at least 1000 times.
    4. Find 5th Percentile max T.
    5. Any voxel in our observed dataset that exceeds this threshold has only 5% probability of being noise.

148

Max T

Percentile

0 100

0

5

5%

T= 3.9

149 of 482

Permutation Thresholding

  • Observed, max T = 4.1

149

Group 1

Group 2

  1. Permutation 1, max T = 3.2�
  2. Permutation 2, max T = 2.9�
  3. Permutation 3, max T = 3.3

  • Permutation 4, max T = 2.8

  • Permutation 5, max T = 3.5

...

1000. Permutation 1000, max T = 3.1

H1 : Young (solid) participants have more gray matter than old (dotted).

H0 : Label (young versus old) does not influence gray matter.

150 of 482

Permutation Thresholding

  • Permutation Thresholding offers the same protection against false alarms as Bonferroni.
  • Typically, much more powerful than Bonferroni.
  • Implementations include SnPM, FSL’s randomise, NPM.
  • Disadvantage: computing 1000 permutations means it takes x1000 times longer than typical analysis!
  • Permutation thresholding has been used (Nichols et al) to validate other techniques: Bonferroni conservative,
  • Random Fields only accurate with high DF and heavily smoothed.

150

151 of 482

False Discovery Rate

  • Traditional statistics attempts to control the False Alarm rate.
  • ‘False Discovery Rate’ controls the ratio of false alarms to hits.
  • It often provides much more power than Bonferroni correction.
  • Some issues with neuroimaging. Solution is ‘topological FDR’ which requires a priori cutoff.

151

  • As a thought experiment, consider an Olympics where no one doped on EPO: population would show normally distributed hematocrit levels:
  • However, distribution looks different if 20% dope. The shape of the distribution demonstrates there are different groups here:

Hematocrit Z-Scores: normal distribution

Hematocrit Z-Scores: bimodal/skewed distribution

152 of 482

Controlling for multiple comparisons

  • Bonferroni correction
    • We will often fail to find real results.
  • RFT correction
    • Typically less conservative than Bonferroni.
    • Requires large DF and broad smoothing.
  • Permutation Thresholding
    • Offers same inference as Bonferroni correction.
    • Typically much less conservative than Bonferroni.
    • Computationally very slow
  • FDR correction
    • At FDR of .05, about 5% of ‘activated’ voxels will be false alarms.
    • If signal is only tiny proportion of data, FDR will be similar to Bonferroni.

152

153 of 482

Alternatives to voxelwise analysis

  • Conventional fMRI statistics compute one statistical comparison per voxel.
    • Advantage: can discover effects anywhere in brain.
    • Disadvantage: low statistical power due to multiple comparisons.
  • Small Volume Comparison: Only test a small proportion of voxels. (Still have to adjust for RFT).
  • Region of Interest: Pool data across anatomical region for single statistical test.

153

SVC

ROI

Voxels

Example: how many comparisons on this slice?

    • Voxelwise: 1600
    • SVC: 57
    • ROI: 1

154 of 482

ROI analysis

  • In voxelwise analysis, we conduct an independent test for every voxel
    • Each voxel is noisy
    • Huge number of tests, so severe penalty for multiple comparisons
  • Alternative: pool data from region of interest.
    • Averaging across meaningful region should reduce noise.
    • One test per region, so FWE adjustment less severe.
  • Region must be selected independently of statistical contrast! (‘voodoo correlations’)
    • Anatomically predefined
    • Defined based on previous localizer session
    • Selected based on combination of conditions you will contrast.

154

M1: movement

S1: sensation

155 of 482

Inference from fMRI statistics

  • fMRI studies have very low power.
    • Correction for multiple comparisons
    • Poor signal to noise
    • Variability in functional anatomy between people.
  • Null results impossible to interpret. (Hard to say an area is not involved with task).
  • Between group inference require significant interactions (positive effect in one group and null result in other is not a demonstration that groups differ).

155

156 of 482

Spatial Processing

  • Chris Rorden
    • Spatial Registration
      • Motion correction
      • Coregistration
      • Normalization
    • Interpolation
    • Spatial Smoothing
    • Advanced notes:
      • Spatial distortions of EPI scans
      • Image intensity distortions

156

157 of 482

Why spatially register data?

  • Statistics computed individually for voxels.
  • Only meaningful if voxel examines same region across images.
  • Therefore, images must be in spatially registered with each other.

157

158 of 482

Spatial Registration

We use spatial registration to align images

      • Motion correction (realignment) adjusts for an individual’s head movements.
      • Coregistration aligns two images of different modalities from the same individual.
      • Normalization aligns images from different people.

158

159 of 482

Within-subject registration

  • Within subject registrations
    • Assumption: same individual, so there should be a good linear solution.

159

Motion correction

Coregistration

Registration of the fMRI scans (across time)

Registration across modality (e.g. T2* and T1 image)

160 of 482

Rigid Body Transforms

160

Translation

Rotation

  • By measuring and correcting for translations and rotations, we can adjust for an object’s movement in an image.
  • 6 Parameters: translation and rotation, each in 3 dimensions.

161 of 482

Motion Correction

  • Motion correction aligns all in time series.
  • Translations and rotations only
    • 6 parameters (X,Y,Z; yaw, pitch, roll)
    • rigid body registration
    • Assumes brain size and shape identical across images.

161

162 of 482

Motion correction cost function

  • Motion correction uses least squares to check if images are a good match (aka minimum sum of squares).
  • Smaller difference2 = better match (‘least squares’).
  • Notes about difference2
    • Any number squared is positive: does not care about direction of error.
    • Large error penalized much more than small error (e.g. 22=4, 32=9), whereas using absolute value as cost function would not be so biased between large and small errors.
  • Iterative: moves image a bit at a time until match is worse.

162

Image 1

Image 2

Difference

Difference²

|Difference|

163 of 482

Local Minima

  • Search algorithm is iterative:
    • Adjust image a little bit.
    • Test cost function
    • Repeat until cost function does not get better.
  • Problem: local minima
  • Solution: ensure starting estimate is accurate
    • For motion correction: only small head movements
    • For coregistration: both images start in good alignment
    • For normalization: similar origin and rotation as template

163

Value of Cost Function

Local Minimum

Global Minimum

Translation in X (mm)

Starting Estimate

0

10

164 of 482

Coregistration

  • Coregistration is more complicated than motion correction
    • Rigid body not enough:
      • Size differs between images (must rescale: zooms).
      • fMRI scans often have spatial distortion not seen in other scans (must skew: shears).
    • Least squares cost function will fail: relative contrast of gray matter, white matter, CSF and air differences between images.

164

165 of 482

Affine Transforms (aka linear, geometric)

165

Translation

Rotation

Zoom

Shear

12 Parameters: translation, rotation, zoom and shear each in 3 dimensions.

166 of 482

166

Translation

Rotation

Zoom

Shear

167 of 482

Coregistration

  • Coregistration is used to align images of different modalities from the same individual

167

  • Uses ‘mutual information cost function’: Note unaligned images have messy joint histograms.
  • Uses entropy reduction instead of variance reduction as cost function.

168 of 482

Coregistration

  • Note original joint histogram (prior to coregistration) is noiser than the final joint histogram (after coregistration).
  • Used within individual, so linear transforms should be sufficient.
  • Typically 12 parameters (translation, rotation, zooms, shear each in 3 dimensions).
  • Though note that different MRI sequences create different non-linear distortions.

168

T1 reference

Aligned T2

169 of 482

Between-subject: Normalization

  • Normalization: align images from different people (align everyone to a template image)

169

Subject 2

Subject 1

Template

Average activation

Normalization

170 of 482

Why normalize?

  • Stereotaxic coordinates analogous to longitude
    • Universal description for anatomical location
    • Allows others to replicate findings
    • Allows between-subject analysis: crucial for inference that effects generalize across humanity.

170

Group analysis

Subject1

Subject2

Subject3

Subject4

Subject5

171 of 482

Normalization

  • Normalization attempts to register scans from different people.
  • We align each person’s brain to a template.
    • Template often created from multiple people (so it is fairly average in shape, size, etc).
    • We typically use template that is in the same modality as the image we want to normalize
      • Therefore, variance cost function.
  • If different groups use similar templates, they can talk in common coordinates.

171

Popular MNI Template�based on T1-weighted scans �from 152 individuals.

172 of 482

Common templates

  • SPM uses modality specific template
    • MNI T1 template, plus custom templates
  • By default, FSL uses MNI T1 template for all modalities
    • Requires intra-modal cost functions (e.g. mutual information)

172

T1 T2* PET

173 of 482

Templates

  • Our coordinate system using anterior commissure as origin developed by Talairach and Tournoux (1988) who created atlas based on histological slices from one 69-year old woman.
    • Single brain may not be representative
    • No MRI scans from this woman
  • Modern templates are generated from a group of MRI scans that are approximately aligned to images from the Montreal Neurological Institute.
    • MNI space slightly different from T&T atlas (larger in every dimension).
    • Therefore, proper to refer to “Talairach and Tornoux” coordinates and “MNI space”.

173

174 of 482

Affine Transforms

  • Co-linear points remain co-linear after any affine transform.
  • Transform influences entire image.

174

175 of 482

Spatial Processing

  • Non-linear transforms can match features that could not aligned with affine transforms.
  • SPM uses basis functions.

175

176 of 482

Nonlinear functions and normalization

Linear Only

176

Linear + Nonlinear

Scans from 5 people: nonlinear helps alignment

177 of 482

Regularization

177

Medium Regularization

Heavy Regularization

Light Regularization

178 of 482

Regularization

  • Regularization is a parameter that you can adjust that influences non-linear normalization

178

http://www.fmri.ox.ac.uk/fsl/fnirt/

Medium Regularization

Little Regularization

179 of 482

Affect of Regularization on Normalization

180 of 482

Advanced Normalization

  • Affine transforms (e.g. FSL’s FLIRT): 12 degrees of freedom
    • Translation, Rotation, Scaling, Shear *3 dimensions
  • Nonlinear basis functions (SPM5) thousands of DoF
  • Diffeomorphic algorithms (SPM8’s DARTEL, ANT) millions DoF
    • Not yet suggested for fMRI (due to spatial distortions/dropout)

180

www.fil.ion.ucl.ac.uk/spm/course/

www.pubmed.com/19195496/

Affine

template

DARTEL

template

181 of 482

Advanced Normalization

  • SPM’s unified normalization-segmentation uses information about different tissue types.
  • Ensures that normalization is driven by brain tissue (gray matter) not scalp.
  • Can be combined with Diffeomorphic.
  • See VBM lecture.

181

182 of 482

An example from AFNI

182

183 of 482

Alternatives

  • SPM and FSL normalize overall brain shape.
  • Individual sulci largely ignored.
  • What are different normalization strategies?
    • Sulci are crucial for some tasks (Herschl’s gyrus and hearing)
    • Sulci are formed by experience.
    • Perhaps less so for others (e.g. Amunts et. al 2004 with Broca’s variability)

183

Probability (%) of region being Brodmann Area 44 based on histology. Note that the location of this brain region varies across people.

184 of 482

Sulcal matching

  • Normalization conducted on smoothed images.
    • Not precisely matching sulci (avoid local distortion).
    • Sulcal location still varies between people after normalization, especially in parietal lobe.

184

Copyright Pierre Fillard, INRIA http://www-sop.inria.fr/asclepios/projects/UCLA/

185 of 482

The Brain with No Folds

186 of 482

Alternatives

  • SPM/FSL normalization will roughly match orientation and shape of head.
    • Good if function is localized to proportional part of brain
    • Poor if function is localized to specific sulci (e.g. early visual area V1 tied to calcarine fissure).
  • Alternatively, use sulci as cost function (Goebel et al., 2006).
    • Image below: mean sulcal position for 12 people after standard normalization followed by sucal registration.
    • Note: This technique improves sulcal alignment, but distorts cortical size.

186

Individual Sulci Image

Individual Sulci Map

Sulcal maps of group following standard normalization

Sulcal maps of group following using sulci as cost function

187 of 482

No perfect solution

  • Analogy: creating 2D map of spherical earth
    • Mercator projection distorts area but preserves navigation rhumb lines and local shape.
    • Gall-Peters projection preserves area but has severe shape distortions.
  • Different normalization methods have different priorities.
    • SPM/FSL attempt to preserve overall volume and avoid local distortions.
    • Diffeomorphic normalization aligns sulci better but at the cost of local distortion.
    • Brain Voyager and flat map techniques use sulci as the cost function, and can provide very good alignment of the major sulci, but cause local distortions and tend to disregard finer and more variable sulci.

187

188 of 482

Interpolation

Each lower image rotated 12º.

Left looks jagged, right looks smooth.

Different reslicing interpolation.

188

189 of 482

1D Interpolation

  • How do we estimate values that occur between discrete samples?
  • Three popular methods:
    1. Nearest neighbor (‘box’)
    2. Linear (‘tent’, ‘triangle’)
    3. Spline/Sinc

189

Weather analogy: it was 25º at 9am, and 31º at 12am, what would you estimate the temperature was at 10am?

190 of 482

2D Interpolation

  • How do we estimate values that occur between discrete samples?
  • Three popular methods:
    1. Nearest neighbor (‘box’)
    2. Linear (‘tent’, ‘triangle’)
    3. Spline/Sinc

190

191 of 482

Nearest Neighbor Interpolation

  • Nearest neighbor interpolation is simple.
    • Always based on single sample.
    • Quick to compute.
    • Generally a lousy estimate for continuous data.

191

1D Box Interpolation

Nearest sample

2D Box Interpolation

Nearest sample

3D Box Interpolation

Nearest sample

192 of 482

Linear Interpolation

  • For neuroimaging we usually use linear interpolation.
    • Much more accurate than nearest neighbor.
    • There is some loss of high frequencies.
    • Since we spatially smooth data after spatial registration, we will lose high frequencies eventually.

192

1D Linear Interpolation

Weighted mean of 2 samples

2D Bilinear Interpolation

Weighted mean of 4 samples

3D Trilinear Interpolation

Weighted mean of 8 samples

193 of 482

Sinc Interpolation

  • Sinc interpolation retains more high frequency information.
    • Computation very time consuming (in theory, infinite extent)
      • FSL: Windowing option limits extent
      • SPM: Splines are used for rapid approximation
  • Not necessary if you will heavily blur your data with a broad smoothing kernel.

193

Windowed 2D Sinc Function

Asymmetric

1D Sinc Function

Symmetric

194 of 482

Resampling

  • Reslicing always loses information.
    • Nearest neighbor has severe artifacts
    • Linear interpolation gets blurry.
    • Windowed sinc functions often show subtle ringing near edges.

194

Source

1x10°

2x10°

Nearest Neighbor

36x10°

Linear

Sinc�(Lanczos)

  • Cumulative: consecutive reslicing leads to more dramatic artifacts.
  • Solution: whenever possible, apply transforms to image matrix (lossless) rather than reslicing data (lossy).
  • Solution: use sinc interpolation if high frequencies important.

195 of 482

Palette indexed images

  • Nearest Neighbor
    • Never use with MRI scans (or any continuous data).
    • Useful for atlases that use indexed palette.
      • E.G. For Brodmann Atlas, a region at the border of areas 37 and 39 should not be labelled ‘area 38’

195

Source

Zoomed �(Linear Interpolation)

Continuous

Palette

n.b. errors at edges

196 of 482

Interpolation summary

  • Nearest Neighbor
    • Paletted images
  • Linear interpolation
    • Default option
  • Sinc/Spline interpolation
    • When edges are critical

196

1D kernels

2D kernels

Nearest Neighbor

Linear

Sinc

197 of 482

Interpolation versus smoothing

  • Interpolation kernel is always 100% at 0 and 0% at all other integers.
  • This means that interpolated estimates always cross through control points (observations).
  • Smoothing kernels blur an observation with its neighbors thereby lowering intensity maxima.

197

Interpolation

Smoothing

Kernel

Observations

198 of 482

Spatial Smoothing

  • Each voxel is noisy. However, neighbors tend to show similar effect. Smoothing results in a more stable signal.
  • Smoothed data more normal – fits our assumptions. Also, allows RFT thresholding (statistics lecture). Minimizes sulcal variability between individuals (group analysis)

198

=

Gaussian Smoothing

199 of 482

How much to smooth - FWHM

  • Smoothing is a form of convolution: the output intensity based on weighted-influence of neighbors.
  • The most popular kernel is the gaussian function (a normal distribution).
  • The ‘full width half maximum’ adjusts the amount of gaussian smoothing.
  • FWHM is a measure of dispersion (like standard deviation, standard error or variance)
  • Large FWHMs lead to more blurry images.
  • For fMRI, we typically use a FWHM that is ~x2..x3 our original resolution (e.g. 8mm for 3x3x3mm data).
  • However, the FWHM tunes the size of region we will be best able to detect.
    • E.G. If you want to look for a brain region that is around 10mm diameter, use a 10mm FWHM.

199

Dispersion Differs

200 of 482

Smoothing Limits Inference

  • Consider a study that observes ‘increased’ activation for strong versus weak motor movements.
  • After smoothing we can not distinguish between:
    • Recruitment of more neighboring neurons (bar on right)
    • Increased activation of the same population of neurons (bar on left)

200

2D

1D

Example: note that after smoothing broad low contrast looks line looks like focused high contrast line.

None 10pixel 20pixel

Smoothing Amount

201 of 482

Smoothing Alternatives

  • Gaussian smoothing is great for ‘normal’ (Gaussian) noise: lots of small errors, very few outliers.

  • Gaussian poor for spike noise
    • Outlier contaminates neighbors
  • Alternatives if your data has spikes:
    • Median filters
    • FSL’s SUSAN

201

Gaussian

Median Filter

Gaussian Noise

Gaussian Smooth

Spike Noise

202 of 482

FSL SUSAN: preserve edges, reduce noise

203 of 482

Spatial unwarping

  • Objects distort magnetic field strength
  • Shim attempts to create homogeneous field
    • Impossible for regions with tissue interfaces: e.g. bone, air (sinuses) and brain all near orbital frontal cortex.
  • Errors in shim (inhomogeneity) lead to spatial distortion & signal dropout.
  • We can measure field homogeneity.
  • Fieldmap can unwarp images (FSL’s B0 unwarping, SPM’s FieldMap).
  • We can recover shape, but not lost signal.

203

204 of 482

Intensity unwarping

  • Motion correction creates a spatially stabilized image.
  • However, head motion also changes image intensity – some regions of the brain will appear brighter/darker.
    • SPM: EPI unwarping corrects for brightness changes (right)
    • FSL: You can add motion parameters to statistical model (FEAT stats page).
  • Problem: We will lose statistical power if head motion is task related, e.g. pitch head every time we press a button

204

Above: motion related image intensity changes.

205 of 482

Image Matrices

  • As noted, reslicing data to a grid will lose information.
  • A matrix with 12 numbers can store the consequences of an infinite number of affine transforms without requiring any reslicing.
  • Each NIfTI image records its own transformation matrix.
  • Understanding matrix math is useful for neuroimaging (e.g. it is the ‘Mat’ in SPM’s Matlab).

205

Example NIfTI Affine transform viewed with MRIcron.

206 of 482

Vectors

    • Vectors have a direction and a length
      • The 2D vector [1,0] points East and has a length of 1
      • The 2D vector [0,1] points North and has a length of 1
      • The 2D vector [1,2] points North-East and has a length of 2.23
    • 2D: two values [x,y], 3D: three values [x,y,z]

206

1, 0

0, 1

1, 2

207 of 482

2D Rotation matrix

  • 2D rotation matrix as two vectors (horizontal and vertical)
    • Can be described by four numbers

207

1, 0

0, 1

1, 0

0, 2

.7,-.7

.7, .7

Original

Zoom: ‘Scale’

Rotation

1, 0

1, 1

Shear

-1, 0

0, 1

-Zoom: ‘Flip’

208 of 482

2D Transformation matrix

  • Transformation matrix is a rotation matrix plus translation values (encodes x,y origin of vectors)

n.b. for computations we use 9 values, final row is always 0 0 1 (must have as many rows as columns).

208

1, 0, 0

0, 1, 0

0, 0, 1

Original

1, 0, 1

0, 1, 0

0, 0, 1

2, 0, .2

0, 1, .2

0, 0, 1

Translated+Zoom

Translated

All affine transforms can be combined into a single matrix

209 of 482

3D Matrices are 4x4

  • We can generate 4x4 matrices that will allow us to work with 3D images.
  • A 4x4 matrix, but the last row always ‘0 0 0 1’

fx = (x*i)+(y*j)+(z*k)+l

fy = (x*m)+(y*n)+(z*o)+p

fz = (x*q)+(y*r)+(z*s)+t

209

i

j

k

m

n

o

l

p

q

r

s

t

0

0

0

1

210 of 482

Matrices and 3D space

  • 3D matrices work just like 2D matrices
    • The identity matrix still has 1’s along the diagonal
    • Translations are the values in the final column (constants)
    • Zooms are done by scaling values of a row.
    • Shears are values added to the relevant orthogonal value
    • Rotations use sine/cosine in dimensions of plane.
    • Our twelve numbers can store all of the possible rotations, shears, translations and scaling.
      • Simply multiply previous matrix with our transform.
      • ‘Not commutative’: order crucial: to undo operations we need to put opposite values in reverse order (e.g. 30° rotation followed by X shear 1.2 can be undone by X shear of -1.2 followed by -30° rotation.

210

211 of 482

Transform Matrix and Filtering Demo

211

Select your operation: shear, zoom, rotate and translate.

Order crucial: e.g. 30° rotate: X shear 1.2: -30 ° rotate: X shear -1.2 does not return to source

Add up to 8 successive transforms

Choose an interpolation or smoothing filter.

Filter Kernel

Transformation Matrix

Settings for Current Transform

Source Image

Transformed Image

Specify Additional Transforms

212 of 482

212

Transformed

Source

Amount

Function

Select function: gaussian blur, perspective, intensity gradient, gaussian noise, median filter, fisheye.

‘Combine’ overlays two images (mean, difference, absolute difference, difference2).

Select amount, e.g. blur with 10-pixel FWHM.

Save results to create new source images.

E.G. save gaussian noise, clean up with blur.

E.G. Save blur, use combine for difference with source – result is edge map (unsharp mask)

‘Rotation’ illustrates consequence of successive reslicing (e.g. linear vs sinc filter).

Transformed 2D image and 1D cross-section shown in the right column

213 of 482

Temporal Processing

  • Chris Rorden
  • Temporal Processing can reduce error in our model
    • Slice Time Correction
    • Temporal Autocorrelation
    • High and low pass temporal filtering
    • Temporal Derivatives

213

214 of 482

Slice Timing Problem

  • 3D EPI volume composed of a stack of 2D slices.
  • Each slice acquired at a different timepoint.

214

Slice 4

Slice 3

Slice 2

Slice 1

Time

Slice 4

Slice 3

Slice 2

Slice 1

Slice 4

Slice 3

Slice 2

Slice 1

Slice 4

Slice 3

Slice 2

Slice 1

Slice 4

Slice 3

Slice 2

Slice 1

TR

215 of 482

The slice timing problem

  • Each 2D slice like a photograph.
  • Each 2D slice within a 3D volume taken at different time.
  • Hemodynamic response changes with time.
  • Therefore, we need to adjust for slice timing differences.

215

216 of 482

Why slice time correct?

  • Consider 3D volumes collected as ascending axial slices
    • For each volume, we see inferior slices before superior slices

216

Statistics assume all slices are seen simultaneously…

Time

Time

217 of 482

Slice timing correction

  • Timing of early slices weighted with later image of same slice
  • Timing of late slices is balanced with previous image of same slice
  • Result: each volume represents single point in time
  • Typically, volume corrected to mean volume image time (estimate time of middle slice in volume)

217

Timef

218 of 482

Should we slice time correct?

  • If fast acquisition (TR < 2sec)
    • Very little time difference between slices
    • Therefore, STC will have little influence
  • If we acquire images slowly
    • We only rarely see a particular slice
    • Therefore, STC interpolation will not be very accurate.
  • General guideline: not required for block designs, helpful for event related designs (Sladky et al., 2011; PMID: 21757015)

218

With long TRs, STC can be inaccurate – e.g. miss HRF peak

219 of 482

Creating 3D volumes from 2D EPI

  • Slices can be either collected sequentially interleaved order.
  • Interleaving reduces RF interference between slices.
  • Unfortunately, any head movements will cause worse ‘spin history’ effects for interleaved slices than sequential. (TR will be shorter for regions that were previously in another slice).

219

Sequential

Interleaved

Time

Time

220 of 482

Possible Slice Ordering

  • Ascending
    • 1,2,3,4…n
  • Descending
    • n...4,3,2,1
  • Interleaved Ascending (all vendors except Siemens with even number of slices)
    • 1,3,5…n-1 2,4,6…n
  • Interleaved Ascending (Siemens with even number of slices)
    • 2,4,6…n, 1,3,5…n-1

220

221 of 482

Reference Slice Example

  • Example: 4 Ascending Slices, TR=2000, Reference Slice=2
  • Event 1 occurred 3900ms after acquisition began
  • After slice timing we events appear to occur 500ms earlier
  • So Event 1 should have a reported onset of 3400ms

221

Slice 4

Slice 3

Slice 2

Slice 1

Slice 4

Slice 3

Slice 2

Slice 1

Slice 4

Slice 3

Slice 2

Slice 1

Slice 4

Slice 3

Slice 2

Slice 1

Event Onset Time

Apparent Onset Time

ScreenEvent

222 of 482

Reference Slice

  • We usually start experiment with pulse that signals start of acquisition of first slice.
  • If we time events from this moment, we want our reference slice to be the first in the slice order (1 for ascending or interleaved, n for descending, 2 for Siemens interleaved with odd number of slices).
  • If we make every image look like the middle slice, we need to adjust our event times

222

Time

Slice 4

Slice 3

Slice 2

Slice 1

Slice 4

Slice 3

Slice 2

Slice 1

Slice 4

Slice 3

Slice 2

Slice 1

Slice 4

Slice 3

Slice 2

Slice 1

ScreenEvent

Event Onset Time

223 of 482

Sequential vs Interleaved Volumes

  • Interleaving reduces interference between slices. Unfortunately, any head movements will cause worse ‘spin history’ effects for interleaved slices than sequential. (TR will be shorter for regions that were previously in another slice).
  • Therefore, it is often useful to have ~20% gap between slices: reduces interference for sequential acquisitions, no spin-history artefacts for small head movements in inteleaved acqusitions.

223

Slice 4

Slice 3

Slice 2

Slice 1

Slice 4

Slice 3

Slice 2

Slice 1

Slice 4

Slice 3

Slice 2

Slice 1

Still

Head Motion

Sequential

Interleaved

Still

Head Motion

imaging.mrc-cbu.cam.ac.uk/imaging/CommonArtefacts#spinhistory

224 of 482

Autocorrelated Data

  • Scans are not independent observations - they are temporally autocorrelated (HRF is sluggish)
  • Solutions for temporal autocorrelation
    • FSL: Uses “pre-whitening” is sensitive, but can be biased if K misestimated
    • SPM99: Temporally smooth the data with a known autocorrelation that swamps any intrinsic autocorrelation. Robust, but less sensitive
    • SPM2: restrict K to highpass filter, and estimate residual autocorrelation
  • For more details, see Rik Henson’s page� www.mrc-cbu.cam.ac.uk/Imaging/Common/rikSPM-GLM.ppt

224

225 of 482

Signal Intensity Drift

  • Successive images change brightness.
    • If uncorrected, this drift will reduce statistical power (e.g. blue line in upper image has both task related signal and error from signal drift).
  • Simple correction is ‘global scaling’ (FSL = intensity normalization’, SPM8 = ‘global intensity normalisation’)
    • Make each 3D image have same mean intensity.
    • Problem:
      • If a large portion of the brain shows task related activity, global scaling will reduce related activity and add task-related noise to unrelated brain areas
      • Example: lower panel, where 30% of brain has related activity. Will selectively decrease signal (that is consistent across related voxels) and not reduce noise (which is not).
  • Never use this correction for fMRI!
  • Next slides: temporal filters can preserve BOLD signal while eliminating lower-frequency drift.

225

Data with drift – images get brighter

Corrected with global scaling

see NeuroImage 13, 1193–1206 (2001)

226 of 482

Spectral power of fMRI signal

  • Our raw fMRI data includes
    • Task related frequencies: our signal
      • Block design: fundamental period is twice the duration of block, plus higher frequency harmonics.
        • Below: 15s blocks show peaks at 30 and 15s duration
      • Event related designs:
        • HRF has a frequency with a fundamental period ~20s, harmonics will include higher frequencies.
    • Unrelated frequencies
      • Low frequency scanner drift
      • Aliased physiological artifacts
        • cardiac, respiration

226

227 of 482

High Pass Filter

  • We should always apply a high pass filter (HPF).
    • Eliminate very slow signal changes.
    • Attenuate Scanner drift and other noise.
  • HPF selectively removes low frequencies
  • What value should we use for HPF?
  • Block designs:
    • Our fundamental frequency will be duration of blocks.
      • For 12s-long blocks, frequency is 24s (period for on-off cycle). We would therefore apply a 48-s high pass filter.
  • Event related designs:
    • 100s filter is typical.

227

High Pass Filter

228 of 482

Low Pass Filter

  • A low pass filter (LPF) can eliminate high frequency noise.
  • Event related designs have high frequency information, so low pass filters will reduce signal.
  • In theory, block designs can benefit.
  • In practice, LPF rarely used
    • Most of the MRI noise is in the low frequencies
    • Most high frequency noise (heart, breathing) too high for our sampling rate.

228

Low Pass Filter

229 of 482

Temporal Filtering

  • Nyquist theorem: One can only detect frequencies with a period slower than twice the sampling rate.
  • Aliasing: High frequency information can appear to be lower frequency
    • E.G. wheels appear to spin backwards on TV
  • For fMRI, the TR is our sampling rate (~2sec for whole brain).
    • High frequency noise can include cardiac (~1 Hz) respiration (~0.25 Hz)

229

c

Example: Sample exactly once per cycle, and signal appears constant

Example: Sample 1.5 times per cycle, and you will infer a lower frequency (aliasing)

230 of 482

Physiological Noise

  • Respiration causes head motion
  • Some brain regions show cardiac-related pulsation.
  • What to do about physiological noise?
    1. Ignore
    2. Monitor pulse/respiration during scanning, then retrospectively correct images. (Deckers et al. 2006)
    3. Acquire scans faster than the Nyquist frequency(TR <0.5sec), e.g. Anand et al. 2005
    4. The whole brain's fMRI signal fluctuates with physiological (respiratory) cycle. Therefore, one approach is to model this effect as a regressor in your analysis (Birn, 2006; though global scaling problem).

230

231 of 482

Hemodynamic Response Function (HRF)

  • SPM models HRF using double gamma function: intensity increase followed by undershoot.
  • By default, FSL uses a single gamma function: intensity increase.

231

232 of 482

HRF variability

  • The temporal properties of the HRF vary between people and brain areas (Aguirre et al. 1998).
  • Our statistics uses a generic estimate for the HRF.
  • If our subject’s HRF differs from this canonical model, we will lose statistical power.
  • The common solution is to model both the canonical HRF and its temporal derivative.

232

Time (seconds)

Image Brightness

233 of 482

Temporal Derivative

  • Temporal Derivative is the rate of change in the convolved HRF.
  • TD is to HRF as acceleration is to speed.
  • By adding TD to statistical model, we allow some variability in individual HRF to be removed from model.

233

Time (sec)

0 5 10 15

  • HRF

-TD

234 of 482

How does the TD work?

  • Consider individual with slightly slow HRF (green line).
  • The canonical (red) HRF is not a great match, so the model’s fit will not be strong.
  • The TD (blue) predicts most of the discrepancy between the canonical and observed HRF.
    • Adding the TD as a regressor will remove the TD’s effect from the observed data. The result (add blue to green) will allow a better fit of the canonical HRF.

234

  • TD usually a nuisance variable in our analysis
    • Reduces noise by explaining some variability.
  • You can analyze TD and use HRF as regressor:
    • Analyze HRF: magnitude inference
    • Analyze TD: latency inference
  • NB: TD can be detrimental to block designs.

235 of 482

Alternatives to TD

  • Another approach is to directly tune the HRF.
    • By default, FSL uses a single gamma function for convolution
    • Alternatively, you can design more accurate convolutions (e.g. FSL’s FLOBs, right). Note that some of these options can make all your statistics two-tailed.

235

236 of 482

Temporal Filter Demo

236

Note filters have roll-off, e.g. 12 Hz low pass will only partially attenuate 20 Hz signal.

Use left sliders to define input frequencies.

E.G. 3, 59 and 4 Hz

Right slider sets filter threshold

E.G. 12 Hz

Select filter type: low pass, high pass, notch or band pass.

237 of 482

Detecting Subtle Changes in Structure

  • Christian Gaser and Chris Rorden
    • Voxel Based Morphometry
      • Segmentation – identifying gray and white matter
      • Modulation- adjusting for normalization’s spatial distortions.
    • Diffusion Tensor Imaging
      • Measuring white matter integrity
      • Tractography and analysis.

Many images inspired or created by Christian Gaser.

237

238 of 482

Voxel Based Morphometry

  • Most lectures in course focus on functional MRI.
  • However, anatomical scans can also help us infer brain function.
    • Do people with chronic epilepsy show brain atrophy?
    • Which brain regions atrophy with age?
    • Do people with good spatial memory (taxi drivers) have different anatomy than other people?
  • Voxel based morphometry is a tool to relate gray and white matter concentration with medical history and behavior

238

239 of 482

Morphometry

  • Morphometry examines the shape, volume and integrity of structures.
  • Classically, morphometry was conducted by manually segmenting a few regions of interest.
  • Voxel based morphometry conducts an independent statistical comparison for each voxel in the brain.

239

240 of 482

Voxel Based Morphometry

  • VBM has some advantages over manual tracing:
    • Automated: fast and not subject to individual bias.
    • Able to examine regions that are not anatomically well defined.
    • Able to see the whole brain
    • Normalization compensates for overall differences in brain volume, which can add variance to manual tracing of un-normalized images.

240

241 of 482

VBM disadvantages

  • VBM has clear disadvantages
    • Crucially depends on accurate normalization.
    • Low power: gray matter random fields are very heterogeneous (individual patterns of sulcal folding registration is always poor.
    • Crucially depends on a priori probability maps.
    • Assumes normal gray-white contrast. E.G. Focal Cortical Dysplasia is a challenge (blurring)
    • Looks for differences in volume, can be disrupted if shape of brain is different: problem for developmental disorders

241

242 of 482

Segmentation

  • Start with high quality MRI scan
  • Classify tissue types (gray matter in this example)

242

243 of 482

Partitioning Tissue Types

  • VBM segments image into three tissue types: gray matter, white matter and CSF.
    • Typically T1 scans (spatial resolution, gray-white contrast).
    • Only three tissue types: will not cope with large lesions.
    • Probability map: each voxel has a 0..100% chance of being one of the 3 tissue types.

243

T1

white matter

gray matter

CSF

244 of 482

Intensity based segmentation

  • White matter appears bright on T1 scans
  • Therefore, bright threshold selects white matter.
  • Problem: What is the appropriate threshold?

244

  • Automatic routines threshold continuous images to have 2,3,4.. Discrete intensity levels.
  • Examples: Otsu’s Method, Isodata, Fuzzy C-means, etc.

245 of 482

Partial Volume Effects

  • MRI and computer screens segment images into discrete voxels/pixels.
  • However, voxels/pixels at tissue boundaries contain a mixture to tissue types.
  • Binary segmentation will lose this information.
  • Solution: Compute continuous image that maps proportion of tissue in each voxel.

245

246 of 482

Segmentation I: Image Intensity

First pass limitations:

  • Scans are noisy: image intensity grainy.
  • Intensity is ambiguous: Fat on scalp similar brightness to WM, muscles on scalp similar to GM.
  • Partial volume problem: a voxel that is filled with 50% WM and 50% CSF will have same brightness as GM.

246

Air

WM

GM

CSF

Image brightness

frequency

Initial estimate for GM

Based on image brightness

p=0.95

p=0.05

247 of 482

Homogeneity correction crucial

  • Field inhomogeneity will disrupt intensity based segmentation.
  • Bias correction required.

247

no

correction

T1

WM

GM

Estimate

248 of 482

Segmentation II: Voxel location

  • Maximization of a posteriori probability: Bayesian approach (expectation maximization)
  • Analogy:
    • We know that last year there were 248 of 365 days with rain in Norway (p=0.68)
    • the conditional (or posterior) probability for rain in Bergen will be p > 0.5
  • Tissue probability maps
    • For each voxel, shows probability of tissue type for a large group of individuals.
    • E.G. voxel may have historically had 25% chance of being gray matter, 45% white matter and 30% CSF

248

T1

white

gray

CSF

249 of 482

Segmentation overview

249

Intensity based estimate for GM

p=0.95

p=0.95

p=0.90

p=0.01

Final result

a priori GM map

p=0.95

p=0.01

Source Image

250 of 482

Normalization is crucial

  • Poor normalization has two problems
    • Image will not be registered with a priori map = poor segmentation.
    • Images from different people will not be registered: we will compare different brain areas.
    • Paradox: good segmentation requires good normalization, but good normalization could benefit from good segmentation (we want to normalize gray matter to gray matter TPM).
  • Custom template and tissue priors are useful
    • Accounts for characteristics of your scanner.
    • Accounts for characteristics of your population (e.g. age).
    • Age specific templates available for young (PMID 12203688) and old (PMID 15955500) populations
    • Easy to generate PMID 18424084
    • Must be independent of your analysis:
      • Either formed from combination of both groups (control+experimental) or from independent control group.

250

251 of 482

Two step segmentation

251

segmentation II

customized template

MNI template

segmentation I

norma-

lization

segmentation II

Step I:

Creation of customized template

segmentation I

norma-

lization

Step II:

Optimized segmentation

averaging

252 of 482

Image cleanup

252

segmented

mask

masked

T1

Heuristic: Gray matter always thick band near white matter. Dilate (inflate) white matter for mask.

253 of 482

SPM5+ segmentation

  • Unified segmentation (SPM5+)
  • Iterated steps of segmentation estimation, bias correction and warping
  • Impact
  • Warping of prior images during segmentation makes segmentation more independent from size, position, and shape of prior images
  • much slower than SPM2

253

40 iterations

segmentation

40 iterations

bias correction

20 iterations

warping

no significant change of estimate

significant change of estimate

254 of 482

SPM8 new segmentation

  • SPM8’s default segmentation similar to SPM5
  • There is a hidden new segmentation
    • From Graphics window, Choose Tasks/SPM/Tools/NewSegment
    • Allow multiple channels, for example combine T1 and T2 for
      • better tissue classification
    • Includes more tissue types (gray, white, CSF, bone, other soft tissue, air.
      • Not as finicky with regards to starting estimate

254

255 of 482

Building a better template: data

  • 25 participants scanned
    • High resolution: 0.85mm3 vs 1mm3
    • Large FOV: 320x320 matrix vs 256x256 matrix
    • Both T1 and T2-weighted images for each person (with same orientation and coverage)
    • Uses revolutionary 3D SPACE sequence that provides high resolution
    • T2 has opposite contrasts for CSF (bright) and bone (dark)

255

T1

T2

256 of 482

Building a better template: normalization

  • SPM8 affine coregistration to MNI space.
  • ANTS diffeomorphic normalization used to generate mean image (25 Core i7 * 70 hours)
  • Mean image normalized to MNI space using SPM8 ‘new seg’.

256

Subj1

Subj2

Mean

257 of 482

Building a better template: segmentation

  • Crude tissue probability map (TPM) created:
    • Default newseg TPM for brain case
    • Neck region generated from mean image
  • Refined TPM created:
    • Each individual’s T1+T2 scans segmented using newseg
    • TPM created from mean segmentation
    • TPM manually edited to remove errors
    • TPM smoothed by 2mm FWHM
    • Repeat previous steps four times.

257

TPM : CSF

258 of 482

Building a better template: results

  • Our new template accurately segments tissues
  • Includes neck regions
  • Works well, even if only provided with a T1 scan
    • We no longer need a CT scan for bone, or even a T2 scan…

258

Single subject segmentation

259 of 482

Voxel Based Morphometry

  • We can statistically analyze gray matter atrophy

259

Epilepsy

260 of 482

Segmentation Problem

  • If someone has atrophy, normalization will stretch gray matter to make brain match healthy template.
  • This will reduce ability to detect differences

260

Normalization will squish this region

Normalization will stretch this region

261 of 482

Image Modulation

    • Analogy: as we blow up a balloon, the surface becomes thinner. Likewise, as we expand a brain area it’s volume is reduced.

261

Source

Template

Modulated

Without modulation

262 of 482

Image Modulation

  • Optimized Segmentation can adjust for distortions caused during normalization.
  • Areas that had to be stretched are assumed to have less volume than areas that were compressed.
    • Corrects for changes in volume induced by nonlinear normalization
    • Multiplies voxel intensities by a modulation matrix derived from the normalization step
    • Allows us to make inferences about volume, instead of concentration.

262

263 of 482

Modulation: size vs shape

  • Williams Syndrome is due to chromosome 6 deletions
    • Associated with behavioral symptoms: outgoing, relatively good with language, impaired with calculations.
  • Manual morphometry shows differences in shape and size: small superior parietal lobe, shallow intraparietal sulcus and small/flat corpus callosum.
  • Eckert et al. 2006 note VBM better at detecting SPL and CC differences when images are modulated.
  • However, modulated images also detected differences in brain stem and ventral frontal cortex that may be artifacts of the shape differences (e.g. deformations)
  • They suggest caution interpreting modulated gray matter findings when there are gross shape and size differences between groups.
  • With modulation, size and shape can interact.

263

264 of 482

Modulation is optional and controversial

  • Modulation will smooth images, specificity will decrease
  • Alternatively, you can covary overall brain volume by including GM or GM+WM as nuisance regressor.

264

Example showing danger of modulation. This image comes from an elderly participant, with relatively large ventricles. Normalization adjusts ventricle size, but the deformations are spatially smooth, so tissue near the ventricles (e.g. caudate) are also being spatially compressed.

[Deformations exaggerated for exposition]

265 of 482

DBM

  • Deformation-based Morphometry examines absolute displacements.
  • E.G. Mean differences (mapping from an average female to male brain).

265

266 of 482

VBM and developmental syndromes

  • Williams Syndrome
    • Developmental syndrome: Chromosome 7
    • Manual Morphology shows
      • 8-18% decrease in posterior GM/WM
        • Most consistent finding is reduced intra-parietal sulcus depth and superior parietal lobe volume (see figure)
        • Relatively preserved frontal GM/WM
        • Creates unique shape
    • Unique spatial distribution of gross volume loss influences VBM results depending on whether modulation is used

266

Eckert et al. 2006

Control WS

267 of 482

Modulation and shape

  • Shape differences influence modulated data.
  • Deformation Based Morphometry can identify shape/gross volumetric differences.

267

Eckert et al., 2006

268 of 482

Cortical Thickness

  • New methods can complement VBM.
  • Freesurfer’s cortical thickness is powerful tool.
  • Requires very good T1 scans.

268

Modulated VBM

Freesurfer

Age-related declines in

gray matter volume and cortical thickness

269 of 482

VBM comments

  • Longitudinal VBM:
    • Sensitive way to detect atrophy through time. Using the same individual reduces variability.
  • Cross sectional studies
    • Can compare two distinct populations
    • Can also examine atrophy through time, though will require more people than longitudinal VBM.
  • VBM findings are first step in understanding structural changes.
    • www.tina-vision.net/docs/memos/2003-011.pdf
    • Bookstein, 2001
    • Davatzikos, 2004

269

270 of 482

Diffusion Weighted Imaging

  • T1/T2 scans do not show acute injury. Diffusion weighted scans do.
  • DW scans identify areas of permanent injury
  • Measures random motion of water molecules.
    • In ventricles, CSF is unconstrained, so high velocity diffusion
    • In brain tissue, CSF more constrained, so less diffusion.

270

T2

DW

Core clinical tool: Stroke does not appear on T1 and T2 scans for the day, but does on DWI

271 of 482

Diffusion Tensor Imaging (DTI)

  • DTI is an extension of DWI that allows us to measure direction of motion.
  • DTI allows us to measure both the velocity and preferred direction of diffusion
    • In gray matter, diffusion is isotropic (similar in all directions)
    • In white matter, diffusion is anisotropic (prefers motion along fibers).

271

Unconstrained

(fast)

Constrained

(slow)

272 of 482

DTI

  • The amount of diffusion occurring in one pixel of a MR image is termed the Apparent Diffusion Coefficient (ADC) or Mean Diffusivity (MD).
  • The non-uniformity of diffusion with direction is usually described by the term Fractional Anisotropy (FA).

272

MD differs

FA differs

273 of 482

What is a tensor?

  • A tensor is composed of three vectors.
    • Think of a vector like an arrow in 3D space – it points in a direction and has a length.
  • The first vector is the longest – it points along the principle axis.
  • The second and third vectors are orthogonal to the first.

273

Sphere: V1=V2=V3

Football:

V1>V2

V1>V3

V3 = V2

???:

V1>V2>V3

274 of 482

Diffusion Tensor Imaging

  • To create a tensor, we need to collect multiple directions.
  • Typically 30-100 directions.
  • More directions offer a better estimate of optimal tensor.

274

275 of 482

DTI

MD

275

FA

Principle Tensor Vector

Red: left/right

Green: posterior/anterior

Blue: inferior/superior

276 of 482

Tractography

  • DTI can be used for tractography.
  • This can identify whether pathways are abnormal.

276

277 of 482

DTI limitations

  • DTI has problems where multiple fibers are in one voxel.
    • Are fibers ‘kissing’ or ‘crossing’?
    • FA is reduced in voxels where fibers are not parallel with each other.

277

278 of 482

Deterministic Tractography

  • Once we have normalized scans, we can measure how much white matter is in particular brain regions.
  • Allows morphometry, just like VBM except based on DTI
  • FSL’s TBSS is a popular tool.
  • Disadvantage: good for subtle differences. After big injury tracts not aligned/present.

278

279 of 482

Diffusion Spatial Distortion

All EPI scans are spatially distorted. We can acquire data where some volumes have reversed phase encoding: all volumes distorted same magnitude, but opposite direction. TOPUP can infer undistorted shape.

Gradient Vector Direction

280 of 482

Diffusion Image Undistortion

  • Images distorted by phase encoding
  • Images distorted by eddy currents
  • Careful acquisition allows TOPUP/Eddy to model both and correct.

281 of 482

Probabilistic Tractography

  • Undistort, Identify fibers, estimate connections between regions.

281

282 of 482

Probabilistic Tractography

  • Tractography to predict stroke outcome.
  • Better than TBSS for large injuries

282

283 of 482

Catastrophic structural changes

  • Chris Rorden
  • Voxelwise Lesion Symptom Mapping
    • Motivation: strengths and limitations.
    • Should we examine acute or chronic injury?
    • Visualizing injury.
    • Mapping brain injury.
    • Normalization lesion maps.
    • Lesion mapping statistics.

“George Miller coined the term ‘cognitive neuroscience’…we already knew that neuropsychology was not what we had in mind…the bankruptcy and intellectual impoverishment of that idea seemed self evident.”

-Michael S. Gazzaniga, 2000

283

284 of 482

Thought experiment

  • What brain injury leads to visual field injury?

284

285 of 482

Thought experiment

  • To validate method, lets ask a question we know the answer to.
  • Every neurological text book tells us the brain areas associated with visual field defects.
  • We know calcarine sulcus injury causes field cuts.

286 of 482

Mapping Lesions

  • We can create an overlay plot of damaged region.
  • For example: here are the lesion maps for 36 people with visual field defects:

286

287 of 482

The problem with overlay plots

  • Overlay plots are misleading:
    • Highlight areas involved with task (good)
    • Highlight areas commonly damaged (bad)
  • Brain damage is not random: some brain areas more vulnerable. Overlay plots highlight these areas of common damage.
    • Car analogy: some parts fail more often than others.
  • Solution: collect data from patients with similar injury but without target deficit.

287

288 of 482

Value of control data

  • Solution: collect data from patients with similar injury but without target deficit:

288

289 of 482

Statistical plots

  • We can use statistics to identify areas that reliably predict deficit
  • E.G. Damage that results in visual field cuts

289

290 of 482

Catastrophic change across time

290

“Bathtub curve”: e.g. birth/manufacturing defects, vs worn out.

Obesity epidemic and changes in smoking, blood pressure, etc. are changing this a bit...

291 of 482

Lesion symptom mapping

  • Lesion symptom mapping infers brain function by observing consequences of brain injury.
  • Historically, one of the most influential tools in understanding brain function
    • Language: Broca’s and Wernicke’s Area
    • Memory: anterograde amnesia
    • Vision: Prosopagnosia, achromatopsia
    • Emotions, Motor Control, etc.

291

292 of 482

Lesion mapping

  • Perhaps lesion mapping was historically important because it was the only tool.
  • New tools address some of the weaknesses.

Disadvantages:

    • Poor spatial precision: lesions are messy: location and extent influenced by vasculature, not functional areas.
    • Poor temporal precision: lesions are permanent.
      • Sequence of information processing difficult to assess.
    • Dilemma of acute vs chronic lesion mapping.

292

293 of 482

Acute vs chronic lesion mapping

  • Acute lesion mapping:
    • Initially after a stroke, one sees widespread dysfunction. Intact areas are disrupted as they depend on damaged areas.
    • Chronically, the brain is plastic. So it is difficult to infer what a brain region used to do.
  • Ideally, we would examine both acute and chronic effects.
    • Acute injury is more clinically relevant.
    • Chronic deficits: more stable and identifies functions that can not be compensated.

293

294 of 482

Diaschisis

Diaschisis is a sudden loss (or change) of function in a portion of the brain connected to a distant, but damaged, brain area. The site of the originally damaged area and of the diaschisis are connected to each other by neurons.

295 of 482

Lesion mapping

Advantages:

    • Stronger inference:
      • Activation techniques (fMRI, ERP): is area involved with task?
      • Disruption techniques (TMS, lesions): is area required by task?
    • Understand function of tightly coupled network.
      • Activation techniques like listening to whole orchestra: hard to identify differential contribution of musicians.
        • Example: highly connected visual attention network tends to be activated in concert, even though lesions suggest different functions.
      • Lesion method: How does music change if individual stops playing?
    • Clinically relevant:
      • Can we predict recovery or best treatment?

295

296 of 482

Stroke location not random

  • Driven by vasculature
  • Two types: blockage vs bleed

Watershed Regions →

Watershed Regions →

297 of 482

CT versus MRI scans

  • CT
    • Clinically crucial:
      • Detect acute hemorrhage
      • Can be conducted when MRI contraindicated
    • Limited research potential
      • Exposes individual to radiation
        • Difficult to collect control data
        • Typically very thick slices, hard to normalize
      • Little contrast between gray and white matter

297

  • MRI
    • Different contrasts (T1,T2, DWI)
    • No radiation, so we can collect thin slices if we have time.

298 of 482

Conventional MRI scans

  • T1 (anatomical): fast to acquire, excellent structural detail (e.g. white and gray matter).
  • T2 (pathological): slower to acquire, therefore usually lower resolution than T1. Excellent for finding lesions.

298

T2

Air

T1

T2

T1

CSF

Bone

Air

CSF

WM

GM

GM

WM

Fat

edema

Air

299 of 482

Lesion mapping: T1 vs T2

  • T1 scans offer good spatial resolution.
  • T2 scans better for identifying extent of injury, but poor spatial resolution.
  • Solutions:
    1. Acquire chronic T1 (>8 weeks)
    2. Acquire both T1 and T2, use T2 to guide mapping on T1.
    3. Acquire T2, map on normalized iconic brain (requires expert lesion mapper).
    4. Acquire high resolution T2 image, use for both mapping and normalization (e.g. 1x1x1mm T2 ~9min). Requires latest generation MRI.
  • Note: Many clinicians like FLAIR as it attenuates CSF. Lesion signal similar to T2. Normalization tricky (thick slices, no standard template).

T1

299

T2

FLAIR

300 of 482

Imaging acute stroke

  • T1/T2 MRI and x-rays can not visualize hyperacute ischemic strokes.
    • Acute: Subtle low signal on T1, often difficult to see, and high signal (hyperintense) on spin density and/or T2-weighted and proton density-weighted images starting 8 h after onset. Mass effect maximal at 24 h, sometimes starting 2 h after onset.
    • Subacute (1 wk or older): Low signal on T1, high signal on T2-weighted images. Follows vascular distribution. Revascularization and blood-brain barrier breakdown may cause enhancement with contrast agents.
    • Old (several weeks to years): Low signal on T1, high signal on T2. Mass effect disappears after 1 mo. Loss of tissue with large infarcts. Parenchymal enhancement fades after several months.

www.strokecenter.org/education/ct-mri_criteria/

300

www.med.harvard.edu/AANLIB/

T2

acute

+3days

CT

301 of 482

Imaging Hyperacute Stroke

T1/T2 scans do not show acute injury.

Diffusion and Perfusion weighted scans show acute injury:

    • Diffusion images show permanent injury. Perhaps good predictor of eventual recovery.
    • Perfusion scans show functional injury. Best correlate of acute behavior.
    • Difference between DWI and PWI is tissue that might survive.
      • Diaschisis: regions connected to damaged areas show acute hypoperfusion and dysfunction.
      • Hypoperfused regions may have enough collateral blood supply to survive but not function correctly (misery perfusion).

301

T2

DW

302 of 482

Perfusion imaging

Allows us to measure perfusion

    • Static images can detect stenosis and aneurysms (MRA)
    • Dynamic images can measure perfusion (PWI)
      • Measure latency – acute latency appears to be strong predictor of functional deficits.
      • Measure volume
    • Perfusion imaging uses either Gadolinium or blood as contrast agent.
      • Gd offers strong signal. However, only a few boluses can be used and requires medical team in case of (very rare) anaphylaxis.
      • Arterial Spin Labelling can be conducted continuously (CASL). Good CASL requires good hardware.

302

303 of 482

DTI in stroke

  • Diffusion tensor imaging is an extension of Diffusion Weighted Imaging.
  • DTI allows us to examine integrity and direction of fiber tracts.
  • This will allow us to examine disconnection syndromes (see Catani).
  • Analysis of DTI still in infancy.

303

v

DTI - stroke

Healthy

304 of 482

fMRI in stroke

  • fMRI analysis of stroke difficult.
  • Hemodynamic response often disrupted.
    • Misery perfusion: system always compensating for reduced blood flow, so no dynamic ability to increase.
    • Luxury perfusion: destroyed tissue no longer requires blood, so regulation not required for surviving tissue.

304

305 of 482

Summary

  • Modality of scanning depends on age of lesion.
  • Hyperacute imaging will require PWI/DWI.
  • Older injuries seen on T1 and T2.
  • Different modalities provide different information: can we combine information across modalities?
  • Our analysis should be based on individuals with similar delay between injury and observation.

305

306 of 482

Lesion analysis

  • fMRI – many low resolution volumes per individual
  • Typical stages (SPM):
    1. Slice Time Correct
    2. Motion correct
    3. Normalize
    4. Smooth (spatial, temporal)
    5. Statistics

  • Lesions – one high quality scan per indivdiual
  • Typical stages:
    1. Map Lesion
    2. Normalize
    3. Statistics

306

307 of 482

Map lesion

  • We need to map the region of brain injury.
  • We can use MRIcron to draw the location of the lesion.
  • Note that our high-resolution T1 scans may not show full extent of injury (must refer to other scans).

307

308 of 482

Normalization

  • Normalization adjusts size and shape of brain.
  • Aligns brains to ‘template’ image in stereotaxic space.
  • Allows us to compare brains between individuals.

308

309 of 482

Normalization Transforms

309

Shear

Translation

Rotation

Zoom

Linear

Non-linear

When combined, Can have relatively local influence

Global influence

310 of 482

Lesions disrupt normalization

  • Normalization works by adjusting the image orientation until ‘difference’ with template is minimized
  • However, region of injury appears different in image and template
  • Therefore, normalization will attempt to warp lesioned region.
  • One solution is to use ‘masked’ normalization (Brett et al., 2001), however SPM5 and later are very robust (Crinion et al., 2007)

Image Template Variance

310

nonlinear normalization can ‘shrink’ lesion volume.

311 of 482

Lesions disrupt Normalization

  • Template does not have injury.
  • If injury visible on scan it will disrupt normalization.
  • For hyper acute scans: normalize using T2 or T1 scan (as injury not visible yet) and apply to DWI/PWI

312 of 482

Normalizing Stroke Scans

  • Strokes typically unilateral.
  • Brain roughly symmetrical
  • Trick: Create ‘healed’ brain that matches template. Estimate warping based on this ‘healthy’ brain.

313 of 482

Lesion analysis

Two classes of analysis:

    • Binomial analysis: if behavior falls into two mutually exclusive groups (e.g. Broca’s Aphasia, No Broca’s Aphasia).
      • Traditional test: Fisher-exact or Chi-squared test
      • Alternative test: Liebermeister quasi-exact test
    • Continuous analysis: if data is not binomial. (e.g. number of words starting with ‘b’ spontaneously reported in two minutes).
      • Traditional test: t-test
      • Alternative test: Brunner-Munzel test.

313

314 of 482

Binomial Data

  • Visual neglect: ask patients for find each letter ‘A’ in cluttered display (60 possible).
  • People who detect < 55 are considered to have a deficit.

314

Deficit

Control

315 of 482

Statistics

  • Our goal is to see if brain anatomy predicts behavior.
  • We collect data scans and behavioral data from many stroke patients.
  • Consider 24 patients – half with deficits.
  • Statistics will identify regions that predict deficit.

12 people w. cancellation deficit

315

Statistical

12 people w/o cancellation deficit

316 of 482

T-test lesion analysis.

  • A t-test requires two groups and one continuous variable.
  • The VLSM t-test is orthogonal to t-tests used for fMRI/VBM:
    • fMRI/VBM t-tests:
      • Deficit defines two groups.
      • Voxel intensity provides continuous variable.
    • VLSM
      • Voxel intensity (lesion/no lesion) defines two groups.
      • Behavioral performance provides continuous variable.
  • Note VLSM group size varies from voxel-to-voxel.
  • Statistical tests provide optimal power both groups have the same number of observations (balanced).
    • Therefore, VLSM power fluctuates across voxels
    • We can not make inferences of voxels that are rarely damaged or always damaged (also true for binomial tests).

316

317 of 482

Neuropsychological Data

  • Visual neglect: ask patients for find each letter ‘A’ in cluttered display (60 possible).
  • Continuous measure is number of ‘A’s detected.
  • Performance on neuropsychological tasks is rarely normal.
    • Skewed distribution: many at ceiling

317

Performance

318 of 482

Advanced VLSM statistics

  • Lesion volume always correlates with deficits.
    • Small lesions unlikely to knock out entire functional region.
    • Large lesions knock out more regions.
  • Therefore, previous tests can not distinguish between equipotentiality and localized function.
  • Logistic regression can covary out lesion volume: is location still a good predictor independent of volume? (Karnath et al., 2004).
  • As lesion volume correlates well with deficits, LR analysis offers poor statistical power but strong inference.

318

319 of 482

Problems with lesion-behavior inference

  • Group lesion studies may have biases based on vasculature (Nachev)
    • Lesions not random, typically middle cerebral artery territory
      • Small lesions: red zone
      • Medium lesions: green zone
      • Large regions: blue zone
    • Note that VLSM of a task that relies on frontal cortex may incidentally detect posterior area as well.
  • Single patient studies may simply sample outliers to find ‘double dissociations’ (Plunkett)

319

Behavior A

Impaired

Intact

Impaired

Intact

Behavior B

320 of 482

Statistical thresholding

  • The bad news:
    • Typical fMRI study uses 3x3x3mm voxels.
    • Typical VLSM study uses 1x1x1 voxels.
    • x27 the number of voxels.
    • Bonferroni correction leads to exceptionally low power.
  • The good news:
    • Lesions are large, contiguous across individuals, therefore less within-subject variability than fMRI.
    • Ideally suited for Permutation Thresholding (Frank et al. 1997; Kimberg et al. in press).
  • Solution: use either FDR or permutation thresholds.
    • My NPM provides both.
  • Alternative: region of interest analysis

320

321 of 482

VLSM applied to surgery

  • Lesion analysis typically used to assess brain function.
  • Technique can also refine neurosurgery, e.g. epilepsy.
      • Sustained uncontrolled seizures are a problem:
        • Seizures lead to more seizures (vicious circle)
        • Leads to brain damage and cognitive decline
        • Impairs lifestyle and job options (driving car, tractor, etc.)
      • If drugs do not stop seizures, surgery can be used.
      • Surgery attempts to remove origin of seizures.
        • Current surgery attempts to remove hippocampus and amygdala
        • Fails to stop seizures in around 30% of patients
      • Can we map lesions to identify brain regions crucial to stopping seizures?

321

322 of 482

Improving neurosurgery

322

  • VLSM predicts lesion-symptom associations.
  • Often used to ask ‘which strokes lead to a particular symptom’.
  • However, we can also ask ‘which brain surgery leads to best recovery.’
  • Consider epilepsy surgery:
  • Current surgery attemps to remove hippocampus.
  • Large anterior portion of temporal lobe removed.
  • Some variability in which regions are removed.

323 of 482

Overlay: regions typically removed

323

1

33

-11

-27

-21

  • Overlay shows locations typically removed.
  • Note: we can not determine whether a region correlates with outcome if it is virtually always or virtually never resected.

324 of 482

324

-15

-30

-20

0

3.66

Statistics - hippocampus and entorhinal cortex is crucial

  • Resection locations that predict good outcome.
  • Future work: does removal of entorhinal cortex have behavioral consequences (e.g. poor memory).

325 of 482

Resting State MRI

(Grigori Yourganov)

326 of 482

Protocol of resting-state fMRI scanning

  • Unlike task fMRI, resting-state fMRI is not associated with performing any task
  • Duration: 10-20 minutes
  • TR: 0.8 – 1.5 seconds
  • Voxel size: 2-3 mm3
  • If possible, acquire an additional scan with reversed phase (to reduce special distortions)
  • The subjects have to keep their heads still
    • If possible, eyes closed

327 of 482

Resting-state functional connectivity

  • Brain is organized into a hierarchy of networks
  • The activity within these networks is correlated, even during rest
  • Resting-state fMRI offers a great way to study the network organization of the brain
  • Seed-based analysis: pick a location in the brain (“seed”), compute the correlation of BOLD signal between the seed and all other voxels

328 of 482

Example: correlations and anticorrelations during resting state

Seed: posterior cingulate cortex (PCC)

Fox et al., PNAS, 2005

329 of 482

Anticorrelated networks: default mode network and task-positive network

  • Fox et al., PNAS 2005
  • Combining correlations from 6 seeds

330 of 482

Resting-state connectivity with Independent Component Analysis (ICA)

  • Every voxel, in turn, serves as a seed
  • We compute the correlation of BOLD signal between every pair of voxels
  • We end up with a HUGE matrix of voxelwise functional connectivity
  • This matrix is decomposed into most salient patterns of connectivity (independent components)
    • Beckmann & Smith, IEEE Transactions on Medical Imaging, 2004

331 of 482

Correspondence between networks during resting state and task

Default mode

Auditory perception

Cognition /

memory

Pain

Smith et al., PNAS, 2009

Left: resting-state network

Right: task network

332 of 482

Preprocessing of resting-state fMRI data

  • Spatial smoothing
  • Slice scan timing correction (if needed)
  • Motion correction
    • Motion scrubbing: if there is a large amount of motion between a volume and the previous volume, discard the volume
  • Bandpass filtering: keep the signal within the 0.01-0.1 Hz band
  • Regression of mean signal of white matter and of cerebro-spinal fluid

333 of 482

Parcellation according to AICHA atlas

384 grey-matter ROIs

Joliot et al., Journal of Neuroscience Methods, 2015

334 of 482

Graph theory

  • Nodes: brain areas
  • Edges: strength of functional connectivity (correlation of BOLD signal between a pair of nodes)

Functional connectivity

335 of 482

Resting-state connectivity and hubs

  • “Shortest” paths: paths that have the highest summary value of functional connectivity
  • Hubs: nodes which are included in a large number of shortest paths

van den Heuvel & Sporns, Trends in cognitive sciences, 2013

  • Hubs play an important role in integrating the information across domains
  • Damage to hubs has a big impact on the brain

336 of 482

Functional connectivity in stroke

  • Stroke has a big impact on functional connectivity
  • The lesioned areas are destroyed, and their functional connectivity to other areas is greatly diminished
  • There is a drop in functional connectivity between the hemispheres
  • The white-matter tracts connected to lesioned areas might degenerate over time
  • On the other hand, white-matter fibers between intact areas might strengthen during recovery

337 of 482

Associations between functional connections and post-stroke speech/language impairment

Auditory comprehension

Speech Fluency

Yourganov et al., Neuroimage, 2018

338 of 482

Effect of left-hemisphere stroke on right-hemisphere functional connectivity

339 of 482

Effect of left-hemisphere stroke on right-hemisphere functional connectivity

340 of 482

GUEST LECTURE:

White Matter Hyperintensities

341 of 482

342 of 482

GUEST LECTURE:

Brain Stimulation

343 of 482

Transcranial Magnetic Stimulation:� �Noninvasively Assaying and Affecting the Function of the Human Brain�Roger D. Newman-Norlund, Ph.D., Chip Epstein, Ph.D. and Christopher Rorden, Ph.D.�Department of Psychology,University of South Carolina

344 of 482

43 AD – Scribonious Largus

From: Alvaro Pascual-Leone and Timothy Wagner, 2007

TMS Background

345 of 482

1755 – Leroy

1804 – Aldini

When Aldini applied conducting rods, connected to a large battery, to Forster's face, "the jaw began to quiver, the adjoining muscles were horribly contorted, and the left eye actually opened". The climax of the performance came as Aldini probed Forster's rectum, causing his clenched fist to punch the air, as if in fury, his legs to kick and his back to arch violently."

From: Alvaro Pascual-Leone and Timothy Wagner, 2007

TMS Background

346 of 482

Williamsburg, VA, circa 1840

From: Alvaro Pascual-Leone and Timothy Wagner, 2007

TMS Background

347 of 482

TMS Background

Perception of magnetophosphenes (retina or Olobe) via ‘head stimulation’

1910

1974

First TMS system (1985) for focused stimulation of the brain.

348 of 482

TMS Background

349 of 482

TMS : Anatomy of the System

Stimulating Coil

Multi-DOF Arm

Cooling System

MagPro X100

MagStim

Control Panel

350 of 482

TMS : Anatomy of the System

351 of 482

TMS : Let’s Accessorize

  • Different coils for different types of stimulation protocols.

352 of 482

TMS : Let’s Accessorize

  • AP sham coil developed together with

Mark George, MD (MUSC).

353 of 482

� Biophysics of TMS (1)��An electric field E is induced by a changing magnetic field B, according to �� E ~ dB/dt��The magnetic field passes through tissue, but the induced electric field does the work.

TMS Biophysics

354 of 482

Biophysics of TMS (2)�The induced electric field is strongly constrained to an orientation parallel with the brain surface.��The neural elements most easily activated by TMS are myelinated axons:�— usually belonging to interneurons�— probably at branch points and bends�—at a level near the gray-white junction

TMS Biophysics

355 of 482

TMS Biophysics

356 of 482

Electric and Magnetic Fields with Figure 8 TMS Coil

TMS Biophysics

357 of 482

Area of Induced E-field Beneath a Figure-8 Coil

TMS Biophysics

358 of 482

359 of 482

360 of 482

The intersection of the curves is the only depth at which the same electric field magnitude is associated with the same effect (motor threshold.)

361 of 482

The site of stimulation appears to be close to the the level of the gray-white matter junction for superficial gyri.

362 of 482

The fundamental TMS circuit.

TMS Biophysics

363 of 482

TMS Biophysics

364 of 482

TMS coil current

8kA

Magnetic field pulse

2.5T

Rate of change of

magnetic field

30kT/s

Induced tissue current

15mA/cm2

Induced electric field

500v/m

TMS Biophysics

365 of 482

TMS Efficiency and Pulse Shape:Monophasic designs tend to waste energy at the end of�the pulse. Biphasic waveforms can reduce energy requirements by almost 2:1, with further gains from charge recovery.�

TMS Biophysics

366 of 482

Biphasic pulses stimulate cortex in two directions, a fraction of a ms apart.

TMS Biophysics

367 of 482

TMS Biophysics

Typical magnetic stimulators produce pulses of up to 8000 Amperes & up to 3000 Volts.�That’s 20,000,000 Watts for .0001 sec.�Doing this 10 times a second, things tend to heat up!�(And less than 1/1,000,000 of this energy gets into the brain. . .)

How much dB/dT do you need?

368 of 482

TMS and Neural Activity

Frequency and strength dictate effects

Low frequency stimulation (1-5 pulses per second) depress brain activity

Higher frequency stimulation (25 or more pulses per second) increases excitability.

Single pulse simply interferes with processing in a given area.

Sub-threshold stimulation does not lead to MEP whereas suber-threshold stimulation does.

369 of 482

Video

TMS “Unlocking the brain”:

370 of 482

Designing experiments for TMS

371 of 482

Design considerations

  1. Type of stimulation

  • Choosing control conditions

  • Targeting stimulation

  • Choosing parameters

  • Ethical considerations

372 of 482

Choosing a type of TMS

On-line stimulation occurs while the subject performs a task and the effects last for approximately the duration of stimulation.

Eg: Virtual lesions

Chronometrics

Functional connectivity

Off-line stimulation occurs without a task and the length of effect is typically measured in minutes.

Eg: 1Hz stimulation

Theta burst

373 of 482

Repetitive or chronometric?

Repetitive stimulation typically involves trains of three or more pulses (e.g 10Hz for 500msec)

    • Effect lasts approx. duration of stimulation
    • Don’t need to know exactly when to stimulate
    • Lots of pulses

Chronometric studies use either single or paired-pulses to examine the processing time course in a region

    • Requires far more trials!!!
    • Subjects tolerate stimulation better
    • How to best order trials?

374 of 482

Design considerations

  1. Type of stimulation

  • Choosing control conditions
    • Control sites
    • Control tasks
    • Control stimuli
    • Sham stimulation

  • Targeting stimulation

  • Choosing parameters

  • Ethical considerations

375 of 482

Control site: Vertex

376 of 482

Choosing another control site

377 of 482

Control task(s)

378 of 482

Sham TMS…

…is a sham

379 of 482

Design considerations

  1. Type of stimulation

  • Choosing control conditions

  • Targeting stimulation
    • Functional localizers
    • Anatomically guided: MRI based stereotaxy
    • Heuristics

  • Choosing parameters

  • Ethical considerations

380 of 482

fMRI-guided TMS

vOTC

LOC

381 of 482

Identifying corresponding positions on the subject and subject’s

MRI scan for registration

Frameless stereotaxy

382 of 482

Functionally localize w/ TMS

Rostral site

Task: Same category?

potato

+

turnip

Caudal site

Task: Rhyme?

vein

+

pane

41 ms*

52 ms*

383 of 482

Scalp coordinates

Rostral: -52, 35, -7 4 × 6cm

Caudal: -52, 15, 8 2 × 3cm

⇒ Mean distance in cortex of 2.3cm apart

⇒ Sites on scalp separated by 3.5cm, on average

MNI coordinates

Relative to C-T line

384 of 482

Design considerations

  1. Type of stimulation

  • Choosing control conditions

  • Targeting stimulation

  • Choosing parameters

  • Ethical considerations

385 of 482

Choosing parameters

  • Stimulation intensity / duration / rate
  • Inter-stimulation interval
  • Type of coil
  • Type of stimulation / stimulator
  • Accessibility
  • Number of trials per condition?
  • Number of subjects in a study?
  • Analysis methods?

386 of 482

Design considerations: MT

How do we set the Dial (Power Output):

Motor Threshold = Gauge

387 of 482

Design considerations: MT

Resting Motor Threshold

🡪 Typically ook for visible motor twitch 3/5 times

🡪 Custom program to find motor threshold (MUSC)

388 of 482

Design considerations: MT

Active Motor Threshold

The AMT is defined as the lowest stimulation intensity able to produce at least 5/10 MEPs greater than or equal to a 200 µV amplitude (above baseline).

Target Force

389 of 482

Design considerations: MT

Problem: Motor threshold is not ideal for calculating necessary stimulator output in other areas (i.e. PFC, IFG, Occ, etc.) (Kozel et al., 2000)

Luckily, MT technique overestimates necessary stimulation (not as sensitive as AMT), so researchers almost always ‘hit’ something…

390 of 482

Design considerations

  1. Type of stimulation

  • Choosing control conditions

  • Targeting stimulation

  • Choosing parameters

  • Safety / Ethical considerations

391 of 482

Practical Aspects of TMS Studies, Safety:�

What are the possible negative effects of TMS:

🡪 Headache, general Discomfort

🡪 Temporary hearing loss (140 dB sound)

🡪 Structural brain changes (inconsistent)

🡪 Histotoxicity (inconsistent)

🡪 Biologically transient effects (changes in TSH and blood lactate levels?)

392 of 482

Practical Aspects of TMS Studies, Safety:�

Suicide

🡪 This fear grew out of the depression literature, seems to just be coincidence.

Siezures

🡪 Out of 3000 studies in past 10 years, only 17 have reported seizures (12 had protocols that exceed safety guidelines, 4 neurological impairments, 1 jetlag?).

🡪 No recurring seizure disorder following Tx.

🡪 Scalp burns (occasionally with rTMS)

393 of 482

Practical Aspects of TMS Studies, Safety:�

Single Pulse vs multi-pulse TMS vs…

394 of 482

Practical Aspects of TMS Studies, Safety:�

395 of 482

Ethics of TMS

Although the risk is small, it is always present, so there is an obligation on the experimenter to always consider the value of a given experiment

    • How can you minimize risk & discomfort?
    • What is the minimal stimulation necessary?
    • Is the TMS information clear and consent informed?
    • Are subjects always screened?
    • Are the experimenters safety trained?
    • Are emergency procedures clear & in place?
    • Would YOU do this experiment?

Practical Aspects of TMS Studies, Safety:�

396 of 482

Ethics of TMS Article Break

397 of 482

A Highly-Eclectic Set of Examples from Different Areas of Neuroscience.

398 of 482

Estrogen level and MEP threshold (late follicular phase)�--courtesy of Eric Wassermann, NINCDS*

30

35

40

45

50

55

60

65

70

0

50

100

150

200

250

300

350

Estradiol concentration

MEP threshold

Effects of Endogenous Steroids:

399 of 482

Effects of Anticonvulsants on TMS Motor Responses

Sodium channel blockers raise the threshold to stimulation (depolarization gets harder):

Phenytoin, carbamazepine, valproate, lamotrigine

GABA augmenters increase intracortical inhibition: benzodiazepines, barbiturates, vigabatrin, baclofen, ethanol

400 of 482

Mean TMS motor thresholds for patients with primary generalized epilepsy before and after valproate Rx. Adapted from Reutens et al, Neurology 1993.

401 of 482

TMS can work where drugs don’t (Epilepsy, Depression, ??):

402 of 482

TMS Measuring Plasticity

403 of 482

TMS Hand Motor Cortex

Measure Thumb Movement

Train for 30 minutes…

Re-measure TMS evoked

Movements…

Change

Default

Thumb

Mvmt.

Dir.

404 of 482

Pascual-Leone et al., 1995:

Participants learn 5-finger piano exercise over 5 days

Motor Imagery 🡪 Leads to comparable (but non-equivalent) changes in

MI representations. (Pascual-Leon, 1995)

Liepert et al., 1999:

Paired repeated movement of thumb and foot

405 of 482

Enhancing Plasticity

406 of 482

  • Huang et al. (2005)
  • LTP
    • Intermittent theta burst stimulation (iTBS)
    • Three pulses at 50 Hz every 200 ms, simulating a theta like-rhythm.
    • Intensity = 80% resting MEP
    • 10 bursts grouped and repeated every 10 s
    • Total duration = 191.84 s resulting in 20 trains with 600 pulses.
    • Aprox. 60 min. of cortical excitation

  • LTD
    • Constant theta bust stimulation (cTBS)
    • 200 bursts delivered without interruption for duration of 40.04 s (600 pulses).
    • Suppresses MEP amplitudes and decreases intracortical facilitation for 20 - 60 min

Trans-cranial Magnetic Stimulation: Role of LTP/LDP in Plasticity

407 of 482

  • Jacobs and Donoghue (1991)
    • Horizontal connections between motor columns may allow coordinated movements of multiple muscles via:
      • Disynaptic inhibition
        • GABA-ergic inhibitory interneurons
      • Monsynaptic excitation
        • Glutamatergic excitatory pyramidal cells

The Role of Connections Intrinsic to M1 on Plasticity

408 of 482

  • Pyramidal Cells
    • Excitatory via glutamate
  • GABAergic interneurons
    • Inhibitory via GABA
  • GABA-A receptors
    • Located on pyramidal cell bodies and axons
    • Short intra-cortical inhibition of pyramidal cells lasting < 5ms
    • Competition between glutamate and GABA usually won by glutamate > 3ms
  • GABA-B receptors
    • Located on GABAergic interneurons
    • Long intra-cortical inhibition of GABAergic interneurons <150ms
    • Competition between GABA-B and GABA-A usually won by GABA-B, disinhibiting GABA-A.

Huerta and Volpe, 2009

Intracortical Inhibition of Motor Cortex: A Closer Look

409 of 482

  • Intra-cortical inhibition via GABAergic inhibitory inter-neurons within M1
    • Cash et al. (2009)
      • Disinhibtion of GABA-A receptors with GABA-B recepts

Priming

Stimulation

Intra-cortical

Inhibition

Intra-cortical

Disinhibition

Plasticity of M1: Intracortical Inhibition

410 of 482

411 of 482

Paired-Pulse and ICI/ICF

In humans, paired-pulse transcranial magnetic stimulation

(TMS) provides a noninvasive means of studying intracortical (cortico-cortical) excitability.

When a suprathreshold magnetic test stimulus is preceded by a subthreshold magnetic conditioning stimulus, the resulting motor evoked potential (MEP) may be inhibited or facilitated, depending on the interstimulus interval (ISI) (Kujirai et al. 1993). Generally, short ISIs (1–5 ms) produce inhibition of the test MEP, while longer ISIs (10–15 ms) produce facilitation of the test MEP (Kujirai et al. 1993; Chen et al. 1998).

SICI = Short interval intracotrical inhibition

LICI = Long interval intracorical inhibition ) (2 suprathreshold, 100 ms aart)

412 of 482

413 of 482

Paired-Pulse and ICI/ICF

414 of 482

415 of 482

Intracortical inhibition is fundamental in modulating motor cortex output during fine movements (Liepert et al., 1998; Stinear and Byblow, 2004), and excessive intracortical inhibition following stroke is believed to limit recovery by reducing dexterity (Butefisch et al., 2003; Butefisch et al., 2006).

Additionally, an abnormal balance between inhibitory and excitatory mechanisms has been implicated in neurodegenerative diseases that manifest with disordered movement (Ridding et al., 1995a; Hanajima et al., 1996; Hanajima and Ugawa, 2000).

Accordingly, it is probable that the age-associated changes in cortical excitability results in functional consequences, but further work is needed to better understand the specific implications of these the neurophysiologic changes.

416 of 482

417 of 482

  • Knowing the direction degree of intracortical plasticity may be advantageous…
  • Liepert et al. (2006)
    1. Optimal coil position for measuring MEPs at first dorsal interosseous muscle found
    2. Paired pulse of MT + 10% in all directions until no detectable MEP
    3. Center of Gravity (CoG) found
    4. Motor threshold calculated 1 cm anterior, posterior, medial, and lateral of CoG in order to measure intracortical inhibition (ICI).
    5. 4 weeks of Physical therapy
    6. CoG recalculated

***Plasticity consistently in direction of least ICI.

Light Box – Most ICI

Dark Box – Least ICI

Can we “lead” the plasticity in

Order to optimize functional

Recovery?

Plasticity of M1: Intracortical Inhibition

418 of 482

TMS Causality & Troublemaking (i.e. disruption)

419 of 482

Lateralized Speech Disruption�by rTMS�(See videos at http://sites.google.com/site/chipstein/videos )

420 of 482

The areas where TMS produces muscle contraction and speech disruption are plotted on a grid, along with their centers of gravity.

421 of 482

The grid is projected onto anatomic MRI.

422 of 482

H = hand area, F = face area, S = specch arrest area

423 of 482

Conclusion: The easiest site for magnetic speech arrest is at or near the motor cortex.��Although TMS speech effects have also been reported (once) at a more anterior site, possibly compatible with Broca’s area, they were substantially more difficult to obtain.

424 of 482

The McGurk Effect

425 of 482

426 of 482

Tools:

•Neuronavigation to area of fMRI BOLD activation

•Single-Pulse TMS (more elegant, more tolerable, and less regulated than fast repetitive TMS (rTMS).

427 of 482

428 of 482

429 of 482

430 of 482

There is a tight linkage between sensory (auditory) and motor . Especially during music processing (i.e. learning).

TMS Studies – single pulse vs paired-pulse

431 of 482

TMS in Rodents:

Luft et al., 2001

432 of 482

TMS in Rodents:

cTMS vs iTMS(TBS)

Intermittent better for learning, results in neural changes.

433 of 482

TMS in Rodents:

…reduction in paired-pulse inhibition seen normally with short (15 msec) interpulse intervals and an increase in paired-pulse potentiation seen with longer interpulse intervals. (net result is increase in excitability)

…reduction in paired-pulse inhibition seen normally with short (15 msec) interpulse intervals and an increase in paired-pulse potentiation seen with longer interpulse intervals. (net result is decrease in excitability)

7 Day treatment regimen…

434 of 482

1960 - Bindman

tDCS Background

Demonstrates that 0.25 microamp/mm^2 applied via surface electrodes could influence spontaneous activity in the evoked response of neurons for hours following just minutes of stimulation (in rats)

435 of 482

tDCS Anatomy of a System

Transcranial Direct Current Stimulation (tDCS)

(1mA Electric Currents)

Reduction in Resting Membrane Potential

(Anode Stimulation)

OR

Increase in Resting Membrane Potential

(cathode Stimulation)

436 of 482

Our tDCS units

  • Our tDCS units designed for iontophoresis.
  • Can deliver up to 4mA: contemporary studies do not exceed 2mA.
  • Disposable sponge electrodes.
  • Optional USB system can ensure double blind research.

436

437 of 482

Common aspects of these treatments

  • Suprathreshold stimulation
  • Phenomenological effects

DCS

  • Subthreshold stimulation
  • Physiological effects

438 of 482

Cortical DC-stimulation of the rat: effects during stimulation …

Bindman et al. 1964

Baseline

cathodal

anodal

439 of 482

…and after-effects

440 of 482

After-effects are protein synthesis-dependent

Gartside 1968

cycloheximide

tetracycline

441 of 482

…and accompanied by specific biochemical alterations

dark neurones

3 h stimulation

30 min

Islam et al. 1995,

Hattori

et al. 1990

442 of 482

50% of transcranially applied direct currents reach the brain

- calculations on realistic head models, validation in animal experiments (Rush & Driscoll 1968)

- validation in humans (Dymond et al. 1975)

443 of 482

Polarity-dependent excitability-modulation during tDCS

Nitsche & Paulus 2000

m-

cS

kS

prm

m

pom

oc

Electrode positions

:

m = motor

cortex

;

prm

=

premotor

cortex

;

pom

= post-motor

cortex

;

oc

=

occipital

; cS = contralateral

forehead

; cm =

kontralateral

motor

cortex

cm

0.5

0.75

1.0

1.25

1.5

MEP-Amplitude

with

/

without

tDCS

*

*

anodal

stimulation

cathodal

stimulation

444 of 482

Enhanced motor cortico-spinal excitability after anodal stimulation

Nitsche & Paulus 2001

13 min

5 min

0.9

1.1

1.3

1.5

1.7

1

5

10

15

20

25

30

35

40

45

50

55

60

90

120

150 min

9 min

7 min

11 min

Stimulation

duration

MEP

size

after

current stmulation

/

baseline

Time after

current

stimulation

445 of 482

Reduced motor cortico-spinal excitability after cathodal tDCS

0.4

0.6

0.8

1.2

1

5

10

15

20

25

30

35

40

45

50

55

60

90

120 min

1.0

Nitsche et al. 2003

Time after

current

stimulation

MEP

size

after

current stmulation

/

baseline

446 of 482

tDCS How it Works: Neural Effects / Changes

  • Transcranial Direct Current Stimulation
    • Fritsch et al. (2010)
      • No intracortical disinhibition
      • Early Vs. Late LTP
        • NMDA Mg block
        • BDNF release and TrkB activation
          • BDNF dependent
          • Activity Dependent
    • Carvalho et al. (2008)
      • BDNF
        • Phosphorylates NMDA receptors
        • Catalyzes production of Plasticity proteins

447 of 482

High Definition tDCS and Tissue Modeling:�

Bikson, CUNY New York

448 of 482

High Definition tDCS and Tissue Modeling:�

Bikson, CUNY New York

449 of 482

High Definition tDCS and Tissue Modeling:�

Bikson, CUNY New York

450 of 482

High Definition tDCS and Tissue Modeling:�

Bikson, CUNY New York

451 of 482

High Definition tDCS and Tissue Modeling:�

Bikson, CUNY New York

452 of 482

High Definition tDCS and Tissue Modeling:�

Bikson, CUNY New York

453 of 482

High Definition tDCS and Tissue Modeling:�

Bikson, CUNY New York

454 of 482

High Definition tDCS and Tissue Modeling:�

Bikson, CUNY New York

455 of 482

High Definition tDCS and Tissue Modeling:�

Bikson, CUNY New York

456 of 482

High Definition tDCS and Tissue Modeling:�

Bikson, CUNY New York

457 of 482

High Definition tDCS and Tissue Modeling:�

Bikson, CUNY New York

458 of 482

High Definition tDCS and Tissue Modeling:�

Bikson, CUNY New York

459 of 482

High Definition tDCS and Tissue Modeling:�

Bikson, CUNY New York

460 of 482

High Definition tDCS and Tissue Modeling:�

Bikson, CUNY New York

461 of 482

High Definition tDCS and Tissue Modeling:�

Bikson, CUNY New York

462 of 482

High Definition tDCS and Tissue Modeling:�

Bikson, CUNY New York

463 of 482

High Definition tDCS and Tissue Modeling:�

Bikson, CUNY New York

464 of 482

High Definition tDCS and Tissue Modeling:�

Bikson, CUNY New York

465 of 482

High Definition tDCS and Tissue Modeling:�

Bikson, CUNY New York

466 of 482

High Definition tDCS and Tissue Modeling:�

Bikson, CUNY New York

467 of 482

High Definition tDCS and Tissue Modeling:�

Bikson, CUNY New York

468 of 482

High Definition tDCS and Tissue Modeling:�

Bikson, CUNY New York

469 of 482

High Definition tDCS and Tissue Modeling:�

Bikson, CUNY New York

470 of 482

High Definition tDCS and Tissue Modeling:�

Bikson, CUNY New York

471 of 482

Typical design

  • Convention is to conduct behavioral task during and/or immediately after stimulation.
  • E.G. Dockery reports that prefrontal tDCS polarity influences learning of Tower of London task – with effects seen 6-12 months later.

Dockery et al. (2009)

472 of 482

Theoretical safety concerns

  • Potential side effects with tDCS
      • electrode-tissue interface could lead to skin irritation and damage.
      • Stimulations could lead to excitotoxic firing rates.
      • Tissue damage due to heating.
  • Rat studies suggest injury only when current density is several orders of magnitude beyond those used in humans (Liebetanz et al. 2009).
  • Standard doses in humans does not appear to alter serum neuron specific enolase (NSE), a sensitive marker of neuronal damage (Nitsche et al, 2003).
  • Datta (2009) heating in humans is negligible.

472

473 of 482

Practical safety concerns

  • Subtle but common side effects
      • Nitsche et at. (2003) reports that in more than 500 participants the only side effects are initial scalp tingling or sensation of a light flash.
      • Some studies suggest that higher current densities can lead to skin irritation.
      • If cognitive effects are prolonged, perhaps we should warn participants about driving or other hazardous tasks after a treatment session.
        • Koenigs (2009) note one neurologically healthy participant reported a couple hours dysphoria following cathodal tDCS.

474 of 482

tDCS vs TMS

  • Transcranial magnetic stimulation
    • Relatively expensive (~$50,000).
    • Moderate sized effects (e.g. mild speech arrest).
    • Safe, but there are reports of inducing seizures when high amplitude and frequency are combined.
    • Causes resting neurons to fire.
      • Very brief pulse stops interrupts processing for ~30ms, �can be used repetitively.
      • Depending on frequency, sustained TMS �can induce excitability reduction (long-term �depression) or enhancements (long-term �potentiation) that can persist for hours or days.

474

475 of 482

tDCS vs TMS

  • Transcranial direct current stimulation
    • Very inexpensive (~$250 for iontophoresis unit).
    • Believed to be exceptionally safe.
    • Does not cause resting neurons to fire (Purpura and McMurtry, 1965; Terzuolo and Bullock,1956).
    • Believed to modulate the firing rate of active neurons.
      • Depending on polarity, tDCS can induce cortical excitability reduction or enhancement can �persists for hours.

475

476 of 482

tDCS vs TENS

  • Transcutaneous Electrical Nerve Stimulation systems are used to treat pain.
  • TENS pulsed 2-160Hz, 5-80 mA.

476

  • At slow frequency and high amplitude TENS induces muscle contraction.
  • In contrast, tDCS uses constant 1-2mA.

477 of 482

Combining TMS and TDCS: Differing Plastic Mechanisms

478 of 482

tDCS of the parietal cortex did not change the excitability of the motor cortex at rest, ipsi- and contralaterally to its application.

Such a lack of effects may be due to a state-dependent factor of the parieto-motor network: … its susceptibility to neuromodulatory effects of tDCS, may emerge only for specific processes such as motor imagery and, to a lesser extent, action observation, which definitely engage this area.

479 of 482

New functional architecture (EARLY)

New functional architecture (LATE)

NORMALIZE as much as possible

Enhanced Plasticity

Modulate Plasticity

PT

+/-

+/-

Dendritic branching

Synaptogenesis

hyperexcitability

PT

Motor Learning and Functional Recovery

480 of 482

Rehabilitation and Brain Plasticity Enhancing Interventions

10 days 17 days 24 days 31 days 90 days

Normal

Functional recovery is accompanied by changes in Brain activity which can be measured using fMRI.

Motor Learning and Functional Recovery

481 of 482

GUEST LECTURE:

Artificial Intelligence in Neuroimaging

For PowerPoint Click → HERE

For PDF Click → HERE

482 of 482

Questions?