1 of 7

Programming in R

Data Analysis in Genome Biology

GEN242

1

Thomas Girke

April 19, 2018

GEN

242

2 of 7

Outline

Some Unfinished topics

GitHub from command-line and RStudio

Running R in a terminal on local and remote systems

Programming in R

2

GEN

242

3 of 7

Some Unfinished Topics

These are some shorter topics we need to finish up:

  • Git & GitHub from command-line - URL
  • Git & GitHub from RStudio - URL
  • Running R in a terminal on local or remote systems - URL

3

GEN

242

4 of 7

Outline

Some Unfinished topics

GitHub from command-line and RStudio

Running R in a terminal on local and remote systems

Programming in R

4

GEN

242

5 of 7

Why Programming in R?

  • Powerful statistical environment and programming language
  • Facilitates reproducible research
  • Efficient data structures make programming very easy
  • Ease of implementing custom functions
  • Powerful graphics
  • Access to fast growing number of analysis packages
  • Most widely used language in bioinformatics
  • Is standard for data mining and biostatistical analysis
  • Technical advantages: free, open-source, available for all OSs

5

GEN

242

6 of 7

Online Programming Tutorial

  • Continue here!

6

GEN

242

7 of 7

References and Books

Huber W, Carey VJ, Gentleman R, Anders S, Carlson M, Carvalho BS, …, et al (2015) Orchestrating high-throughput genomic analysis with Bioconductor. Nat Methods 12: 115–121

7

GEN

242