Published using Google Docs
Savonen CV 2024
Updated automatically every 5 minutes

Savonen page

Oct 16th 2024

Candace Savonen, M.S.

Data Scientist

Website: cansavvy.com 

Code-based projects: github.com/cansavvy 

csavonen@fredhutch.org

SKILLS

EXPERIENCE

Fred Hutchinson Cancer Center - Data Scientist

June 2022 - PRESENT

Johns Hopkins University - Research Associate

April 2021 - June 2021

Childhood Cancer Data Lab - Biological Data Analyst

Sept 2018 - April 2021

Michigan State University - Masters of Science

June 2015 - Aug 2018

Wayne State University - Research Assistant

May 2013 - June 2015


PUBLICATION SUMMARY

PUBLICATIONS

1.         Savonen C, Wright C, Hoffman A, Humphries E, Cox K, Tan F, et al. Motivation, inclusivity, and realism should drive data science education. F1000Research. 2024;12: 1240.

2.         Afiaz A, Ivanov A, Chamberlin J, Hanauer D, Savonen C, Goldman MJ, et al. Evaluation of software impact designed for biomedical research: Are we measuring what’s meaningful? arXiv [cs.SE]. 2023. Available: http://arxiv.org/abs/2306.03255

3.         Afiaz A, Ivanov AA, Chamberlin J, Hanauer D, Savonen CL, Goldman MJ, et al. Best practices to evaluate the impact of biomedical research software-metric collection beyond citations. Bioinformatics. 2024;40. doi:10.1093/bioinformatics/btae469

4.         Humphries EM, Wright C, Hoffman AM, Savonen C, Leek JT. What’s the best chatbot for me? Researchers put LLMs through their paces. Nature. 2023 [cited 16 Oct 2024]. doi:10.1038/d41586-023-03023-4

5.         Shapiro JA, Gaonkar KS, Savonen CL, Spielman SJ, Bethell CJ, Jin R, et al. OpenPBTA: An Open Pediatric Brain Tumor Atlas. 2022. doi:10.1101/2022.09.13.507832

6.         Dang MT, Gonzalez MV, Gaonkar KS, Rathi KS, Young P, Arif S, et al. Macrophages in SHH subgroup medulloblastoma display dynamic heterogeneity that varies with treatment modality. Cell Reports. 2021;34: 108917.

7.         Casey S. Greene, Dongbo Hu, Richard W. W. Jones, Stephanie Liu, David S. Mejia, Rob Patro, Stephen R. Piccolo, Ariel Rodriguez Romero, Hirak Sarkar, Candace L. Savonen, Jaclyn N. Taroni, William E. Vauclain, Deepashree Venkatesh Prasad, Kurt G. Wheeler. Refine.Bio. In: Refine.bio [Internet]. 2018 [cited 16 Oct 2024]. Available: https://www.refine.bio/

8.         Kochmanski J, Savonen C, Bernstein AI. A Novel Application of Mixed Effects Models for Reconciling Base-Pair Resolution 5-Methylcytosine and 5-Hydroxymethylcytosine Data in Neuroepigenetics. Frontiers in Genetics. 2019;10. doi:10.3389/fgene.2019.00801

9.         Saad MH, Savonen CL, Rumschlag M, Todi SV, Schmidt CJ, Bannon MJ. Opioid Deaths: Trends, Biomarkers, and Potential Drug Interactions Revealed by Decision Tree Analyses. Frontiers in Neuroscience. 2018;12: 728.

10.         Saad MH, Rumschlag M, Guerra MH, Savonen CL, Jaster AM, Olson PD, et al. Differentially expressed gene networks, biomarkers, long noncoding RNAs, and shared responses with cocaine identified in the midbrains of human opioid abusers. Scientific Reports. 2019;9. doi:10.1038/s41598-018-38209-8

11.         Bannon MJ, Savonen CL, Hartley ZJ, Johnson MM, Schmidt CJ. Investigating the potential influence of cause of death and cocaine levels on the differential expression of genes associated with cocaine abuse. PLoS One. 2015;10: e0117580.

12.         Bannon MJ, Savonen CL, Jia H, Dachet F, Halter SD, Schmidt CJ, et al. Identification of long noncoding RNAs dysregulated in the midbrain of human cocaine abusers. J Neurochem. 2015;135: 50–59.

OTHER MEDIA

COMMITTEES

SOFTWARE

TEACHING

        github: https://github.com/fhdsl/Tools_for_Reproducible_Workflows_in_R 

The ITN is a collaborative effort of researchers to catalyze informatics research through training opportunities (NCI UE5CA254170). ITN Courses are published on Leanpub and Coursera and on their own Bookdown websites.

Published ITN Courses:

A course to cover the basics of creating documentation and tutorials to maximize the usability of informatics tools.

github: https://github.com/jhudsl/Documentation_and_Usability 

Equip learners with reproducibility skills they can apply to their existing analyses scripts and projects. This course opts for an “ease into it” approach.

github: https://github.com/jhudsl/Adv_Reproducibility_in_Cancer_Informatics 

To equip learners with a deeper knowledge of the capabilities of reproducibility tools and how they can apply to their existing analyses scripts and projects.

github: https://github.com/jhudsl/Reproducibility_in_Cancer_Informatics

 

To help learners find resources and tools to help them process and interpret their genomic data.

github: https://github.com/fhdsl/Choosing_Genomics_Tools 

This course walks through why’s and the how’s for using automation to boost scientific software development process. It’s meant for folks who already have a basic familiarity with GitHub but would like to automate more of their software dev work.

                github: https://github.com/fhdsl/GitHub_Automation_for_Scientists 

This course walks through why’s and the how’s for using containers (e.g. Docker) to boost reproducibility of scientific analysis.

github:https://github.com/fhdsl/Containers_for_Scientists 

A self-guided tutorial to help users analyze processed gene expression data from refine.bio repository.

github: https://github.com/AlexsLemonade/refinebio-examples 

A short format workshop (3 – 5 days) to introduce pediatric cancer researchers to the basics of single-cell and bulk RNA-seq data analysis.

github: https://github.com/AlexsLemonade/training-modules

PRESENTATIONS 

2024

2023

2022

2021

2020

2019

POSTER PRESENTATIONS 

PERSONAL STATEMENT 

My interest is in making data science tools more easily attainable to those who are looking to impactfully apply them to their areas of knowledge and background. I am passionate about creating educational materials which emphasize reproducibility and using scalable methods to disseminate educational materials. I believe that as a part of the data science community, we need to work to become a more inclusive work environment. This would not only create better science, but would widen the circle for individuals who are currently underrepresented in data science. I have been involved in creating and delivering bioinformatic education materials for cancer genomics. My neuroscience background has helped me empathize with researchers who are looking to bridge the data science knowledge gap.

Keywords

Bioinformatics, Data Science, Data Analysis, Education, Reproducibility, Gene Expression, Genomics