Early Career Data Scientist¶
Description: A collection of videos, tutorials, training materials, and exercises targeted towards any entry-level, early-career trainee interested in learning basic skills in data science.
Preparation: no advance preparation is required.
1. Data Science Ethics¶
Videos¶
6 videos available here
2. Overview: What is Data Science¶
Videos¶
- IBM OpenDS4All What is Data Science? with Yucen Wang - Part I
- IBM OpenDS4All What is Data Science? with Yucen Wang - Part II
3. Understand and Appreciate Open and FAIR Data¶
Article to read¶
Exercises¶
- Create an ORCID
- Create wikidata entry about yourself and link to other projects if applicable
- Share past work on FigShare/Zenodo, etc
4. Learn GitHub¶
Getting started¶
- Create a GitHub account, see https://docs.github.com/en/get-started/signing-up-for-github/signing-up-for-a-new-github-account
- Download and install GitHub Desktop
Tutorials¶
Introduction to GitHub¶
GitHub Issues¶
Exercises¶
- Help improve this pathway! Make edits to this OBO Academy page and make a pull request. (For example, find typos to fix, add or revise content to this document, etc.)
- Create a GitHub website by forking this repository: https://github.com/laderast/academic_site_workshop
5. Learn command line¶
Tutorials¶
Note: for the tutorials below PC users need to install ODK (instructions are linked from the tutorial)
Alternatively, PC users can download Git Bash
- Tutorial: Very (!) short introduction to the command line for ontology curators and semantic engineers: Part 1
- Tutorial: Very (!) short introduction to the command line for ontology curators and semantic engineers: Part 2
6. Introduction to Ontologies¶
Articles to read¶
Videos¶
- An Introduction to Ontologies by Mark Musen, Stanford University (~15 min)
- Introduction to Biomedical Ontologies #1: What is an Ontology?, by Jennifer Smith, Rat Genome Database (~15 min)
- Using ontologies to standardize rare disease data collection, by Nicole Vasilevsky, C-Path (1 hr)
Tutorials¶
7. Basic Data Management¶
Videos¶
- Data Preparation and Planning
- https://dmice.ohsu.edu/bd2k/demo/BDK12-2/presentation_html5.html
- https://dmice.ohsu.edu/bd2k/demo/BDK12-3/presentation_html5.html
- Data sharing snafu: Data Sharing and Management Snafu in 3 Short Acts
Article to read¶
- 10 Simple Rules for the Care and Feeding of Scientific Data
- Big Data: The Future of Biocuration
- A primer on data sharing
- Identifiers for the 21st century: How to design, provision, and reuse persistent identifiers to maximize utility and impact of life science data
- Reproducible and reusable research: Are journal data sharing policies meeting the mark?
Exercise¶
8. Preparing your CV and Tracking Your Contributions¶
Video¶
Workshop from Biocuration: Workshop - Careers In Biocuration
Articles¶
Is authorship sufficient for today’s collaborative research? A call for contributor roles