We facilitate and develop lessons for Data Carpentry workshops. These lessons are distributed under the CC-BY and are free for re-use or adaptation, with attribution. We’ve had people use the lessons in courses, to build new lessons or use them for self-guided learning.

Data Carpentry workshops are domain-specific, so that we are teaching researchers the skills most relevant to their domain and using examples from their type of work. Therefore we have several types of workshops and curriculum is organized by domain.

Workshop materials

Workshop materials under development or consideration

Semester materials


Ecology Curriculum

This workshop uses a tabular ecology dataset from the Portal Project Teaching Database and teaches data cleaning, management, analysis and visualization. There are no pre-requisites, and the materials assume no prior knowledge about the tools. We use a single dataset throughout the workshop to model the data management and analysis workflow that a researcher would use.

The Ecology workshop can be taught using R or Python as the base language.

Lessons

Lesson Site Repository Reference Instructor Guide Maintainer(s)
Ecology Workshop Home Page     Karen Cranston, Aleksandra Pawlik, Karthik Ram, Tracy Teal, Ethan White
Data Organization in Spreadsheets Christie Bahlai, Tracy Teal, Peter R. Hoyt
Data Cleaning with OpenRefine Deborah Paul, Cam Macdonell
Data Management with SQL Timothée Poisot, Rémi Rampin, Donal Heidenblad
Data Analysis and Visualization in R François Michonneau, Auriel Fournier, Ana Costa Conrado, Brian Seok
Data Analysis and Visualization in Python April Wright, Tania Allard

Genomics Curriculum

Overview

The focus of this workshop is on working with genomics data and data management and analysis for genomics research. It covers data management and analysis for genomics research including: best practices for organization of bioinformatics projects and data, use of command line utilities, use of command line tools to analyze sequence quality and perform variant calling, and connecting to and using cloud computing.

Please note that workshop materials for working with Genomics data in R are under development and will become available in June 2018.

Lessons

Lesson Site Repository Reference Instructor Guide Maintainer(s)
Genomics Workshop Home Page     Erin Becker
Project organization and management Roselyn Lemus, Yujuan Gui, Mateusz Kuzak, Rayna Harris, Peter Hoyt
Introduction to the command line Shichen Wang, Anita Schürch, Bastian Greshak, Sue McClatchy
Data wrangling and processing Josh Herr, Ming Tang, Fotis Psomopoulos, Malvika Sharan
Introduction to cloud computing for genomics Bob Freeman, Darya Vanichkina, Kevin Buckley, Amanda Charbonneau
Data analysis and visualization in R *beta* Naupaka Zimmerman, Ahmed Moustafa, Krzysztof Poterlowicz, Jason Williams

Materials under development

Social Science Curriculum

These materials are scheduled for release and will be available for teaching in May 2018.

Lessons

Lesson Site Repository Reference Instructor Guide Maintainer(s)
Social Science Workshop Homepage     TBA
Data Organization in Spreadsheets for Social Scientists Chris Prener
Data Cleaning with OpenRefine for Social Scientists Geoff LaFlair
Data Management with SQL for Social Scientists Peter Smyth
Data Analysis and Visualization with R for Social Scientists Juan Fung
Data Analysis and Visualization with Python for Social Scientists Stephen Childs

Geospatial Data Workshop

These materials are scheduled for release and will be available for teaching in July 2018.

Overview

This workshop is co-developed with the National Ecological Observatory Network (NEON). It focuses on working with geospatial data - managing and understanding spatial data formats, understanding coordinate reference systems, and working with Raster and Vector data in R for analysis and visualization.

Join the geospatial curriculum email list to get updates and be involved in conversations about this curriculum.

Lessons

Lesson Site Repository Reference Instructor Guide Maintainer(s)
Geospatial Workshop Homepage     Leah Wasser, Joseph Stachelek, Tyson Swetnam, Lauren O'Brien, Janani Selvaraj, Lachlan Deer, Chris Prener, Juan Fung
Geospatial Project Organization and Management   Leah Wasser, Joseph Stachelek, Tyson Swetnam, Lauren O'Brien, Janani Selvaraj, Lachlan Deer, Chris Prener, Juan Fung
Introduction to R for Geospatial Data   Leah Wasser, Joseph Stachelek, Tyson Swetnam, Lauren O'Brien, Janani Selvaraj, Lachlan Deer, Chris Prener, Juan Fung
R for Raster and Vector Data   Leah Wasser, Joseph Stachelek, Tyson Swetnam, Lauren O'Brien, Janani Selvaraj, Lachlan Deer, Chris Prener, Juan Fung

Materials in Early development

These materials are at the initial stages of development, identifying the core concepts to teach and piloting materials.

Digital Humanities Curriculum

Many groups are piloting different versions of this curriculum. There is not yet one set of lessons under active development.

If you are interested in following or being involved in development of this curriculum, please sign up for the dh-curriculum email list

Image analysis Curriculum

Groups at Stanford, Doane College and attendees of the ImageXD meeting have piloted ideas for curriculum in teaching image analysis. There is not yet one set of lessons under active development. Development is planned for 2018.

If you are interested in following or being involved in development of this curriculum, please sign up for the image-analysis-curriculum email list

Economics Curriculum

There is initial interest on economics curriculum. Development is planned for 2018.

If you are interested in following or being involved in development of this curriculum, please sign up for the economics-curriculum email list

Astronomy Curriculum

Development of a Data Carpentry lesson immediately aimed at astronomy, but which can easily be extended to other physics based disciplines. American Institute of Physics/Member Society Venture Partnership funding is supporting the development and testing of the lesson. Lesson development will begin the AAS hack day and will continue throughout the next two years. If you are interested in contributing in any way, please join the astronomy-curriculum email list. We would especially like to encourage anyone who is part of an AIP member society (Acoustical Society of America, American Association of Physicists in Medicine, American Association of Physics Teachers, American Astronomical Society, American Crystallographic Association, American Meteorological Society, American Physics Society, AVS: Science & Technology of Materials, Interfaces, and Processing, The Optical Society, and the Society of Rheology) to join as we are eager to develop lessons that can be easily extended into these sub-fields.

Other curriculum

If you are interested in developing other lessons or getting updates on other topics, see the lessons ideas github repository for topics under consideration or discussion, or to propose new ideas.

Semester materials

Biology Semester-long Course

The Biology Semester-long Course was developed and piloted at the University of Florida in Fall 2015. Course materials include, readings, lectures, exercises and assignments that expand on the material presented at workshops focusing on SQL and R. The course is accessible to: