Data Carpentry workshops are for any researcher who has data they want to analyze , and no prior computational experience is required. This hands-on workshop teaches basic concepts, skills and tools for working more effectively with data.
The focus of this workshop will be on working with genomics data and data management and analysis for genomics research. We will cover metadata organization in spreadsheets, data organization, connecting to and using cloud computing, the command line for sequence quality control and bioinformatics workflows, and R for data analysis and visualization. We will not be teaching any particular bioinformatics tools, but the foundational skills that will allow you to conduct any analysis and analyze the output of a genomics pipeline.
Participants should bring their laptops and plan to participate actively. By the end of the workshop learners should be able to more effectively manage and analyze data and be able to apply the tools and approaches directly to their ongoing research.
Data Carpentry's aim is to teach researchers basic concepts, skills, and tools for working with data so that they can get more done in less time, and with less pain.
One dataset will be used throughout the workshop. We will start by introducing the dataset and the steps we'll go through for analysis.
In this workshop we're using data from Blount et al 2012 paper from Dr. Richard Lenski's Long Term Evolution Experiment.
All the software and data used in the workshop is on an Amazon AMI.
If you want to run your instance of the server used for this workshop, launch a t2.medium instance with AMI in the N. Virginia region ami-6516b30e, available under "Community AMIs" in the Amazon EC2 Management Console.
Module 1: Workshop Introduction
Module 2: Data tidiness
Module 3: Using cloud computing for genomics
Module 4: Introduction to the command line
Module 5: Data wrangling and processing
Module 5: R for data analysis and visualization
Data Carpentry's teaching is hands-on, so participants are encouraged to use their own computers to insure the proper setup of tools for an efficient workfl\ ow. These lessons assume no prior knowledge of the skills or tools, but working through this lesson requires working copies of the software described. To most effectively use these materials, please make sure to install everything before working through this workshop.