Amplicon sequencing pipeline¶
This document describes the process of going from raw 16S or ITS data to
processed data (OTU tables, oligotypes, etc.) using the scripts in the
Alm lab’s processing pipeline.
Most of the 16S and ITS processing steps are orchestrated by
raw2otu.py. However, the larger pipeline platform is designed in such
a manner that the user interfaces with a single script,
Master.py, which takes
as input the path to a folder containing the dataset and a machine-readable file
called a summary file. The summary file tells
Master.py what type of
processing to do (e.g. 16S or ITS) and what steps to do for each type of processing.
Currently, the pipeline can only process 16S and ITS data.
The format of these summary files, and all files required for 16S or ITS processing,
are described in Preparing your dataset for processing.
- Preparing your dataset for processing
- Running the pipeline
- Pipeline output files and directories
- Description of pipeline OTU tables
- Source code