Skip to content
Lindsay Clark edited this page Dec 9, 2015 · 1 revision

barcode_splitter.py is a program for splitting one FASTQ file into multiple files according to barcode.

Although tagdigger_interactive.py works with libraries from any enzyme combination, the barcode splitter currently only supports PstI-MspI and NsiI-MspI. Available adapter sequences include those from Poland et al. (2012) and those used in the Sacks lab, designed by Megan Hall. New enzyme combinations and adapters can be easily added by editing tagdigger_fun.py (which I am happy to do upon request).

Input

Two input types are required for barcode_splitter.py:

(1) FASTQ files

(2) Barcode key: Similar to the format for tagdigger_interactive.py, but with column headers "Input File", "Barcode", and "Output File".

Output

The output FASTQ files are uncompressed, with barcodes, adapter sequence, and potentially chimeric sequence clipped out. The comment line for each read has the barcode appended to it.

Optionally, the barcode splitter can also generate a CSV file listing MD5 checksums for each output FASTQ file.

Clone this wiki locally