kreeq
The standard notation for using kreeq is as follows:
It accepts multiple read files as input, separated by space. The two modes we will be using today arevalidate
and union
.
To check out all options and flags use:
Let's get some test files first:
We will test some typical usage with the files moved from the testFiles
folder, e.g.:
kreeq validate -f random1.fasta -r random1.fastq
kreeq validate -f random2.fasta -r random1.fastq random2.fastq
Importantly, the kreeq database can only be computed once on the read set, and reused for multiple analyses to save runtime:
Similarly, kreeq databases can be generated separately for multiple inputs and combined, with increased performance in HPC environments:
kreeq validate -r random1.fastq -o random1.kreeq
kreeq validate -r random2.fastq -o random2.kreeq
kreeq union -d random1.kreeq random2.kreeq -o union.kreeq
kreeq validate -f random1.fasta -d union.kreeq
Now working with real sequencing data:
Let's start by running gfastats
to get a sense of what we are evaluating:
Now we are ready to run kreeq: