Setting | Description |
---|---|
Keyword Length [ ] Bases | The minimum length for keywords. The default value is 16 bases. |
Long Keyword >= [x] Bases | When a keyword is long because of sequence region without a homopolymer (three or more identical nucleotides), then the keyword can be divided into a smaller size. If the keyword length exceeds the specified value (60 bases is the default value), then it is parsed into multiple keywords at locations with base sequences of AAT or ATT. |
Frequency <= [x] Counts and <= [y%] or [z%] | Indicates the count and percentage at which a variation between reads within a single cluster is corrected. If there are less than “x” reads and less than y% of the reads show a variation, then the variation is corrected. If there are more than “x” reads that contain the variation, then the frequency of the variation must be below z% to be corrected. |
Combine Both Forward and Reverse | Allows the Error Correction Tool to use reverse complement sequences to calculate variation frequencies. Selecting this option helps to distinguish true SNPs from instrument errors. |