Skip to main content

Command Line Reference

pipeline

This is the subcommand that will kick off the tool. This will always follow directly after the call to the Lancet executable

./Lancet2 pipeline --help
./Lancet2 pipeline -t /path/to/tumor.bam -n /path/to/normal.bam -r /path/to/ref.fasta -o /path/to/out_prefix

General Options:

-h, --help

Bring up the standard help output. This will give examples of options the user can change to customize a run.

-v, --version

Print the version information for the build

Required Arguments:

A standard run of Lancet will provide a tumor bam file, a normal bam file, the reference fasta, and an output path to place the outputted vcf file.

./lancet pipeline -t /path/to/tumor.bam -n /path/to/normal.bam -r /path/to/ref.fasta -o /path/to/out.vcf

-t, --tumor

Provide the path to the tumor BAM/CRAM file(s). Index for this file should also be in the same directory

-n, --normal

Provide the path to normal BAM/CRAM file(s). Index for this file should also be in the same directory

-r, --reference

Provide the path to the reference fasta file. Index for this file should also be in the same directory

-o, --out-vcfgz

Output path to the compressed VCF file

Optional Arguments:

These arguments allow for more fine-tuned control of the tool. If not provided, default values will be assigned

--graphs-dir

This tag allows you to define the output path for dumping serialized graphs from a run. If this option is not utilized, there will be no outputted graphs.

Regions

These options will allow you to play around with what the tool looks at.

-R, --region

Allows the user to define what region(s) the tool should run on. Should be of format chr:start_pos-end_pos and multiple regions will be separated by a comma. For example, this will indicate that the tool will run on chromosome 2 between positions 33091000 and 33092000 as well as chromosome 18 between positions 5729000 and 5759000:

... --region 2:33091000-33092000,18:5729000-5759000 ...

If no region(s) specified, the tool will default to run on the whole genome provided

-b, --bed-file

Path to bed file that will be used to define which region the tool will run on. Bed file should be a tab delimited file in which the first column is the chromosome, the second column is the start position, and the third column is the end position. For example, a bed file like this specifies chromosomes 1 and 2 between positions 4 and 239410 and 892 and 1029348 respectively:

1   4   239410
2 892 1029348

If no region(s) specified, the tool will default to run on the whole genome provided

-P, --padding

Padding to be applied to all input regions. By default, there will be a padding of 500 bp.

-w, --window-size

This tag allows you to define the window size used for each microassembly task. By default, the window will be 500 bp.

-p, --pct-overlap

Allows the user to define how much overlap there should be between windows. If not specified, the tool will default to 20% overlap

Parameters

These options allow you to define certain parameters for how the tool performs its variant calling

-T, --num-threads

This allows you to define how many async worker threads are used by the tool. If not specified, the tool will default to running with 2 threads.

-k, --min-kmer

This allows you to define the minimum length kmers should be for graph nodes. If no length specified, the min kmer length defaults to 31 bp.

-K, --max-kmer

This allows you to define the maximum length kmers should be for graph nodes. If no length specified, the max kmer length defaults to 133 bp.

-s, `--kmer-step

This allows you to define the kmer step sizes to try for graph nodes. If not specified, the kmer step size defaults to 4

--min-anchor-cov

Minimum coverage for anchor nodes (source & sink). Default is 5

--min-node-cov

Minimum coverage for all nodes in the graph. Default is 2

--max-sample-cov

Maximum per sample coverage before downsampling. Default is 500

--min-alt-qual

Minimum phred quality supporting ALT allele. Default is 20

Filters

These options let you apply different filters to apply to the variant caller

--min-nml-cov

Minimum normal coverage. Default is 20

--min-tmr-cov

Minimum tumor coverage. Default is 20

--min-odds-ratio

Minimum VAF odds in tumor vs normal. Default is 10

--min-snv-fisher

Minimum phred scaled fisher score for SNVs

--min-indel-fisher

Minimum phred scaled fisher score for InDels

--min-str-fisher

Minimum phred scaled fisher score for STRs

Flags

Use these flags to toggle certain portions of the code to apply different features. By default, these features are off, but by using tags (with no argument following), you can toggle the feature on.

--verbose

Turn on verbose logging for more detailed messages

--extract-pairs

Extract all useful read pairs

--no-active-region

Force assemble all windows

--no-contig-check

Skip contig check with reference