Annotate a VCF File 📄
Before annotating a VCF file using Genome Nexus, it must be first converted to a MAF file. This is because the pipeline is specifically designed to annotate MAF files, which provide a standardized format for storing mutation data.
VCF to MAF conversion
To convert a Variant Call Format (VCF) file to Mutation Annotation Format (MAF), we recommend using the vcf2maf-lite Python tool. It is a lightweight Python adaptation of the vcf2maf Perl tool, that converts the VCF to MAF format without adding variant annotations.
Minimal Example
This command converts the VCF files in /vcf folder to MAF format.
The
--input-data
option is used to specify either a single VCF file or a directory containing multiple VCF files (separated by commas). This option supports passing multiple input files or directories at once.The
--output-directory
option allows you to specify the directory where the MAF files will be saved. If no output path is provided, the default output directoryvcf2maf_output
will be used in the current working directory.The
--tumor-id
option allows you to specify the ID of the tumor sample used in the genotype columns of the VCF file. If the option is not used, the script will automatically identify the tumor ID from either thetumor_sample
keyword in the meta data lines or the sample columns from VCF header.The
--normal-id
option allows you to specify the ID of the normal sample used in the genotype columns of the VCF file. If the option is not used, the script will automatically identify the normal ID from either thenormal_sample
keyword in the meta data lines or the sample columns from VCF header.The
--retain-info
option allows you to specify the INFO fields to be retained as additional columns in the MAF. If the option is not used, standard MAF columns are included by default.The
--retain-fmt
option allows you to specify the FORMAT fields to be retained as additional columns in the MAF. If the option is not used, standard MAF columns are included by default.
Convert with Docker
vcf2maf-lite is available in DockerHub at https://hub.docker.com/r/genomenexus/vcf2maf-lite
Output can be found in the /maf directory.
Annotate a MAF file:
Once the VCF file has been converted to Mutation Annotation Format (MAF), the MAF file can be annotated using Genome Nexus. Refer to the Annotate a MAF File section for detailed instructions.
Last updated