Annotate a VCF File 📄
Last updated
Was this helpful?
Last updated
Was this helpful?
Before annotating a VCF file using Genome Nexus, it must be first converted to a MAF file. This is because the pipeline is specifically designed to annotate MAF files, which provide a standardized format for storing mutation data.
To convert a Variant Call Format (VCF) file to Mutation Annotation Format (MAF), we recommend using the . It is a lightweight Python adaptation of the , that converts the VCF to MAF format without adding variant annotations.
This command converts the VCF files in /vcf folder to MAF format.
The --input-data
option is used to specify either a single VCF file or a directory containing multiple VCF files (separated by commas). This option supports passing multiple input files or directories at once.
The --output-directory
option allows you to specify the directory where the MAF files will be saved. If no output path is provided, the default output directory vcf2maf_output
will be used in the current working directory.
The --tumor-id
option allows you to specify the ID of the tumor sample used in the genotype columns of the VCF file. If the option is not used, the script will automatically identify the tumor ID from either the tumor_sample
keyword in the meta data lines or the sample columns from VCF header.
The --normal-id
option allows you to specify the ID of the normal sample used in the genotype columns of the VCF file. If the option is not used, the script will automatically identify the normal ID from either the normal_sample
keyword in the meta data lines or the sample columns from VCF header.
The --retain-info
option allows you to specify the INFO fields to be retained as additional columns in the MAF. If the option is not used, standard MAF columns are included by default.
The --retain-fmt
option allows you to specify the FORMAT fields to be retained as additional columns in the MAF. If the option is not used, standard MAF columns are included by default.
vcf2maf-lite is available in DockerHub at https://hub.docker.com/r/genomenexus/vcf2maf-lite
Output can be found in the /maf directory.
Once the VCF file has been converted to Mutation Annotation Format (MAF), the MAF file can be annotated using Genome Nexus. Refer to the section for detailed instructions.