|
The options in this table affect the way that the program filters the input data. Some of the options provide direct control over which samples and SNPs get included in the analysis, while others set rules for how the program should behave when faced with certain filtering choices. These options are designed to make filtering more flexible, so that it is easy to apply any desired set of filters to a single underlying genotype file. Some of these options apply to the dataset as a whole while others apply only to specific panels. The flag name for each panel-specific option ends in the command-line symbol for the file on which it operates; e.g., to exclude SNPs from the -g file you should use -exclude_snps_g, and to exclude SNPs from the -g_ref file you should use -exclude_snps_g_ref. |
Flag | Default | Description |
|
none |
This option provides flexible variant filtering in the reference panel via "filter rules", which are based on annotation columns in a To filter variants based on the numeric annotation values in the It is very important that you enclose each filtering string in single quotes, as shown above. Otherwise, the command-line environment may interpret symbols like You can develop annotations yourself and add them to the For an illustration of using |
|
none |
List of SNPs to exclude from the
|
|
none |
Same as |
|
Specifies that SNPs excluded from the study dataset via the
|
|
|
none |
List of reference-panel-only SNPs to impute. If you do not want the program to impute all of the reference SNPs in the region you are analyzing, you can use this list to specify a subset of SNPs to impute; all other SNPs will be ignored unless they have data in the
This option does not have any effect on SNPs in the |
|
none |
File of sample IDs for the individuals in the NOTE: Currently, the only reason to provide a sample file is if you want to exclude some individuals via the |
|
none |
Same as |
|
none |
List of samples to exclude from the NOTE: Part of the IMPUTE2 algorithm involves pooling information across the individuals in your study dataset. Samples with systematically aberrant genotypes (due, e.g., to degraded assay DNA) can confuse this part of the model; you should take care to identify such samples ahead of time and exclude them either manually or with this option. |
|
none |
Same as |