META
is a program for the meta analysis of genome-wide association
studies. The program is designed to synthesizing the evidence from
different association studies. Particularly, the program is able
to work seamlessly with the output of SNPTEST.
This program was used in the meta analysis of the genome-wide
association studies of smoking related traits [1]
Home | Input File Formats | Output Summaries | Running META | Options | Perl Script |
Contributors |
Download |
Version History | References |
Contact Information |
The following
people have contributed to the development of the software for
META:
Jason Liu, Jonathan Marchini
Pre-compiled
versions of the program and example files can be downloaded from
the links below. If you intend to run META on a machine
running an old kernel then you probably want to use the dynamic
version. If you have any problems getting the program to work on
your machine please contact me.
Platform |
File |
Linux (x86_64) Dynamic
Executable |
meta_v1.7_x86_64_dynamic.tgz |
Linux (x86_64) Static Executable | meta_v1.7_x86_64_static.tgz |
Mac OSX |
meta_v1.7_Mac_OSX.tgz |
tar zxvf meta_vX.X_x86_64.tgz |
META
reads plain text files at input, with each line of each file
represeting the information of a SNP. Although the format is quite
flexible, following column names must be provided:
rsid | SNP id. |
pos | Base-pair position of SNP. |
allele_A | non-coded allele (a.k.a non-effect allele, non-reference allele). |
allele_B | coded allele (a.k.a effect allele, reference allele). |
info | imputation quality
score (RSQR_HAT column in MACH; INFO column in PLINK; PROPER_INFO column in SNPTEST). |
P_value | p-value of each SNP. |
beta | effect size of each snp. |
se | standard error of effect size. |
chr rsid pos allele_A allele_B P_value info beta
se 1 rs16969968 76669980 A G 0.027185 0.99025 -0.12571 0.056914 1 rs518425 76670868 A G 0.012406 0.98888 -0.15238 0.060954 2 rs514743 76671282 A T 0.91281 0.9997 0.0061483 0.056075 3 rs615470 76673043 C T 0.90384 0.99988 0.0067651 0.055996 6 rs12899226 76674493 G T 0.69283 0.99717 -0.050464 0.12883 6 rs660652 76674887 A G 0.90419 0.99943 -0.0067418 0.056007 6 rs472054 76675049 A G 0.90419 0.99943 -0.0067418 0.056007 15 rs8029939 76675404 C T 0.96537 0.91413 -0.027428 0.63172 15 rs578776 76675455 A G 0.013069 0.98698 0.1537 0.061882 15 rs6495307 76677376 C T 0.77301 0.99926 0.016023 0.055553 15 rs12910984 76678682 A G 0.0032279 0.98707 -0.19692 0.066864 15 rs1051730 76681394 A G 0.030083 0.96504 -0.12551 0.058043 ... |
META can use the output files of SNPTEST as its
input files because all the information mentioned above is already
included in the output of SNPTEST. See SNPTEST Mode for how to read
them and here
for the details of output of SNPTEST.
The output file
of META contains a line for each SNP and there is a header
line which specifies the contents of each column. The following
table give a description of each of the entries in this file.
chr |
Chromosome number (if you
specified the chromosome in input files) |
rsid | SNP id (taken from input files). |
pos | Base-pair position of SNP (taken from input files). |
allele_A | non-coded allele (taken from input files). |
allele_B | coded allele (taken from input files). |
P_value | combined p-value. |
beta | combined effect size. |
se | combined standard error of effect size. |
Q | Cochran's Q statistics. |
P_heterogeneity | p-value for heterogeneity. |
I2 | percentage of total variation across studies that is due to heterogeneity. |
P_cohort_1, ..., P_cohort_n | p-values of cohort 1, ..., cohort n. |
To run META
and see the parameters it requires, simply type:
./meta |
./meta \ --method 1 \ --cohort examples/example1.txt examples/example2.txt \ --output meta.txt |
./meta \ --method 1 \ --cohort examples/example1.txt.gz examples/example2.txt.gz \ --output meta.txt |
The following is
an example using input files that have a chr column
./meta \ --method 1 \ --cohort examples/example3.txt examples/example4.txt \ --output meta.txt |
When
combining data across cohorts it is crucial that the information
about the alleles at each SNP is consistent. There are a few
reasons why this might not be the case. Two (or more cohorts)
may differ
(a) There might be differences between cohorts in the strand of
the human reference sequence that is used to define the alleles
a SNP.
(b) The order of the alleles at a SNP in the input files may
differ between cohorts.
(c) There might be real inconsistencies in the alleles reported
by each cohort.
META will
try to align the allele information across cohorts at a SNP. If
it finds inconsistencies at a SNP that cannot be rectified then
the SNP will be removed. For example SNP rs16969968 has
alleles A and G in the example file example1.txt
and alleles A and T in example2.txt. This means the SNP
has inconsistent alleles and is removed. When you run the
following command you will see that the screen output reports
that 1 SNP has been removed.
./meta \ --method 1 \ --cohort examples/example1.txt examples/example2.txt \ --output meta.txt |
./meta \ --method 1 \ --threshold 0.9 \ --cohort examples/example1.txt examples/example2.txt \ --output meta.txt |
./meta \ --method 3 \ --sample-size 100 120 \ --cohort examples/example1.txt examples/example2.txt \ --output meta.txt |
./meta \ --method 1 \ --lambda 1.05 1.08 \ --cohort examples/example1.txt examples/example2.txt \ --output meta.txt |
./meta \ --method 1 \ --cohort examples/example1.txt examples/example2.txt \ --rsid rs1051730 rs16969968 \ --output meta.txt |
./meta \ --method 1 \ --cohort examples/example1.txt examples/example2.txt \ --interval 76500000 77000000 \ --output meta.txt |
./meta \ --method 1 \ --cohort examples/example1.txt examples/example2.txt \ --top-snp 5 \ --output meta.txt |
./meta \ --snptest \ --method 1 \ --cohort examples/example7.txt examples/example8.txt \ --output meta.txt |
./meta
\ --snptest \ --method 1 \ --cohort examples/example5.txt examples/example6.txt \ --output meta.txt |
./meta \ --snptest \ --use_info_col \ --method 1 \ --cohort examples/example5.txt examples/example6.txt \ --output meta.txt |
A complete set
of options is given in the following table :
Parameters | Type | Description |
--method | Number (1 to 3) | Three different methods used to combine p-values: 1 = inverse variance method (based on fixed-effects model); 2 = inverse variance method (based on random-effects model); 3 = z-statistics combination method (based on fixed-effects model). |
--cohort | Files | A vector of formatted files. |
--output | File | Output file. |
--snptest |
Flag |
Optional, use the output of
SNPTEST as input files. |
--use_info_col |
Flag |
Optional, specify use of
info column in SNPTEST outpur files (use with --snptest) |
--sample-size |
Numbers |
Optional, a vector of
sample sizes for each cohort. To use z-statistics combination method (method = 3), sample sizes have to be given. |
--lambda |
Numbers |
Optional, a vector of
genomic control lambdas for each cohort. |
--threshold |
Number (between 0 and 1) |
Optional, define a
threshold of imputation quality score (between 0 and 1),
default value = 0.5. |
--rsid | RSIDs | Optional, RSIDs of SNP of interest. |
--interval | Two numbers | Optional, define a subset of SNPs by position (in basepairs) in the range start ≤ position ≤ end. |
--top-snp | Number | Optional, define the number of most significant SNPs that will be output. |
Version |
Release Time |
Description |
1.0 |
11-3-2010 |
First version made available:
|
1.1 |
20-9-2010 |
Changes from META v1.0
|
1.2 |
20-11-2010 |
Changes from META v1.1
|
1.3 |
07-06-2011 |
Changes from META v1.2
|
1.3.1 |
16-08-2011 |
Changes from META v1.3
|
1.3.2 |
22-08-2011 |
Changes from META v1.3.1
|
1.4 |
19-12-2011 |
|
1.5 |
29-03-2013 |
|
1.6 |
16-09-2014 |
|
1.7 |
17-08-2014 |
|
If you have a
question please send a mail to our maillist
http://www.jiscmail.ac.uk/OXSTATGEN
You will need to subscribe to the maillist to do this.
IMPORTANT : If you are
having a problem with one of the programs please include details
of the following when you email.
(a) the version number of the program and the type of computer you
are running the program on e.g. SNPTEST v2.1.0 Mac OSX 10.6
(b) include the precise command line(s) you have used
(c) include any log file and/or screen output from the program
(d) sometimes it may be necessary for us to obtain a copy of the
data you have so please be prepared to supply this. Otherwise, we
may not be able to diagnose the problem.