GCTA-LD: searching for SNPs in LD with a target SNP

GCTA-LD: searching for SNPs in LD with a target SNP Oct 10, 2016 1:25:36 GMT

Post by Jian Yang on Oct 10, 2016 1:25:36 GMT

For each target SNP, GCTA uses the simple regression approach to search for SNPs that are in significant LD with the target SNP.

--ld ld.snplist
Specify a list of SNPs.

--ld-wind 5000
Search for SNPs in LD with a target SNP within d Kb (e.g. 5000 Kb) region in either direction by simple regression test.

--ld-sig 0.05
Threshold p-value for regression test, e.g. 0.05.

Example
gcta64 --bfile test --ld ld.snplist --ld-wind 5000 --ld-sig 0.05 --out test

Output files
1) test.rsq.ld, summary of LD structure with each row corresponding to each target SNP. The columns are target SNP

length of LD block
two flanking SNPs of the LD block
total number of SNPs within the LD block
mean r2
median r2
maximum r2
SNP in highest LD with the target SNP

2) test.r.ld, the correlations (r) between the target SNP and all the SNPs in the LD block.
3) test.snp.ld, the names of all the SNPs in the LD with the target SNP.
Note: LD block is defined as a region where SNPs outside this region are not in significant LD with the target SNP. According to this definition, the length of LD block depends on user-specified window size and significance level.

References

Method for LD calculation: Yang et al. (2011) Genomic inflation factors under polygenic inheritance. Eur J Hum Genet. 19(7): 807-812. [Pubmed ID: 21407268]

GCTA software: Yang J, Lee SH, Goddard ME and Visscher PM. GCTA: a tool for Genome-wide Complex Trait Analysis. Am J Hum Genet. 2011 Jan 88(1): 76-82. [PubMed ID: 21167468]