TY - JOUR
T1 - Within-family outliers
T2 - Segregating alleles or environmental effects? A linkage analysis of height from 5815 sibling pairs
AU - Benyamin, Beben
AU - Perola, Markus
AU - Cornes, Belinda K.
AU - Madden, Pamela A.F.
AU - Palotie, Aarno
AU - Nyholt, Dale R.
AU - Montgomery, Grant W.
AU - Peltonen, Leena
AU - Martin, Nicholas G.
AU - Visscher, Peter M.
N1 - Funding Information:
We thank the twins and their families for their participation. For the ongoing data collection, recruitment and organisation of the studies in which the phenotypes were collected, we thank Marlene Grace and Ann Eldridge who collected the adolescent data; Dixie Statham who supervised collection of much of the adult data; Anjali Henders and Megan Campbell for managing sample processing; David Smyth, Scott Gordon and Harry Beeby for IT support. The genome scans of adolescents were supported by the Australian NHMRC’s Program in Medical Genomics (NHMRC-219178) and a grant to Dr Jeff Trent from the Center for Inherited Disease Research at Johns Hopkins University. CIDR is fully funded through a federal contract from the National Institutes of Health to The Johns Hopkins University, Contract Number N01-HG-65403. For genome scans of adults, we acknowledge and thank the Mammalian Genotyping Service, Marshfield WI (Director: Dr James Weber) for genotyping under grants to Drs Daniel T O’Connor, David Duffy, Patrick Sullivan and Dale Nyholt; Drs Eline Slagboom, Bas Heijmans and Dorret Boomsma for the Leiden genome scan; Dr Peter Reed for the Gemini genome scan; and Dr Jeff Hall for the Sequana genome scan. This research was supported in part by Grants from NIAAA (USA) AA007535, AA013320, AA013326, AA014041, AA07728, AA10249, AA11998, the GenomEUtwin project, supported by the European Union contract number QLRT-2001-01254, and NHMRC (Australia) 941177, 951023, 950998, 981339, 241916, 941944 and 389892. We thank Jonathan Hansen, Allan McRae, Sarah Medland and Gu Zhu for discussions; Bill Hill and Sri Shekar for their comments on the earlier draft of the article. Beben Benyamin thanks the School of Biological Sciences, University of Edinburgh and the Overseas Research Student Award for providing his PhD scholarship.
PY - 2008/4
Y1 - 2008/4
N2 - Most information in linkage analysis for quantitative traits comes from pairs of relatives that are phenotypically most discordant or concordant. Confounding this, within-family outliers from non-genetic causes may create false positives and negatives. We investigated the influence of within-family outliers empirically, using one of the largest genome-wide linkage scans for height. The subjects were drawn from Australian twin cohorts consisting of 8447 individuals in 2861 families, providing a total of 5815 possible pairs of siblings in sibships. A variance component linkage analysis was performed, either including or excluding the within-family outliers. Using the entire dataset, the largest LOD scores were on chromosome 15q (LOD 2.3) and 11q (1.5). Excluding within-family outliers increased the LOD score for most regions, but the LOD score on chromosome 15 decreased from 2.3 to 1.2, suggesting that the outliers may create false negatives and false positives, although rare alleles of large effect may also be an explanation. Several regions suggestive of linkage to height were found after removing the outliers, including 1q23.1 (2.0), 3q22.1 (1.9) and 5q32 (2.3). We conclude that the investigation of the effect of within-family outliers, which is usually neglected, should be a standard quality control measure in linkage analysis for complex traits and may reduce the noise for the search of common variants of modest effect size as well as help identify rare variants of large effect and clinical significance. We suggest that the effect of within-family outliers deserves further investigation via theoretical and simulation studies.
AB - Most information in linkage analysis for quantitative traits comes from pairs of relatives that are phenotypically most discordant or concordant. Confounding this, within-family outliers from non-genetic causes may create false positives and negatives. We investigated the influence of within-family outliers empirically, using one of the largest genome-wide linkage scans for height. The subjects were drawn from Australian twin cohorts consisting of 8447 individuals in 2861 families, providing a total of 5815 possible pairs of siblings in sibships. A variance component linkage analysis was performed, either including or excluding the within-family outliers. Using the entire dataset, the largest LOD scores were on chromosome 15q (LOD 2.3) and 11q (1.5). Excluding within-family outliers increased the LOD score for most regions, but the LOD score on chromosome 15 decreased from 2.3 to 1.2, suggesting that the outliers may create false negatives and false positives, although rare alleles of large effect may also be an explanation. Several regions suggestive of linkage to height were found after removing the outliers, including 1q23.1 (2.0), 3q22.1 (1.9) and 5q32 (2.3). We conclude that the investigation of the effect of within-family outliers, which is usually neglected, should be a standard quality control measure in linkage analysis for complex traits and may reduce the noise for the search of common variants of modest effect size as well as help identify rare variants of large effect and clinical significance. We suggest that the effect of within-family outliers deserves further investigation via theoretical and simulation studies.
UR - http://www.scopus.com/inward/record.url?scp=41049090634&partnerID=8YFLogxK
U2 - 10.1038/sj.ejhg.5201992
DO - 10.1038/sj.ejhg.5201992
M3 - Article
C2 - 18197190
AN - SCOPUS:41049090634
SN - 1018-4813
VL - 16
SP - 516
EP - 524
JO - European Journal of Human Genetics
JF - European Journal of Human Genetics
IS - 4
ER -