ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Correspondence

Issues with data transformation in genome-wide association studies for phenotypic variability

[version 1; peer review: 2 approved]
PUBLISHED 02 Oct 2013
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS

Abstract

The purpose of this correspondence is to discuss and clarify a few points about data transformation used in genome-wide association studies, especially for phenotypic variability. By commenting on the recent publication by Sun et al. in the American Journal of Human Genetics, we emphasize the importance of statistical power in detecting functional loci and the real meaning of the scale of the phenotype in practice.

Correspondence

Recently, Sun et al.1 raised an interesting suggestion concerning the use of variance-stabilization transformations in genome-wide association studies (GWAS) for phenotypic variability. Specifically, Sun et al. revisited Yang et al.’s2 results on the variability-controlling locus FTO for human body mass index (BMI) and claimed that the underlying variability across genotypes might not be as large as Yang et al. had seen. Although it was an important point that Sun et al. discussed, especially when quantitatively studying phenotypic variability has become such a hot topic, it is our opinion that there are some issues with the transformation approach that Sun et al. proposed.

First of all, if we take Sun et al.’s transformation according to Yang et al.’s phenotypic mean and variance per FTO genotype class, i.e. a one-to-one map through an inverse hyperbolic sine function, the BMI scale will become rather different compared with the ordinary measurement that we normally use (Figure 1). On the transformed scale of BMI, the difference between two persons who have a BMI of 24 and 25 kg/m2 is much larger than that between two BMIs of 20 and 21 kg/m2, which is strange in reality since the original BMI scale is what we commonly use and also what we care about. Sun et al.’s main argument here is that nearly all the measurement units are manmade. However, considering one of the traits of most interest, e.g. height, why should we regard the difference between 160cm and 170cm different from 170cm and 180cm? Although the definitions of most units can be arbitrary, some measurement scales do have meaning in real life.

5cae9a17-3bb9-41c8-b012-df2677941c58_figure1.gif

Figure 1. Comparison of the original scale of body mass index (BMI) and the transformed scale using Sun et al.’s1 transformation.

The transformation was determined by the phenotypic distribution across FTO genotypes reported by Yang et al.2.

Secondly, a key problem with Sun et al.’s transformation in practice is that such a transformation is marker-specific. Namely, when performing a GWAS, one needs to transform the phenotypic records differently for different markers, according to the phenotypic distribution across the genotypes per marker. This does not make much sense in practical analyses, because if there is a "best" scale of the phenotype, it should be used for all the markers across the genome, before testing the association between the phenotype and the markers. Using the tested marker to determine the transformation of the phenotype is strange. If a marker-specific transformation can be estimated, one should estimate a genome-specific transformation for GWAS, instead of doing different transformations marker-by-marker.

Thirdly, if the transformation of the phenotype is determined by one marker showing a significant effect on the phenotypic variability before testing the other markers, another significant effect on the phenotypic variability might be created due to such a transformation. In such a situation, it is problematic to decide which phenotypic scale we should choose.

Fourthly, several recent studies discussed that gene-gene or gene-environment interactions could cause significant variance heterogeneity across genotypes36, which makes testing variance-controlling loci a powerful tool to reveal potential interaction effects. Reducing the difference in variance across genotypes using a marker-specific variance-stabilization transformation would dramatically reduce such power. Regarding the biological sense of genetically regulated variance heterogeneity, empirical evidence has shown that a single causal locus could show a much higher significant effect on variance compared with the mean6. In a particular population, such a locus may only be mappable through testing the variability rather than the magnitude of the phenotype.

The above issues cause us to question Sun et al.’s transformation in practice. The scale of the phenotype is certainly an important concern when interpreting an effect on phenotypic variability7. However, one needs to be careful for the points above before applying any transformation on the data. In particular, the statistical power in detecting functional loci and the real meaning of the scale used should be emphasized.

Comments on this article Comments (0)

Version 1
VERSION 1 PUBLISHED 02 Oct 2013
Comment
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Shen X and Rönnegård L. Issues with data transformation in genome-wide association studies for phenotypic variability [version 1; peer review: 2 approved]. F1000Research 2013, 2:200 (https://doi.org/10.12688/f1000research.2-200.v1)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions
Version 1
VERSION 1
PUBLISHED 02 Oct 2013
Views
40
Cite
Reviewer Report 01 Nov 2013
Yurii Aulchenko, Institute of Cytology and Genetics, Siberian Division of the Russian Academy of Sciences, Novosibirsk, Russian Federation 
Yakov Tsepilov, Novosibirsk State University, Novosibirsk, Russian Federation 
Sodbo Sharapov, Novosibirsk State University, Novosibirsk, Russian Federation 
Approved
VIEWS 40
We agree with criticism raised by Shen and Ronnegard in their points 2 and 3 concerning the application of the transformation of Sun et al. in the context of whole-genome scans. Indeed, applying this transformation in SNP-specific manner is difficult ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Aulchenko Y, Tsepilov Y and Sharapov S. Reviewer Report For: Issues with data transformation in genome-wide association studies for phenotypic variability [version 1; peer review: 2 approved]. F1000Research 2013, 2:200 (https://doi.org/10.5256/f1000research.2505.r1948)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
Views
33
Cite
Reviewer Report 07 Oct 2013
 William G. Hill, Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, UK 
Ian White, University of Edinburgh 
Approved
VIEWS 33
Shen and Rönnegård (SR) comment critically and succinctly on the paper by Sun et al. published in AJHG which advocates that, before any claim of differences in variance among genotypes in a GWAS or similar study, a check should first ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Hill  G and White I. Reviewer Report For: Issues with data transformation in genome-wide association studies for phenotypic variability [version 1; peer review: 2 approved]. F1000Research 2013, 2:200 (https://doi.org/10.5256/f1000research.2505.r1949)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.

Comments on this article Comments (0)

Version 1
VERSION 1 PUBLISHED 02 Oct 2013
Comment
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.