<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">F1000Research</journal-id>
            <journal-title-group>
                <journal-title>F1000Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2046-1402</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/f1000research.9202.1</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Research Article</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                    <subj-group>
                        <subject>Neuronal Signaling Mechanisms</subject>
                    </subj-group>
                    <subj-group>
                        <subject>Statistical Methodologies &amp; Health Informatics</subject>
                    </subj-group>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>Does sadness impair color perception? Flawed evidence and faulty methods</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 1; peer review: 2 approved]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Holcombe</surname>
                        <given-names>Alex O.</given-names>
                    </name>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Brown</surname>
                        <given-names>Nicholas J. L. </given-names>
                    </name>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Goodbourn</surname>
                        <given-names>Patrick T.</given-names>
                    </name>
                    <xref ref-type="aff" rid="a1">1</xref>
                    <xref ref-type="aff" rid="a3">3</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Etz</surname>
                        <given-names>Alexander</given-names>
                    </name>
                    <xref ref-type="aff" rid="a4">4</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Geukes</surname>
                        <given-names>Sebastian</given-names>
                    </name>
                    <xref ref-type="aff" rid="a5">5</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>School of Psychology, University of Sydney, New South Wales, 2006, Australia</aff>
                <aff id="a2">
                    <label>2</label>Department of Health Sciences, University Medical Center, University of Groningen, Groningen, 9713 GZ, The Netherlands</aff>
                <aff id="a3">
                    <label>3</label>School of Psychological Sciences, University of Melbourne, Victoria, 3010, Australia</aff>
                <aff id="a4">
                    <label>4</label>Department of Cognitive Sciences, University of California, Irvine, CA, 92697-5100, USA</aff>
                <aff id="a5">
                    <label>5</label>Institut f&#x00fc;r Psychologie, Westf&#x00e4;lische Wilhelms-Universit&#x00e4;t M&#x00fc;nster, M&#x00fc;nster, 48149, Germany</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:nick.brown@free.fr">nick.brown@free.fr</email>
                </corresp>
                <fn fn-type="con">
                    <p>AOH coordinated the team and wrote most of the main section of the article. NJLB started the discussion (on Twitter), wrote the R code to perform the detailed analysis of the dataset, and wrote most of the sections describing this analysis. PTG contributed several points and 
                        <xref ref-type="fig" rid="f4">Figure 4</xref> as well as contributing to the writing. AE contributed to the discussion of stimuli problems and (lack of) necessary stimulus sampling. SG contributed to the analysis, contributed early versions of some of the R code, and helped revise the manuscript. All authors agreed to the final content of the article.</p>
                </fn>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>AOH is an associate editor for a journal, &lt;i&gt;Perspectives on Psychological Science&lt;/i&gt;, that is published by the organization (the Association for Psychological Science) that also publishes the journal in which Thorstenson &lt;i&gt;et al.&lt;/i&gt; (2015) appears.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>21</day>
                <month>7</month>
                <year>2016</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2016</year>
            </pub-date>
            <volume>5</volume>
            <elocation-id>1778</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>13</day>
                    <month>7</month>
                    <year>2016</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2016 Holcombe AO et al.</copyright-statement>
                <copyright-year>2016</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://f1000research.com/articles/5-1778/pdf"/>
            <abstract>
                <p>In their 2015 paper, Thorstenson, Pazda, and Elliot offered evidence from two experiments that perception of colors on the blue&#x2013;yellow axis was impaired if the participants had watched a sad movie clip, compared to participants who watched clips designed to induce a happy or neutral mood. Subsequently, these authors retracted their article, citing a mistake in their statistical analyses and a problem with the data in one of their experiments. Here, we discuss a number of other methodological problems with Thorstenson 
                    <italic toggle="yes">et al.</italic>&#x2019;s experimental design, and also demonstrate that the problems with the data go beyond what these authors reported. We conclude that repeating one of the two experiments, with the minor revisions proposed by Thorstenson 
                    <italic toggle="yes">et al.</italic>, will not be sufficient to address the problems with this work.</p>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>Mood</kwd>
                <kwd>perception</kwd>
                <kwd>color</kwd>
                <kwd>open data</kwd>
                <kwd>reanalysis</kwd>
            </kwd-group>
            <funding-group>
                <funding-statement>This work was supported by internal university grants to AOH and PTG from the University of Sydney and the University of Melbourne, respectively. No other authors received any private or public funding to support their involvement in this work.</funding-statement>
                <funding-statement>
                    <italic>The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.</italic>
                </funding-statement>
            </funding-group>
        </article-meta>
    </front>
    <body>
        <sec sec-type="intro">
            <title>Introduction</title>
            <p>Based on two experiments, 
                <xref ref-type="bibr" rid="ref-13">Thorstenson 
                    <italic toggle="yes">et al.</italic> (2015a)</xref> claimed that a state of sadness&#x2014;induced by watching a short film clip&#x2014;impairs performance on a specific perceptual task: discrimination of colors along the blue&#x2013;yellow axis, but not the red&#x2013;green axis. This conclusion is interesting because it is specific to a single dimension of color space; poor performance on tasks generally, or low willingness to cooperate with an experimenter, would not be a surprising effect of sadness.</p>
            <p>In their retraction notice (
                <xref ref-type="bibr" rid="ref-14">Thorstenson 
                    <italic toggle="yes">et al.</italic>, 2015b</xref>), the authors acknowledged that their data did not justify their conclusion that impairment was specific to one aspect of color space. They also described an anomaly in the histogram of the data of Experiment 2. In our sections below entitled &#x201c;A confounded comparison&#x201d; and &#x201c;Perceptual impairment or change in bias?&#x201d;, we detail other problems with the way the experiments were conducted, the choice of stimuli, and the measures chosen. It is these problems that lead us to believe that even the revised Experiment 2  (proposed by 
                <xref ref-type="bibr" rid="ref-14">Thorstenson 
                    <italic toggle="yes">et al.</italic>, 2015b</xref>) will not justify their original conclusion. In our &#x201c;Re-analysis of data&#x201d; section, we describe a number of statistical issues that go beyond the problem mentioned in the retraction notice. In our final section, &#x201c;Anomalies and strange patterns in the data&#x201d;, we report some other anomalies with the dataset, which further undermine confidence in the way the experiments were carried out and in the resulting findings. We offer this analysis not only to improve the state of the literature on the perceptual effects of watching sad film clips, but also in the hope that it will lead to better work in this area in the future.</p>
        </sec>
        <sec>
            <title>A confounded comparison</title>
            <p>When designing an experiment comparing two conditions, one strives to make the factor of interest the 
                <italic toggle="yes">only</italic> difference between the conditions. 
                <xref ref-type="bibr" rid="ref-13">Thorstenson 
                    <italic toggle="yes">et al.</italic> (2015a)</xref> contrasted two film clips, one of which was intended to cause the participants to feel sad. The clips ought to have been chosen to avoid any other differences (on average) in their effect on the participants. The &#x201c;sadness&#x201d; clip of Experiment 1 is an excerpt from the animated Disney movie 
                <italic toggle="yes">The Lion King</italic>, with an unusual lighting that gives the impression of daylight filtered through dust, while the &#x201c;happiness&#x201d; clip is a warmly-lit, indoor recording of the comedian Bill Cosby. The &#x201c;sadness&#x201d; clip used in Experiment 2 is the same Lion King excerpt, converted from color to grayscale, and the &#x201c;neutral&#x201d; clip is a grayscale film of sticks appearing on top of one another at different orientations (also converted from color to grayscale).</p>
            <p>Unfortunately, differences in mean color and the color variability in these clips may have differently affected subsequent perception of blue and yellow versus red and green. For example, the contrast along the blue&#x2013;yellow axis might have been greater in the sadness clips. Such a difference would result in reduced sensitivity to blue&#x2013;yellow (
                <xref ref-type="bibr" rid="ref-9">Krauskopf 
                    <italic toggle="yes">et al.</italic>, 1982</xref>). The use of grayscale clips does not eliminate this issue. An analysis of the Hue, Saturation, Brightness (HSB) values in the movie files posted online indicates that the mean color of the grayscale clips is bluish-reddish, with some saturations approaching 5%. These grayscale clips, therefore, may have had differences in average color as well as color contrast that were uncontrolled. To resolve the issue, the colors displayed on the laboratory screen must be measured with a colorimeter. The authors should have made these measurements and reported them in their paper, in order to provide an indication of whether simple contrast adaptation specific to each color axis would occur when viewing the clips. In the absence of a report of these measurements or any mention of the issue, it appears that 
                <xref ref-type="bibr" rid="ref-13">Thorstenson 
                    <italic toggle="yes">et al.</italic> (2015a)</xref> did not take the appropriate steps to eliminate the possibility that a classic process in color perception could explain the results.</p>
            <p>Blue, yellow, green and red are all defined relative to a white point that, in the human visual system, is quite flexible. Just as one can adjust the white balance of a camera to fit scenes with different illumination, for humans the point considered to be the center of color space changes depending on the palette of colors that confronts us (
                <xref ref-type="bibr" rid="ref-16">Webster &amp; Leonard, 2008</xref>). Unfortunately, Thorstenson 
                <italic toggle="yes">et al.</italic> displayed their test stimuli in a manner unsuited to controlling the participant&#x2019;s white point. Color perception experiments typically use a neutral grey or white background to provide a white-point reference for participants, alongside the test stimulus. Thorstensen 
                <italic toggle="yes">et al.</italic>&#x2019;s use of full-field color without a simultaneous reference stimulus makes categorization of desaturated patches problematic. In such circumstances, the participants&#x2019; white points may be more dependent on the color content of the movie clip they viewed previously, which as mentioned above appears to have been uncontrolled. In addition, the lack of a grey or white reference stimulus may cause participants to be completely unable to judge the stimulus color more often. In such circumstances, responses may be particularly prone to influence by cognitive factors or by priming (
                <xref ref-type="bibr" rid="ref-6">Garc&#x00ed;a-P&#x00e9;rez &amp; Alcal&#x00e1;-Quintana, 2013</xref>).</p>
            <p>In addition to color, there may be other confounding difference between the two types of clips. The clips likely differed in interest, action, and other features. Unfortunately, it is difficult to know whether such features might have affected participants&#x2019; color perception. It certainly is possible for such differences to bias the participants&#x2019; responses when they are uncertain of the stimulus&#x2019;s color. Of course, it is almost impossible to avoid featural differences between any two particular clips. Because of this, a good experimental design would utilize a large set of clips, assess the various featural differences between the stimuli, and either match the two groups of clips carefully on their features, or model them as random effects in a mixed-effects model (
                <xref ref-type="bibr" rid="ref-17">Wells &amp; Windschitl, 1993</xref>).</p>
        </sec>
        <sec>
            <title>Perceptual impairment or change in bias?</title>
            <p>
                <xref ref-type="bibr" rid="ref-13">Thorstenson 
                    <italic toggle="yes">et al.</italic> (2015a)</xref> concluded that sadness &#x201c;impair[s] color perception on the blue&#x2013;yellow color axis&#x201d; (p. 1). But signal detection theory, which was not used, would be necessary to show whether the decrease in accuracy found was indeed due to an impairment in color perception (i.e., a decline in sensitivity along the blue&#x2013;yellow axis), or whether the judgments of the sadness group were instead biased away from blue and yellow. For decades, studies of perception have used signal detection theory to distinguish between a change in perceptual ability and a change in, say, cognitive bias to press the blue or yellow button rather than the red or green one (
                <xref ref-type="bibr" rid="ref-7">Green &amp; Swets, 1966</xref>). Unfortunately, Thorstenson 
                <italic toggle="yes">et al.</italic>&#x2019;s plan to simply repeat Experiment 2 with minor revisions would not allow for the appropriate analysis. In Experiment 2, participants were tested in only two trials for each stimulus. Much more data would be needed to discern between a decline in the participants&#x2019; ability to discriminate the colors from a decline in the participants&#x2019; bias toward pressing the blue or yellow button (instead of the red or green). For the four-alternative categorization task used by Thorstenson 
                <italic toggle="yes">et al.</italic>, a multivariate extension of signal-detection theory should be used (such as general-recognition theory, 
                <xref ref-type="bibr" rid="ref-2">Ashby &amp; Townsend, 1986</xref>).</p>
            <p>The analyses of 
                <xref ref-type="bibr" rid="ref-13">Thorstenson 
                    <italic toggle="yes">et al.</italic> (2015a)</xref>, and also the improved analyses that we have suggested above, assume stochastic independence of participants&#x2019; accuracies on the red&#x2013;green stimuli and the blue&#x2013;yellow stimuli. Unfortunately, however, this assumption may be unjustified. Participants&#x2019; accuracy on one axis might affect their guessing strategy on another. In Experiment 1 for example, accuracy was very high on the blue&#x2013;yellow axis, suggesting that many participants may have had a clear color percept of the blue or yellow stimuli, but were less certain about the red and green stimuli. If so, when an unclear patch came up and they guessed, they may have been unlikely to guess blue or yellow, in an effort to balance their responses across the available options (many participants may have correctly guessed that the stimuli were roughly equally distributed among the four categories). This would artifactually improve performance on the red and green stimuli. Modeling this phenomenon, however, would be difficult. Even if we had access to the raw responses (rather than the summary data provided by Thorstenson 
                <italic toggle="yes">et al.</italic>), it would be difficult to estimate the participants&#x2019; guessing strategy. To avoid this problem in a future version of this experiment, we suggest that Thorstenson and colleagues should consider adopting the two-alternative forced choice design (
                <xref ref-type="bibr" rid="ref-4">Fechner, 1860/1966</xref>) commonly used in psychophysics.</p>
            <p>
                <xref ref-type="bibr" rid="ref-13">Thorstenson 
                    <italic toggle="yes">et al.</italic> (2015a)</xref> are not the only researchers to have used bias-prone measures of perception to support claims that some non-perceptual state can influence perception. 
                <xref ref-type="bibr" rid="ref-5">Firestone &amp; Scholl (2015)</xref> provide many other examples, with useful discussion.</p>
        </sec>
        <sec>
            <title>Reanalysis of data</title>
            <p>There are issues with the dataset (
                <xref ref-type="bibr" rid="ref-15">Thorstenson 
                    <italic toggle="yes">et al.</italic>, 2015c</xref>) that were not described in the retraction (
                <xref ref-type="bibr" rid="ref-14">Thorstenson 
                    <italic toggle="yes">et al.</italic>, 2015b</xref>) of the article. Some of these issues affect Experiment 1, which Thorstenson 
                <italic toggle="yes">et al.</italic> indicated that they plan to re-publish. We describe and discuss these issues in this section, as well as the next section, &#x201c;Anomalies and strange patterns in the data&#x201d;.</p>
            <p>The most important empirical claim made by 
                <xref ref-type="bibr" rid="ref-13">Thorstenson 
                    <italic toggle="yes">et al.</italic> (2015a)</xref> was that there was a difference in performance between their two measures, namely color perception along the blue&#x2013;yellow axis and color perception along the red&#x2013;green axis. However, these authors provided no statistical test of a difference in the effect of the film clip on blue&#x2013;yellow compared to red&#x2013;green. This problem was discussed widely on blogs and on 
                <xref ref-type="bibr" rid="ref-10">PubPeer (2016)</xref>, and was acknowledged by 
                <xref ref-type="bibr" rid="ref-14">Thorstenson 
                    <italic toggle="yes">et al.</italic> (2015b)</xref> in their retraction notice. 
                <xref ref-type="bibr" rid="ref-13">Thorstenson 
                    <italic toggle="yes">et al.</italic> (2015a, p. 4)</xref> noted that the possible difference between red&#x2013;green and blue&#x2013;yellow color perception, such that &#x201c;sadness influenced chromatic judgments about colors on the blue&#x2013;yellow axis, but not those on the red&#x2013;green axis,&#x201d; is critical to ruling out &#x201c;the possibility that sadness simply led to less effort, arousal, attention, or task engagement.&#x201d; Such a difference implies a statistical interaction between the &#x201c;emotion condition&#x201d; and &#x201c;color axis&#x201d; factors. However, the authors did not report a statistical test for this interaction in either of their experiments. When we (and the authors of various blogs, such as 
                <xref ref-type="bibr" rid="ref-1">Areshenkoff, 2015</xref>) tested this interaction with the published data, we found (code at: 
                <xref ref-type="bibr" rid="ref-8">Holcombe 
                    <italic toggle="yes">et al.</italic>, 2016</xref>) that it was not statistically significant: Experiment 1, 
                <italic toggle="yes">F</italic>(1, 125) = 3.51, 
                <italic toggle="yes">p</italic> = .06; Experiment 2: 
                <italic toggle="yes">F</italic>(1, 128) = 0.40, 
                <italic toggle="yes">p</italic> = .52. In their retraction notice, 
                <xref ref-type="bibr" rid="ref-14">Thorstenson 
                    <italic toggle="yes">et al.</italic> (2015b)</xref> reported a 
                <italic toggle="yes">z</italic> test to test the same issue (for unknown reasons, they did not use a conventional statistical interaction, but instead followed a procedure described in 
                <xref ref-type="bibr" rid="ref-11">Rosenthal &amp; Rosnow, 1991</xref>), which also did not yield statistical significance.</p>
            <p>An additional potential source of error is that 
                <xref ref-type="bibr" rid="ref-13">Thorstenson 
                    <italic toggle="yes">et al.</italic> (2015a)</xref> did not record the color-perception performance of their participants before the film clips were shown. It was apparently considered sufficient to randomize the participants to watch one of two film clips; presumably the reasoning was that this randomization made it unlikely that the two groups differed much in baseline performance. However, even if this assumption were to be confirmed, the two groups would likely differ somewhat at baseline, even if by only a small amount, and such a difference could have an effect on the outcome given the relatively small sample sizes involved (
                <xref ref-type="bibr" rid="ref-12">Saint-Mont, 2015</xref>). It would have been useful for these differences to be measured and included in the subsequent analyses, given that Thorstenson 
                <italic toggle="yes">et al.</italic>&#x2019;s hypothesis was that sadness would &#x201c;impair&#x201d; (i.e., reduce, compared to a previous state) participants&#x2019; color perception. In addition, using a change score for each participant can increase statistical power by reducing the contribution of variation among participants to the error term.</p>
            <p>Finally, we note that 
                <xref ref-type="bibr" rid="ref-13">Thorstenson 
                    <italic toggle="yes">et al.</italic>&#x2019;s (2015a)</xref> experimental design assumes the complete independence of participants&#x2019; accuracy on the two sets of stimuli (red&#x2013;green and blue&#x2013;yellow). We discuss a possible violation of this assumption in our &#x201c;Perceptual impairment or change in bias?&#x201d; section above.</p>
        </sec>
        <sec>
            <title>Anomalies and strange patterns in the data</title>
            <sec>
                <title>Large numbers of participants with identical scores</title>
                <p>We observed a strange pattern in the data for the blue&#x2013;yellow axis in 
                    <xref ref-type="bibr" rid="ref-13">Thorstenson 
                        <italic toggle="yes">et al.</italic>&#x2019;s (2015a)</xref> Experiment 2. Specifically, a very large number of participants (53 out of 130) had a score of exactly 50%, corresponding to 12 out of 24 correct responses, with every other number of correct responses (10, 11, 13, 14, etc.) being achieved by a much smaller number of participants. This is illustrated in 
                    <xref ref-type="fig" rid="f1">Figure 1</xref>, where the spike at the 50% level is clearly visible. This issue was one of the reasons given by 
                    <xref ref-type="bibr" rid="ref-14">Thorstenson 
                        <italic toggle="yes">et al.</italic> (2015b)</xref> for retracting their article.</p>
                <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                    <label>Figure 1. </label>
                    <caption>
                        <title>Distribution of standardized blue&#x2013;yellow axis scores recorded for Experiment 2.</title>
                        <p>The histogram shows the number of occurrences of each score, calculated by 
                            <xref ref-type="bibr" rid="ref-13">Thorstenson 
                                <italic toggle="yes">et al.</italic> (2015a)</xref> as the proportion of correct responses in the 24 trials in which either a blue or yellow patch was presented. The pairs of adjacent bars near 0.55 and 0.8 correspond to cases that are not compatible with correct rounding.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/9905/08d186c6-0334-465e-b4f5-9f9b6e37b3ff_figure1.gif"/>
                </fig>
                <p>Upon our request, Christopher Thorstenson provided us with 
                    <italic toggle="yes">per-color patch data</italic>. The per-color patch data consists of two Excel files (one per experiment), with each cell containing a combined score for the participants&#x2019; two responses for each color and saturation level. The score for each case is either 0.0, 0.5, or 1.0, corresponding to 0, 1, or 2 correct responses (see 
                    <ext-link ext-link-type="uri" xlink:href="https://osf.io/sbhn9/">https://osf.io/sbhn9/</ext-link>).</p>
                <p>Closer examination of the per-color patch data, shows that of the 53 participants scoring exactly 50%, 49 (i.e., 37.7% of all participants in Experiment 2) had identical scores for both colors, namely 6.0 (100%) for blue and 0.0 (0%) for yellow (in their patch data files, each correct observation counts for a half-point, so that scores for each color range from 0.0 to 6.0 in increments of 0.5; thus, a score of 6.0 corresponds to 12 correct responses out of 12). We are at a loss to explain this phenomenon, which affected both experimental conditions (26 of the 49 participants with this 12&#x2013;0 split were in the neutral condition, with 23 of the 49 being in the sadness condition). There seems no reason to suppose that the undergraduate participants in this experiment would have been markedly less sensitive to yellow than those in Experiment 1. However, even if their ability to distinguish the color yellow was affected by some environmental factor, or if they had been accidentally (perhaps due to a software problem) shown, say, a gray patch instead of a yellow one, their expected score for yellow would be 1.5 (i.e., three correct identifications out of 12 attempts) by chance alone.</p>
            </sec>
            <sec>
                <title>Inconsistent calculation of percentages</title>
                <p>A further concern with the summary data (
                    <xref ref-type="bibr" rid="ref-15">Thorstenson 
                        <italic toggle="yes">et al.</italic>, 2015c</xref>) is that the conversion of color perception values from counts of responses to percentages of correct attempts for both axes in Experiment 2 appears to be inconsistent. These percentage values, reported to two decimal places, ought to be the result of dividing the number of successful attempts on each axis (i.e., the total number of correct identifications of red or green patches for the red&#x2013;green axis, and the total number of correct identifications of blue or yellow patches for the blue&#x2013;yellow axis) by 24. For example, an examination of the patch scores shows that participants #4 and #5 both scored a total of 6.5 for blue and yellow patches combined, corresponding to 13 correct identifications out of 24 on the blue&#x2013;yellow axis. However, in the published dataset file, participant #4 has a value of 0.54 for the corresponding percentage variable BY_ACC (blue&#x2013;yellow accuracy), whereas participant #5 has a value of 0.55 for the same variable (the true value of 13/24 being  
                    <mml:math display="inline" id="math1">
                        <mml:mrow>
                            <mml:mn>0.5416</mml:mn>
                            <mml:mover accent="true">
                                <mml:mn>6</mml:mn>
                                <mml:mo stretchy="true">&#x00af;</mml:mo>
                            </mml:mover>
                        </mml:mrow>
                    </mml:math>).  Christopher Thorstenson (personal communication, December 1, 2015) has explained to us that these percentages resulted from taking the mean of the individual percentages of correct attempts for each color of the axis in question (e.g., red and green), with these individual percentages having first been rounded to two decimal places.  It is not clear whether this explains all the anomalies in the data for Experiment 2; in any case, it serves as a reminder that, in order to avoid loss of information, rounding should be avoided during an analysis and only applied, if necessary, during the final reporting of results.</p>
            </sec>
            <sec>
                <title>Large differences in skewness between experiments</title>
                <p>An examination of the distribution of the scores for the two color axes reveals considerable differences between Experiment 1 and Experiment 2. In Experiment 1, the distribution for both axes was substantially negatively skewed, with the majority of participants correctly identifying almost all of the patches for all four colors (
                    <xref ref-type="fig" rid="f2">Figure 2a</xref>). In Experiment 2, the score distribution was different for each axis. For the red&#x2013;green axis (
                    <xref ref-type="fig" rid="f2">Figure 2b</xref>, top panel) the scores were approximately normally distributed: roughly similar numbers of participants achieved each possible score, with a small number having very low or very high scores. In contrast, the blue&#x2013;yellow axis was positively skewed, displaying the &#x201c;spike&#x201d; discussed previously (
                    <xref ref-type="fig" rid="f2">Figure 2b</xref>, bottom panel).</p>
                <fig fig-type="figure" id="f2" orientation="portrait" position="float">
                    <label>Figure 2. </label>
                    <caption>
                        <p>Distribution of blue&#x2013;yellow and red&#x2013;green axis scores in (
                            <bold>a</bold>) Experiment 1 and (
                            <bold>b</bold>) Experiment 2. Histograms show the number of occurrences of each score for the red&#x2013;green axis (top panels) and blue&#x2013;yellow axis (bottom panels). The range of scores on the 
                            <italic toggle="yes">x</italic>-axis is 0&#x2013;12, reflecting 
                            <xref ref-type="bibr" rid="ref-13">Thorstenson 
                                <italic toggle="yes">et al.</italic>&#x2019;s (2015a)</xref> scoring scheme of 0.5 points per correct answer, with 12 trials per color and two colors per axis. Note the discontinuity in the 
                            <italic toggle="yes">y</italic>-axis for blue&#x2013;yellow in Experiment 2 (bottom-right panel), added to accommodate the surprisingly high peak at 6.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/9905/08d186c6-0334-465e-b4f5-9f9b6e37b3ff_figure2.gif"/>
                </fig>
                <p>Using the patch-level data, we broke the two-color axis scores down into individual colors, as shown in 
                    <xref ref-type="fig" rid="f3">Figure 3</xref>. For Experiment 1, the per-color data more or less followed the pattern of the two-color axis of which each color was a part (
                    <xref ref-type="fig" rid="f3">Figure 3a</xref>); this was also true for the red&#x2013;green axis in Experiment 2 (
                    <xref ref-type="fig" rid="f3">Figure 3b</xref>, left panels). However, an even stranger pattern emerged for the blue&#x2013;yellow axis in Experiment 2 (
                    <xref ref-type="fig" rid="f3">Figure 3b</xref>, right panels). Of the 130 participants, 106 (81.5%) scored a maximum 6.0 (corresponding to 12 correct responses) for blue, while 56 (43.1%) scored zero for yellow. The observed &#x201c;spike&#x201d; at 50% (i.e., 12 out of a possible 24 correct responses) for the blue&#x2013;yellow axis is thus mostly explained by people who had a perfect score (12 out of 12) for blue, while completely failing to recognize yellow patches at any saturation and thus obtaining a score of 0.</p>
                <fig fig-type="figure" id="f3" orientation="portrait" position="float">
                    <label>Figure 3. </label>
                    <caption>
                        <p>Distribution of color patch scores in (
                            <bold>a</bold>) Experiment 1 and (
                            <bold>b</bold>) Experiment 2. Histograms show the number of occurrences of each score for the red, green, blue and yellow color patches. The range of scores on the 
                            <italic toggle="yes">x</italic>-axis is 0&#x2013;6, reflecting 
                            <xref ref-type="bibr" rid="ref-13">Thorstenson 
                                <italic toggle="yes">et al.</italic>&#x2019;s (2015a)</xref> scoring scheme of 0.5 points per correct answer, with 12 trials per color. Note the discontinuity in the 
                            <italic toggle="yes">y</italic>-axis for blue patches in Experiment 2 (bottom row, second panel from right), added to accommodate the surprisingly high peak at 6.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/9905/08d186c6-0334-465e-b4f5-9f9b6e37b3ff_figure3.gif"/>
                </fig>
                <p>
                    <xref ref-type="fig" rid="f4">Figure 4</xref> plots the participants&#x2019; performance for each color, broken down further into the proportion of correct responses for each saturation level (recall that participants were asked to identify colors at each of six different levels of saturation.)  In Experiment 1, this resulted in what appears to be a ceiling effect &#x2013; mean accuracy reaches 90% or more already at the third-lowest color saturation level (.10) and levels off as saturation increased thereafter (
                    <xref ref-type="fig" rid="f4">Figure 4a</xref>). In Experiment 2, the ceiling effect disappeared for the red&#x2013;green axis, for which scores on both colors improved approximately linearly with increasing color saturation (
                    <xref ref-type="fig" rid="f4">Figure 4b</xref>, left panels); however, on the blue&#x2013;yellow axis, the effect of the split between the two colors is once again clear. The ceiling effect is even more pronounced for blue here than in Experiment 1, while scores for yellow are low even at the highest color saturation level (
                    <xref ref-type="fig" rid="f4">Figure 4b</xref>, right panels).</p>
                <fig fig-type="figure" id="f4" orientation="portrait" position="float">
                    <label>Figure 4. </label>
                    <caption>
                        <p>Color categorization as a function of saturation in (
                            <bold>a</bold>) Experiment 1 and (
                            <bold>b</bold>) Experiment 2. Each panel shows the proportion of correct responses for each of the six saturation levels for a given color. Mean performance in the 
                            <italic toggle="yes">sadness</italic> condition is represented by triangles joined by solid lines, and mean performance in the 
                            <italic toggle="yes">happiness</italic> (Experiment 1) and 
                            <italic toggle="yes">neutral</italic> (Experiment 2) conditions is represented by circles joined by dashed lines. Error bars show parametric 95% confidence intervals on the means. (Details of the color calibration procedure were not stated by 
                            <xref ref-type="bibr" rid="ref-13">Thorstenson 
                                <italic toggle="yes">et al.</italic> (2015a)</xref>, so it is not clear how to interpret these saturation values.)</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/9905/08d186c6-0334-465e-b4f5-9f9b6e37b3ff_figure4.gif"/>
                </fig>
                <p>It is difficult to imagine what might have caused these results in Experiment 2. The Method section for this experiment suggests that the only change that was made from Experiment 1 was the nature of the film clips that were shown to participants. The differences for both axes (and, indeed, for all four colors) between Experiments 1 and 2&#x2014;regardless of the film clip watched by participants&#x2014;are puzzling, given that both samples were drawn from the same population of undergraduates and hence ought not to differ widely in their physiological characteristics. Because the color characteristics of the two sets of film clips were apparently not well-controlled, one possible explanation for this discrepancy is differential adaptation of the color mechanisms in the visual system, which adds to our concern about possible confounds (see our section &#x201c;A confounded comparison?&#x201d;). But we have difficulty believing that this, on its own, could account for such a substantial difference between the two experiments.</p>
                <p>Given that the extreme blue&#x2013;yellow scores in Experiment 2 were obtained from participants in both the neutral and sadness conditions, a further possibility is that simply watching grayscale film clips for a few minutes was sufficient to substantially distort participants&#x2019; color vision (on the blue&#x2013;yellow axis only). However, if Thorstenson and colleagues had noticed such a finding, they would presumably have mentioned it in their article (and perhaps alerted colleagues in the field of physiology to this remarkable discovery). Otherwise, we are left with two possible conclusions: either around 40% of the participants in 
                    <xref ref-type="bibr" rid="ref-13">Thorstenson 
                        <italic toggle="yes">et al.</italic>&#x2019;s (2015a)</xref> Experiment 2 all had the same problem with their vision (which was not shared by any of the participants in Experiment 1), or some form of equipment failure or other technical problem caused this unusual pattern of values to be recorded. In any case, it seems likely that Thorstenson 
                    <italic toggle="yes">et al.</italic> failed to notice this anomaly when examining their data prior to performing their statistical analyses.</p>
            </sec>
        </sec>
        <sec sec-type="conclusions">
            <title>Conclusion</title>
            <p>While we strongly support the retraction by 
                <xref ref-type="bibr" rid="ref-14">Thorstenson 
                    <italic toggle="yes">et al.</italic> (2015b)</xref> of their article (
                <xref ref-type="bibr" rid="ref-13">Thorstenson 
                    <italic toggle="yes">et al.</italic>, 2015a</xref>) on the basis of the problems they noted with Experiment 2, we maintain that the basic methodology of both of their experiments is flawed. As Thorstenson and colleagues move forward, together with others who seek to assess whether mood and other factors can influence perception, they should bring their work up to modern standards of statistics and psychophysics. Doing so for experiments like those of Thorstenson 
                <italic toggle="yes">et al.</italic> would involve: (1) careful control of the visual differences between the movie clips, or, better, mood induction via non-visual stimuli such as an audio recording of a story; (2) the use of many movie clips or recordings, and mixed-effects analysis to address differences that cannot be eliminated between any two clips or recordings; (3) a baseline measurement of color perception; (4) an analysis based on signal-detection theory.</p>
        </sec>
        <sec>
            <title>Data availability</title>
            <p>Open Science Framework: Reanalysis of Thorstenson 
                <italic toggle="yes">et al.</italic>&#x2019;s (2015) &#x201c;Sadness Impairs Color Perception&#x201d;, doi 
                <ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.17605/osf.io/kwuq4">10.17605/osf.io/kwuq4</ext-link> (
                <xref ref-type="bibr" rid="ref-3">Brown 
                    <italic toggle="yes">et al.</italic>, 2016</xref>).</p>
            <p>We have archived the R code that we used to analyze the data and generate our figures at the Open Science Framework (OSF; doi: 
                <ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.17605/osf.io/kwuq4">10.17605/osf.io/kwuq4</ext-link>). This code works with the original dataset files uploaded to OSF (
                <xref ref-type="bibr" rid="ref-15">Thorstenson 
                    <italic toggle="yes">et al.</italic>, 2015c</xref>), together with the patch data files that Christopher Thorstenson sent us (by &#x201c;patch data&#x201d;, we mean data broken down to the individual color patches tested) and that we posted at 
                <ext-link ext-link-type="uri" xlink:href="https://osf.io/sbhn9/">https://osf.io/sbhn9/</ext-link> (Thorstenson subsequently asked us to delete two of the files, which we did).</p>
        </sec>
    </body>
    <back>
        <ack>
            <title>Acknowledgements</title>
            <p>We thank Christopher Thorstenson for sharing the per-color patch data, and for his comments on a previous version of this manuscript.</p>
        </ack>
        <ref-list>
            <ref id="ref-1">
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Areshenkoff</surname>
                            <given-names>CN</given-names>
                        </name>
					</person-group>:
                    <article-title>On the importance of plotting; or &#x2014; Psych. Science will publish anything</article-title>.  [weblog post].<year>2015</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://areshenk-research-notes.com/on-the-importance-of-plotting-or-psych-science-will-publish-anything/">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-2">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Ashby</surname>
                            <given-names>FG</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Townsend</surname>
                            <given-names>JT</given-names>
                        </name>
					</person-group>:
                    <article-title>Varieties of perceptual independence.</article-title>
                    <source>
						
                        <italic toggle="yes">Psychol Rev.</italic>
					</source>
                    <year>1986</year>;<volume>93</volume>(<issue>2</issue>):<fpage>154</fpage>&#x2013;<lpage>179</lpage>.
                    <pub-id pub-id-type="pmid">3714926</pub-id>
                    <pub-id pub-id-type="doi">10.1037/0033-295X.93.2.154</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-3">
                <mixed-citation publication-type="data">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Brown</surname>
                            <given-names>NJL</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Holcombe</surname>
                            <given-names>AO</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Etz</surname>
                            <given-names>A</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Reanalysis of Thorstenson 
                        <italic toggle="yes">et al.</italic>&#x2019;s (2015) &#x201c;Sadness Impairs Color Perception&#x201d;</article-title>.
                    <source>
						
                        <italic toggle="yes">Open Science Framework.</italic>
					</source>
                    <year>2016</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.17605/osf.io/kwuq4">Data Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-4">
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Fechner</surname>
                            <given-names>GT</given-names>
                        </name>
					</person-group>:
                    <article-title>Elements of psychophysics</article-title>. New York, NY: Holt, Rinehart and Winston. (Original work published 1860).<year>1966</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://www.worldcat.org/title/elements-of-psychophysics-vol-1/oclc/650196798?ht=edition&amp;referer=br">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-5">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Firestone</surname>
                            <given-names>C</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Scholl</surname>
                            <given-names>BJ</given-names>
                        </name>
					</person-group>:
                    <article-title>Cognition does not affect perception: Evaluating the evidence for &#x2018;top-down&#x2019; effects.</article-title>
                    <source>
						
                        <italic toggle="yes">Behav Brain Sci.</italic>
					</source>
                    <year>2015</year>;<volume>20</volume>:<fpage>1</fpage>&#x2013;<lpage>77</lpage>.
                    <pub-id pub-id-type="pmid">26189677</pub-id>
                    <pub-id pub-id-type="doi">10.1017/S0140525X15000965</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-6">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Garc&#x00ed;a-P&#x00e9;rez</surname>
                            <given-names>MA</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Alcal&#x00e1;-Quintana</surname>
                            <given-names>R</given-names>
                        </name>
					</person-group>:
                    <article-title>Shifts of the psychometric function: distinguishing bias from perceptual effects.</article-title>
                    <source>
						
                        <italic toggle="yes">Q J Exp Psychol (Hove).</italic>
					</source>
                    <year>2013</year>;<volume>66</volume>(<issue>2</issue>):<fpage>319</fpage>&#x2013;<lpage>37</lpage>.
                    <pub-id pub-id-type="pmid">22950887</pub-id>
                    <pub-id pub-id-type="doi">10.1080/17470218.2012.708761</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-7">
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Green</surname>
                            <given-names>DM</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Swets</surname>
                            <given-names>JA</given-names>
                        </name>
					</person-group>:
                    <article-title>Signal Detection Theory and Psychophysics</article-title>. New York, NY: Wiley.<year>1966</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://andrei.gorea.free.fr/Teaching_fichiers/SDT%20and%20Psytchophysics.pdf">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-8">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Holcombe</surname>
                            <given-names>AO</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Brown</surname>
                            <given-names>NJL</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Etz</surname>
                            <given-names>A</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Code</article-title>.<year>2016</year>.
                    <pub-id pub-id-type="doi">10.17605/osf.io/kwuq4</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-9">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Krauskopf</surname>
                            <given-names>J</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Williams</surname>
                            <given-names>DR</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Heeley</surname>
                            <given-names>DW</given-names>
                        </name>
					</person-group>:
                    <article-title>Cardinal directions of color space.</article-title>
                    <source>
						
                        <italic toggle="yes">Vision Res.</italic>
					</source>
                    <year>1982</year>;<volume>22</volume>(<issue>9</issue>):<fpage>1123</fpage>&#x2013;<lpage>1131</lpage>.
                    <pub-id pub-id-type="pmid">7147723</pub-id>
                    <pub-id pub-id-type="doi">10.1016/0042-6989(82)90077-3</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-10">
                <mixed-citation publication-type="book">
                    <collab>PubPeer</collab>:
                    <article-title>Sadness impairs color perception</article-title>.<year>2016</year>.
                    <ext-link ext-link-type="uri" xlink:href="https://pubpeer.com/publications/989FBE60680F308F7BF98BB22F3C50">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-11">
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Rosenthal</surname>
                            <given-names>R</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Rosnow</surname>
                            <given-names>RL</given-names>
                        </name>
					</person-group>:
                    <article-title>Essentials of Behavioral Research: Methods and Data Analysis</article-title>. New York, NY: McGraw Hill.<year>1991</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://www.abebooks.com/servlet/BookDetailsPL?bi=11787574184">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-12">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Saint-Mont</surname>
                            <given-names>U</given-names>
                        </name>
					</person-group>:
                    <article-title>Randomization Does Not Help Much, Comparability Does.</article-title>
                    <source>
						
                        <italic toggle="yes">PLoS One.</italic>
					</source>
                    <year>2015</year>;<volume>10</volume>(<issue>7</issue>):<fpage>e0132102</fpage>.
                    <pub-id pub-id-type="pmid">26193621</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pone.0132102</pub-id>
                    <pub-id pub-id-type="pmcid">4507867</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-13">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Thorstenson</surname>
                            <given-names>CA</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Pazda</surname>
                            <given-names>AD</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Elliot</surname>
                            <given-names>AJ</given-names>
                        </name>
					</person-group>:
                    <article-title>Sadness impairs color perception.</article-title>
                    <source>
						
                        <italic toggle="yes">Psychol Sci.</italic>
					</source>
                    <year>2015a</year>.
                    <pub-id pub-id-type="doi">10.1177/0956797615597672</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-14">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Thorstenson</surname>
                            <given-names>CA</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Pazda</surname>
                            <given-names>AD</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Elliot</surname>
                            <given-names>AJ</given-names>
                        </name>
					</person-group>:
                    <article-title>Retraction of &#x201c;Sadness impairs color perception&#x201d;.</article-title>
                    <source>
						
                        <italic toggle="yes">Psychol Sci.</italic>
					</source>
                    <year>2015b</year>;<volume>26</volume>(<issue>11</issue>):<fpage>1822</fpage>. [The publisher has given the retraction the same DOI as the original article, perhaps accidentally.].
                    <pub-id pub-id-type="doi">10.1177/0956797615597672 </pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-15">
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Thorstenson</surname>
                            <given-names>CA</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Pazda</surname>
                            <given-names>AD</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Elliot</surname>
                            <given-names>AJ</given-names>
                        </name>
					</person-group>:
                    <article-title>Sadness impairs color perception</article-title>. [Data and stimuli set]. Republished by Holcombe
                    <italic toggle="yes">et al.</italic>(2016).<year>2015c</year>.
                    <pub-id pub-id-type="doi">10.17605/OSF.IO/ZSNXB</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-16">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Webster</surname>
                            <given-names>MA</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Leonard</surname>
                            <given-names>D</given-names>
                        </name>
					</person-group>:
                    <article-title>Adaptation and perceptual norms in color vision.</article-title>
                    <source>
						
                        <italic toggle="yes">J Opt Soc Am A Opt Image Sci Vis.</italic>
					</source>
                    <year>2008</year>;<volume>25</volume>(<issue>11</issue>):<fpage>2817</fpage>&#x2013;<lpage>2825</lpage>.
                    <pub-id pub-id-type="pmid">18978861</pub-id>
                    <pub-id pub-id-type="doi">10.1364/JOSAA.25.002817</pub-id>
                    <pub-id pub-id-type="pmcid">2657039</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-17">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Wells</surname>
                            <given-names>GL</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Windschitl</surname>
                            <given-names>PD</given-names>
                        </name>
					</person-group>:
                    <article-title>What's in a question?</article-title>
                    <source>
						
                        <italic toggle="yes">Contemp Psychol.</italic>
					</source>
                    <year>1993</year>;<volume>38</volume>(<issue>4</issue>):<fpage>383</fpage>&#x2013;<lpage>385</lpage>.
                    <pub-id pub-id-type="doi">10.1037/033227</pub-id>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report15140">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.9905.r15140</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Crognale</surname>
                        <given-names>Michael A</given-names>
                    </name>
                    <xref ref-type="aff" rid="r15140a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r15140a1">
                    <label>1</label>Department of Psychology, University of Nevada, Reno, NV, USA</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>8</day>
                <month>8</month>
                <year>2016</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2016 Crognale MA</copyright-statement>
                <copyright-year>2016</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport15140" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.9202.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The manuscript by Holcombe 
                <italic>et al. </italic>as a comment on Thorstenson 
                <italic>et al.'</italic>s prior manuscript is well well written and reasoned.&#x00a0; The problems with the Thorstenson 
                <italic>et al</italic>. article are many and most of them have been well addressed here. However, since the manuscript by Throstenson
                <italic> et al.</italic> has been retracted reportedly for other reasons, it does not seem particularly fruitful to critique the original manuscript further unless a resubmission has published.&#x00a0; On the other hand, pointing out the additional methodological and statistical problems with the original manuscript may be instructive for those following up on this work as suggested by Holcomb 
                <italic>et al. </italic>In particular, there does seem to be a disturbing trend to attribute results to factors that have not been well established as uniquely or even likely causal, (e.g. in the present case that "sadness" caused the observed trends in the data). Holcomb 
                <italic>et al.'</italic>s manuscript also illustrates the importance of scrutinizing the raw data for evidence of faulty methodology, and ceiling/floor effects. Perhaps publishing the present manuscript may be worthwhile as it is a potentially valuable instructional tool.</p>
            <p>Reviewer Expertise:</p>
            <p>NA</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report15370">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.9905.r15370</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>von Stumm</surname>
                        <given-names>Sophie</given-names>
                    </name>
                    <xref ref-type="aff" rid="r15370a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r15370a1">
                    <label>1</label>Department of Psychology, Goldsmiths University of London, London, UK</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>2</day>
                <month>8</month>
                <year>2016</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2016 von Stumm S</copyright-statement>
                <copyright-year>2016</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport15370" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.9202.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The article reviews a previous publication that was based on two flawed experiments and has since been retracted. The authors thoroughly analyse the experimental methods that were employed in the retracted study, and they support the retraction. In addition, they highlight further problems with the experimental design and they make specific recommendations for improving future studies in this area.</p>
            <p>Reviewer Expertise:</p>
            <p>NA</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
</article>
