<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">F1000Research</journal-id>
            <journal-title-group>
                <journal-title>F1000Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2046-1402</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/f1000research.51192.1</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Research Article</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>Measure of unevenness in human genomes, described as a self-affine phase transition in a 'spin-chain&#x2019; model</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 1; peer review: 1 not approved]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Feranchuk</surname>
                        <given-names>Sergey</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Formal Analysis</role>
                    <role content-type="http://credit.niso.org/">Investigation</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Validation</role>
                    <role content-type="http://credit.niso.org/">Visualization</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-2774-4179</uri>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>Department of Physics, Smolensk State University, Smolensk, Russian Federation</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:feranchuk@gmail.com">feranchuk@gmail.com</email>
                </corresp>
                <fn fn-type="conflict">
                    <p>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>25</day>
                <month>2</month>
                <year>2021</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2021</year>
            </pub-date>
            <volume>10</volume>
            <elocation-id>149</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>18</day>
                    <month>2</month>
                    <year>2021</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2021 Feranchuk S</copyright-statement>
                <copyright-year>2021</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://f1000research.com/articles/10-149/pdf"/>
            <abstract>
                <p>
                    <bold>Background:</bold> Non-Gaussian distribution of polymorphic positions across a genome can substantially influence the results of any approach to molecular evolution based on a 'classical' probability model. The infinite dispersion of non-Gaussian perturbations is a challenge in an attempt to accept it in a probability-based model of evolution.</p>
                <p>
                    <bold>Methods:</bold> Here a model is proposed where non-Gaussian distribution is introduced to an exact solution of the 'Ising model'; it describes a behavior of one-dimensional chain of spins in an approaching to a phase transition. The distribution of fragments which are identical between two genomes is similar to distribution of islands of spins with the same orientation, in the model where non-integer dimension is introduced.</p>
                <p>
                    <bold>Results:</bold> Application of this model allows us to compare the relative contributions of non-Gaussian perturbations for pairs of human genomes from different ethnic groups. An evolution of the three human races in a most compact presentation is considered, rates of development on the separated stages of the evolution are assumed to be proportional to a value of relative unevenness between the appropriate groups of genomes. In the resolved model, the meaningful details of the separation between Asian and European races are clarified, in a period around ten thousand years ago; a particular viewpoint to the separation of the African race is also presented.</p>
                <p>
                    <bold>Conclusion:</bold> The proposed approximation of non-Gaussian perturbations in human genomes allows to support the statements which are otherwise missed in the scientific investigations of the early history of modern humans.</p>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>mutation rate</kwd>
                <kwd>self-affine development</kwd>
                <kwd>human evolution</kwd>
                <kwd>phase transition</kwd>
                <kwd>genome coverage</kwd>
                <kwd>Landau-Zener transition</kwd>
            </kwd-group>
            <funding-group>
                <funding-statement>The author(s) declared that no grants were involved in supporting this work.</funding-statement>
            </funding-group>
        </article-meta>
    </front>
    <body>
        <sec sec-type="intro">
            <title>Introduction</title>
            <p>The issues about an unevenness of a genome arose in particular in a distribution of coverage of sequencing reads mapped to a genome
                <sup>
                    <xref ref-type="bibr" rid="ref-1">1</xref>
                </sup>. There, the deviations of a reads&#x2019; coverage from a 'classical' Lange-Waterman model, which was constructed following a Poisson distribution for short genome fragments, reflects some features of self-affinity for most frequent genome fragments in addition to the previously observed over-dispersion of a ''Poisson'' peak. There, the effect was observed stably in several analyzed genomes and is to be treated as a robust enough phenomenon to be discovered further and in depth.</p>
            <p>The features of self-affinity in DNA sequences were detected at the very early days of genomics, in a classical work of Peng 
                <italic toggle="yes">et al.</italic>
                <sup>
                    <xref ref-type="bibr" rid="ref-2">2</xref>
                </sup>; a definition of the fractal dimension for the one-dimensional series was proposed there, and DNA sequences were a model of ''fractal''-like series.</p>
            <p>The self-affinity features in a phenomenon imply the influence of perturbations with infinite dispersion, or the presence of a so-called 'fat-tail' in their probability distribution, and these features are difficult to detect and describe.</p>
            <p>Here, an approach is proposed to detect and apply a measure to these 'perturbations' focusing on the phenomenon mentioned, which was observed in coverage distributions. The relevance of a proposed research project is demonstrated on a model of evolution of humans restored from some of the present-day human genomes; confusions which are accumulated in solving of this scientific problem were in fact a motif to drive out the research.</p>
        </sec>
        <sec sec-type="methods">
            <title>Methods</title>
            <p>Self-affinity features are a property of 'transitional' period, and a description of these features is borrowed from approaches to a so-called 'Ising model', the model where a phase transition in a mutual orientation of spins in a crystal is explained. The heating of magnetic crystal leads to abrupt disappearance of magnetic momentum, and an approximation of this phenomenon is simulated in the Ising model. In a very simplified form, the interactions in crystal are presented there as an increase of energy if two spins in a linear chain are oriented in the same direction, and a critical temperature of phase transition (
                <italic toggle="yes">T
                    <sub>c</sub>
                </italic>) is derived from a strength of these interactions ('coupling constant').</p>
            <p>The cooling led in turn to a sudden appearance of magnetic momentum, and a decrease of a temperature close to a critical temperature leads to accumulation of 'islands', long enough fragments where spins are oriented in the same order. Ordered fragments in a one-dimensional spin chain can be compared to identical fragments of genome sequences, and a distribution of these fragments allows to describe there the features of self-affinity.</p>
            <p>In the frames of statistical physics, an expression for the probability of island of length 
                <italic toggle="yes">k</italic> was obtained by Dziamagra
                <sup>
                    <xref ref-type="bibr" rid="ref-3">3</xref>
                </sup>, as an applied case of the so-called 'Landau-Zener' transition'':</p>
            <p>
                <disp-formula>
                    <mml:math display="inline" id="math">
                        <mml:mrow>
                            <mml:mi>p</mml:mi>
                            <mml:mo stretchy="false">(</mml:mo>
                            <mml:mi>k</mml:mi>
                            <mml:mo>,</mml:mo>
                            <mml:mi>&#x03c4;</mml:mi>
                            <mml:mo stretchy="false">)</mml:mo>
                            <mml:mo>~</mml:mo>
                            <mml:msup>
                                <mml:mi>e</mml:mi>
                                <mml:mrow>
                                    <mml:mi>&#x03c4;</mml:mi>
                                    <mml:msup>
                                        <mml:mi>k</mml:mi>
                                        <mml:mn>2</mml:mn>
                                    </mml:msup>
                                </mml:mrow>
                            </mml:msup>
                        </mml:mrow>
                    </mml:math>
                </disp-formula>
            </p>
            <p>This is a point where a distribution with infinite dispersion can be introduced, assuming that a power coefficient 2 in this Gaussian-like distribution is substituted to some floating power coefficient D &lt; 2, a dimension of 'intrinsic' self-affinity of the underlaid process. The model constructed above depends on the two flexible parameters; intrinsic dimension 
                <italic toggle="yes">D</italic> and a parameter 
                <italic toggle="yes">&#x03c4;</italic>, a rate of cooling, or a rate of approaching a transitional phase. This model allows us to explain over-dispersion of the genome coverage distributions mentioned above (
                <xref ref-type="fig" rid="f1">Figure 1</xref>) and to fit the parameters to a measure of unevenness of human genomes, trying to reconstruct their evolution most precisely . The distribution of 'islands' for human genomes can be obtained as distribution of lengths of completely identical fragments in the genomes; lists of polymorphic positions (SNP) from ''1000 genomes'' project
                <sup>
                    <xref ref-type="bibr" rid="ref-4">4</xref>
                </sup> were used as a representation of genomes. A similar distribution of fragment sizes is observed for this data (
                <xref ref-type="fig" rid="f2">Figure 2A</xref>; 
                <xref ref-type="table" rid="T1">Table 1</xref>). The clusters for the three races are clearly seen for both genetic distance and for tails of 'island' sizes in genome-genome comparison (
                <xref ref-type="fig" rid="f2">Figure 2B</xref>). To interpret this, higher unevenness relative to same genetic distance means higher 'equilibration rate', higher mutation rates, and a lesser slope of a fitting line.</p>
            <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                <label>Figure 1. </label>
                <caption>
                    <p>(
                        <bold>A</bold>) Simulation of over-dispersion effect in a genome following modified Ising model of islands in a chain of spins. Here, D = 1.9, &#x03c4; = 0.0005, number of spins N = 200. The dashed line emulates an ordinary Poisson distribution, &#x03c4; is the same. (
                        <bold>B</bold>) Dependency between estimated fitting coefficient, the average of distribution, and underlying closeness to 'phase transition'.</p>
                </caption>
                <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/54328/b5070e06-bbc3-4e52-963f-dd1f9b93aa18_figure1.gif"/>
            </fig>
            <fig fig-type="figure" id="f2" orientation="portrait" position="float">
                <label>Figure 2. </label>
                <caption>
                    <p>(
                        <bold>A</bold>) Distribution of fragments&#x2019; sizes in human genomes in a pairwise comparison.  Measures of closeness between genomes: genetic distance (
                        <bold>B</bold>) and unevenness of fragment sizes (
                        <bold>C</bold>). Here races are, from left to right, from bottom to top: Asia, Mexico, Europe, India, Africa.</p>
                </caption>
                <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/54328/b5070e06-bbc3-4e52-963f-dd1f9b93aa18_figure2.gif"/>
            </fig>
            <table-wrap id="T1" orientation="portrait" position="anchor">
                <label>Table 1. </label>
                <caption>
                    <title>Approximated averaged similarities between human races and within a race, according to the detailed chart in 
                        <xref ref-type="fig" rid="f2">Figure 2</xref>.</title>
                </caption>
                <table content-type="article-table" frame="hsides">
                    <thead>
                        <tr>
                            <th align="center" colspan="1" rowspan="1"> </th>
                            <th align="left" colspan="1" rowspan="1"> </th>
                            <th align="left" colspan="1" rowspan="1" valign="top">&#x2019;unevenness&#x2019; m</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">genetic
                                <break/>distance p</th>
                        </tr>
                    </thead>
                    <tbody>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Europe</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Europe</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">0.0138</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">0.25</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Europe</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Asia</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">0.0144</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">0.27</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Asia</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Asia</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">0.0137</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">0.22</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Europe</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Africa</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">0.0152</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">0.29</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Asia</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Africa</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">0.0153</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">0.29</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Africa</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Africa</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">0.0131</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">0.24</td>
                        </tr>
                    </tbody>
                </table>
            </table-wrap>
        </sec>
        <sec sec-type="results">
            <title>Results</title>
            <p>For the two independent populations, the distance does not depend on heterogeneity in populations; a simple model of exponential development can provide a dependency of average genetic distance within population.</p>
            <p>
                <disp-formula id="e1">
                    <mml:math display="inline" id="math1">
                        <mml:mrow>
                            <mml:mtable columnalign="right">
                                <mml:mtr columnalign="right">
                                    <mml:mtd columnalign="right">
                                        <mml:mrow>
                                            <mml:msub>
                                                <mml:mi>p</mml:mi>
                                                <mml:mi>x</mml:mi>
                                            </mml:msub>
                                            <mml:mo>&#x2248;</mml:mo>
                                            <mml:msub>
                                                <mml:mi>p</mml:mi>
                                                <mml:mrow>
                                                    <mml:mn>0</mml:mn>
                                                    <mml:mi>x</mml:mi>
                                                </mml:mrow>
                                            </mml:msub>
                                            <mml:mo>+</mml:mo>
                                            <mml:mi>t</mml:mi>
                                            <mml:msub>
                                                <mml:mi>r</mml:mi>
                                                <mml:mi>x</mml:mi>
                                            </mml:msub>
                                            <mml:mfrac>
                                                <mml:mn>1</mml:mn>
                                                <mml:mrow>
                                                    <mml:msub>
                                                        <mml:mi>N</mml:mi>
                                                        <mml:mn>0</mml:mn>
                                                    </mml:msub>
                                                </mml:mrow>
                                            </mml:mfrac>
                                            <mml:mstyle displaystyle="true">
                                                <mml:munderover>
                                                    <mml:mo>&#x2211;</mml:mo>
                                                    <mml:mi>n</mml:mi>
                                                    <mml:mrow>
                                                        <mml:msub>
                                                            <mml:mi>N</mml:mi>
                                                            <mml:mi>g</mml:mi>
                                                        </mml:msub>
                                                    </mml:mrow>
                                                </mml:munderover>
                                                <mml:mrow>
                                                    <mml:mo stretchy="false">(</mml:mo>
                                                    <mml:mi>n</mml:mi>
                                                    <mml:msup>
                                                        <mml:mi>&#x03b1;</mml:mi>
                                                        <mml:mi>n</mml:mi>
                                                    </mml:msup>
                                                    <mml:mo stretchy="false">)</mml:mo>
                                                </mml:mrow>
                                            </mml:mstyle>
                                        </mml:mrow>
                                    </mml:mtd>
                                </mml:mtr>
                                <mml:mtr columnalign="right">
                                    <mml:mtd columnalign="right">
                                        <mml:mrow>
                                            <mml:mi>k</mml:mi>
                                            <mml:mo>~</mml:mo>
                                            <mml:msup>
                                                <mml:mi>e</mml:mi>
                                                <mml:mi>m</mml:mi>
                                            </mml:msup>
                                            <mml:mo>&#x2248;</mml:mo>
                                            <mml:mn>1</mml:mn>
                                            <mml:mo>+</mml:mo>
                                            <mml:mi>m</mml:mi>
                                        </mml:mrow>
                                    </mml:mtd>
                                </mml:mtr>
                                <mml:mtr columnalign="right">
                                    <mml:mtd columnalign="right">
                                        <mml:mrow>
                                            <mml:msub>
                                                <mml:mi>d</mml:mi>
                                                <mml:mi>x</mml:mi>
                                            </mml:msub>
                                            <mml:mo>=</mml:mo>
                                            <mml:msub>
                                                <mml:mi>p</mml:mi>
                                                <mml:mn>0</mml:mn>
                                            </mml:msub>
                                            <mml:mo>+</mml:mo>
                                            <mml:mi>t</mml:mi>
                                            <mml:msub>
                                                <mml:mi>r</mml:mi>
                                                <mml:mi>x</mml:mi>
                                            </mml:msub>
                                            <mml:msub>
                                                <mml:mi>N</mml:mi>
                                                <mml:mi>g</mml:mi>
                                            </mml:msub>
                                            <mml:mo>+</mml:mo>
                                            <mml:mi>t</mml:mi>
                                            <mml:msub>
                                                <mml:mi>r</mml:mi>
                                                <mml:mi>y</mml:mi>
                                            </mml:msub>
                                            <mml:msub>
                                                <mml:mi>N</mml:mi>
                                                <mml:mi>g</mml:mi>
                                            </mml:msub>
                                        </mml:mrow>
                                    </mml:mtd>
                                </mml:mtr>
                            </mml:mtable>
                        </mml:mrow>
                    </mml:math>
                </disp-formula>
            </p>
            <p>The simplified model of evolution of human ethnic groups is shown in 
                <xref ref-type="fig" rid="f3">Figure 3</xref>, and for further consideration it was reduced to just clarify a separation between three human races. In this case, the model can be further reduced to a system of equations (2). The events which are assumed here as events of separation between races are (a) separation between modern Asians and modern Europeans, which happened nearly just after an expansion to America, about ten thousand years ago; and (b) expansion of modern humans to Eurasia, about fifty thousand years ago.</p>
            <fig fig-type="figure" id="f3" orientation="portrait" position="float">
                <label>Figure 3. </label>
                <caption>
                    <title>Simplified presentation of human evolution.</title>
                    <p>
                        <italic toggle="yes">p
                            <sub>a,</sub> p
                            <sub>e,</sub>
                        </italic> ... - heterogeneity of genomes within a communicating group; 
                        <italic toggle="yes">d
                            <sub>ae,</sub>
                        </italic> ... - distance between genomes of separated groups; 
                        <italic toggle="yes">b
                            <sub>e,</sub> b
                            <sub>a,</sub>
                        </italic> ... relative heterogeneity of a group in the events of separation between groups. 
                        <italic toggle="yes">k
                            <sub>a,</sub> k
                            <sub>e,</sub>
                        </italic> ... - rates of 'exponential' development.</p>
                </caption>
                <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/54328/b5070e06-bbc3-4e52-963f-dd1f9b93aa18_figure3.gif"/>
            </fig>
            <p>
                <disp-formula id="e2">
                    <mml:math display="inline" id="math2">
                        <mml:mrow>
                            <mml:mtable columnalign="right">
                                <mml:mtr columnalign="right">
                                    <mml:mtd columnalign="right">
                                        <mml:mrow>
                                            <mml:msub>
                                                <mml:mi>p</mml:mi>
                                                <mml:mi>a</mml:mi>
                                            </mml:msub>
                                            <mml:mo>=</mml:mo>
                                            <mml:msub>
                                                <mml:mi>b</mml:mi>
                                                <mml:mi>a</mml:mi>
                                            </mml:msub>
                                            <mml:msub>
                                                <mml:mi>p</mml:mi>
                                                <mml:mrow>
                                                    <mml:mi>a</mml:mi>
                                                    <mml:mi>e</mml:mi>
                                                </mml:mrow>
                                            </mml:msub>
                                            <mml:mo>+</mml:mo>
                                            <mml:mi>t</mml:mi>
                                            <mml:msub>
                                                <mml:mi>k</mml:mi>
                                                <mml:mi>a</mml:mi>
                                            </mml:msub>
                                            <mml:mi>r</mml:mi>
                                        </mml:mrow>
                                    </mml:mtd>
                                </mml:mtr>
                                <mml:mtr columnalign="right">
                                    <mml:mtd columnalign="right">
                                        <mml:mrow>
                                            <mml:msub>
                                                <mml:mi>p</mml:mi>
                                                <mml:mi>e</mml:mi>
                                            </mml:msub>
                                            <mml:mo>=</mml:mo>
                                            <mml:msub>
                                                <mml:mi>b</mml:mi>
                                                <mml:mi>e</mml:mi>
                                            </mml:msub>
                                            <mml:msub>
                                                <mml:mi>p</mml:mi>
                                                <mml:mrow>
                                                    <mml:mi>a</mml:mi>
                                                    <mml:mi>e</mml:mi>
                                                </mml:mrow>
                                            </mml:msub>
                                            <mml:mo>+</mml:mo>
                                            <mml:mi>t</mml:mi>
                                            <mml:msub>
                                                <mml:mi>k</mml:mi>
                                                <mml:mi>e</mml:mi>
                                            </mml:msub>
                                            <mml:mi>r</mml:mi>
                                        </mml:mrow>
                                    </mml:mtd>
                                </mml:mtr>
                                <mml:mtr columnalign="right">
                                    <mml:mtd columnalign="right">
                                        <mml:mrow>
                                            <mml:msub>
                                                <mml:mi>d</mml:mi>
                                                <mml:mrow>
                                                    <mml:mi>a</mml:mi>
                                                    <mml:mi>e</mml:mi>
                                                </mml:mrow>
                                            </mml:msub>
                                            <mml:mo>=</mml:mo>
                                            <mml:msub>
                                                <mml:mi>p</mml:mi>
                                                <mml:mrow>
                                                    <mml:mi>a</mml:mi>
                                                    <mml:mi>e</mml:mi>
                                                </mml:mrow>
                                            </mml:msub>
                                            <mml:mo>+</mml:mo>
                                            <mml:mn>2</mml:mn>
                                            <mml:mi>t</mml:mi>
                                            <mml:mi>r</mml:mi>
                                        </mml:mrow>
                                    </mml:mtd>
                                </mml:mtr>
                                <mml:mtr columnalign="right">
                                    <mml:mtd columnalign="right">
                                        <mml:mrow>
                                            <mml:msub>
                                                <mml:mi>p</mml:mi>
                                                <mml:mi>A</mml:mi>
                                            </mml:msub>
                                            <mml:mo>=</mml:mo>
                                            <mml:msub>
                                                <mml:mi>b</mml:mi>
                                                <mml:mi>A</mml:mi>
                                            </mml:msub>
                                            <mml:msub>
                                                <mml:mi>p</mml:mi>
                                                <mml:mn>0</mml:mn>
                                            </mml:msub>
                                            <mml:mo>+</mml:mo>
                                            <mml:mo stretchy="false">(</mml:mo>
                                            <mml:mi>t</mml:mi>
                                            <mml:mo>+</mml:mo>
                                            <mml:mi>T</mml:mi>
                                            <mml:mo stretchy="false">)</mml:mo>
                                            <mml:msub>
                                                <mml:mi>k</mml:mi>
                                                <mml:mi>A</mml:mi>
                                            </mml:msub>
                                            <mml:mi>r</mml:mi>
                                        </mml:mrow>
                                    </mml:mtd>
                                </mml:mtr>
                                <mml:mtr columnalign="right">
                                    <mml:mtd columnalign="right">
                                        <mml:mrow>
                                            <mml:msub>
                                                <mml:mi>p</mml:mi>
                                                <mml:mrow>
                                                    <mml:mi>a</mml:mi>
                                                    <mml:mi>e</mml:mi>
                                                </mml:mrow>
                                            </mml:msub>
                                            <mml:mo>=</mml:mo>
                                            <mml:msub>
                                                <mml:mi>b</mml:mi>
                                                <mml:mrow>
                                                    <mml:mi>a</mml:mi>
                                                    <mml:mi>e</mml:mi>
                                                </mml:mrow>
                                            </mml:msub>
                                            <mml:msub>
                                                <mml:mi>p</mml:mi>
                                                <mml:mn>0</mml:mn>
                                            </mml:msub>
                                            <mml:mo>+</mml:mo>
                                            <mml:mi>T</mml:mi>
                                            <mml:msub>
                                                <mml:mi>k</mml:mi>
                                                <mml:mrow>
                                                    <mml:mi>a</mml:mi>
                                                    <mml:mi>e</mml:mi>
                                                </mml:mrow>
                                            </mml:msub>
                                            <mml:mi>r</mml:mi>
                                        </mml:mrow>
                                    </mml:mtd>
                                </mml:mtr>
                                <mml:mtr columnalign="right">
                                    <mml:mtd columnalign="right">
                                        <mml:mrow>
                                            <mml:msub>
                                                <mml:mi>d</mml:mi>
                                                <mml:mrow>
                                                    <mml:mi>A</mml:mi>
                                                    <mml:mi>e</mml:mi>
                                                </mml:mrow>
                                            </mml:msub>
                                            <mml:mo>=</mml:mo>
                                            <mml:msub>
                                                <mml:mi>p</mml:mi>
                                                <mml:mn>0</mml:mn>
                                            </mml:msub>
                                            <mml:mo>+</mml:mo>
                                            <mml:mn>2</mml:mn>
                                            <mml:mo stretchy="false">(</mml:mo>
                                            <mml:mi>T</mml:mi>
                                            <mml:mo>+</mml:mo>
                                            <mml:mi>t</mml:mi>
                                            <mml:mo stretchy="false">)</mml:mo>
                                            <mml:mi>r</mml:mi>
                                        </mml:mrow>
                                    </mml:mtd>
                                </mml:mtr>
                            </mml:mtable>
                        </mml:mrow>
                    </mml:math> </disp-formula>
            </p>
            <p>The mutation rate 10
                <sup>-9</sup> per nucleotide per year can be transformed as 3 per genome per year = 3000 per genome per thousand years, 3000 mutations per 700000 SNP, so that 
                <italic toggle="yes">r</italic> in the equations above should be about 0.004.</p>
            <p>Having a requirement that 
                <italic toggle="yes">p
                    <sub>0</sub>
                </italic> &#x2265; 0, the 
                <italic toggle="yes">r</italic> should be less than 0.0029. Rates of development are assumed to be unknown, what is only known is a dependency between a rate of development and a linear coefficient 
                <italic toggle="yes">m</italic>. Values of 
                <italic toggle="yes">b
                    <sub>a</sub>, b
                    <sub>e</sub>, b
                    <sub>A</sub>, p
                    <sub>ae</sub>, k
                    <sub>ae</sub>
                </italic> which are attributes of a passed history are also assumed to be unknown.</p>
            <p>For a marginal but the most confident assumption
                <italic toggle="yes">, p
                    <sub>0</sub> = 0, p
                    <sub>ae</sub> =</italic> 0.21
                <italic toggle="yes">, k
                    <sub>A</sub> =</italic> 1.65 and 
                <italic toggle="yes">k
                    <sub>ae</sub> =</italic> 1.81. Unevenness in a comparison between groups is a weighted average of evolutionary paths from a time of separation, so that if 
                <italic toggle="yes">kA ~ mA</italic>, 
                <italic toggle="yes">k
                    <sub>e</sub> ~ m
                    <sub>e</sub>
                </italic>, then 
                <italic toggle="yes">k
                    <sub>ae</sub> ~ (m
                    <sub>Ae</sub>T + m
                    <sub>A</sub>(T + t) + m
                    <sub>e</sub> t)/2(T + t)</italic>; 
                <italic toggle="yes">k
                    <sub>ae</sub>
                </italic> &#x2248; 0.0141. Following a log-linear approximation, 
                <italic toggle="yes">k
                    <sub>a</sub>
                </italic> and 
                <italic toggle="yes">k
                    <sub>e</sub>
                </italic>, for modern Asians and Europeans, should be about 1.74 and 1.76.</p>
            <p>The genetic diversity of Asians in a time of separation is than estimated as 0.15, much lower than a pool of genotypes just before the separation (
                <italic toggle="yes">p
                    <sub>ae</sub>
                </italic> = 0.21). For Europeans, the pool of genotypes was wider, about 0.20.</p>
            <p>What is known is that Eurasia is a continent with good communications, and that it was populated mostly by ancestors of present-day Asian race before the time of last separation, or 'crash', in better words. It is also known that ancestors of modern Europeans began to expand to Europe mostly after that 'crash'; the founders of the latest expansion originated from tribes which developed slowly for a long time somewhere in an area of central Asia mountains. The wide expansion to Eurasia for the ancient pre-Asian race was characterized, instead, by a substantive increase of genome unevenness. Some of it now is lost, some are kept in native Americans, and, for modern Asians, the instabilities in diverged genomes were neutralized by a long enough period of stable slow development after the crash.</p>
        </sec>
        <sec sec-type="conclusions">
            <title>Conclusions</title>
            <p>Selection of individuals was almost the same as in Schiffels and Durbin
                <sup>
                    <xref ref-type="bibr" rid="ref-5">5</xref>
                </sup>, the difference with that model is that non-Gaussian features in genomes are considered here explicitly. This has a substantial influence on a reconstructed history of the three human races. Dealing with self-affine phenomena is difficult and risky, but it by no way can be ignored in any of valuable challenges to a present-day science.</p>
        </sec>
        <sec>
            <title>Data availability</title>
            <sec>
                <title>Underlying data</title>
                <p>Zenodo: Measure of unevenness in human genomes: supplement software utilities and intermediate data files, 
                    <ext-link ext-link-type="uri" xlink:href="http://doi.org/10.5281/zenodo.4495444">http://doi.org/10.5281/zenodo.4495444</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref-6">6</xref>
                    </sup>.</p>
                <p>This project contains the utilities to convert, pre-process and compare genotypes.</p>
                <p>Data are available under the terms of the 
                    <ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/publicdomain/zero/1.0/">Creative Commons Zero "No rights reserved" data waiver</ext-link> (CC0 1.0 Public domain dedication).</p>
                <p>The International Genome Sample Resource: 1000 Genomes phase 3 release, 
                    <ext-link ext-link-type="uri" xlink:href="https://www.internationalgenome.org/data-portal/data-collection/phase-3">https://www.internationalgenome.org/data-portal/data-collection/phase-3</ext-link>
                </p>
            </sec>
        </sec>
    </body>
    <back>
        <ref-list>
            <ref id="ref-1">
                <label>1</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Kuzmin</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Feranchuk</surname>
                            <given-names>SI</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Sharov</surname>
                            <given-names>VV</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Stepwise large genome assembly approach: a case of Siberian larch (
                        <italic toggle="yes">Larix sibirica Ledeb</italic>).</article-title>
                    <source>

                        <italic toggle="yes">BMC Bioinformatics.</italic>
</source>
                    <year>2019</year>;<volume>20</volume>(<issue>Suppl 1</issue>):<fpage>37</fpage>.
                    <pub-id pub-id-type="pmid">30717661</pub-id>
                    <pub-id pub-id-type="doi">10.1186/s12859-018-2570-y</pub-id>
                    <pub-id pub-id-type="pmcid">6362582</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-2">
                <label>2</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Peng</surname>
                            <given-names>CK</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Buldyrev</surname>
                            <given-names>SV</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Havlin</surname>
                            <given-names>S</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Mosaic organization of DNA nucleotides.</article-title>
                    <source>

                        <italic toggle="yes">Phys Rev E Stat Phys Plasmas Fluids Relat Interdiscip Topics.</italic>
</source>
                    <year>1994</year>;<volume>49</volume>(<issue>2</issue>):<fpage>1685</fpage>&#x2013;<lpage>9</lpage>.
                    <pub-id pub-id-type="pmid">9961383</pub-id>
                    <pub-id pub-id-type="doi">10.1103/physreve.49.1685</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-3">
                <label>3</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Dziarmaga</surname>
                            <given-names>J</given-names>
                        </name>
</person-group>:
                    <article-title>Dynamics of quantum phase transition: exact solution in quantum Ising model.</article-title>
                    <source>

                        <italic toggle="yes">arxiv.org.</italic>
</source>
                    <year>2005</year>;<volume>95</volume>(<issue>24</issue>).
                    <pub-id pub-id-type="doi">10.1103/PhysRevLett.95.245701</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-4">
                <label>4</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>1000 Genomes Project Consortium, Abecasis</surname>
                            <given-names>GR</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Auton</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>An integrated map of genetic variation from 1,092 human genomes.</article-title>
                    <source>

                        <italic toggle="yes">Nature.</italic>
</source>
                    <year>2012</year>;<volume>491</volume>(<issue>7422</issue>):<fpage>56</fpage>&#x2013;<lpage>65</lpage>.
                    <pub-id pub-id-type="pmid">23128226</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nature11632</pub-id>
                    <pub-id pub-id-type="pmcid">3498066</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-5">
                <label>5</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Schiffels</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Durbin</surname>
                            <given-names>R</given-names>
                        </name>
</person-group>:
                    <article-title>Inferring human population size and separation history from multiple genome sequences.</article-title>
                    <source>

                        <italic toggle="yes">Nat Genet.</italic>
</source>
                    <year>2014</year>;<volume>46</volume>(<issue>8</issue>):<fpage>919</fpage>&#x2013;<lpage>25</lpage>.
                    <pub-id pub-id-type="pmid">24952747</pub-id>
                    <pub-id pub-id-type="doi">10.1038/ng.3015</pub-id>
                    <pub-id pub-id-type="pmcid">4116295</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-6">
                <label>6</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Sergey</surname>
                            <given-names>F</given-names>
                        </name>
</person-group>:
                    <article-title>"Measure of unevenness in human genomes":  supplement software utilities and intermediate data files (Version 1.5) [Data set].</article-title>
                    <source>

                        <italic toggle="yes">Zenodo.</italic>
</source>
                    <year>2020</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://www.doi.org/10.5281/zenodo.4495444">http://www.doi.org/10.5281/zenodo.4495444</ext-link>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report127304">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.54328.r127304</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Ji</surname>
                        <given-names>Ying</given-names>
                    </name>
                    <xref ref-type="aff" rid="r127304a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-5691-1303</uri>
                </contrib>
                <aff id="r127304a1">
                    <label>1</label>Department of Molecular Physiology &amp; Biophysics, Vanderbilt University, Nashville, TN, USA</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>4</day>
                <month>4</month>
                <year>2022</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2022 Ji Y</copyright-statement>
                <copyright-year>2022</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport127304" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.51192.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>reject</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The author presented a model to approximate non-Gaussian perturbations in human genomes, and provided reconstructed history of Europe, Asia, and Africa based on the model results.</p>
            <p> </p>
            <p> Human evolution history is very interesting and important problem to work, and I appreciate the authors for the efforts to tackle this problem. However, I found it challenging to follow the study design and analysis from this article. 
                <list list-type="bullet">
                    <list-item>
                        <p>Introduction</p>
                    </list-item>
                </list> I found it challenging to figure out what scientific gap this article address. For example,&#x00a0;the specific shortcoming of "without explicit consider the non-Gaussian features" are not clearly stated, and the definition of "self-affinity", 'Ising model' are not clearly described.&#x00a0; 
                <list list-type="bullet">
                    <list-item>
                        <p>Results</p>
                    </list-item>
                </list> It's unclear to me how the models are designed, and what underlying assumptions or approximations were made. An example is that the model equations on page 3 were presented without stating what each parameter mean.&#x00a0;And I'd recommend adding more details on captions for tables and figures, it'll help the readers understand the data analysis process and interpretation of results.&#x00a0; 
                <list list-type="bullet">
                    <list-item>
                        <p>Conclusions</p>
                    </list-item>
                </list> The authors mentioned dealing with the problem can not be ignored, it'll be helpful if the authors could elaborate more on the importance of the discovery.</p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Partly</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Partly</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Yes</p>
            <p>Is the study design appropriate and is the work technically sound?</p>
            <p>Partly</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Partly</p>
            <p>Are sufficient details of methods and analysis provided to allow replication by others?</p>
            <p>Partly</p>
            <p>Reviewer Expertise:</p>
            <p>statistical genetics</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to state that I do not consider it to be of an acceptable scientific standard, for reasons outlined above.</p>
        </body>
        <sub-article article-type="response" id="comment8061-127304">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Feranchuk</surname>
                            <given-names>Sergey</given-names>
                        </name>
                        <aff>Forest Research Institute, Belarus</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>6</day>
                    <month>4</month>
                    <year>2022</year>
                </pub-date>
            </front-stub>
            <body>
                <p>a foreword.</p>
                <p> </p>
                <p> I appreciate the efforts of a reviewer who is a specialist in a statistical genetics.</p>
                <p> In my reply, first, I would like to point out the meaning and a definition of a term "statistics".</p>
                <p> A lot of efforts, including the contribution of the reviewer, were put into a development of a statistics in a "classical" meaning.</p>
                <p> This meaning is, historically, started from the subjects like "probability theory", rules of gambling and Gaussian distribution.</p>
                <p> These methods are good to answer to some challenges and are inadequate in some other challenges.</p>
                <p> A man can not know the everything in the mathematics, so the incompetence in the areas beyond the "classical statistics" is not a "sin".</p>
                <p> Anyway, complex developing systems, and genetic codes in particular, are beyond the applicability of a classical statistics.</p>
                <p> I will not prove this statement here, as it can be treated as a provocation. But, having education in a classical statistics, I had a need to "migrate" towards the "fractal" statistics. It is more complex and challenging. But the methods there are suitable for the object under study.</p>
                <p> I became to feel myself a fool when I was trying to apply the statistical methods to a system which is not suitable to the basic assumption of the statistical theory.</p>
                <p> </p>
                <p> "I found it challenging to figure out what scientific gap this article address. For example, the specific shortcoming of "without explicit consider the non-Gaussian features" are not clearly stated, and the definition of "self-affinity", 'Ising model' are not clearly described."</p>
                <p> Speaking about a "fractal methods", there is a choice, either to explain everything, or to assume that the reader is know what I'm speaking about. In a format of a submitted publication, I choose the second. I'm unable to explain what I mean speaking "non-Gaussian" without reminding the whole context around the "fat-tailed distributions".</p>
                <p> </p>
                <p> "Ising model" is a term known for physicists, it is a most complex of physical problems which can be solved exactly.</p>
                <p> The model explain a "collective behavior", mathematicians are not too familiar with it. I intended to put an attention to power of this model in a case of fractal mathematics. That is, to bring a gap to physics, but not to explain physics.</p>
                <p> </p>
                <p> "It's unclear to me how the models are designed, and what underlying assumptions or approximations were made. An example is that the model equations on page 3 were presented without stating what each parameter mean. And I'd recommend adding more details on captions for tables and figures, it'll help the readers understand the data analysis process and interpretation of results."</p>
                <p> </p>
                <p> A man wrote a text by different reasons. And a scientific text can be motivated by different needs.</p>
                <p> The science is an universal way to prove falsehood or truth of the statement. Anyway, there are scholar texts, grant proposals, popular explanations, and the reporting of the results. In a present-day flow of scientific information, I choose to separate a style of the text from the "popular-style" explanation for "newbies" with precise definitions and detailed captions.</p>
                <p> </p>
                <p> "The authors mentioned dealing with the problem can not be ignored, it'll be helpful if the authors could elaborate more on the importance of the discovery."</p>
                <p> </p>
                <p> My bid is to the changed times. The results are anyway contradictory and difficult to disseminate. Either the importance of the discovery would be revealed anyway, or all the efforts to elaborate these results will be anyway in vain.</p>
            </body>
        </sub-article>
    </sub-article>
</article>
